Research Scientist, Interpretability at Anthropic

San Francisco, California, United States

Apply Now

Not SpecifiedCompensation

Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level

Full TimeJob Type

UnknownVisa

Artificial Intelligence, AI ResearchIndustries

Requirements

Strong track record of scientific research (in any field), and have done some work on Interpretability
Enjoy team science – working collaboratively to make big discoveries
Comfortable with messy experimental science

Responsibilities

Develop methods for understanding LLMs by reverse engineering algorithms learned in their weights
Design and run robust experiments, both quickly in toy scenarios and at scale in large models
Create and analyze new interpretability features and circuits to better understand how models work
Build infrastructure for running experiments and visualizing results
Work with colleagues to communicate results internally and publicly

Skills

Mechanistic Interpretability

Neural Networks

Transformer Circuits

Reverse Engineering

AI Safety

Research

Machine Learning

PyTorch

Scaling Laws

Superposition

Anthropic

Develops reliable and interpretable AI systems

About Anthropic

Anthropic focuses on creating reliable and interpretable AI systems. Its main product, Claude, serves as an AI assistant that can manage tasks for clients across various industries. Claude utilizes advanced techniques in natural language processing, reinforcement learning, and code generation to perform its functions effectively. What sets Anthropic apart from its competitors is its emphasis on making AI systems that are not only powerful but also understandable and controllable by users. The company's goal is to enhance operational efficiency and improve decision-making for its clients through the deployment and licensing of its AI technologies.

San Francisco, CaliforniaHeadquarters

2021Year Founded

$11,482.1MTotal Funding

GROWTH_EQUITY_VCCompany Stage

Enterprise Software, AI & Machine LearningIndustries

1,001-5,000Employees

Benefits

Flexible Work Hours

Paid Vacation

Parental Leave

Hybrid Work Options

Company Equity

Risks

Ongoing lawsuit with Concord Music Group could lead to financial liabilities.

Technological lag behind competitors like OpenAI may impact market position.

Reliance on substantial funding rounds may indicate financial instability.

Differentiation

Anthropic focuses on AI safety, contrasting with competitors' commercial priorities.

Claude, Anthropic's AI assistant, is designed for tasks of any scale.

Partnerships with tech giants like Panasonic and Amazon enhance Anthropic's strategic positioning.

Upsides

Anthropic's $60 billion valuation reflects strong investor confidence and growth potential.

Collaborations like the Umi app with Panasonic tap into the growing wellness AI market.

Focus on AI safety aligns with increasing industry emphasis on ethical AI development.

Land your dream remote job 3x faster with AI

Try Jobo Free