Anthropic

Machine Learning Systems Engineer, RL Engineering

San Francisco, California, United States

$315,000 – $425,000Compensation
Junior (1 to 2 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, AI & Machine Learning, Research & DevelopmentIndustries

ML Systems Engineer

Position Overview

  • About Anthropic: Anthropic's mission is to create reliable, interpretable, and steerable AI systems. They aim for AI to be safe and beneficial for users and society. Anthropic is a rapidly growing team of researchers, engineers, policy experts, and business leaders building beneficial AI systems.
  • About the Role: This role involves building the systems that train AI models like Claude. The focus is on implementing and improving advanced machine learning techniques to create more capable, reliable, and steerable AI. The ML Systems Engineer will be responsible for the algorithms and infrastructure researchers depend on to train models, directly enabling breakthroughs in AI capabilities and safety. The primary goal is to improve the performance, robustness, and usability of these systems to accelerate research progress.

Requirements

  • 4+ years of software engineering experience
  • Experience with high-performance, large-scale distributed systems
  • Experience with large-scale LLM training
  • Proficiency in Python
  • Experience implementing LLM finetuning algorithms (e.g., RLHF)
  • A desire to learn more about machine learning research
  • A commitment to the societal impacts of AI work

Responsibilities

  • Profiling the reinforcement learning pipeline to identify opportunities for improvement.
  • Building a system for regularly launching training jobs in a test environment to quickly detect problems.
  • Making changes to finetuning systems to support new model architectures.
  • Building instrumentation to detect and eliminate Python GIL contention in training code.
  • Diagnosing and fixing issues causing training runs to slow down.
  • Implementing a stable and fast version of new training algorithms proposed by researchers.
  • Maintaining and improving the algorithms and systems used by finetuning researchers to train models.
  • Focusing on the speed, reliability, and ease-of-use of these systems.

Compensation

  • Salary: $315,000 - $425,000 USD (Annual)

Application Instructions

  • Deadline to Apply: None. Applications will be reviewed on a rolling basis.

Company Information

  • Anthropic's Mission: To create reliable, interpretable, and steerable AI systems.
  • Team: A quickly growing group of researchers, engineers, policy experts, and business leaders.

Skills

Python
Distributed Systems
Large-Scale LLM Training
Reinforcement Learning
Finetuning Algorithms
System Profiling
Performance Optimization
Training Infrastructure

Anthropic

Develops reliable and interpretable AI systems

About Anthropic

Anthropic focuses on creating reliable and interpretable AI systems. Its main product, Claude, serves as an AI assistant that can manage tasks for clients across various industries. Claude utilizes advanced techniques in natural language processing, reinforcement learning, and code generation to perform its functions effectively. What sets Anthropic apart from its competitors is its emphasis on making AI systems that are not only powerful but also understandable and controllable by users. The company's goal is to enhance operational efficiency and improve decision-making for its clients through the deployment and licensing of its AI technologies.

San Francisco, CaliforniaHeadquarters
2021Year Founded
$11,482.1MTotal Funding
GROWTH_EQUITY_VCCompany Stage
Enterprise Software, AI & Machine LearningIndustries
1,001-5,000Employees

Benefits

Flexible Work Hours
Paid Vacation
Parental Leave
Hybrid Work Options
Company Equity

Risks

Ongoing lawsuit with Concord Music Group could lead to financial liabilities.
Technological lag behind competitors like OpenAI may impact market position.
Reliance on substantial funding rounds may indicate financial instability.

Differentiation

Anthropic focuses on AI safety, contrasting with competitors' commercial priorities.
Claude, Anthropic's AI assistant, is designed for tasks of any scale.
Partnerships with tech giants like Panasonic and Amazon enhance Anthropic's strategic positioning.

Upsides

Anthropic's $60 billion valuation reflects strong investor confidence and growth potential.
Collaborations like the Umi app with Panasonic tap into the growing wellness AI market.
Focus on AI safety aligns with increasing industry emphasis on ethical AI development.

Land your dream remote job 3x faster with AI