Performance Engineer - GPU at Anthropic

Seattle, Washington, United States

Anthropic Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, Machine LearningIndustries

Requirements

  • Bachelor's degree in a related field or equivalent experience
  • Deep experience with GPU programming and optimization at scale
  • Track record of delivering transformative GPU performance improvements in production ML systems
  • Ability to navigate complex systems from hardware interfaces to high-level ML frameworks
  • Impact-driven, passionate about delivering measurable performance breakthroughs
  • Enjoy collaborative problem-solving and pair programming
  • Thrive in ambiguous environments where you define the path forward

Responsibilities

  • Architect and implement foundational systems that power large language models
  • Maximize GPU utilization and performance at unprecedented scale
  • Develop cutting-edge optimizations to enable new model capabilities and improve inference efficiency
  • Implement state-of-the-art techniques from custom kernel development to distributed system architectures
  • Work across the stack from low-level tensor core optimizations to orchestrating thousands of GPUs
  • Co-design attention mechanisms and algorithms for next-generation hardware architectures
  • Develop custom kernels for emerging quantization formats and mixed-precision techniques
  • Design distributed communication strategies for multi-node GPU clusters
  • Optimize end-to-end training and inference pipelines for frontier language models
  • Build performance modeling frameworks to predict and optimize GPU utilization
  • Implement kernel fusion strategies to minimize memory bandwidth bottlenecks
  • Create resilient systems for planet-scale distributed training infrastructure
  • Profile and eliminate performance bottlenecks in production serving infrastructure
  • Partner with hardware vendors to influence future accelerator capabilities and software stacks

Skills

Key technologies and capabilities for this role

GPU ProgrammingGPU OptimizationCustom Kernel DevelopmentDistributed SystemsTensor Core OptimizationML InfrastructureHardware Interfaces

Questions & Answers

Common questions about this position

What skills are required for the GPU Performance Engineer role?

Strong candidates need deep experience with GPU programming and optimization at scale, the ability to navigate complex systems from hardware interfaces to high-level ML frameworks, and a track record of delivering transformative GPU performance improvements in production ML systems.

What experience is preferred for this position?

Preferred experience includes GPU Kernel Development with CUDA, Triton, CUTLASS; ML Compilers & Frameworks like PyTorch/JAX internals; Performance Engineering such as kernel fusion and profiling with Nsight; Distributed Systems with NCCL and NVLink; Low-Precision techniques; and Production Systems for large-scale training.

What is the company culture like at Anthropic?

Anthropic has a quickly growing team of committed researchers, engineers, policy experts, and business leaders focused on building safe and beneficial AI, with an emphasis on collaborative problem-solving, pair programming, and thriving in ambiguous environments.

What qualities make a strong candidate for this role?

Strong candidates are impact-driven, passionate about delivering measurable performance breakthroughs, excited to shape the future of AI infrastructure with world-class teams, enjoy collaborative problem-solving, and care about the societal impacts of their work.

Is this a remote position, or is there a location requirement?

This information is not specified in the job description.

Anthropic

Develops reliable and interpretable AI systems

About Anthropic

Anthropic focuses on creating reliable and interpretable AI systems. Its main product, Claude, serves as an AI assistant that can manage tasks for clients across various industries. Claude utilizes advanced techniques in natural language processing, reinforcement learning, and code generation to perform its functions effectively. What sets Anthropic apart from its competitors is its emphasis on making AI systems that are not only powerful but also understandable and controllable by users. The company's goal is to enhance operational efficiency and improve decision-making for its clients through the deployment and licensing of its AI technologies.

San Francisco, CaliforniaHeadquarters
2021Year Founded
$11,482.1MTotal Funding
GROWTH_EQUITY_VCCompany Stage
Enterprise Software, AI & Machine LearningIndustries
1,001-5,000Employees

Benefits

Flexible Work Hours
Paid Vacation
Parental Leave
Hybrid Work Options
Company Equity

Risks

Ongoing lawsuit with Concord Music Group could lead to financial liabilities.
Technological lag behind competitors like OpenAI may impact market position.
Reliance on substantial funding rounds may indicate financial instability.

Differentiation

Anthropic focuses on AI safety, contrasting with competitors' commercial priorities.
Claude, Anthropic's AI assistant, is designed for tasks of any scale.
Partnerships with tech giants like Panasonic and Amazon enhance Anthropic's strategic positioning.

Upsides

Anthropic's $60 billion valuation reflects strong investor confidence and growth potential.
Collaborations like the Umi app with Panasonic tap into the growing wellness AI market.
Focus on AI safety aligns with increasing industry emphasis on ethical AI development.

Land your dream remote job 3x faster with AI