Backend Engineer – Inference Optimization at Vercel

Seattle, Washington, United States

Vercel Logo
$150,000 – $250,000Compensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
AI, Machine Learning, TechnologyIndustries

Requirements

  • Must have:
  • Deep experience in optimizing model inference pipelines, model quantization and KV caching
  • Proficiency in backend systems and high-performance programming (Python, C++, or Rust)
  • Familiarity with distributed serving, GPU acceleration, and large-scale systems
  • Ability to debug complex performance issues across model, runtime, and hardware layers
  • Comfort working in fast-moving environments with ambitious technical goals
  • Nice to have:
  • Hands-on experience with vLLM or similar inference frameworks
  • Background in GPU kernel optimization (CUDA, Triton, ROCm)
  • Experience scaling inference across multi-node or heterogeneous clusters
  • Prior work in model compilation (e.g., TensorRT, TVM, ONNX Runtime)
  • Hands-on experience with model quantization

Responsibilities

  • Own the design and optimization of inference pipelines for large-scale models
  • Work closely with researchers and infrastructure engineers to identify bottlenecks
  • Implement advanced techniques like quantization and KV caching
  • Deploy high-performance serving systems in production

Skills

Key technologies and capabilities for this role

PythonC++Rustmodel quantizationKV cachingdistributed servingGPU accelerationvLLMCUDATritonROCm

Questions & Answers

Common questions about this position

What is the salary range for the Backend Engineer role?

The salary range is $150K - $250K plus equity.

Is this position remote or onsite?

The position is onsite at the company's Seattle HQ.

What skills are required for this Backend Engineer position?

Must have deep experience in optimizing model inference pipelines, model quantization and KV caching, proficiency in backend systems and high-performance programming (Python, C++, or Rust), familiarity with distributed serving, GPU acceleration, and large-scale systems, ability to debug complex performance issues, and comfort in fast-moving environments.

What is the company culture like?

The team is high-energy, impact-driven, with a track record of academic excellence including best paper awards and highly cited researchers.

What benefits are offered?

Benefits include health benefits, a 401(k) plan, and meaningful equity.

Vercel

Platform for building and deploying web applications

About Vercel

Vercel provides a platform for developers and businesses to build, deploy, and manage modern web applications. Its services include tools that enhance image and video workflows using AI features like smart cropping and object detection. Vercel simplifies the complexities of serverless architecture, allowing for global content delivery without extra infrastructure. The company ensures high security and uptime with features such as automatic HTTPS and DDoS protection. Unlike competitors, Vercel focuses on a managed global rendering layer and offers a subscription-based model tailored to various client needs, from individual developers to large enterprises. The goal of Vercel is to empower developers to create efficient and secure web applications.

San Francisco, CaliforniaHeadquarters
2015Year Founded
$547.6MTotal Funding
SERIES_ECompany Stage
Consumer Software, Enterprise Software, AI & Machine LearningIndustries
501-1,000Employees

Benefits

Health Insurance
Stock Options
Company Equity
Professional Development Budget
Unlimited Paid Time Off
Remote Work Options
Home Office Stipend

Risks

Increased competition in the cloud application platform space threatens Vercel's market share.
Rapid AI evolution may outpace Vercel's current offerings, risking competitive edge loss.
Reliance on a subscription model could be risky during economic downturns.

Differentiation

Vercel offers a managed global rendering layer for modern web applications.
The company provides advanced AI-powered tools for image and video optimization.
Vercel's platform supports full lifecycle media management with auto-tagging and access control.

Upsides

Vercel secured $250 million in Series E funding for growth and platform development.
The introduction of V0 enhances Vercel's offerings in AI-driven web development.
Recognition as a Visionary in Gartner's Magic Quadrant boosts Vercel's market position.

Land your dream remote job 3x faster with AI