AI Inference Engineer - San Francisco at Perplexity AI

San Francisco, California, United States

Perplexity AI Logo
$190,000 – $250,000Compensation
Junior (1 to 2 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, SoftwareIndustries

Skills

Key technologies and capabilities for this role

PythonTensorFlowPyTorchONNXCUDAGPU architecturesLLM inference optimizationKubernetesC++

Questions & Answers

Common questions about this position

What is the salary range for the AI Inference Engineer position?

The salary range is $190,000 - $250,000, with final offer amounts determined by factors including experience and expertise.

Is this position remote or does it require working in San Francisco?

The position is remote.

What skills are required for this AI Inference Engineer role?

Required skills include experience with ML systems and deep learning frameworks like PyTorch, TensorFlow, ONNX; familiarity with LLM architectures and inference optimizations such as continuous batching and quantization; and understanding of GPU architectures or CUDA programming. The current stack involves Python, C++, TensorRT-LLM, and Kubernetes.

What benefits are offered for this position?

Benefits include comprehensive health, dental, and vision insurance for you and your dependents, plus a 401(k) plan. Equity is also part of the total compensation package in addition to base salary.

What makes a strong candidate for this AI Inference Engineer role?

Strong candidates will have hands-on experience with ML systems, deep learning frameworks, LLM inference optimizations, and GPU programming, along with familiarity with the company's stack including Python, C++, TensorRT-LLM, and Kubernetes.

Perplexity AI

Advanced answer engine providing reliable information

About Perplexity AI

Perplexity AI provides an advanced answer engine that delivers accurate and reliable responses to user queries. The platform uses current sources to ensure the information is both precise and relevant. It caters to a wide audience, including individuals looking for quick answers and businesses needing detailed information. Unlike many competitors, Perplexity AI emphasizes high-quality, source-backed answers, making it a valuable resource for users seeking trustworthy data. The company's goal is to meet the increasing demand for immediate access to reliable information, generating revenue through subscription fees, advertising, and partnerships.

San Francisco, CaliforniaHeadquarters
2022Year Founded
$890MTotal Funding
LATE_VCCompany Stage
Data & Analytics, Consumer SoftwareIndustries
201-500Employees

Benefits

Health Insurance
Dental Insurance
Vision Insurance
401(k) Retirement Plan
Company Equity

Risks

Pending copyright infringement class action poses legal and financial challenges.
Competition from Google's AI Mode could impact user retention and market share.
Otterly.AI's brand visibility tool may pressure Perplexity to maintain high performance.

Differentiation

Perplexity AI integrates large language models with search engines for precise responses.
The platform offers an open-source environment, enhancing public access to AI tools.
Perplexity's strategic acquisition of Carbon boosts its data connectivity capabilities.

Upsides

Partnership with Tripadvisor enhances travel planning with personalized recommendations.
$500M funding round increases valuation to $9 billion, supporting growth and innovation.
Integration with FactSet attracts financial clients with enhanced data accessibility.

Land your dream remote job 3x faster with AI