Key technologies and capabilities for this role
Common questions about this position
The salary range is $190,000 - $240,000. Final offer amounts are determined by multiple factors, including experience and expertise, and may vary from the amounts listed.
Yes, this is a fully remote role.
Required skills include experience with ML systems and deep learning frameworks (e.g., PyTorch, TensorFlow, ONNX), familiarity with common LLM architectures and inference optimization techniques (e.g., continuous batching, quantization), and experience with deploying reliable, distributed, real-time model serving at scale. The modern stack involves Python, C++, TensorRT-LLM, and Kubernetes. Optional: Understanding of GPU architectures or experience with GPU kernel programming using CUDA.
Benefits include equity as part of the total compensation package, comprehensive health, dental, and vision insurance for you and your dependents, and a 401(k) plan.
Strong candidates have hands-on experience with ML systems, deep learning frameworks like PyTorch or TensorFlow, LLM inference optimization, and deploying scalable model serving systems.
Advanced answer engine providing reliable information
Perplexity AI provides an advanced answer engine that delivers accurate and reliable responses to user queries. The platform uses current sources to ensure the information is both precise and relevant. It caters to a wide audience, including individuals looking for quick answers and businesses needing detailed information. Unlike many competitors, Perplexity AI emphasizes high-quality, source-backed answers, making it a valuable resource for users seeking trustworthy data. The company's goal is to meet the increasing demand for immediate access to reliable information, generating revenue through subscription fees, advertising, and partnerships.