AI ML Engineer
dv01Full Time
Senior (5 to 8 years)
Key technologies and capabilities for this role
Common questions about this position
Final offer amounts are determined by multiple factors, including experience and expertise. Equity may be part of the total compensation package in addition to the base salary.
This information is not specified in the job description.
Required skills include experience with ML systems and deep learning frameworks like PyTorch, TensorFlow, ONNX; familiarity with LLM architectures and inference optimizations such as continuous batching and quantization; and understanding of GPU architectures or CUDA kernel programming.
This is a growing team working on large-scale deployment of machine learning models for real-time inference, using a stack including Python, Rust, C++, PyTorch, Triton, CUDA, and Kubernetes.
Strong candidates will have experience with ML systems, deep learning frameworks like PyTorch, LLM inference optimizations, and GPU programming with CUDA, as these directly align with the responsibilities of developing APIs, benchmarking, and improving inference systems.
Advanced answer engine providing reliable information
Perplexity AI provides an advanced answer engine that delivers accurate and reliable responses to user queries. The platform uses current sources to ensure the information is both precise and relevant. It caters to a wide audience, including individuals looking for quick answers and businesses needing detailed information. Unlike many competitors, Perplexity AI emphasizes high-quality, source-backed answers, making it a valuable resource for users seeking trustworthy data. The company's goal is to meet the increasing demand for immediate access to reliable information, generating revenue through subscription fees, advertising, and partnerships.