AI Inference Engineer - San Francisco
Perplexity AICandidates should have experience with ML systems and deep learning frameworks such as PyTorch, TensorFlow, or ONNX, familiarity with common LLM architectures and inference optimization techniques, and an understanding of GPU architectures or experience with GPU kernel programming using CUDA.
$190,000 - $250,000/year
Full Time
Junior (1 to 2 years)