AI Engineer & Researcher, Inference
SpeechifyFull Time
Junior (1 to 2 years)
Key technologies and capabilities for this role
Common questions about this position
The salary range is $325K - $490K.
This information is not specified in the job description.
Required skills include experience building and scaling inference systems for LLMs or multimodal models, working with GPU-based ML workloads especially with images or audio, familiarity with inference tooling like vLLM or TensorRT-LLM, and handling systems spanning networking, distributed compute, and high-throughput data.
The team is small, fast-moving, and focused on delivering world-class developer experience while pushing AI boundaries, partnering closely with Research, and expanding into multimodal inference infrastructure.
Strong candidates have experience with inference systems for LLMs or multimodal models, GPU-based ML workloads with images/audio, enjoy experimental fast-evolving work, collaborate closely with research, own problems end-to-end, and thrive in ambiguous spaces.
Develops safe and beneficial AI technologies
OpenAI develops and deploys artificial intelligence technologies aimed at benefiting humanity. The company creates advanced AI models capable of performing various tasks, such as automating processes and enhancing creativity. OpenAI's products, like Sora, allow users to generate videos from text descriptions, showcasing the versatility of its AI applications. Unlike many competitors, OpenAI operates under a capped profit model, which limits the profits it can make and ensures that excess earnings are redistributed to maximize the social benefits of AI. This commitment to safety and ethical considerations is central to its mission of ensuring that artificial general intelligence (AGI) serves all of humanity.