AI Engineer & Researcher, Inference
SpeechifyFull Time
Junior (1 to 2 years)
Key technologies and capabilities for this role
Common questions about this position
The base salary range is $175,900 - $307,800.
The position is hybrid.
Proficiency in C++ is essential, with extensive hands-on experience in generative AI inference focused on speculative decoding. Additional requirements include understanding of Generative AI model architecture, PyTorch, distributed systems with MPI and Kubernetes, and strong analytical skills.
Groq fosters a culture of humility, collaboration, and a growth mindset where egos are checked at the door, teams make up the smartest person in the room together, and knowledge is shared generously.
A strong candidate has a Master’s degree in Computer Science or equivalent, extensive experience in generative AI inference with speculative decoding, C++ proficiency, and expertise in distributed systems, AI infrastructure, and providing technical leadership.
AI inference technology for scalable solutions
Groq specializes in AI inference technology, providing the Groq LPU™, which is known for its high compute speed, quality, and energy efficiency. The Groq LPU™ is designed to handle AI processing tasks quickly and effectively, making it suitable for both cloud and on-premises applications. Unlike many competitors, Groq's products are designed, fabricated, and assembled in North America, which helps maintain high standards of quality and performance. The company targets a variety of clients across different industries that require fast and efficient AI processing capabilities. Groq's goal is to deliver scalable AI inference solutions that meet the growing demands for rapid data processing in the AI and machine learning market.