Staff AI Researcher- Generative AI
AledadeFull Time
Senior (5 to 8 years), Expert & Leadership (9+ years)
Candidates should possess a Master’s degree in Computer Science, Electrical Engineering, or a related field, or equivalent industry experience. Extensive hands-on experience in generative AI inference with a specific focus on speculative decoding is required, along with proficiency in C++. Strong analytical and problem-solving skills, and a proven ability to work effectively in fast-paced, cross-functional environments are also necessary. Familiarity with the architecture of Generative AI models, PyTorch, and AI infrastructure challenges is desired.
The Staff Software Engineer will design, implement, and optimize speculative decoding algorithms and underlying models to enhance the speed and accuracy of Generative AI Inference. They will collaborate with cross-functional teams to integrate solutions into Groq’s production AI infrastructure, working in a multi data center production environment and Kubernetes environment. The role involves developing high-performance, scalable code primarily in C++ and Rust, ensuring efficient resource utilization and system stability, and staying up-to-date with the latest developments in generative AI and speculative decoding. Furthermore, the engineer will provide technical leadership and mentorship to team members, champion code quality, maintainability, observability, monitoring, and best practices, and work closely with teams across software engineering, research, and operations to drive improvements in post training, model evaluation, and overall system performance.
AI inference technology for scalable solutions
Groq specializes in AI inference technology, providing the Groq LPU™, which is known for its high compute speed, quality, and energy efficiency. The Groq LPU™ is designed to handle AI processing tasks quickly and effectively, making it suitable for both cloud and on-premises applications. Unlike many competitors, Groq's products are designed, fabricated, and assembled in North America, which helps maintain high standards of quality and performance. The company targets a variety of clients across different industries that require fast and efficient AI processing capabilities. Groq's goal is to deliver scalable AI inference solutions that meet the growing demands for rapid data processing in the AI and machine learning market.