Founding ML Engineer
Red Cell PartnersFull Time
Senior (5 to 8 years)
Key technologies and capabilities for this role
Common questions about this position
The salary is TBD.
This information is not specified in the job description.
Required skills include 5+ years in machine learning with focus on model training, experience with transformer-based architectures like LLaMA, Mistral, Gemma, deep understanding of speculative decoding and draft models, hands-on quantization-aware training experience with PyTorch QAT or similar, and proficiency in Python and ML frameworks such as PyTorch, JAX, or TensorFlow.
Groq values humility, collaboration, a growth and giver mindset, curiosity and innovation, and passion, grit, and boldness.
Strong candidates have 5+ years of ML experience focused on training, proven work with transformers like LLaMA, expertise in speculative decoding and QAT, plus preferred experience in inference optimization, distributed training, and open-source contributions.
AI inference technology for scalable solutions
Groq specializes in AI inference technology, providing the Groq LPU™, which is known for its high compute speed, quality, and energy efficiency. The Groq LPU™ is designed to handle AI processing tasks quickly and effectively, making it suitable for both cloud and on-premises applications. Unlike many competitors, Groq's products are designed, fabricated, and assembled in North America, which helps maintain high standards of quality and performance. The company targets a variety of clients across different industries that require fast and efficient AI processing capabilities. Groq's goal is to deliver scalable AI inference solutions that meet the growing demands for rapid data processing in the AI and machine learning market.