AI Hardware Architect
d-MatrixFull Time
Expert & Leadership (9+ years)
Candidates should possess a background or experience in computer science, mathematics, or ECE, with strong fundamentals in computer architecture and deep knowledge of AI architectures. Proficiency in C/C++ and Python is required, along with experience in modeling AI/ML workloads and using cycle-accurate simulators. Experience with ASIC microarchitecture design and simulating RTL designs is a plus.
Develop and maintain performance models for Groq hardware using AI/ML workloads, analyze AI/ML algorithms to understand their requirements, and map them to hardware architectures. Lead cross-functional teams for software/hardware co-optimization, identify performance bottlenecks, and guide future chip architecture development. Evaluate new technologies, provide what-if scenarios to leadership, develop Design Space Exploration tools, and define custom hardware solutions for customers.
AI inference technology for scalable solutions
Groq specializes in AI inference technology, providing the Groq LPU™, which is known for its high compute speed, quality, and energy efficiency. The Groq LPU™ is designed to handle AI processing tasks quickly and effectively, making it suitable for both cloud and on-premises applications. Unlike many competitors, Groq's products are designed, fabricated, and assembled in North America, which helps maintain high standards of quality and performance. The company targets a variety of clients across different industries that require fast and efficient AI processing capabilities. Groq's goal is to deliver scalable AI inference solutions that meet the growing demands for rapid data processing in the AI and machine learning market.