Performance Modeling Engineer
GroqFull Time
Expert & Leadership (9+ years)
Key technologies and capabilities for this role
Common questions about this position
The salary range is $196K - $300K.
This is a hybrid role requiring onsite presence 3 days per week at the Santa Clara, CA headquarters.
Candidates need 10+ years in AI/ML hardware, software, and infrastructure, strong background in deep learning and neural networks especially generative AI, proven experience tuning inference on GPUs, and strong programming skills in C++ and Python.
The culture emphasizes respect, collaboration, inclusivity, humility, and direct communication, with differing perspectives fostering better solutions.
A strong candidate has an engineering degree in a relevant field, 10+ years of AI/ML experience including GPU performance tuning and generative AI, academic background in computer architecture and performance modeling, and excellent communication skills.
AI compute platform for datacenters
d-Matrix focuses on improving the efficiency of AI computing for large datacenter customers. Its main product is the digital in-memory compute (DIMC) engine, which combines computing capabilities directly within programmable memory. This design helps reduce power consumption and enhances data processing speed while ensuring accuracy. d-Matrix differentiates itself from competitors by offering a modular and scalable approach, utilizing low-power chiplets that can be tailored for different applications. The company's goal is to provide high-performance, energy-efficient AI inference solutions to large-scale datacenter operators.