Sr. Staff Software Engineer – High Performance GPU Inference Systems
GroqFull Time
Expert & Leadership (9+ years)
Key technologies and capabilities for this role
Common questions about this position
The salary range is $142.5K - $230K.
This is a hybrid role requiring onsite presence at the Santa Clara, CA headquarters 3 days per week.
Strong proficiency in modern C++ (C++11 and above) and Python is required, along with experience in parallel and concurrent programming, software design patterns, CMake, Pytest, and familiarity with PyTorch internals or similar frameworks.
The culture is one of respect and collaboration, valuing humility and direct communication.
Candidates with a Bachelor’s degree and 6+ years or Master’s with 3+ years in C++ software development, experience architecting complex systems, distributed systems or HPC, and GPU programming knowledge stand out.
AI compute platform for datacenters
d-Matrix focuses on improving the efficiency of AI computing for large datacenter customers. Its main product is the digital in-memory compute (DIMC) engine, which combines computing capabilities directly within programmable memory. This design helps reduce power consumption and enhances data processing speed while ensuring accuracy. d-Matrix differentiates itself from competitors by offering a modular and scalable approach, utilizing low-power chiplets that can be tailored for different applications. The company's goal is to provide high-performance, energy-efficient AI inference solutions to large-scale datacenter operators.