Sr. Staff Software Engineer – High Performance GPU Inference Systems
GroqFull Time
Expert & Leadership (9+ years)
Key technologies and capabilities for this role
Common questions about this position
The salary range is $175K - $270K.
This is a hybrid role requiring onsite work at the Santa Clara, CA headquarters 3 days per week.
Candidates need a Bachelor’s or Master’s in EE, CE, or related field with 5+ years of hands-on experience with GPU server and rack-scale solution architecture, design, and bring-up. Preferred skills include experience with ODM/OEM vendors, PCIe PEX switches, mechanical/thermal/power design, BMC integration, and strong debugging across hardware, BIOS/firmware, BMC, and OS.
The culture emphasizes respect, collaboration, humility, direct communication, inclusivity, and diverse perspectives for better solutions. They seek passionate individuals driven by execution.
Strong candidates have 5+ years of hands-on experience with GPU server and rack-scale systems, plus preferred expertise in ODM/OEM collaboration, PCIe designs, thermal/power/mechanical design, BMC integration, and debugging across hardware to OS layers.
AI compute platform for datacenters
d-Matrix focuses on improving the efficiency of AI computing for large datacenter customers. Its main product is the digital in-memory compute (DIMC) engine, which combines computing capabilities directly within programmable memory. This design helps reduce power consumption and enhances data processing speed while ensuring accuracy. d-Matrix differentiates itself from competitors by offering a modular and scalable approach, utilizing low-power chiplets that can be tailored for different applications. The company's goal is to provide high-performance, energy-efficient AI inference solutions to large-scale datacenter operators.