Staff Software Engineer, Speculative Decoding
GroqFull Time
Senior (5 to 8 years), Expert & Leadership (9+ years)
Candidates should possess 10+ years of experience in customer engineering and field support for enterprise-level AI and datacenter products, with a focus on AI/ML software and generative AI inference. They require in-depth knowledge and hands-on experience with generative AI inference at scale, including the integration and deployment of AI models in production environments. Strong experience with automation tools and scripting is also necessary.
The AI Software Application Engineer – Technical Lead / Principal will provide expert guidance and support to customers deploying generative AI inference models, assisting with integration, troubleshooting, and optimizing AI/ML software stacks. They will work directly with customers to understand their needs and deliver solutions that maximize performance across their AI workloads, collaborating on technical collateral and leading the installation, configuration, and bring-up of d-Matrix’s AI software stack. Additionally, they will perform functional and performance validation testing and partner with internal engineering and product teams to produce developer guides and technical notes.
AI compute platform for datacenters
d-Matrix focuses on improving the efficiency of AI computing for large datacenter customers. Its main product is the digital in-memory compute (DIMC) engine, which combines computing capabilities directly within programmable memory. This design helps reduce power consumption and enhances data processing speed while ensuring accuracy. d-Matrix differentiates itself from competitors by offering a modular and scalable approach, utilizing low-power chiplets that can be tailored for different applications. The company's goal is to provide high-performance, energy-efficient AI inference solutions to large-scale datacenter operators.