Principal Engineer
NVIDIA- Full Time
- Expert & Leadership (9+ years)
Candidates should possess an MS or PhD in Computer Science, Electrical Engineering, Math, Physics, or a related degree, along with 10-12+ years of industry experience. A strong grasp of computer architecture, data structures, system software, and machine learning fundamentals is required. Proficiency in Python, C, or C++ development in a Linux environment is essential, as is experience with deep learning frameworks such as PyTorch or TensorFlow. Candidates should also have experience mapping NLP models to accelerators and a research background with a publication record in top-tier ML or computer architecture conferences is preferred.
The Principal Software Engineer will be responsible for developing, enhancing, and maintaining the development and testing infrastructure for next-generation AI hardware. They will help productize the software stack for the AI compute engine and leverage the d-Matrix ISA and dataflow architecture to build optimized implementations of state-of-the-art large language models. The role involves collaborating with a team of compiler, hardware architecture experts, and machine learning model researchers, as well as contributing to research on novel techniques for the machine learning software stack, models, and architecture.
AI compute platform for datacenters
d-Matrix focuses on improving the efficiency of AI computing for large datacenter customers. Its main product is the digital in-memory compute (DIMC) engine, which combines computing capabilities directly within programmable memory. This design helps reduce power consumption and enhances data processing speed while ensuring accuracy. d-Matrix differentiates itself from competitors by offering a modular and scalable approach, utilizing low-power chiplets that can be tailored for different applications. The company's goal is to provide high-performance, energy-efficient AI inference solutions to large-scale datacenter operators.