AI / ML System Software Engineer, Senior Staff at d-Matrix

Santa Clara, California, United States

Apply Now

$180,000 – $280,000Compensation

Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level

Full TimeJob Type

UnknownVisa

Artificial Intelligence, TechnologyIndustries

Requirements

BS in Computer Science, Engineering, Math, Physics or related degree with 8+ years of industry software development experience (MS in Computer Science, Engineering, Math, Physics or related degree preferred with 5+ years)
Strong grasp of computer architecture, data structures, system software, and machine learning fundamentals
Proficient in C/C++/Python development in Linux environment and using standard development tools
Experience with distributed, high performance software design and implementation
Self-motivated team player with a strong sense of ownership and leadership
Preferred
MS or PhD in Computer Science, Electrical Engineering, or related fields
Prior startup, small team or incubation experience
Work experience at a cloud provider or AI compute / sub-system company
Experience implementing SIMD algorithms on vector processors
Experience with open source ML compiler frameworks such as MLIR
Experience with deep learning frameworks (such as PyTorch, Tensorflow)
Experience with deep learning runtimes (such as ONNX Runtime, TensorRT)
Experience with inference servers/model serving frameworks (such as Triton, TFServ, KubeFlow)
Experience with distributed systems collectives such as NCCL, OpenMPI
Experience deploying ML workloads on distributed systems, in a multitenancy environment
Experience with MLOps from definition to deployment including training, quantization, sparsity, model preprocessing, and deployment
Experience training, tuning and deploying ML models for CV (ResNet), NLP (BERT, GPT), and/or Recommendation Systems (DLRM)

Responsibilities

Be part of the team that helps productize the SW stack for the AI compute engine
Develop, enhance, and maintain the next-generation AI Deployment software
Work across all aspects of the full stack tool chain, understanding nuances of hardware-software co-design optimization and trade-offs
Build and scale software deliverables in a tight development window
Work with a team of compiler experts to build out the compiler infrastructure
Collaborate closely with other software (ML, Systems) and hardware (mixed signal, DSP, CPU) experts in the company

Skills

Key technologies and capabilities for this role

C++CPythonLinuxComputer ArchitectureData StructuresSystem SoftwareMachine LearningCompilersHardware-Software Co-DesignAI Deployment

Questions & Answers

Common questions about this position

What is the salary range for this position?

The salary range is $180K - $280K.

Is this role remote or hybrid, and what are the location requirements?

This is a hybrid role requiring onsite work at the Santa Clara, CA headquarters 3 days per week.

What are the key skills required for this Senior Staff AI/ML System Software Engineer role?

Candidates need a strong grasp of computer architecture, data structures, system software, and machine learning fundamentals, proficiency in C/C++/Python in Linux, and experience with distributed, high performance software design.

What is the company culture like at d-Matrix?

The culture emphasizes respect, collaboration, humility, direct communication, inclusivity, and diverse perspectives for better solutions, while valuing passion for challenges and execution.

What makes a strong candidate for this role?

A strong candidate has 8+ years of industry software experience (or 5+ with MS), self-motivation, team collaboration, ownership, leadership, and preferably startup experience or expertise in ML compilers, deep learning frameworks, and distributed systems.

d-Matrix

AI compute platform for datacenters

About d-Matrix

d-Matrix focuses on improving the efficiency of AI computing for large datacenter customers. Its main product is the digital in-memory compute (DIMC) engine, which combines computing capabilities directly within programmable memory. This design helps reduce power consumption and enhances data processing speed while ensuring accuracy. d-Matrix differentiates itself from competitors by offering a modular and scalable approach, utilizing low-power chiplets that can be tailored for different applications. The company's goal is to provide high-performance, energy-efficient AI inference solutions to large-scale datacenter operators.

Santa Clara, CaliforniaHeadquarters

2019Year Founded

$149.8MTotal Funding

SERIES_BCompany Stage

Enterprise Software, AI & Machine LearningIndustries

201-500Employees