Machine Learning Engineer, Staff - Model Factory at d-Matrix

Santa Clara, California, United States

d-Matrix Logo
$155,000 – $250,000Compensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
AI & Machine Learning, HardwareIndustries

Skills

Key technologies and capabilities for this role

PythonPyTorchTensorFlowJAXModel OptimizationQuantizationInference AccelerationTransformer ArchitecturesAttention MechanismsDistributed InferenceTensor ParallelPipeline ParallelSequence ParallelKV CachingRay

Questions & Answers

Common questions about this position

What is the salary range for this Machine Learning Engineer position?

The salary range is $155K - $250K.

Is this role remote or hybrid, and what's the location?

This is a hybrid role requiring onsite presence at the Santa Clara, CA headquarters 3-5 days per week.

What key skills are required for this Machine Learning Engineer role?

Required skills include strong programming in Python and experience with ML frameworks like PyTorch, TensorFlow, or JAX; hands-on experience with model optimization, quantization, and inference acceleration; deep understanding of Transformer architectures and distributed inference; knowledge of quantization techniques; and software engineering best practices like CI/CD, Docker, and Kubernetes.

What is the company culture like at d-Matrix?

The culture values humility, direct communication, and inclusivity, seeking individuals passionate about tackling challenges and driven by execution.

What makes a strong candidate for this Machine Learning Engineer position?

Strong candidates will have hands-on experience with model optimization, quantization, inference acceleration, Transformer architectures, distributed inference, and MLOps practices, along with strong Python skills and problem-solving abilities.

d-Matrix

AI compute platform for datacenters

About d-Matrix

d-Matrix focuses on improving the efficiency of AI computing for large datacenter customers. Its main product is the digital in-memory compute (DIMC) engine, which combines computing capabilities directly within programmable memory. This design helps reduce power consumption and enhances data processing speed while ensuring accuracy. d-Matrix differentiates itself from competitors by offering a modular and scalable approach, utilizing low-power chiplets that can be tailored for different applications. The company's goal is to provide high-performance, energy-efficient AI inference solutions to large-scale datacenter operators.

Santa Clara, CaliforniaHeadquarters
2019Year Founded
$149.8MTotal Funding
SERIES_BCompany Stage
Enterprise Software, AI & Machine LearningIndustries
201-500Employees

Benefits

Hybrid Work Options

Risks

Competition from Nvidia, AMD, and Intel may pressure d-Matrix's market share.
Complex AI chip design could lead to delays or increased production costs.
Rapid AI innovation may render d-Matrix's technology obsolete if not updated.

Differentiation

d-Matrix's DIMC engine integrates compute into memory, enhancing efficiency and accuracy.
The company offers scalable AI solutions through modular, low-power chiplets.
d-Matrix focuses on brain-inspired AI compute engines for diverse inferencing workloads.

Upsides

Growing demand for energy-efficient AI solutions boosts d-Matrix's low-power chiplets appeal.
Partnerships with companies like Microsoft could lead to strategic alliances.
Increasing adoption of modular AI hardware in data centers benefits d-Matrix's offerings.

Land your dream remote job 3x faster with AI