Staff Machine Learning Engineer
AngiFull Time
Expert & Leadership (9+ years)
Key technologies and capabilities for this role
Common questions about this position
The salary range is $155K - $250K.
This is a hybrid role requiring onsite presence at the Santa Clara, CA headquarters 3-5 days per week.
Required skills include strong programming in Python and experience with ML frameworks like PyTorch, TensorFlow, or JAX; hands-on experience with model optimization, quantization, and inference acceleration; deep understanding of Transformer architectures and distributed inference; knowledge of quantization techniques; and software engineering best practices like CI/CD, Docker, and Kubernetes.
The culture values humility, direct communication, and inclusivity, seeking individuals passionate about tackling challenges and driven by execution.
Strong candidates will have hands-on experience with model optimization, quantization, inference acceleration, Transformer architectures, distributed inference, and MLOps practices, along with strong Python skills and problem-solving abilities.
AI compute platform for datacenters
d-Matrix focuses on improving the efficiency of AI computing for large datacenter customers. Its main product is the digital in-memory compute (DIMC) engine, which combines computing capabilities directly within programmable memory. This design helps reduce power consumption and enhances data processing speed while ensuring accuracy. d-Matrix differentiates itself from competitors by offering a modular and scalable approach, utilizing low-power chiplets that can be tailored for different applications. The company's goal is to provide high-performance, energy-efficient AI inference solutions to large-scale datacenter operators.