Staff AI Engineer, Inference & Optimization at Sonatus

Sunnyvale, California, United States

Sonatus Logo
Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Automotive, SoftwareIndustries

Requirements

Candidates must have a minimum of 7 years of work experience in MLOps or a similar role with a strong focus on high-performance machine learning systems. Proven experience with inference optimization techniques such as quantization, pruning, and model distillation is required, along with deep hands-on experience with hardware acceleration for machine learning, including familiarity with GPUs, TPUs, NPUs and related software ecosystems. Strong experience with AI compilers and runtime environments like TensorRT, OpenVINO, and TVM is necessary. Proven experience deploying and managing ML models on edge devices, strong experience in designing and building distributed systems, and proficiency with inter-process communication protocols like gRPC, message queuing systems like MQTT, and efficient data handling techniques such as buffering and callbacks are essential. Hands-on experience with popular ML frameworks such as PyTorch, TensorFlow, TFLite, and ONNX, proficiency in programming languages including Python and C++, and a solid understanding of machine learning concepts, the ML development lifecycle, and the challenges of deploying models at scale are required. Proficiency with containerization technologies (Docker, Kubernetes) and cloud platforms (AWS, Azure), and expertise in CI/CD principles and tools applied to machine learning workflows are also necessary. A Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related field is required.

Responsibilities

The Staff AI Engineer will own the full lifecycle of model inference and hardware acceleration, from initial optimization to large-scale deployment. They will design, build, and maintain robust pipelines and runtime environments for deploying and serving machine learning models at the Edge, ensuring high availability, low latency, and efficient resource utilization. Responsibilities include collaborating with researchers and hardware engineers to optimize models for performance, latency, and power consumption on specific hardware, using AI compilers and specialized software stacks to accelerate model execution, and designing, building, and maintaining MLOps pipelines for deploying models to various edge devices with a focus on performance and efficiency constraints. The role involves implementing and maintaining monitoring and alerting systems, working with cloud platforms and on-device environments to provision and manage infrastructure, and proactively identifying and resolving issues related to model performance, deployment failures, and data discrepancies. The engineer will work closely with Machine Learning Engineers, Software Engineers, and Product Managers to bring models from design to high-performance production systems.

Skills

AI
Machine Learning
Model Optimization
Inference
Edge Computing
Quantization
Pruning
Knowledge Distillation
TensorRT
OpenVINO
TVM
GPUs
TPUs
NPUs
FPGAs
MLOps
Pipeline Design
High Availability
Low Latency
Resource Utilization

Sonatus

Platform for software-defined vehicle development

About Sonatus

Sonatus provides a platform for developing software-defined vehicles, focusing on a no-code solution that allows automotive companies to create flexible software architectures. This platform enables the collection and analysis of real-time diagnostic data from vehicles, which helps manufacturers like Hyundai Motor Group to continuously enhance vehicle quality and the ownership experience. Unlike competitors, Sonatus offers a comprehensive solution that spans the entire vehicle lifecycle, from design to after-sales services, allowing for faster innovation and cost reduction. The goal of Sonatus is to drive continuous improvement in automotive software, capitalizing on the increasing demand for software-defined vehicles globally.

Sunnyvale, CaliforniaHeadquarters
2018Year Founded
$107MTotal Funding
SERIES_ACompany Stage
Data & Analytics, Automotive & TransportationIndustries
51-200Employees

Risks

Competition from established firms like Bosch and Continental threatens market share.
Rapid technological advancements may outpace Sonatus' platform updates.
Expansion into new markets may bring regulatory compliance challenges.

Differentiation

Sonatus offers a no-code platform for adaptable vehicle software architectures.
The company partners with major automotive firms like Hyundai for real-time data diagnostics.
Sonatus' Updater solution manages over-the-air updates for software-defined vehicles.

Upsides

Sonatus raised $75 million to accelerate vehicle software innovation.
The company is expanding globally, including into the Japanese market.
Sonatus won two 2024 MotorTrend SDV Innovator Awards for software solutions.

Land your dream remote job 3x faster with AI