Senior Research Engineer - Enterprise Products
NVIDIA- Full Time
- Senior (5 to 8 years)
Baseten is a rapidly growing startup revolutionizing AI deployment with cutting-edge inference infrastructure. Backed by investors like IVP, Spark Capital, Greylock, and Conviction, they work with leading AI innovators such as Descript, Bland.ai, Patreon, Writer, and Robust Intelligence. They recently secured $75 million in Series C funding and are focused on making AI accessible across all products. This role is for a Software Engineer focused on ML performance, ideal for someone passionate about LLM Inference and building transformative solutions.
Platform for deploying and managing ML models
Baseten provides a platform for deploying and managing machine learning (ML) models, aimed at simplifying the process for businesses. Users can select from a library of open-source foundation models and deploy them with just two clicks, making it easier to implement ML solutions. The platform features autoscaling, which adjusts resources based on demand, and comprehensive monitoring tools for tracking performance and troubleshooting. A key differentiator is Baseten's open-source model packaging framework, Truss, which allows users to package and deploy custom models easily. The company operates on a usage-based pricing model, where clients pay only for the time their models are actively deployed, helping them manage costs effectively.