Key technologies and capabilities for this role
Common questions about this position
The salary range is $150K - $230K.
This is a hybrid position.
Candidates need 3+ years experience building and operating distributed systems or large-scale APIs, a proven track record of owning low-latency reliable backend services including rate-limiting auth quotas and metering, infra instincts with performance sensibilities like profiling tracing and SLO management, and comfort debugging complex systems.
You'll join a small, high-impact team on the Model Performance team operating at the intersection of product, model performance, and infra.
Strong candidates have 3+ years in distributed systems or large-scale APIs, experience owning low-latency backend services with rate-limiting and auth, performance optimization instincts including profiling and SLOs, and the ability to debug complex systems while producing clear design docs.
Platform for deploying and managing ML models
Baseten provides a platform for deploying and managing machine learning (ML) models, aimed at simplifying the process for businesses. Users can select from a library of open-source foundation models and deploy them with just two clicks, making it easier to implement ML solutions. The platform features autoscaling, which adjusts resources based on demand, and comprehensive monitoring tools for tracking performance and troubleshooting. A key differentiator is Baseten's open-source model packaging framework, Truss, which allows users to package and deploy custom models easily. The company operates on a usage-based pricing model, where clients pay only for the time their models are actively deployed, helping them manage costs effectively.