Applied Research Lead, Model Scaling
RunwayFull Time
Junior (1 to 2 years)
Key technologies and capabilities for this role
Common questions about this position
The salary range is $200K - $275K.
This information is not specified in the job description.
Required skills include 4+ years in software engineering focused on ML infrastructure or distributed systems, hands-on expertise in distributed training frameworks like FSDP, DDP, ZeRO and ML frameworks like PyTorch, Transformers, plus strong GPU optimization and large-scale systems experience.
This information is not specified in the job description.
A strong candidate has a Bachelor’s degree in Computer Science or equivalent, 4+ years in ML infrastructure or distributed systems, expertise in distributed training frameworks like FSDP/DDP/ZeRO and PyTorch, GPU optimization skills, and experience with large-scale production systems.
Platform for deploying and managing ML models
Baseten provides a platform for deploying and managing machine learning (ML) models, aimed at simplifying the process for businesses. Users can select from a library of open-source foundation models and deploy them with just two clicks, making it easier to implement ML solutions. The platform features autoscaling, which adjusts resources based on demand, and comprehensive monitoring tools for tracking performance and troubleshooting. A key differentiator is Baseten's open-source model packaging framework, Truss, which allows users to package and deploy custom models easily. The company operates on a usage-based pricing model, where clients pay only for the time their models are actively deployed, helping them manage costs effectively.