Baseten

Engineering Manager - Model Performance

San Francisco, California, United States

$175,000 – $275,000Compensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
AI & Machine Learning, Enterprise SoftwareIndustries

Requirements

Candidates must hold a Bachelor's, Master's, or Ph.D. in Computer Science, Engineering, or a related field. They should have over 5 years of professional experience in software engineering, with at least 2 years in a technical leadership role. Proven experience in managing and mentoring engineering teams is required, along with expertise in programming languages such as Python, C++, or Go. An in-depth understanding of ML model performance optimization using libraries like PyTorch, TensorRT, and CUDA is essential. Strong knowledge of containerization (Docker) and orchestration systems (Kubernetes) is also necessary, as well as experience with production-level AI/ML solutions for scaling and deploying large models. Candidates must demonstrate the ability to balance hands-on technical work with team leadership and project management.

Responsibilities

The Engineering Manager will lead, mentor, and manage a team of engineers focused on developing and optimizing ML model inference and performance. They will oversee technical strategy and architecture decisions, driving improvements across the engineering organization. Collaboration with cross-functional teams will be essential to ensure seamless integration and scalability of ML models in production environments. The manager will dive into the codebase of frameworks like TensorRT, PyTorch, and CUDA to identify and resolve complex performance bottlenecks. They will drive the development and deployment of large-scale optimization techniques for various ML models, particularly large language models (LLMs). Additionally, they will own the full lifecycle of projects from inception through delivery, including planning, execution, and resource management, while fostering a collaborative and inclusive team environment that encourages continuous learning and growth.

Skills

Python
C++
Go
TensorRT
PyTorch
CUDA
ML model performance optimization
Software Engineering
Leadership
Team Management

Baseten

Platform for deploying and managing ML models

About Baseten

Baseten provides a platform for deploying and managing machine learning (ML) models, aimed at simplifying the process for businesses. Users can select from a library of open-source foundation models and deploy them with just two clicks, making it easier to implement ML solutions. The platform features autoscaling, which adjusts resources based on demand, and comprehensive monitoring tools for tracking performance and troubleshooting. A key differentiator is Baseten's open-source model packaging framework, Truss, which allows users to package and deploy custom models easily. The company operates on a usage-based pricing model, where clients pay only for the time their models are actively deployed, helping them manage costs effectively.

Key Metrics

San Francisco, CaliforniaHeadquarters
2019Year Founded
$58.4MTotal Funding
SERIES_BCompany Stage
AI & Machine LearningIndustries
51-200Employees

Benefits

💰 Competitive compensation: We aim to provide 90th percentile (or better) salaries and equity grants for every team member commensurate with their experience.
🌎 Remote-first work environment: The Baseten team is welcome to work from wherever they want; fully remote, in our San Francisco office, or a mix of both. We provide a $1,000 stipend for you to make your home office comfortable and productive.
🏓 Regular in-person team summits: We get together as a team three times a year to plan, workshop, and most importantly, get to know each other better.
🌴 Unlimited PTO: We ask that everyone take at least 4 weeks of vacation. And we have a company-wide break between Christmas and New Year's Day.
🏥 Full healthcare coverage: Medical, dental and vision insurance for you and your family.
🍼 Paid parental leave: 16-weeks fully paid parental leave (adoptive and non-birth parents included) and flexibility with schedules while returning to work.
📈 401(k): Company-sponsored 401(k) for you to contribute to.
🧠: Learning and development budget: We encourage you to take classes, attend conferences, and invest in your craft and we’ll cover expenses to make it happen.

Risks

Increased competition from specialized AI models tailored for specific industries.
Potential over-reliance on Google Cloud Marketplace may limit flexibility and control.
Rapid AI model development could render Baseten's offerings obsolete without continuous innovation.

Differentiation

Baseten offers a serverless backend for machine-learning applications with auto-scaling.
Truss, an open-source model packaging framework, allows seamless deployment of custom models.
Baseten's platform provides comprehensive monitoring tools for efficient model performance tracking.

Upsides

Integration with Google Cloud Marketplace boosts visibility and customer acquisition potential.
$40M Series B funding enhances Baseten's platform capabilities and market reach.
Chains framework positions Baseten for complex AI workflows, attracting sophisticated projects.

Land your dream remote job 3x faster with AI