Engineering Manager - Forward Deployed Engineering (LLM) at Baseten

San Francisco, California, United States

Apply Now

$220,000 – $285,000Compensation

Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level

Full TimeJob Type

UnknownVisa

AI, TechnologyIndustries

Requirements

Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related field
4+ years of professional software engineering experience, including 1+ year in a leadership or mentorship capacity
Strong programming skills in Python, with production experience in building or optimizing ML inference

Responsibilities

Lead, mentor, and grow a team of Forward Deployed Engineers, providing guidance on technical direction, project execution, and professional development
Set clear goals and ensure timely, high-quality delivery across multiple customer-facing projects involving LLM deployment and inference optimization
Collaborate with leadership to align team priorities with company and customer goals, balancing short-term delivery, widely varying customer priorities, and long-term technical initiatives
Player-coach: Be a key driver on strategic product initiatives and customer engagements, remaining hands-on when needed
Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python
Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring), working with customers’ engineering teams at every stage of the customer journey including sales, implementation, and expansion
Deliver with velocity: Turn vague objectives into clear specs and well-defined PoCs to rapidly ship well-tested services and outcomes for customers
Optimize and enhance AI/ML projects, contributing to the continuous improvement of the technical stack, including developing features and PRDs with other engineering and product orgs
Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution

Skills

LLM

AI inference

engineering management

team leadership

Docker

ComfyUI

Whisper transcription

model deployment

low latency optimization

generative AI

production deployment

customer engineering

Baseten

Platform for deploying and managing ML models

About Baseten

Baseten provides a platform for deploying and managing machine learning (ML) models, aimed at simplifying the process for businesses. Users can select from a library of open-source foundation models and deploy them with just two clicks, making it easier to implement ML solutions. The platform features autoscaling, which adjusts resources based on demand, and comprehensive monitoring tools for tracking performance and troubleshooting. A key differentiator is Baseten's open-source model packaging framework, Truss, which allows users to package and deploy custom models easily. The company operates on a usage-based pricing model, where clients pay only for the time their models are actively deployed, helping them manage costs effectively.

San Francisco, CaliforniaHeadquarters

2019Year Founded

$58.4MTotal Funding

SERIES_BCompany Stage

AI & Machine LearningIndustries

51-200Employees

Benefits

💰 Competitive compensation: We aim to provide 90th percentile (or better) salaries and equity grants for every team member commensurate with their experience.

🌎 Remote-first work environment: The Baseten team is welcome to work from wherever they want; fully remote, in our San Francisco office, or a mix of both. We provide a $1,000 stipend for you to make your home office comfortable and productive.

🏓 Regular in-person team summits: We get together as a team three times a year to plan, workshop, and most importantly, get to know each other better.

🌴 Unlimited PTO: We ask that everyone take at least 4 weeks of vacation. And we have a company-wide break between Christmas and New Year's Day.

🏥 Full healthcare coverage: Medical, dental and vision insurance for you and your family.

🍼 Paid parental leave: 16-weeks fully paid parental leave (adoptive and non-birth parents included) and flexibility with schedules while returning to work.

📈 401(k): Company-sponsored 401(k) for you to contribute to.

🧠: Learning and development budget: We encourage you to take classes, attend conferences, and invest in your craft and we’ll cover expenses to make it happen.

Risks

Increased competition from specialized AI models tailored for specific industries.

Potential over-reliance on Google Cloud Marketplace may limit flexibility and control.

Rapid AI model development could render Baseten's offerings obsolete without continuous innovation.

Differentiation

Baseten offers a serverless backend for machine-learning applications with auto-scaling.

Truss, an open-source model packaging framework, allows seamless deployment of custom models.

Baseten's platform provides comprehensive monitoring tools for efficient model performance tracking.

Upsides

Integration with Google Cloud Marketplace boosts visibility and customer acquisition potential.

$40M Series B funding enhances Baseten's platform capabilities and market reach.

Chains framework positions Baseten for complex AI workflows, attracting sophisticated projects.

Land your dream remote job 3x faster with AI

Try Jobo Free