Tavus

Machine Learning Engineer

San Francisco, California, United States

$160,000 – $250,000Compensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, Biotechnology, SaaSIndustries

About Us

At Tavus, we're building the human layer of AI. Our mission is to make human-AI interaction as natural as face-to-face interaction, enabling the human touch where it has been previously unscalable. We achieve this through pioneering research in multi-modal AI models for human perception and understanding, combined with state-of-the-art human avatar rendering and communication models. Our models power everything from text-to-video AI avatars to real-time conversational video experiences across industries like healthcare, recruiting, sales, education, and more. By enabling AI to see, hear, and communicate with human-like authenticity, we're creating the foundation for the next generation of AI employees, assistants, and companions.

We're a Series A company backed by top investors, including Sequoia, Y Combinator, and Scale VC. Join us in driving the future of human-AI interaction.

The Role

As a Machine Learning Engineer at Tavus, you'll be at the forefront of developing our cutting-edge Conversational Video Interface (CVI). This role involves building and optimizing real-time, multimodal AI systems that enable lifelike digital twins to engage in natural conversations. You'll collaborate with cross-functional teams to enhance the CVI's capabilities, ensuring seamless integration of vision, speech, and emotional intelligence components.

Your Mission

  • Develop and Optimize CVI Components: Design and implement core components of the CVI, including WebRTC/video conferencing, vision processing, speech recognition (ASR), text-to-speech (TTS), and replica video output.
  • Integrate Multimodal AI Models: Work with in-house models like Phoenix-3 for lifelike avatar rendering, Sparrow-0 for conversational pacing, and Raven-0 for visual perception to create cohesive, responsive digital twins.
  • Ensure Low-Latency Performance: Optimize the CVI pipeline to achieve sub-600ms utterance-to-utterance latency, delivering real-time, natural conversations.
  • Collaborate with Cross-Functional Teams: Work closely with AI researchers, product managers, and UX designers to align technical development with user needs and product goals.
  • Maintain and Enhance API Infrastructure: Develop and maintain APIs that allow developers to easily integrate Tavus's CVI into their applications, ensuring scalability and reliability.

Requirements

  • Educational Background: Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related field.
  • Technical Experience: 3+ years of experience in software engineering, with a focus on real-time systems, multimedia processing, or AI integration.
  • Programming Skills: Proficiency in languages such as Python, C++, or JavaScript, and experience with frameworks like TensorFlow or PyTorch.
  • Agentic Systems: Experience designing or building agentic systems, such as autonomous agents or AI-driven decision-making frameworks.
  • Knowledge of Multimedia Protocols: Familiarity with WebRTC, streaming protocols, and video/audio processing techniques.
  • AI Integration: Experience integrating machine learning models into production systems, particularly in areas related to speech, vision, or natural language processing.
  • Problem-Solving Abilities: Strong analytical skills and the ability to troubleshoot complex systems.

Bonus if you have:

  • Experience with Digital Avatars: Background in developing or working with digital human representations or avatars.
  • Multilingual Support: Understanding of building systems that support multiple languages and cultural nuances.
  • Cloud Infrastructure: Experience with deploying and managing applications on cloud platforms like AWS, GCP, or Azure.

Benefits

When you join Tavus, you’re joining a family. Our work is driven by our team, and our success is shared by all. This position has a flexible work schedule, unlimited PTO, competitive healthcare and gear stipends, as well as, of course, plenty of fun!

Job Details

  • Salary: $160K - $250K
  • Location Type: Hybrid
  • Employment Type: FullTime

Skills

Machine Learning
AI
WebRTC
Video Conferencing
Vision Processing
Speech Recognition
ASR
Text-to-Speech
TTS
Real-time Systems
Multimodal AI
Low-latency Optimization

Tavus

AI-driven video personalization platform for marketing

About Tavus

Tavus offers a video personalization platform that uses artificial intelligence to create customized videos for each customer. The platform takes a single recorded video and generates numerous versions, each tailored with unique voice variables to enhance customer loyalty and encourage repeat purchases. It caters to businesses of all sizes, recognizing the growing need for personalized content in digital marketing. Tavus stands out by enabling the auto-generation of hundreds or even millions of personalized videos, allowing businesses to scale their marketing efforts while maintaining quality interactions. The company likely operates on a subscription or usage-based pricing model, providing various plans to suit different needs. The main goal of Tavus is to help businesses build personal connections with their audience at scale, allowing them to focus on creativity and strategic initiatives.

San Francisco, CaliforniaHeadquarters
2020Year Founded
$23.4MTotal Funding
SERIES_ACompany Stage
Consumer Software, AI & Machine LearningIndustries
11-50Employees

Benefits

Health Insurance
Unlimited Paid Time Off
Flexible Work Hours

Risks

Emerging competition from AI video startups could dilute Tavus's market share.
Ethical concerns over deepfake capabilities may impact Tavus's operations.
Data privacy demands challenge Tavus's handling of sensitive customer information.

Differentiation

Tavus offers the world's fastest digital twin solution for real-time video conversations.
Their platform auto-generates millions of personalized videos from a single recording.
Tavus's AI avatar models create immersive digital experiences akin to face-to-face interactions.

Upsides

Tavus raised $18 million in Series A funding, boosting their growth potential.
The platform supports asynchronous communication, aligning with business trends.
Growing demand for AI-driven video personalization enhances Tavus's market position.

Land your dream remote job 3x faster with AI