[Remote] Senior+ AI Researcher (Multimodal Perception Models) at Tavus

San Francisco, California, United States

Tavus Logo
Not SpecifiedCompensation
N/AExperience Level
N/AJob Type
Not SpecifiedVisa
N/AIndustries

Requirements

  • A PhD plus 2–3+ years working hands-on with LLMs, VLMs, or multimodal systems
  • Previous experience leading research efforts or mentoring teams
  • Expertise in sequence modeling across video, audio, and text — with strong understanding of autoregressive, predictive, and diffusion frameworks
  • Experience with large-scale model training and optimization for performance and real-time generation
  • Proven ability to translate research ideas into production-grade systems
  • Publications in top-tier venues (CVPR, ICCV, NeurIPS, ECCV, ACMMM)
  • Strong PyTorch skills and comfort moving fluidly between research and engineering
  • Nice-to-Haves: Broad familiarity with generative AI paradigms and foundation models
  • Nice-to-Haves: Comfort working across the full research–to–deployment stack
  • Nice-to-Haves: A builder’s mindset: eager to experiment, iterate, and ship

Responsibilities

  • Lead research on Foundational Multimodal Models for Conversational Avatars — systems that can perceive, reason, and generate across video, audio, and language
  • Build and train models using Autoregressive, Predictive (e.g., V-JEPA), and Diffusion-based architectures with a deep focus on temporal and sequential data
  • Design and execute experiments to predict and control the visual, auditory, and linguistic responses of avatars
  • Partner with the Applied ML team to bring research into real-world use cases
  • Mentor other researchers and drive excellence across the team

Skills

Tavus

AI-driven video personalization platform for marketing

About Tavus

Tavus offers a video personalization platform that uses artificial intelligence to create customized videos for each customer. The platform takes a single recorded video and generates numerous versions, each tailored with unique voice variables to enhance customer loyalty and encourage repeat purchases. It caters to businesses of all sizes, recognizing the growing need for personalized content in digital marketing. Tavus stands out by enabling the auto-generation of hundreds or even millions of personalized videos, allowing businesses to scale their marketing efforts while maintaining quality interactions. The company likely operates on a subscription or usage-based pricing model, providing various plans to suit different needs. The main goal of Tavus is to help businesses build personal connections with their audience at scale, allowing them to focus on creativity and strategic initiatives.

San Francisco, CaliforniaHeadquarters
2020Year Founded
$23.4MTotal Funding
SERIES_ACompany Stage
Consumer Software, AI & Machine LearningIndustries
11-50Employees

Benefits

Health Insurance
Unlimited Paid Time Off
Flexible Work Hours

Risks

Emerging competition from AI video startups could dilute Tavus's market share.
Ethical concerns over deepfake capabilities may impact Tavus's operations.
Data privacy demands challenge Tavus's handling of sensitive customer information.

Differentiation

Tavus offers the world's fastest digital twin solution for real-time video conversations.
Their platform auto-generates millions of personalized videos from a single recording.
Tavus's AI avatar models create immersive digital experiences akin to face-to-face interactions.

Upsides

Tavus raised $18 million in Series A funding, boosting their growth potential.
The platform supports asynchronous communication, aligning with business trends.
Growing demand for AI-driven video personalization enhances Tavus's market position.

Land your dream remote job 3x faster with AI