[Remote] Senior+ AI Researcher (Multimodal Perception Models) at Tavus

San Francisco, California, United States

Tavus Logo
Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, SoftwareIndustries

Requirements

  • A PhD plus 2–3+ years working hands-on with LLMs, VLMs, or multimodal systems
  • Previous experience leading research efforts or mentoring teams
  • Expertise in sequence modeling across video, audio, and text — with strong understanding of autoregressive, predictive, and diffusion frameworks
  • Experience with large-scale model training and optimization for performance and real-time generation
  • Proven ability to translate research ideas into production-grade systems
  • Publications in top-tier venues (CVPR, ICCV, NeurIPS, ECCV, ACMMM)
  • Strong PyTorch skills and comfort moving fluidly between research and engineering
  • Nice-to-Haves: Broad familiarity with generative AI paradigms and foundation models
  • Nice-to-Haves: Comfort working across the full research–to–deployment stack
  • Nice-to-Haves: A builder’s mindset: eager to experiment, iterate, and ship

Responsibilities

  • Lead research on Foundational Multimodal Models for Conversational Avatars — systems that can perceive, reason, and generate across video, audio, and language
  • Build and train models using Autoregressive, Predictive (e.g., V-JEPA), and Diffusion-based architectures with a deep focus on temporal and sequential data
  • Design and execute experiments to predict and control the visual, auditory, and linguistic responses of avatars
  • Partner with the Applied ML team to bring research into real-world use cases
  • Mentor other researchers and drive excellence across the team

Skills

Key technologies and capabilities for this role

Multimodal ModelingConversational AIAutoregressive ModelsPredictive ModelingDiffusion ModelsTemporal DataSequential DataV-JEPA

Questions & Answers

Common questions about this position

Is this role remote or onsite?

This is an onsite position.

What qualifications are required for this Senior AI Researcher role?

Candidates need a PhD plus 2–3+ years hands-on with LLMs, VLMs, or multimodal systems, experience leading research or mentoring teams, expertise in sequence modeling across video, audio, and text with autoregressive, predictive, and diffusion frameworks, large-scale model training experience, proven ability to translate research to production systems, publications in top venues like CVPR or NeurIPS, and strong PyTorch skills.

What is the salary or compensation for this position?

This information is not specified in the job description.

What is the company culture like at Tavus?

Tavus is a research lab pioneering human computing with a focus on building AI Humans for empathetic interactions, backed by top investors, where researchers lead foundational work, steer technical direction, mentor teams, and partner across applied ML.

What makes a strong candidate for this role?

A strong candidate has a PhD with hands-on multimodal experience, leadership in research, top publications, and the ability to bridge research to production, plus a builder's mindset for experimentation.

Tavus

AI-driven video personalization platform for marketing

About Tavus

Tavus offers a video personalization platform that uses artificial intelligence to create customized videos for each customer. The platform takes a single recorded video and generates numerous versions, each tailored with unique voice variables to enhance customer loyalty and encourage repeat purchases. It caters to businesses of all sizes, recognizing the growing need for personalized content in digital marketing. Tavus stands out by enabling the auto-generation of hundreds or even millions of personalized videos, allowing businesses to scale their marketing efforts while maintaining quality interactions. The company likely operates on a subscription or usage-based pricing model, providing various plans to suit different needs. The main goal of Tavus is to help businesses build personal connections with their audience at scale, allowing them to focus on creativity and strategic initiatives.

San Francisco, CaliforniaHeadquarters
2020Year Founded
$23.4MTotal Funding
SERIES_ACompany Stage
Consumer Software, AI & Machine LearningIndustries
11-50Employees

Benefits

Health Insurance
Unlimited Paid Time Off
Flexible Work Hours

Risks

Emerging competition from AI video startups could dilute Tavus's market share.
Ethical concerns over deepfake capabilities may impact Tavus's operations.
Data privacy demands challenge Tavus's handling of sensitive customer information.

Differentiation

Tavus offers the world's fastest digital twin solution for real-time video conversations.
Their platform auto-generates millions of personalized videos from a single recording.
Tavus's AI avatar models create immersive digital experiences akin to face-to-face interactions.

Upsides

Tavus raised $18 million in Series A funding, boosting their growth potential.
The platform supports asynchronous communication, aligning with business trends.
Growing demand for AI-driven video personalization enhances Tavus's market position.

Land your dream remote job 3x faster with AI