[Remote] Senior+ AI Researcher (Multimodal Perception Models) at Tavus

San Francisco, California, United States

Not SpecifiedCompensation

Senior (5 to 8 years)Experience Level

Full TimeJob Type

UnknownVisa

Artificial Intelligence, SoftwareIndustries

Requirements

A PhD plus 2–3+ years working hands-on with LLMs, VLMs, or multimodal systems
Previous experience leading research efforts or mentoring teams
Expertise in sequence modeling across video, audio, and text — with strong understanding of autoregressive, predictive, and diffusion frameworks
Experience with large-scale model training and optimization for performance and real-time generation
Proven ability to translate research ideas into production-grade systems
Publications in top-tier venues (CVPR, ICCV, NeurIPS, ECCV, ACMMM)
Strong PyTorch skills and comfort moving fluidly between research and engineering
Nice-to-Haves: Broad familiarity with generative AI paradigms and foundation models
Nice-to-Haves: Comfort working across the full research–to–deployment stack
Nice-to-Haves: A builder’s mindset: eager to experiment, iterate, and ship

Responsibilities

Lead research on Foundational Multimodal Models for Conversational Avatars — systems that can perceive, reason, and generate across video, audio, and language
Build and train models using Autoregressive, Predictive (e.g., V-JEPA), and Diffusion-based architectures with a deep focus on temporal and sequential data
Design and execute experiments to predict and control the visual, auditory, and linguistic responses of avatars
Partner with the Applied ML team to bring research into real-world use cases
Mentor other researchers and drive excellence across the team

Skills

Key technologies and capabilities for this role

Multimodal ModelingConversational AIAutoregressive ModelsPredictive ModelingDiffusion ModelsTemporal DataSequential DataV-JEPA

Questions & Answers

Common questions about this position

Is this role remote or onsite?

This is an onsite position.

What qualifications are required for this Senior AI Researcher role?

Candidates need a PhD plus 2–3+ years hands-on with LLMs, VLMs, or multimodal systems, experience leading research or mentoring teams, expertise in sequence modeling across video, audio, and text with autoregressive, predictive, and diffusion frameworks, large-scale model training experience, proven ability to translate research to production systems, publications in top venues like CVPR or NeurIPS, and strong PyTorch skills.

What is the salary or compensation for this position?

This information is not specified in the job description.

What is the company culture like at Tavus?

Tavus is a research lab pioneering human computing with a focus on building AI Humans for empathetic interactions, backed by top investors, where researchers lead foundational work, steer technical direction, mentor teams, and partner across applied ML.

What makes a strong candidate for this role?

A strong candidate has a PhD with hands-on multimodal experience, leadership in research, top publications, and the ability to bridge research to production, plus a builder's mindset for experimentation.

Tavus

AI-driven video personalization platform for marketing

About Tavus

Tavus offers a video personalization platform that uses artificial intelligence to create customized videos for each customer. The platform takes a single recorded video and generates numerous versions, each tailored with unique voice variables to enhance customer loyalty and encourage repeat purchases. It caters to businesses of all sizes, recognizing the growing need for personalized content in digital marketing. Tavus stands out by enabling the auto-generation of hundreds or even millions of personalized videos, allowing businesses to scale their marketing efforts while maintaining quality interactions. The company likely operates on a subscription or usage-based pricing model, providing various plans to suit different needs. The main goal of Tavus is to help businesses build personal connections with their audience at scale, allowing them to focus on creativity and strategic initiatives.

San Francisco, CaliforniaHeadquarters

2020Year Founded

$23.4MTotal Funding

SERIES_ACompany Stage

Consumer Software, AI & Machine LearningIndustries

11-50Employees

Benefits

Health Insurance

Unlimited Paid Time Off

Flexible Work Hours

Risks

Emerging competition from AI video startups could dilute Tavus's market share.

Ethical concerns over deepfake capabilities may impact Tavus's operations.

Data privacy demands challenge Tavus's handling of sensitive customer information.

Differentiation

Tavus offers the world's fastest digital twin solution for real-time video conversations.

Their platform auto-generates millions of personalized videos from a single recording.

Tavus's AI avatar models create immersive digital experiences akin to face-to-face interactions.

Upsides

Tavus raised $18 million in Series A funding, boosting their growth potential.

The platform supports asynchronous communication, aligning with business trends.

Growing demand for AI-driven video personalization enhances Tavus's market position.

Land your dream remote job 3x faster with AI

Try Jobo Free