[Remote] AI Researcher (Multimodal Perception Models) at Tavus

San Francisco, California, United States

Not SpecifiedCompensation

Expert & Leadership (9+ years)Experience Level

Full TimeJob Type

UnknownVisa

Artificial Intelligence, TechnologyIndustries

Requirements

PhD (or near completion) in a relevant field, or equivalent hands-on research experience
Experience modeling human behavior and generation (facial expressions, affect, or speech), ideally in conversational or interactive settings
Deep understanding of sequence modeling in video/audio/language domains
Familiarity with large model training, especially LLMs or VLMs
Strong background in Deep Learning (from Transformers to Diffusion Models) and how to make them work in practice
Excellent programming skills, especially in PyTorch

Responsibilities

Conduct research on Foundational Multimodal Models in the context of Conversational Avatars (e.g., Neural Avatars, Talking-Heads)
Model video, audio, and language sequences using Autoregressive, Predictive Architectures (e.g., V-JEPA), and/or Diffusion paradigms with an emphasis on temporal and sequential data rather than static images
Collaborate with the Applied ML team to bring your work to life in production systems
Stay at the cutting edge of multimodal learning and help define what “cutting edge” means next

Skills

Key technologies and capabilities for this role

Multimodal ModelsAutoregressive ArchitecturesDiffusion ModelsV-JEPANeural AvatarsTalking HeadsVideo ModelingAudio ModelingLanguage ModelingConversational IntelligenceTemporal Data ProcessingSequential Data Modeling

Questions & Answers

Common questions about this position

What is the location or work arrangement for this role?

The role is onsite with a preference for San Francisco (hybrid) or London (office opening soon), and remote within the U.S. or Europe is available for exceptional candidates.

What are the required qualifications for the AI Researcher position?

Candidates need a PhD (or near completion) in a relevant field or equivalent experience, experience modeling human behavior like facial expressions or speech in conversational settings, deep understanding of sequence modeling in video/audio/language, familiarity with large model training like LLMs or VLMs, strong background in Deep Learning from Transformers to Diffusion Models, and excellent PyTorch programming skills.

What salary or compensation does this role offer?

This information is not specified in the job description.

What is the company culture like at Tavus?

Tavus is a fast-paced research lab pioneering human computing with AI Humans, backed by top investors, where team members thrive by turning ideas into code and exploring cutting-edge possibilities.

What makes a candidate stand out for this AI Researcher role?

Exceptional candidates may qualify for remote work, and nice-to-haves like publications in top conferences (CVPR, NeurIPS), understanding of generative AI, software engineering practices, and a curious flexible mindset strengthen applications.

Tavus

AI-driven video personalization platform for marketing

About Tavus

Tavus offers a video personalization platform that uses artificial intelligence to create customized videos for each customer. The platform takes a single recorded video and generates numerous versions, each tailored with unique voice variables to enhance customer loyalty and encourage repeat purchases. It caters to businesses of all sizes, recognizing the growing need for personalized content in digital marketing. Tavus stands out by enabling the auto-generation of hundreds or even millions of personalized videos, allowing businesses to scale their marketing efforts while maintaining quality interactions. The company likely operates on a subscription or usage-based pricing model, providing various plans to suit different needs. The main goal of Tavus is to help businesses build personal connections with their audience at scale, allowing them to focus on creativity and strategic initiatives.

San Francisco, CaliforniaHeadquarters

2020Year Founded

$23.4MTotal Funding

SERIES_ACompany Stage

Consumer Software, AI & Machine LearningIndustries

11-50Employees

Benefits

Health Insurance

Unlimited Paid Time Off

Flexible Work Hours

Risks

Emerging competition from AI video startups could dilute Tavus's market share.

Ethical concerns over deepfake capabilities may impact Tavus's operations.

Data privacy demands challenge Tavus's handling of sensitive customer information.

Differentiation

Tavus offers the world's fastest digital twin solution for real-time video conversations.

Their platform auto-generates millions of personalized videos from a single recording.

Tavus's AI avatar models create immersive digital experiences akin to face-to-face interactions.

Upsides

Tavus raised $18 million in Series A funding, boosting their growth potential.

The platform supports asynchronous communication, aligning with business trends.

Growing demand for AI-driven video personalization enhances Tavus's market position.

Land your dream remote job 3x faster with AI

Try Jobo Free