[Remote] AI Researcher (Multimodal Perception Models) at Tavus

San Francisco, California, United States

Tavus Logo
Not SpecifiedCompensation
N/AExperience Level
N/AJob Type
Not SpecifiedVisa
N/AIndustries

Requirements

  • A PhD (or near completion) in a relevant field, or equivalent hands-on research experience
  • Experience modeling human behavior and generation (facial expressions, affect, or speech), ideally in conversational or interactive settings
  • Deep understanding of sequence modeling in video/audio/language domains
  • Familiarity with large model training, especially LLMs or VLMs
  • Strong background in Deep Learning (from Transformers to Diffusion Models) and practical implementation
  • Excellent programming skills, especially in PyTorch
  • Nice-to-haves: Publications in top-tier conferences like CVPR, ICCV, NeurIPS, ECCV, or ACMMM
  • Nice-to-haves: Broader understanding of generative AI and multimodal architectures
  • Nice-to-haves: Familiarity with software engineering best practices
  • Nice-to-haves: Curiosity and a flexible mindset — you like building and experimenting

Responsibilities

  • Conduct research on Foundational Multimodal Models in the context of Conversational Avatars (e.g., Neural Avatars, Talking-Heads)
  • Model video, audio, and language sequences using Autoregressive, Predictive Architectures (e.g., V-JEPA), and/or Diffusion paradigms with an emphasis on temporal and sequential data rather than static images
  • Collaborate with the Applied ML team to bring your work to life in production systems
  • Stay at the cutting edge of multimodal learning and help define what “cutting edge” means next

Skills

Tavus

AI-driven video personalization platform for marketing

About Tavus

Tavus offers a video personalization platform that uses artificial intelligence to create customized videos for each customer. The platform takes a single recorded video and generates numerous versions, each tailored with unique voice variables to enhance customer loyalty and encourage repeat purchases. It caters to businesses of all sizes, recognizing the growing need for personalized content in digital marketing. Tavus stands out by enabling the auto-generation of hundreds or even millions of personalized videos, allowing businesses to scale their marketing efforts while maintaining quality interactions. The company likely operates on a subscription or usage-based pricing model, providing various plans to suit different needs. The main goal of Tavus is to help businesses build personal connections with their audience at scale, allowing them to focus on creativity and strategic initiatives.

San Francisco, CaliforniaHeadquarters
2020Year Founded
$23.4MTotal Funding
SERIES_ACompany Stage
Consumer Software, AI & Machine LearningIndustries
11-50Employees

Benefits

Health Insurance
Unlimited Paid Time Off
Flexible Work Hours

Risks

Emerging competition from AI video startups could dilute Tavus's market share.
Ethical concerns over deepfake capabilities may impact Tavus's operations.
Data privacy demands challenge Tavus's handling of sensitive customer information.

Differentiation

Tavus offers the world's fastest digital twin solution for real-time video conversations.
Their platform auto-generates millions of personalized videos from a single recording.
Tavus's AI avatar models create immersive digital experiences akin to face-to-face interactions.

Upsides

Tavus raised $18 million in Series A funding, boosting their growth potential.
The platform supports asynchronous communication, aligning with business trends.
Growing demand for AI-driven video personalization enhances Tavus's market position.

Land your dream remote job 3x faster with AI