Senior+ AI Researcher (Multimodal Perception Models)
TavusFull Time
Senior (5 to 8 years)
Key technologies and capabilities for this role
Common questions about this position
The base pay range is $200,000 - $300,000 per year in California, though base pay may vary.
This is a hybrid position.
Expertise in Python and PyTorch with full development pipeline experience is required, along with multimodal data experience processing large-scale text or interleaved audio/video/image/text data, and hands-on experience developing or benchmarking LLMs, Vision Language Models, Audio Language Models, or generative video models.
The role includes competitive equity packages in the form of stock options and a comprehensive benefits plan.
Candidates with significant experience solving hard problems in PyTorch, multimodal data, and distributed systems, plus hands-on work with LLMs, vision/audio language models, or generative video models, will be strongest.
Develops multimodal AI technologies for creativity
Luma AI develops multimodal artificial intelligence technologies that enhance human creativity and capabilities. Their main product, the Dream Machine, allows users to interact with various types of data, enabling creative professionals, businesses, and developers to explore innovative applications of AI. Unlike many competitors, Luma AI focuses on integrating multiple modes of interaction, which broadens the possibilities for users. The company operates on a subscription model, providing access to its AI tools and services, and aims to lead the way in AI-driven creativity and productivity.