Senior+ AI Researcher (Multimodal Perception Models)
TavusFull Time
Senior (5 to 8 years)
Key technologies and capabilities for this role
Common questions about this position
The salary range is $200K - $300K.
The position is hybrid.
Required experience includes strong programming skills in Python and PyTorch, experience with large-scale datasets, and multimodal data processing pipelines, plus understanding of computer vision, audio processing, and/or natural language processing techniques.
The team focuses on data as a fundamental layer to unlock advanced capabilities in foundation models, tackling how different modalities like vision, audio, and language can be combined for powerful multimodal AI systems.
Strong candidates will have the required experience in Python, PyTorch, large-scale datasets, and multimodal processing, along with preferred expertise in interleaved multimodal data or hands-on work with Vision Language Models, Audio Language Models, or generative video models.
Develops multimodal AI technologies for creativity
Luma AI develops multimodal artificial intelligence technologies that enhance human creativity and capabilities. Their main product, the Dream Machine, allows users to interact with various types of data, enabling creative professionals, businesses, and developers to explore innovative applications of AI. Unlike many competitors, Luma AI focuses on integrating multiple modes of interaction, which broadens the possibilities for users. The company operates on a subscription model, providing access to its AI tools and services, and aims to lead the way in AI-driven creativity and productivity.