Luma AI

Research Scientist - Multimodal Language Models

Palo Alto, California, United States

$200,000 – $300,000Compensation
Senior (5 to 8 years), Junior (1 to 2 years)Experience Level
Full TimeJob Type
UnknownVisa
AI & Machine Learning, Consumer SoftwareIndustries

Requirements

Candidates should have significant experience solving complex problems in multimodal language models. Expertise in Python and Pytorch is required, along with practical experience in the full development pipeline from data processing to optimization. Experience working with large-scale text data is essential, and familiarity with interleaved data that includes audio, video, image, and text is a bonus. Hands-on experience in developing or benchmarking LLMs, Vision Language Models, Audio Language Models, or generative video models is also necessary.

Responsibilities

The Research Scientist will design and implement novel AI algorithms and architectures for multimodal language models. They will build tools to evaluate and benchmark these models, develop large-scale AI training and inference methods, and ensure efficient implementation for data processing and training. Additionally, they will build tools to analyze and process multimodal data, collaborate with research and engineering teams to transfer research into products and services, and implement cutting-edge product prototypes based on multimodal generative AI.

Skills

Python
Pytorch
Data Processing
Data Loading
Training
Inference
Optimization
LLMs
Vision Language Models
Audio Language Models
Generative Video Models

Luma AI

Develops multimodal AI technologies for creativity

About Luma AI

Luma AI develops multimodal artificial intelligence technologies that enhance human creativity and capabilities. Their main product, the Dream Machine, allows users to interact with various types of data, enabling creative professionals, businesses, and developers to explore innovative applications of AI. Unlike many competitors, Luma AI focuses on integrating multiple modes of interaction, which broadens the possibilities for users. The company operates on a subscription model, providing access to its AI tools and services, and aims to lead the way in AI-driven creativity and productivity.

Key Metrics

San Francisco, CaliforniaHeadquarters
2021Year Founded
$84.9MTotal Funding
LATE_VCCompany Stage
Consumer Software, AI & Machine LearningIndustries
51-200Employees

Benefits

Company Equity
Stock Options

Risks

Competition from Google's Veo 2 and OpenAI's Sora Turbo challenges Luma AI's market position.
AWS's cost-reducing features may make Luma AI's services less competitive.
AI-generated content controversies could harm Luma AI's reputation if outputs are misleading.

Differentiation

Luma AI transforms text into 3D models, enhancing user interaction with digital content.
The Dream Machine integrates multimodal AI, offering a unique creative platform for users.
Luma AI's Photon models provide high-quality, personalized image generation capabilities.

Upsides

Luma AI's $90 million funding boosts its capacity for innovation and market expansion.
Dream Machine 1.5's advanced text-to-video features attract creative professionals and businesses.
Expanding Dream Machine into a mobile app increases accessibility and user engagement.

Land your dream remote job 3x faster with AI