Luma AI

Senior Research engineer - Multimodal Language Models

Palo Alto, California, United States

Auto‑apply with AI Apply

$200,000 – $300,000Compensation

Senior (5 to 8 years)Experience Level

Full TimeJob Type

UnknownVisa

AI & Machine LearningIndustries

Requirements

Candidates should have significant experience solving hard problems in PyTorch, multimodal data, and distributed systems. Expertise in Python and PyTorch is required, along with practical experience working with the full development pipeline from data processing to training and inference. Experience processing large-scale text data is necessary, and familiarity with interleaved data spanning audio, video, image, and/or text is a bonus. Hands-on experience in developing or benchmarking LLMs, Vision Language Models, Audio Language Models, or generative video models is also essential. Experience in the design and development of annotation tools and synthetic data is good to have.

Responsibilities

The Senior Research Engineer will design and develop large-scale annotation efforts for model post-training. They will build tools to evaluate and benchmark multimodal language models and develop large-scale AI training and inference methods. Ensuring efficient implementation of models and systems for data processing and training is crucial. The engineer will also build tools to visualize, evaluate, and filter datasets, collaborate with research and engineering teams across Luma to transfer research to products and services, and implement cutting-edge product prototypes based on multimodal generative AI.

Skills

Python

PyTorch

LLMs

Vision Language Models

Audio Language Models

Generative Video Models

Data Processing

Data Loading

Training

Inference

Data Visualization

Data Filtering

Distributed Systems

Luma AI

Develops multimodal AI technologies for creativity

About Luma AI

Luma AI develops multimodal artificial intelligence technologies that enhance human creativity and capabilities. Their main product, the Dream Machine, allows users to interact with various types of data, enabling creative professionals, businesses, and developers to explore innovative applications of AI. Unlike many competitors, Luma AI focuses on integrating multiple modes of interaction, which broadens the possibilities for users. The company operates on a subscription model, providing access to its AI tools and services, and aims to lead the way in AI-driven creativity and productivity.

Key Metrics

San Francisco, CaliforniaHeadquarters

2021Year Founded

$84.9MTotal Funding

LATE_VCCompany Stage

Consumer Software, AI & Machine LearningIndustries

51-200Employees

Benefits

Company Equity

Stock Options

Risks

Competition from Google's Veo 2 and OpenAI's Sora Turbo challenges Luma AI's market position.

AWS's cost-reducing features may make Luma AI's services less competitive.

AI-generated content controversies could harm Luma AI's reputation if outputs are misleading.

Differentiation

Luma AI transforms text into 3D models, enhancing user interaction with digital content.

The Dream Machine integrates multimodal AI, offering a unique creative platform for users.

Luma AI's Photon models provide high-quality, personalized image generation capabilities.

Upsides

Luma AI's $90 million funding boosts its capacity for innovation and market expansion.

Dream Machine 1.5's advanced text-to-video features attract creative professionals and businesses.

Expanding Dream Machine into a mobile app increases accessibility and user engagement.

Land your dream remote job 3x faster with AI

Try Jobo Free