Research Scientist - Multimodal Language Models

Palo Alto, California, United States

Apply with AI Apply

$200,000 – $300,000Compensation

Senior (5 to 8 years), Junior (1 to 2 years)Experience Level

Full TimeJob Type

UnknownVisa

AI & Machine Learning, Consumer SoftwareIndustries

Requirements

Candidates should have significant experience solving complex problems in multimodal language models. Expertise in Python and Pytorch is required, along with practical experience in the full development pipeline from data processing to optimization. Experience working with large-scale text data is essential, and familiarity with interleaved data that includes audio, video, image, and text is a bonus. Hands-on experience in developing or benchmarking LLMs, Vision Language Models, Audio Language Models, or generative video models is also necessary.

Responsibilities

The Research Scientist will design and implement novel AI algorithms and architectures for multimodal language models. They will build tools to evaluate and benchmark these models, develop large-scale AI training and inference methods, and ensure efficient implementation for data processing and training. Additionally, they will build tools to analyze and process multimodal data, collaborate with research and engineering teams to transfer research into products and services, and implement cutting-edge product prototypes based on multimodal generative AI.

Skills

Python

Pytorch

Data Processing

Data Loading

Training

Inference

Optimization

LLMs

Vision Language Models

Audio Language Models

Generative Video Models

Luma AI

Develops multimodal AI technologies for creativity

About Luma AI

Luma AI develops multimodal artificial intelligence technologies that enhance human creativity and capabilities. Their main product, the Dream Machine, allows users to interact with various types of data, enabling creative professionals, businesses, and developers to explore innovative applications of AI. Unlike many competitors, Luma AI focuses on integrating multiple modes of interaction, which broadens the possibilities for users. The company operates on a subscription model, providing access to its AI tools and services, and aims to lead the way in AI-driven creativity and productivity.

San Francisco, CaliforniaHeadquarters

2021Year Founded

$84.9MTotal Funding

LATE_VCCompany Stage

Consumer Software, AI & Machine LearningIndustries

51-200Employees

Benefits

Company Equity

Stock Options

Risks

Competition from Google's Veo 2 and OpenAI's Sora Turbo challenges Luma AI's market position.

AWS's cost-reducing features may make Luma AI's services less competitive.

AI-generated content controversies could harm Luma AI's reputation if outputs are misleading.

Differentiation

Luma AI transforms text into 3D models, enhancing user interaction with digital content.

The Dream Machine integrates multimodal AI, offering a unique creative platform for users.

Luma AI's Photon models provide high-quality, personalized image generation capabilities.

Upsides

Luma AI's $90 million funding boosts its capacity for innovation and market expansion.

Dream Machine 1.5's advanced text-to-video features attract creative professionals and businesses.

Expanding Dream Machine into a mobile app increases accessibility and user engagement.

Land your dream remote job 3x faster with AI

Try Jobo Free