Luma AI

Senior Machine Learning Engineer - Hardware Abstractions & Performance Optimization

Remote

$220,000 – $300,000Compensation
Senior (5 to 8 years), Mid-level (3 to 4 years)Experience Level
Full TimeJob Type
UnknownVisa
AI & Machine Learning, Hardware, Enterprise SoftwareIndustries

Requirements

Candidates must have significant experience optimizing for memory, latency, and throughput in PyTorch, along with experience benchmarking and profiling GPU and CPU code for optimal device utilization. Experience with non-NVIDIA systems, using torch.compile or torch.XLA, and working with transformer models and attention implementations is required. Familiarity with high-performance Triton/CUDA and writing custom PyTorch kernels is preferred, as well as experience with parallel inference, particularly with tensor parallelism and pipeline parallelism. Additionally, experience in writing high-performance parallel C++ within an ML context and building inference or demo prototype code is a bonus.

Responsibilities

The Senior Machine Learning Engineer will ensure efficient implementation of models and systems by designing, maintaining, and writing abstractions that scale beyond NVIDIA/CUDA hardware. They will identify and remedy efficiency bottlenecks by profiling and implementing high-performance PyTorch code. Responsibilities also include benchmarking products across various hardware and software to inform optimal tradeoffs, collaborating with partners to identify bottlenecks, and working closely with the research team to ensure systems are efficient from start to finish while addressing potential hardware integration issues.

Skills

PyTorch
Tensor Parallelism
Pipeline Parallelism
GPU Profiling
CPU Profiling
Memory Optimization
Latency Optimization
Throughput Optimization
Triton
CUDA

Luma AI

Develops multimodal AI technologies for creativity

About Luma AI

Luma AI develops multimodal artificial intelligence technologies that enhance human creativity and capabilities. Their main product, the Dream Machine, allows users to interact with various types of data, enabling creative professionals, businesses, and developers to explore innovative applications of AI. Unlike many competitors, Luma AI focuses on integrating multiple modes of interaction, which broadens the possibilities for users. The company operates on a subscription model, providing access to its AI tools and services, and aims to lead the way in AI-driven creativity and productivity.

Key Metrics

San Francisco, CaliforniaHeadquarters
2021Year Founded
$84.9MTotal Funding
LATE_VCCompany Stage
Consumer Software, AI & Machine LearningIndustries
51-200Employees

Benefits

Company Equity
Stock Options

Risks

Competition from Google's Veo 2 and OpenAI's Sora Turbo challenges Luma AI's market position.
AWS's cost-reducing features may make Luma AI's services less competitive.
AI-generated content controversies could harm Luma AI's reputation if outputs are misleading.

Differentiation

Luma AI transforms text into 3D models, enhancing user interaction with digital content.
The Dream Machine integrates multimodal AI, offering a unique creative platform for users.
Luma AI's Photon models provide high-quality, personalized image generation capabilities.

Upsides

Luma AI's $90 million funding boosts its capacity for innovation and market expansion.
Dream Machine 1.5's advanced text-to-video features attract creative professionals and businesses.
Expanding Dream Machine into a mobile app increases accessibility and user engagement.

Land your dream remote job 3x faster with AI