GPU Performance Engineer at Genmo

San Francisco, California, United States

Genmo Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
AI, Machine LearningIndustries

Requirements

  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field
  • 5+ years systems programming experience with 3+ years focused on GPU optimization
  • Expert proficiency with GPU profiling tools (Nsight Systems, nvprof)
  • Strong CUDA programming skills with production kernel development
  • Deep understanding of GPU architecture (memory hierarchy, SMs, warps)
  • Track record of achieving significant performance improvements (5-10x)
  • Experience with Python and C++ in production environments

Responsibilities

  • Profile and optimize GPU workloads using Nsight Systems, nvprof, and custom instrumentation
  • Write high-performance CUDA and Triton kernels for critical model operations
  • Optimize cold start latency from seconds to milliseconds for serving infrastructure
  • Tune memory access patterns, kernel fusion, and GPU utilization
  • Collaborate with ML engineers to optimize model implementations
  • Debug performance issues across the full stack from application to hardware
  • Implement custom memory pooling and allocation strategies
  • Share optimization techniques and build performance culture across teams

Skills

CUDA
Nsight Systems
nvprof
Triton
Python
C++
GPU Profiling
Kernel Development
Memory Optimization

Genmo

AI tools for multimedia content creation

About Genmo

Genmo.ai specializes in providing AI tools for generating and editing multimedia content, including images, videos, and presentations. Users can upload images and animate specific parts, like transforming a static sky into a timelapse, or create entire movies by refining ideas, generating scenes, and selecting transitions. The platform caters to both individual content creators and businesses, operating on a subscription model with various service tiers. Genmo.ai differentiates itself by continuously enhancing its technology and focusing on user intent, ensuring that clients have powerful tools to realize their creative projects.

San Francisco, CaliforniaHeadquarters
N/AYear Founded
$29.2MTotal Funding
EARLY_VCCompany Stage
Consumer Software, AI & Machine LearningIndustries
1-10Employees

Risks

Server crashes during Mochi-1 launch could harm customer trust and satisfaction.
Open-source nature of Mochi-1 may lead to increased competition from developers.
Major tech players entering generative AI market could overshadow Genmo's offerings.

Differentiation

Genmo.ai offers unique AI tools for animating images and generating entire movies.
The platform supports both B2B and B2C models, catering to diverse client needs.
Genmo.ai's subscription model provides flexible access to advanced multimedia editing features.

Upsides

Launch of Mochi-1 model positions Genmo as a competitor to industry leaders.
Rising demand for AI-driven video editing boosts Genmo's market potential.
Subscription-based revenue model ensures steady income and opportunities for upselling.

Land your dream remote job 3x faster with AI