Genmo

Research Scientist (diffusion)

San Francisco, California, United States

Auto‑apply with AI Apply

Not SpecifiedCompensation

Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level

Full TimeJob Type

UnknownVisa

AI & Machine Learning, Robotics & AutomationIndustries

Requirements

Candidates must hold a Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, or a closely related field. A strong publication record in top-tier conferences focusing on generative models, especially diffusion models, is required. Extensive experience in implementing and optimizing large-scale generative models for image or video tasks is necessary, along with a deep understanding of state-of-the-art techniques in text-to-image and text-to-video generation. Proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow is essential, along with excellent communication skills and the ability to work collaboratively in a team environment. Ideal candidates should also have postdoctoral or industrial research experience in generative AI for video, hands-on experience with text-to-video generation projects, and expertise in other generative model architectures.

Responsibilities

The Research Scientist will lead research initiatives in advanced diffusion models for text-to-video generation, focusing on enhancing visual quality, temporal consistency, and semantic fidelity. They will develop and implement state-of-the-art algorithms for translating textual descriptions into dynamic video content, design and conduct rigorous experiments to validate new ideas and evaluate model performance, and collaborate with cross-functional teams to integrate research breakthroughs into the production pipeline. Staying updated with the latest academic literature and attending top-tier conferences is crucial. Additionally, they will contribute to the research community through high-quality publications and open-source contributions, mentor junior researchers, and work closely with product teams to align research directions with user needs and market opportunities.

Skills

Python

PyTorch

TensorFlow

Diffusion Models

Generative Models

GANs

VAEs

Video Codecs

Deep Learning

Text-to-Video Generation

Text-to-Image Generation

Genmo

AI tools for multimedia content creation

About Genmo

Genmo.ai specializes in providing AI tools for generating and editing multimedia content, including images, videos, and presentations. Users can upload images and animate specific parts, like transforming a static sky into a timelapse, or create entire movies by refining ideas, generating scenes, and selecting transitions. The platform caters to both individual content creators and businesses, operating on a subscription model with various service tiers. Genmo.ai differentiates itself by continuously enhancing its technology and focusing on user intent, ensuring that clients have powerful tools to realize their creative projects.

Key Metrics

San Francisco, CaliforniaHeadquarters

N/AYear Founded

$29.2MTotal Funding

EARLY_VCCompany Stage

Consumer Software, AI & Machine LearningIndustries

1-10Employees

Risks

Server crashes during Mochi-1 launch could harm customer trust and satisfaction.

Open-source nature of Mochi-1 may lead to increased competition from developers.

Major tech players entering generative AI market could overshadow Genmo's offerings.

Differentiation

Genmo.ai offers unique AI tools for animating images and generating entire movies.

The platform supports both B2B and B2C models, catering to diverse client needs.

Genmo.ai's subscription model provides flexible access to advanced multimedia editing features.

Upsides

Launch of Mochi-1 model positions Genmo as a competitor to industry leaders.

Rising demand for AI-driven video editing boosts Genmo's market potential.

Subscription-based revenue model ensures steady income and opportunities for upselling.

Land your dream remote job 3x faster with AI

Try Jobo Free