NVIDIA

Senior Generative AI Research Engineer

California, United States

Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, Computer Hardware, Software DevelopmentIndustries

Generative AI Engineering - Foundation Models

Employment Type: Full time

Position Overview

At NVIDIA, we're not just building the future, we're generating it. Our Cosmos generative AI engineering team is pushing the boundaries of what’s possible across multimodal learning, video generation, synthetic data, intelligent simulation, and agentic systems. We are looking for exceptionally driven engineers and applied scientists with deep experience in generative modeling to help define the next era of AI computing.

Responsibilities

  • Design, post-train, and optimize foundation models (e.g., LLMs, diffusion video models, VLMs, VLAs) for real-world applications.
  • Contribute to highly-collaborative development on large-scale training infrastructure, high-efficiency inference pipelines, and scalable data pipelines.
  • Work with teams in research, software, and product to bring world models from idea to deployment.
  • Collaborate on open-source and internal projects, author technical papers or patents, and mentor junior engineers.
  • Prototype and iterate rapidly on experiments across cutting-edge AI domains, including agentic systems, reinforcement learning, reasoning, and video generation.
  • Design and implement model distillation algorithms for size reduction and diffusion step optimization.
  • Profile and benchmark training and inference pipelines to achieve production-ready performance requirements.

Requirements

  • Experience: Minimum 8 years industry or 5+ years research/postdoc experience building and deploying generative AI systems.
  • Deep Learning Frameworks: Proficiency in PyTorch, JAX, or other deep learning frameworks is a must.
  • Generative Model Expertise: Expertise in one or more of the following: LLMs, coding agents, diffusion models, autoregressive models, VAE/GAN architectures, retrieval-augmented generation, neural rendering, or multi-agent systems.
  • Transformer Architectures: Intimate familiarity with all variants of transformer architectures and attention mechanisms.
  • Large-Scale Training & Data Processing: Hands-on experience with large-scale training (e.g., ZeRO, DDP, FSDP, TP, CP) and data processing (e.g., Ray, Spark).
  • Software Engineering: Production-quality software engineering skills in Python.
  • Education: MS or PhD or equivalent experience in Computer Science, Machine Learning, Applied Math, Physics, or a related field.
  • Total Experience: 12+ years of relevant software development experience.

Ways to Stand Out

  • Familiarity with high-performance computing and GPU acceleration.
  • Contributions to influential open-source libraries or influential conference publications (NeurIPS, ICML, CVPR, ICLR).
  • Experience working with multimodal data (e.g., vision-language, VLA, audio).
  • Prior work with NVIDIA GPU-based compute clusters or simulation environments.

Company Information

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, passionate, and self-motivated, we want to hear from you! NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services.

Compensation:

  • Base Salary Range:
    • Level 5: $224,000 - $356,500 USD
    • Level 6: $272,000 - $425,500 USD
  • Eligibility: You will also be eligible for equity and benefits.

Application Deadline: Applications for this job will be accepted at least until July 29, 2025.

Equal Opportunity Employer: NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, or disability.

Skills

Generative AI
Multimodal Learning
Video Generation
Synthetic Data
Intelligent Simulation
Agentic Systems
Foundation Models
LLMs
Diffusion Models
VLMs
VLAs
PyTorch
JAX
Deep Learning Frameworks
Large-scale Training Infrastructure
High-efficiency Inference Pipelines
Scalable Data Pipelines
Model Distillation
Transformer Architectures
Attention Mechanisms
Coding Agents
Autoregressive Models
VAE/GAN Architectures
Retrieval-Augmented Generation
Neural Rendering
Multi-agent Systems
Reinforcement Learning
Reasoning

NVIDIA

Designs GPUs and AI computing solutions

About NVIDIA

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Santa Clara, CaliforniaHeadquarters
1993Year Founded
$19.5MTotal Funding
IPOCompany Stage
Automotive & Transportation, Enterprise Software, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Company Equity
401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.
Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.
Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.
NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.
Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.
Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Land your dream remote job 3x faster with AI