Senior Research Engineer at NVIDIA

Santa Clara, California, United States

NVIDIA Logo
Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, SoftwareIndustries

Requirements

  • BS, MS or PhD in Computer Science, AI, Applied Math, or related fields or equivalent experience
  • 3+ years of proven experience in machine learning, systems, distributed computing, or large-scale model training
  • Experience with AI Frameworks such as Pytorch or JAX
  • Experience with at least one inference and deployment environments such as vLLM, SGLang or TRT-LLM
  • Proficient in Python programming, software design, debugging, performance analysis, test design and documentation
  • Strong understanding of AI/Deep-Learning fundamentals and their practical applications

Responsibilities

  • Work with applied researchers to design, implement and test next generation of RL and pos-training algorithms
  • Contribute and advance open source by developing NeMo-RL, Megatron Core, and NeMo Framework and yet to be announced software
  • Solve large-scale, end-to-end AI training and inference challenges, spanning the full model lifecycle from initial orchestration, data pre-processing, running of model training and tuning, to model deployment
  • Work at the intersection of computer-architecture, libraries, frameworks, AI applications and the entire software stack
  • Performance tuning and optimizations, model training with mixed precision recipes on next-gen NVIDIA GPU architectures
  • Publish and present your results at academic and industry conferences

Skills

Machine Learning
Distributed Computing
Large-Scale Model Training
Pytorch
JAX
Python
Software Design
Debugging
Performance Analysis
Test Design
Documentation
AI/Deep-Learning Fundamentals
RL
NeMo-RL
Megatron Core
NeMo Framework
vLLM
SGLang
TRT-LLM

NVIDIA

Designs GPUs and AI computing solutions

About NVIDIA

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Santa Clara, CaliforniaHeadquarters
1993Year Founded
$19.5MTotal Funding
IPOCompany Stage
Automotive & Transportation, Enterprise Software, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Company Equity
401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.
Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.
Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.
NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.
Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.
Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Land your dream remote job 3x faster with AI