Senior Deep Learning Software Engineer

Santa Clara, California, United States

Apply with AI Apply

Not SpecifiedCompensation

Expert & Leadership (9+ years)Experience Level

Full TimeJob Type

UnknownVisa

Computer Hardware, Artificial Intelligence, Software DevelopmentIndustries

Senior Deep Learning Software Engineer

Employment Type: Full-time Location Type: Hybrid

Position Overview

We are seeking a Senior Deep Learning Software Engineer to design and build our automated inference and deployment solution. This role is instrumental in defining a scalable architecture for Deep Learning (DL) inference, focusing on ease-of-use and compute efficiency. Your work will span multiple layers of the DL deployment stack, including developing features in high-level frameworks like PyTorch and JAX, designing and implementing a high-performance execution environment, low-level GPU optimizations, and developing custom GPU kernels in CUDA and/or Triton. This is an exceptional opportunity for passionate software engineers who bridge the boundaries of research and engineering, possessing a strong background in both machine learning fundamentals and software architecture & engineering.

What You'll Be Doing

Play a pivotal role in defining a modular, scalable platform to seamlessly bridge training and deployment workflows, enabling tight integration of deployment tooling with training frameworks such as Megatron and Nemo.
Leverage and build upon the PyTorch 2.0 ecosystem (TorchDynamo, torch.export, torch.compile, etc.) to analyze and extract standardized model graph representation from arbitrary PyTorch models for our automated deployment solution.
Develop support for inference optimization techniques such as speculative decoding and LoRA.
Collaborate with teams across NVIDIA to utilize performant kernel implementations within the automated deployment solution.
Analyze and profile GPU kernel-level performance to identify hardware and software optimization opportunities.
Continuously innovate on inference performance to ensure NVIDIA's inference software solutions (TRT, TRT-LLM, TRT Model Optimizer) maintain and increase their market leadership.

What We Need to See

Master's, PhD, or equivalent experience in Computer Science, AI, Applied Math, or a related field.
8+ years of relevant work or research experience in Deep Learning.
Excellent software design skills, including debugging, performance analysis, and test design.
Strong proficiency in Python, PyTorch, and related ML tools.
Strong algorithms and programming fundamentals.
Good written and verbal communication skills, with the ability to work independently and collaboratively in a fast-paced environment.

Ways to Stand Out

Contributions to PyTorch, JAX, or other Machine Learning Frameworks.
Knowledge of GPU architecture and compilation stack, and the capability to understand and debug end-to-end performance.
Familiarity with NVIDIA's deep learning SDKs such as TensorRT.
Prior experience in writing high-performance GPU kernels for machine learning workloads in frameworks such as CUDA, CUTLASS, or Triton.

NVIDIA

Designs GPUs and AI computing solutions

About NVIDIA

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Santa Clara, CaliforniaHeadquarters

1993Year Founded

$19.5MTotal Funding

IPOCompany Stage

Automotive & Transportation, Enterprise Software, AI & Machine Learning, GamingIndustries

10,001+Employees

Benefits

Company Equity

401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.

Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.

Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.

The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.

NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.

Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.

Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Land your dream remote job 3x faster with AI

Try Jobo Free

Senior Deep Learning Software Engineer

Senior Deep Learning Software Engineer

Position Overview

What You'll Be Doing

What We Need to See

Ways to Stand Out

Company Information

Skills

NVIDIA

About NVIDIA

Benefits

Company Equity

401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.

Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.

Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.

The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.

NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.

Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.

Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Related Jobs

Senior AI Engineer

Sr Lead Machine Learning Engineer

Principal Engineer - Machine Learning Architecture

Senior Software Engineer (AI Tools)

Principal Engineer - Behaviors

Senior Research Engineer - Training Efficiency

Senior Software Engineer: Numerical and High Performance Computing

Machine Learning Engineer

Staff Software Engineer, Speculative Decoding

AI Research Engineer

Sr. Staff Software Engineer – High Performance GPU Inference Systems

Software Engineer - Model Performance

Senior AI/ML Engineer

Senior AI Infrastructure Engineer

Applied AI Engineer

Land your dream remote job 3x faster with AI