NVIDIA

Distinguished Engineer, Cloud Architecture

California, United States

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Semiconductors, Cloud Computing, AI Infrastructure, Data Center TechnologyIndustries

Technical Leader, AI Cloud Infrastructure

Employment Type: Full-time

Position Overview

Join the team that powers every breakthrough from autonomous vehicles to generative AI. NVIDIA’s DGX Cloud and GB200 superchip platforms are redefining what’s possible in the data center. We’re seeking a technical leader who can transform groundbreaking hardware, open software, and cloud-native practices into reference designs that anyone—from hyperscalers to research labs—can deploy with confidence and speed. If you enjoy teamwork, care about sustainable performance, and are passionate about mentoring emerging talent, this role is for you.

What You’ll Be Doing

  • Lead Architecture: Lead the architecture of next-generation GPU clouds on DGX/HGX/GB200 platforms, balancing performance, energy efficiency, and lifecycle simplicity.
  • Build Blueprints: Build modular blueprints that integrate NVSwitch, NVLink 5.0, Spectrum-X, InfiniBand, and BlueField DPUs into secure, multi-tenant “AI Factory” architectures.
  • Publish & Validate: Publish and validate reference designs for Bare-Metal-as-a-Service, virtualized, and Kubernetes-native stacks, partnering with engineering and operations teams to deploy them in production environments.
  • Develop Infrastructure Pipelines: Build comprehensive end-to-end infrastructure pipelines. Design and implement Redfish/IPMI/iPXE provisioning systems, Slurm and Kubernetes control planes, virtual networking solutions, and distributed storage architectures.
  • Influence Strategy: Influence strategy across silicon, networking, software, and AI infrastructure groups, ensuring a unified vision from chip to cloud.
  • Drive Partnerships & Industry Leadership: Represent NVIDIA at key industry forums including OCP, CNCF, UCF, and major cloud service providers. Lead co-design efforts that accelerate customer success while guiding critical trade-offs in power consumption, cost optimization, density requirements, and operational simplicity, always prioritizing business impact and sustainability.

Requirements

  • Experience: 15+ years of proven experience crafting and shipping large-scale systems or cloud infrastructure with measurable real-world impact.
  • Technical Expertise:
    • Hands-on expertise with GPU platforms, high-bandwidth fabrics (InfiniBand/Spectrum/RoCE), distributed storage, and workload orchestration.
    • Production experience with Kubernetes, Slurm, Terraform, and BMaaS tooling across bare-metal, VM, and container ecosystems.
    • Proven track record delivering secure, scalable, multi-tenant infrastructure in production environments.
  • Education: Bachelor’s, Master’s, or PhD in Computer Science, Electrical/Computer Engineering, or related field (or equivalent experience).
  • Communication: Excellent communication skills—able to build technical architecture documentation and work optimally with both executives and engineers.

Ways to Stand Out from the Crowd

  • Direct experience with GB200, NVSwitch 5.0, Magnum IO, DOCA, or Spectrum-X deployments.
  • Deep understanding of AI/ML training and inference pipelines, GPU scheduling, and multi-tenant cluster management.
  • Hands-on design experience across AWS, Azure, OCI, or GCP plus hybrid-cloud architecture patterns.
  • Active contribution to OCP, UCF, Kubernetes SIGs, or similar industry standards bodies.
  • Leadership experience mentoring senior engineers and leading cross-organizational architecture programs with VP/C-suite visibility.

Company Information

With competitive salaries and a generous benefits package (www.nvidiabenefits.com), we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!

Compensation & Benefits

  • Salary Range: $308,000 - $471,500 USD (base salary, determined by location, experience, and pay of similar positions).
  • Additional Compensation: Eligible for equity.
  • Benefits: Comprehensive benefits package.

Application Instructions

NVIDIA accepts applications on an ongoing basis.

About NVIDIA

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer.

Skills

GPU Cloud Architecture
Hardware Design
Open Software
Cloud-Native Practices
NVSwitch
NVLink 5.0
Spectrum-X
InfiniBand
BlueField DPUs
Bare-Metal-as-a-Service
Kubernetes
Redfish
IPMI
iPXE
Slurm
Virtual Networking
Distributed Storage
System Integration
Infrastructure Pipelines
Industry Standards (OCP, CNCF, UCF)

NVIDIA

Designs GPUs and AI computing solutions

About NVIDIA

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Santa Clara, CaliforniaHeadquarters
1993Year Founded
$19.5MTotal Funding
IPOCompany Stage
Automotive & Transportation, Enterprise Software, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Company Equity
401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.
Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.
Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.
NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.
Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.
Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Land your dream remote job 3x faster with AI