NVIDIA

Principal System Cloud Architect

California, United States

Not SpecifiedCompensation
Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Semiconductors, Cloud Computing, Data Center Infrastructure, AI & Machine LearningIndustries

Principal System Architect - GPU Infrastructure-as-a-Service (IaaS)

Employment Type: Full time

Position Overview

NVIDIA is seeking a Principal System Architect to lead the architectural vision for our GPU Infrastructure-as-a-Service (IaaS) offerings. This strategic role focuses on defining reference architectures and system blueprints that integrate NVIDIA’s latest innovations — including GB200 Grace Blackwell systems, Spectrum-X, Bluefield, InfiniBand, Storage (Block, File, Object), and AI Enterprise software stacks — into scalable, high-performance cloud infrastructure for on-prem, Neo Clouds, and CSPs. This role requires deep engagements across hardware, networking, orchestration, and partner ecosystems to define the future of GPU cloud services.

What You’ll Be Doing

Architect Future-Ready GPU Infrastructure

  • Define scalable, secure, and efficient architectures for GPU-based IaaS using NVIDIA’s full stack: DGX/HGX, GB200, NVLink/NVSwitch, InfiniBand, and Spectrum-X.

Lead Reference Architecture Development

  • Work with internal engineering, cloud partners, and OEMs to define and publish validated reference architectures covering bare-metal provisioning, virtualization, storage fabrics, and networking.

Drive End-to-End Cloud Infrastructure Strategy

  • Architect solutions for bare-metal-as-a-service, VMaaS, and container orchestration (Kubernetes), integrated with virtual networking (VPCs), Infiniband fabrics, high-performance storage, and AI workloads.

Influence Product Strategy Across Domains

  • Partner with silicon, platform, networking, and software teams to ensure alignment of architecture with NVIDIA’s roadmap for GPU, DPU, and AI services.

Engage with Ecosystem Partners

  • Represent NVIDIA in joint solution development with CSPs, OEMs, and hyperscale customers to align infrastructure strategies and deployment practices.

Evaluate Trade-Offs and Drive Decisions

  • Make high-impact architectural decisions across performance, scalability, multi-tenancy, power efficiency, and manageability.

What We Need to See

  • 15+ years in system architecture, with deep experience in cloud-scale infrastructure, HPC, or AI platforms.
  • Proven expertise in GPU platforms, data center networking (InfiniBand, RoCE, Spectrum), virtual networking, storage, and orchestration technologies.
  • Strong understanding of Kubernetes, VM provisioning, bare-metal provisioning, and infrastructure automation.
  • MS or PhD in Computer Engineering, Electrical Engineering, related field, or equivalent experience.
  • Demonstrated ability to define, document, and present architectural designs and influence cross-functional teams.

Ways to Stand Out from the Crowd

  • Experience with NVIDIA technologies such as DGX, HGX, GB200, NVLink, NVSwitch, BlueField, Magnum IO, and Spectrum-X.
  • Deep knowledge of AI/ML workloads, distributed training architectures, and GPU scheduler integration.
  • Familiarity with CSP environments (AWS, Azure, OCI, GCP) and hybrid/multi-cloud architectures.
  • Participation in open standards and industry bodies (e.g., OCP, CNCF, Kubernetes SIGs).

Company Information

With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us, and due to unparalleled growth, our outstanding teams are rapidly growing.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected.

Compensation & Benefits

  • Salary Range: $272,000 USD - $425,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
  • Eligibility: You will also be eligible for equity and benefits.

Application Instructions

NVIDIA accepts applications on an ongoing basis.

Skills

System Architecture
GPU Infrastructure
Cloud Infrastructure
Reference Architecture Development
Bare-metal provisioning
Virtualization
Storage Fabrics
Networking
Kubernetes
Virtual Networking
High-Performance Storage
AI Workloads
Hardware
Networking
Orchestration
Partner Ecosystems
Trade-Off Analysis
Performance Optimization
Scalability
Multi-tenancy
Power Efficiency
Manageability

NVIDIA

Designs GPUs and AI computing solutions

About NVIDIA

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Santa Clara, CaliforniaHeadquarters
1993Year Founded
$19.5MTotal Funding
IPOCompany Stage
Automotive & Transportation, Enterprise Software, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Company Equity
401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.
Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.
Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.
NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.
Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.
Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Land your dream remote job 3x faster with AI