Distinguished Engineer – Data Center System Software Architect at NVIDIA

Santa Clara, California, United States

NVIDIA Logo
Not SpecifiedCompensation
Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Technology, Semiconductors, Data CentersIndustries

Requirements

  • Deep expertise in scalable and performant server system architecture, focusing on SW/HW interfaces
  • Extensive experience with complex system software for accelerators (GPUs, DPUs, FPGAs)
  • Mastery of system firmware (SBIOS, OpenBMC), embedded systems, and Linux kernel internals
  • Proficiency in Out-of-Band and In-Band management architectures, device management protocols (e.g., MCTP, PLDM, SPDM, RDE), and system management protocols (Redfish, IPMI)
  • Extensive knowledge of networking technologies and protocols, including TCP/IP, Ethernet, InfiniBand, as well as advanced switching and routing concepts
  • Experience collaborating with platform security experts to define tradeoffs between security and ease of use
  • Demonstrated success in leading complex, cross-functional projects to completion, showcasing the ability to influence and achieve results without direct authority in large-scale, collaborative environments
  • Demonstrable experience in implementing left shift strategy to de-risk program execution
  • BS or MS degree in Computer Science, Electrical Engineering, or a related field (or equivalent experience)
  • 20+ years in the area of System architecture and design

Responsibilities

  • Own the end-to-end architecture of NVIDIA’s data center systems (DGX and HGX) at the system software level, including firmware, kernel drivers, operating systems, and user mode drivers
  • Collaborate with internal component leads and engage with industry-leading cloud service providers to bring products to market
  • Serve as the primary technical point of contact for major customers, leading technological discussions, defining KPIs, gathering requirements, and addressing complex technical queries
  • Lead technical innovation and strategic collaborations with major hyperscalers to architect next-generation data center products
  • Align NVIDIA’s roadmap with major customers’ requirements through direct engagement
  • Develop and drive adoption of new technologies and protocols
  • Make critical technical decisions in ambiguous situations, mitigating risks through left-shift strategies

Skills

Linux Kernel
OpenBMC
SBIOS
MCTP
PLDM
SPDM
RDE
Redfish
IPMI
TCP/IP
Ethernet
InfiniBand
GPUs
DPUs
FPGAs
Embedded Systems

NVIDIA

Designs GPUs and AI computing solutions

About NVIDIA

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Santa Clara, CaliforniaHeadquarters
1993Year Founded
$19.5MTotal Funding
IPOCompany Stage
Automotive & Transportation, Enterprise Software, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Company Equity
401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.
Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.
Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.
NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.
Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.
Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Land your dream remote job 3x faster with AI