[Remote] Senior Software Developer, HPC Cluster Management at NVIDIA

Netherlands

NVIDIA Logo
Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Technology, High Performance ComputingIndustries

Requirements

  • Degree in Computer Science or related field (or equivalent experience)
  • 7+ years of experience in software development and/or related roles
  • Very familiar with the Linux operating system, particularly networking concepts in Linux
  • Good practical knowledge about the most common software installed as part of a typical Linux installation
  • Proficient in Python and intimately familiar with object-oriented software design, design patterns, and concurrent programming techniques
  • Emphasis on high quality of work and producing clean code
  • Eager to learn and use new technologies
  • Ways to stand out
  • Experience with Ansible
  • Experience with high-performance computing and system administration
  • Knowledge of Kubernetes, AWS, Azure, GCE, OpenStack, Jenkins, and distributed programming
  • Proficiency in C++

Responsibilities

  • Development of the head node and compute node installation and provisioning processes
  • Work on functionality in the area of edge site deployment
  • Integrating product with the latest hardware (e.g., GPUs, DPUs, accelerators, high-speed interconnects such as InfiniBand)
  • Develop new features in firmware management and network configuration for existing and next generation of Nvidia platforms
  • Develop functionality that makes Bright clusters usable for a wider range of workloads and increases scalability to allow clusters to scale to huge number of nodes
  • Adding support for new Linux distributions
  • Improving support for alternative CPU architectures such as ARM
  • Work on adding features to Ansible collections for Cluster Installation and Management
  • Assist support team with customer support requests in the above mentioned features and help customers use the product more efficiently

Skills

Linux
Ansible
HPC
Cluster Management
Bare Metal Provisioning
GPUs
DPUs
InfiniBand
Firmware Management
Network Configuration
ARM

NVIDIA

Designs GPUs and AI computing solutions

About NVIDIA

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Santa Clara, CaliforniaHeadquarters
1993Year Founded
$19.5MTotal Funding
IPOCompany Stage
Automotive & Transportation, Enterprise Software, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Company Equity
401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.
Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.
Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.
NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.
Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.
Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Land your dream remote job 3x faster with AI