NVIDIA

Senior Software Architect - Deep Learning and HPC Communications

Germany

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Semiconductors, High Performance Computing, Artificial Intelligence, Computer HardwareIndustries

Requirements

Candidates should possess a Master’s or Doctoral degree in Computer Science or Electrical Engineering, or equivalent practical experience. They must have at least five years of relevant experience in software architecture, with a focus on high-performance computing and communication technologies. Strong programming skills in C/C++ are essential, along with expertise in parallel programming models such as MPI, SHMEM, NCCL, NVSHMEM, OpenSHMEM, UCX, and UCC. A deep understanding of operating systems, computer architecture, and network architecture is required, including familiarity with concepts like topology, algorithms, and communication scaling. Experience with Linux is also necessary.

Responsibilities

The Senior Software Architect will investigate opportunities to improve communication performance by identifying bottlenecks in existing systems. They will design and implement new communication technologies to accelerate AI and HPC workloads, exploring innovative solutions in both hardware and software for next-generation platforms as part of co-design efforts. This role involves building proofs-of-concept, conducting experiments, and performing quantitative modeling to evaluate and drive new innovations. The architect will utilize simulation to explore the performance of large GPU clusters, and collaborate with GPU, networking, and software architects to deliver platforms that break performance barriers and realize NVIDIA’s vision.

Skills

Deep Learning
High Performance Computing
GPU Communication
NCCL
NVSHMEM
GPUDirect
Interconnects (NVLink, PCIe)
Networking (InfiniBand, Ethernet)
Hardware-Software Co-design
Quantitative Modeling
Performance Optimization

NVIDIA

Designs GPUs and AI computing solutions

About NVIDIA

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Santa Clara, CaliforniaHeadquarters
1993Year Founded
$19.5MTotal Funding
IPOCompany Stage
Automotive & Transportation, Enterprise Software, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Company Equity
401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.
Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.
Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.
NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.
Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.
Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Land your dream remote job 3x faster with AI