Architect - AI
3cloudFull Time
Expert & Leadership (9+ years)
Candidates should possess a Ph.D. or equivalent industry experience in computer science, computer engineering, or a related field, along with 2+ years of experience in systems programming, parallel or distributed computing, or high-performance data movement. Strong programming skills in C++, Python, and ideally CUDA or other GPU programming models are required, and familiarity with AI frameworks such as PyTorch, TensorFlow, and understanding of how they utilize communication libraries is essential.
The HPC and AI Inference Software Architect will design and prototype scalable software systems to optimize distributed AI training and inference, focusing on throughput, latency, and memory efficiency. They will develop and evaluate enhancements to communication libraries like NCCL, UCX, and UCC, collaborating with AI framework teams to improve integration and performance. Furthermore, the Architect will co-design hardware features to accelerate data movement and enable new capabilities, contribute to the evolution of runtime systems and communication libraries, and ultimately, shape the future of scalable AI infrastructure.
Designs GPUs and AI computing solutions
NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.