AI Architect
bswiftFull Time
Expert & Leadership (9+ years)
Key technologies and capabilities for this role
Common questions about this position
The role requires 8+ years of experience building large-scale distributed systems or performance-critical software.
Candidates need a deep understanding of deep learning systems, GPU acceleration, AI model execution flows, and/or high performance networking, along with solid software engineering skills in C++ and/or Python, and familiarity with CUDA or similar platforms.
A Bachelor’s, Master’s, or PhD in Computer Science, Electrical Engineering, or equivalent experience is required.
This information is not specified in the job description.
This information is not specified in the job description.
Stand out with experience in LLM training or inference pipelines, transformer model optimization, model-parallel deployments, profiling and optimizing performance bottlenecks, AI accelerators, distributed communication patterns, and a proven track record of optimizing complex systems at scale.
You'll join the dynamic E2E Architecture group, working with top engineers, researchers, and partners across NVIDIA on cutting-edge systems for generative AI workloads.
Designs GPUs and AI computing solutions
NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.