Senior Solutions Architect, HPC Systems Engineer
NVIDIA- Full Time
- Senior (5 to 8 years)
Candidates should have a strong background in low-level performance-critical software development, with experience in C++ and CUDA. Familiarity with distributed algorithms using RDMA and network simulation techniques is preferred, along with the ability to write performance-sensitive CPU and/or GPU code.
The Software Engineer, Networking will design and implement custom networking collectives integrated into the training stack. They will collaborate with ML researchers to create efficient collective operations, ensure optimal use of network transports in supercomputers for large training jobs, and work on simulations to inform future supercomputer network designs.
Develops safe and beneficial AI technologies
OpenAI develops and deploys artificial intelligence technologies aimed at benefiting humanity. The company creates advanced AI models capable of performing various tasks, such as automating processes and enhancing creativity. OpenAI's products, like Sora, allow users to generate videos from text descriptions, showcasing the versatility of its AI applications. Unlike many competitors, OpenAI operates under a capped profit model, which limits the profits it can make and ensures that excess earnings are redistributed to maximize the social benefits of AI. This commitment to safety and ethical considerations is central to its mission of ensuring that artificial general intelligence (AGI) serves all of humanity.