Senior Software Engineer-Distributed Inference
NVIDIA- Full Time
- Senior (5 to 8 years)
Candidates should have experience working with multi-modal machine learning pipelines and possess strong software engineering skills, particularly in Python. A deep understanding of optimizing training kernels and stable training dynamics is essential, along with a passion for improving system performance and maintainability.
The Distributed Training Engineer will collaborate with researchers to develop systems-efficient video models and architectures. They will apply the latest techniques to enhance the internal training framework for hardware efficiency and profile and optimize the training framework.
Develops safe and beneficial AI technologies
OpenAI develops and deploys artificial intelligence technologies aimed at benefiting humanity. The company creates advanced AI models capable of performing various tasks, such as automating processes and enhancing creativity. OpenAI's products, like Sora, allow users to generate videos from text descriptions, showcasing the versatility of its AI applications. Unlike many competitors, OpenAI operates under a capped profit model, which limits the profits it can make and ensures that excess earnings are redistributed to maximize the social benefits of AI. This commitment to safety and ethical considerations is central to its mission of ensuring that artificial general intelligence (AGI) serves all of humanity.