Senior Software Engineer-Distributed Inference
NVIDIA- Full Time
- Senior (5 to 8 years)
Candidates must have expertise in CUDA or OpenCL, with demonstrated experience developing CUDA kernels or equivalent technologies. Proficiency in Python is required for AI and performance optimization tasks. Experience with deep learning frameworks such as PyTorch or TensorFlow is necessary, along with a strong understanding of CPU and GPU architecture to analyze and optimize performance at the hardware level.
The Senior AI Performance Engineer will optimize inference engines to improve performance and scalability. They will enhance scalable AI infrastructure by implementing optimizations that accelerate AI inference, impacting efficiency and revenue generation. The role involves developing and deploying CUDA kernels for deep learning workloads, conducting performance analysis to identify and resolve bottlenecks, and engaging with the AI research community to track developments and contribute to open-source projects. Additionally, the engineer will improve onboarding and documentation to streamline workflows and collaborate cross-functionally with AI researchers, engineers, and infrastructure teams.
Utilizes wasted energy for computing power
Crusoe Energy Systems Inc. provides digital infrastructure that focuses on using wasted, stranded, or clean energy sources to power high-performance computing and artificial intelligence. The company helps clients in the technology and energy sectors by offering scalable computing solutions that aim to reduce greenhouse gas emissions and support the transition to cleaner energy. Crusoe's approach involves converting excess natural gas and renewable energy into computing power, which allows them to maximize resource efficiency while minimizing environmental impact. Unlike many competitors, Crusoe specifically targets the intersection of energy and technology, generating revenue by supplying computing resources to enterprises that need significant computational power for applications like AI and machine learning, along with providing technical support.