Senior 3D AI Generalist
Global WorldwideFull Time
Mid-level (3 to 4 years)
Candidates should possess a Bachelor's, Master's, or PhD degree in Computer Science, Computer Architecture, or a related technical field, along with 3+ years of experience in Deep Learning model implementation and software development. Extensive knowledge of at least one Deep Learning framework, with practical experience in PyTorch, is required, as is a deep understanding of transformer architectures, attention mechanisms, Visual Generative AI foundational model architectures (e.g., U-Net, DiT), and inference bottlenecks. Excellent Python programming skills, strong problem-solving and analytical skills, fundamental knowledge of algorithms and Deep Learning, and familiarity with Docker containerization are also necessary. Experience with performance measurements, profiling, model optimization and serving frameworks like TensorRT, TensorRT-LLM, vLLM, SGLang, and ONNX, as well as a deep understanding of distributed systems for large-scale model inference and serving, will help candidates stand out.
The Senior Deep Learning Engineer will optimize deep learning models for low-latency, high-throughput inference, focusing on Diffusion models for Visual Generative AI applications. Responsibilities include converting, deploying, and optimizing models using frameworks such as TensorRT, TensorRT-LLM, and vLLM, and understanding, analyzing, profiling, and optimizing the performance of deep learning workloads on NVIDIA GPU hardware and software platforms. The engineer will collaborate with research scientists and software engineers to ensure seamless integration of AI models from training to deployment and contribute to the development of automation and tooling for NVIDIA Inference Microservices (NIMs) and inference optimization, including creating automated benchmarks to track performance regressions.
Designs GPUs and AI computing solutions
NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.