Solutions Architect, AI/ML
SnowflakeFull Time
Senior (5 to 8 years)
Key technologies and capabilities for this role
Common questions about this position
Candidates need an MS/PhD or equivalent experience in Computer Science, Data Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields, plus 5+ years of work or research experience with Python/C++/other software development.
Proficiency in tools like TRT LLM, vLLM, SGLang or similar is required, along with strong systems knowledge, experience with neural network inference techniques such as speculative decoding, request scheduler optimizations, FP4 quantization, and understanding of modern NLP including transformer, state space, diffusion, MOE model architectures.
Excellent verbal, written communication, and technical presentation skills in English are required.
This information is not specified in the job description.
This information is not specified in the job description.
Designs GPUs and AI computing solutions
NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.