NVIDIA

Senior Solutions Architect, Gen AI

New York, New York, United States

Not SpecifiedCompensation
Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Hospitality, Travel, AutomotiveIndustries

Solutions Architect, Generative AI (Hospitality & Travel)

Employment Type: Full-time

Salary Range: $184,000 - $356,500 USD (based on location, experience, and pay equity)

Location Type: Not Specified

Position Overview

NVIDIA is seeking a Solutions Architect skilled in full-cycle Generative AI development and deployment to join our Hospitality and Travel Solutions Architecture team. You will assist key customers in adopting NVIDIA's full-stack technologies and building/deploying solutions around Generative AI and other GPU-accelerated technologies. This role involves direct engagement with developers, researchers, data scientists, and business/engineering teams at strategic customer accounts, influencing product strategy and providing technical expertise.

Responsibilities

  • Provide hands-on technical mentorship to partners and customers on the NVIDIA GenAI stack.
  • Guide customers in developing and deploying Agentic AI workflows on NVIDIA platforms, quantifying the benefits of accelerated computing.
  • Build demonstrations and Proofs of Concept (POCs) for solutions addressing critical customer business needs.
  • Assist in drafting requirements for missing features to facilitate customer and partner progress.
  • Educate customers on new NVIDIA GenAI technologies and platforms through presentations and workshops.
  • Develop industry-specific collateral, including notebooks and blog posts.
  • Partner with NVIDIA engineering, product, and sales teams to secure design wins.
  • Enable the development and growth of NVIDIA product features through customer feedback and POC evaluations.

Requirements

  • Master's or Ph.D. in Computer Science, Artificial Intelligence, or equivalent experience.
  • 8+ years of hands-on experience in a technical AI role, with a strong emphasis on Generative AI.
  • Proficiency in the latest model architectures and an understanding of their computational complexities.
  • Proven track record of deploying and optimizing Large Language Models (LLMs) for inference in production environments using inferencing engines (e.g., vLLM, TRT-LLM, SGLang).
  • Expertise in training and fine-tuning LLMs using popular frameworks (e.g., TensorFlow, PyTorch, Hugging Face Transformers).
  • Solid understanding of GPU cluster architecture and parallel processing for accelerated model training and inference.
  • Experience with basic DevOps tools (e.g., Docker, Kubernetes, GitLab, Linux Command Line, Shell).
  • Excellent communication and teamwork skills, with the ability to explain complex technical concepts to diverse audiences.
  • Experience leading workshops, training sessions, and presenting technical solutions.

Ways to Stand Out

  • Experience with Agentic AI frameworks, tools, and protocols (e.g., LangChain, LangGraph, MCP).
  • Understanding of Multimodal LLMs, Vision-Language Models (VLMs), etc.
  • Experience deploying LLM models at scale on mainstream cloud providers (e.g., AWS, Azure, GCP).
  • Proven ability to profile and optimize inference latency, throughput, memory, and I/O utilization.
  • Mathematical understanding of different parallelization techniques in Generative AI.

Company Information

NVIDIA is building the world's leading AI company. We are committed to fostering a diverse work environment and are proud to be an equal opportunity employer. We highly value diversity in our current and future employees and do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Application Instructions: NVIDIA accepts applications on an ongoing basis.

Skills

Generative AI
AI development
AI deployment
NVIDIA GenAI stack
Agentic AI workflows
Accelerated computing
GPU-accelerated technologies
Computer Science
Artificial Intelligence
Technical mentorship
Proof of Concepts (POCs)
Technical training
Workshops
NVIDIA engineering
NVIDIA product features

NVIDIA

Designs GPUs and AI computing solutions

About NVIDIA

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Santa Clara, CaliforniaHeadquarters
1993Year Founded
$19.5MTotal Funding
IPOCompany Stage
Automotive & Transportation, Enterprise Software, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Company Equity
401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.
Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.
Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.
NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.
Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.
Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Land your dream remote job 3x faster with AI