NVIDIA

Solution Architect - OEM AI Software

Texas, United States

Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, SoftwareIndustries

Solutions Architect - OEM Enterprise AI

Employment Type: Full time

Salary Range: $120,000 - $235,750 USD (based on location, experience, and pay equity)

Location Type: [Information not provided]

Position Overview

NVIDIA is seeking outstanding Solutions Architects to help grow our OEM enterprise AI business. In this role, you will work across different teams, assisting partners and customers with the latest Accelerated Computing and Generative AI, and AI Factory deployments. We are looking for talented individuals to join us at the forefront of technological advancement.

As a trusted technical advisor to our OEM partners, you will work on exciting software solutions focused on enabling enterprise Generative AI workflows. You should be comfortable in a multifaceted environment and possess experience with Generative AI, LLMs, Deep Learning, and GPU technologies. This is an excellent opportunity to collaborate within an interdisciplinary team utilizing the latest NVIDIA technologies.

What You Will Be Doing

  • Working with OEM partners to architect enterprise-grade, end-to-end generative AI software solutions.
  • Collaborating closely with OEM partners' software development teams to craft world-class joint AI solutions.
  • Supporting pre-sales activities, including technical presentations and demonstrations of Generative AI capabilities, in collaboration with sales and business development teams.
  • Working closely with NVIDIA engineering teams to provide feedback and contribute to the evolution of generative AI software.
  • Engaging directly with customers and partners to understand their requirements and challenges.
  • Leading workshops and design sessions to define and refine generative AI solutions, with a strong emphasis on enterprise workflows.
  • Implementing strategies for efficient and effective training of LLMs to achieve peak performance.
  • Designing and implementing RAG-based workflows to improve content generation and information retrieval.

What We Need To See

  • 3-5+ years of hands-on experience as a Solutions Architect or similar role, with a specific focus on AI solutions.
  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering, or related fields (or equivalent experience).
  • Proven track record of successfully deploying and optimizing Generative AI models for inference in production environments.
  • Expertise in training and fine-tuning LLMs using popular frameworks such as TensorFlow, PyTorch, or Hugging Face Transformers.
  • Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms, with a focus on GPUs.
  • Solid understanding of GPU cluster architecture and the ability to leverage parallel processing for accelerated model training and inference.
  • Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to both technical and non-technical collaborators.
  • Experience leading workshops, training sessions, and presenting technical solutions to diverse audiences.

Ways To Stand Out From The Crowd

  • Experience deploying Generative AI models in cloud environments and on-premises infrastructures.
  • Experience with NVIDIA GPUs and software libraries, such as NVIDIA NIM, NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM.
  • Proven ability to optimize LLM models for inference speed, memory efficiency, and resource utilization.
  • Familiarity with Docker or equivalent experience in containerization technologies and Kubernetes for scalable and efficient model deployment.
  • Deep understanding of GPU cluster architecture, parallel computing, and distributed computing concepts.

Company Information

NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We highly value diversity in our current and future employees. NVIDIA accepts applications on an ongoing basis.

You will also be eligible for equity and benefits.

Skills

Generative AI
LLMs
Deep Learning
GPU technologies
RAG
Solution Architecture
Software Development
Technical Presentations
AI Solutions

NVIDIA

Designs GPUs and AI computing solutions

About NVIDIA

NVIDIA designs and manufactures graphics processing units (GPUs) and system on a chip units (SoCs) for various markets, including gaming, professional visualization, data centers, and automotive. Their products include GPUs tailored for gaming and professional use, as well as platforms for artificial intelligence (AI) and high-performance computing (HPC) that cater to developers, data scientists, and IT administrators. NVIDIA generates revenue through the sale of hardware, software solutions, and cloud-based services, such as NVIDIA CloudXR and NGC, which enhance experiences in AI, machine learning, and computer vision. What sets NVIDIA apart from competitors is its strong focus on research and development, allowing it to maintain a leadership position in a competitive market. The company's goal is to drive innovation and provide advanced solutions that meet the needs of a diverse clientele, including gamers, researchers, and enterprises.

Santa Clara, CaliforniaHeadquarters
1993Year Founded
$19.5MTotal Funding
IPOCompany Stage
Automotive & Transportation, Enterprise Software, AI & Machine Learning, GamingIndustries
10,001+Employees

Benefits

Company Equity
401(k) Company Match

Risks

Increased competition from AI startups like xAI could challenge NVIDIA's market position.
Serve Robotics' expansion may divert resources from NVIDIA's core GPU and AI businesses.
Integration of VinBrain may pose challenges and distract from NVIDIA's primary operations.

Differentiation

NVIDIA leads in AI and HPC solutions with cutting-edge GPU technology.
The company excels in diverse markets, including gaming, data centers, and autonomous vehicles.
NVIDIA's cloud services, like CloudXR, offer scalable solutions for AI and machine learning.

Upsides

Acquisition of VinBrain enhances NVIDIA's AI capabilities in the healthcare sector.
Investment in Nebius Group boosts NVIDIA's AI infrastructure and cloud platform offerings.
Serve Robotics' expansion, backed by NVIDIA, highlights growth in autonomous delivery services.

Land your dream remote job 3x faster with AI