Platform Architect-GenAI at KLA

Ann Arbor, Michigan, United States

KLA Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Semiconductor, Information TechnologyIndustries

Requirements

  • Bachelor's Degree or equivalent experience in Computer Science or related IT field
  • Eight (8) years of implementing and maintaining AI/ML Infrastructure in an On-Prem environment
  • Strong experience with AI/ML infrastructure and tools, including GPU clusters and Kubernetes
  • Proficiency in deploying and managing open-source GenAI components and vector databases
  • Hands-on experience with high-performance computing (HPC) environments
  • Expertise in designing and managing on-premises, cloud, and hybrid-based ML platforms
  • Strong Linux system administration and scripting skills

Responsibilities

  • Design, deploy, and manage scalable AI/ML infrastructure supporting hybrid cloud and on-prem environments
  • Work extensively with open-source MLOps platforms (e.g., Kubeflow, MLflow, Flyte) to streamline model development, deployment, and lifecycle management
  • Architect and optimize GenAI infrastructure, including integration of vector databases and large language model serving frameworks
  • Implement and manage high-performance shared storage systems (e.g., Ceph, MinIO) for distributed AI workloads
  • Set up and maintain InfiniBand networking for low-latency, high-throughput GPU cluster communication
  • Collaborate with ML engineers, data scientists, and DevOps teams to build a cohesive and efficient AI/ML ecosystem
  • Monitor and enhance infrastructure performance, ensuring scalability, reliability, and security
  • Evaluate and integrate emerging GenAI tools and frameworks to continuously improve platform capabilities

Skills

Key technologies and capabilities for this role

GenAILLMOpsAI/MLCloudDevOpsHybrid Platforms

Questions & Answers

Common questions about this position

What is the employment type for this position?

The position is full-time employment.

Is this role remote or does it require on-site work?

This information is not specified in the job description.

What key skills are required for the Platform Architect-GenAI role?

Required skills include experience with open-source MLOps platforms like Kubeflow, MLflow, Flyte; architecting GenAI infrastructure with vector databases and LLM serving frameworks; managing shared storage systems like Ceph, MinIO; and setting up InfiniBand networking for GPU clusters.

What is the company culture like at KLA?

KLA offers an exciting work environment where teams thrive on tackling hard problems, with a strong focus on innovation through investing 15% of sales into R&D, and collaborative expert teams of physicists, engineers, data scientists, and problem-solvers.

What makes a strong candidate for this GenAI Platform Architect role?

A strong candidate is highly skilled and motivated, with expertise in designing scalable AI/ML infrastructure for hybrid cloud and on-prem environments, operationalizing LLMOps, and collaborating with ML engineers, data scientists, and DevOps teams.

KLA

Provides process control and yield management solutions

About KLA

KLA provides process control and yield management solutions primarily for semiconductor manufacturers. The company offers advanced inspection tools, metrology systems, and computational analytics that help manufacturers identify and fix defects during production. This process enhances the quality and reliability of electronic devices, leading to higher production yields. KLA distinguishes itself from competitors by focusing on high-precision equipment and software that are essential for defect detection in semiconductor manufacturing. The company's goal is to promote sustainability by committing to using 100% renewable electricity in its operations by 2030.

Milpitas, CaliforniaHeadquarters
1975Year Founded
IPOCompany Stage
Industrial & Manufacturing, EnergyIndustries
5,001-10,000Employees

Benefits

Health Insurance
Dental Insurance
Vision Insurance
Life Insurance
401(k) Retirement Plan
401(k) Company Match
Employee Stock Purchase Plan
Student Loan Assistance
Tuition Reimbursement
Wellness Program
Mental Health Support
Paid Vacation
Paid Holidays
Parental Leave

Risks

Emerging competition in solid-state battery technology may impact market share.
Rapid innovation in semiconductor processes may outpace KLA's current technology.
Potential delays in achieving renewable electricity goals could affect brand reputation.

Differentiation

KLA specializes in advanced inspection tools and metrology systems for semiconductors.
The company integrates computational analytics to enhance defect detection and yield management.
KLA is committed to sustainability, aiming for 100% renewable electricity by 2030.

Upsides

Rising demand for advanced inspection tools due to AI and IoT growth.
AI-driven predictive maintenance reduces downtime in semiconductor manufacturing.
Collaborations with cloud providers enhance data analytics and process optimization.

Land your dream remote job 3x faster with AI