Platform Architect-GenAI at KLA

Ann Arbor, Michigan, United States

KLA Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Semiconductor, Information TechnologyIndustries

Requirements

  • Bachelor's Degree or equivalent experience in Computer Science or related IT field
  • Eight (8) years of implementing and maintaining AI/ML Infrastructure in an On-Prem environment
  • Strong experience with AI/ML infrastructure and tools, including GPU clusters and Kubernetes
  • Proficiency in deploying and managing open-source GenAI components and vector databases
  • Hands-on experience with high-performance computing (HPC) environments
  • Expertise in designing and managing on-premises, cloud, and hybrid-based ML platforms
  • Strong Linux system administration and scripting skills

Responsibilities

  • Design, deploy, and manage scalable AI/ML infrastructure supporting hybrid cloud and on-prem environments
  • Work extensively with open-source MLOps platforms (e.g., Kubeflow, MLflow, Flyte) to streamline model development, deployment, and lifecycle management
  • Architect and optimize GenAI infrastructure, including integration of vector databases and large language model serving frameworks
  • Implement and manage high-performance shared storage systems (e.g., Ceph, MinIO) for distributed AI workloads
  • Set up and maintain InfiniBand networking for low-latency, high-throughput GPU cluster communication
  • Collaborate with ML engineers, data scientists, and DevOps teams to build a cohesive and efficient AI/ML ecosystem
  • Monitor and enhance infrastructure performance, ensuring scalability, reliability, and security
  • Evaluate and integrate emerging GenAI tools and frameworks to continuously improve platform capabilities

Skills

GenAI
LLMOps
AI/ML
Cloud
DevOps
Hybrid Platforms

KLA

Provides process control and yield management solutions

About KLA

KLA provides process control and yield management solutions primarily for semiconductor manufacturers. The company offers advanced inspection tools, metrology systems, and computational analytics that help manufacturers identify and fix defects during production. This process enhances the quality and reliability of electronic devices, leading to higher production yields. KLA distinguishes itself from competitors by focusing on high-precision equipment and software that are essential for defect detection in semiconductor manufacturing. The company's goal is to promote sustainability by committing to using 100% renewable electricity in its operations by 2030.

Milpitas, CaliforniaHeadquarters
1975Year Founded
IPOCompany Stage
Industrial & Manufacturing, EnergyIndustries
5,001-10,000Employees

Benefits

Health Insurance
Dental Insurance
Vision Insurance
Life Insurance
401(k) Retirement Plan
401(k) Company Match
Employee Stock Purchase Plan
Student Loan Assistance
Tuition Reimbursement
Wellness Program
Mental Health Support
Paid Vacation
Paid Holidays
Parental Leave

Risks

Emerging competition in solid-state battery technology may impact market share.
Rapid innovation in semiconductor processes may outpace KLA's current technology.
Potential delays in achieving renewable electricity goals could affect brand reputation.

Differentiation

KLA specializes in advanced inspection tools and metrology systems for semiconductors.
The company integrates computational analytics to enhance defect detection and yield management.
KLA is committed to sustainability, aiming for 100% renewable electricity by 2030.

Upsides

Rising demand for advanced inspection tools due to AI and IoT growth.
AI-driven predictive maintenance reduces downtime in semiconductor manufacturing.
Collaborations with cloud providers enhance data analytics and process optimization.

Land your dream remote job 3x faster with AI