Groq

Systems Quality and Reliability Lead

San Jose, California, United States

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Semiconductor, Artificial Intelligence, AI & Machine LearningIndustries

Position Overview

  • Location Type: Hybrid
  • Job Type: Full-time
  • Salary: $186,915 - $305,900

Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed and scale they need. Headquartered in Silicon Valley, we are on a mission to make high performance AI compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast.

Requirements

  • Education: BS/MS in Electrical Engineering, Physics, or a related degree
  • Experience: 7+ years of hands-on systems test and/or validation engineering experience
  • Management Experience: Proven hands-on management and leadership experience
  • Technical Skills:
    • Competence using lab equipment such as oscilloscopes, logic analyzers, power analyzers, etc.
    • Deeply cognizant of the differences between System test vs ATE test
    • Experience with enabling reliability tests such as HTOL and quality tests such as Burn In.
    • Working knowledge of Failure analysis techniques and tools such as FIB, SEM, TDR, VNA, and CSAM
    • Working knowledge of Fault Isolation techniques such as OBIRCH, DLS/LADA, LVP and LVI
    • Proficiency with high speed interfaces (Serdes, PCIe, DDR)
    • Experience testing power sub-sections (e.g. POLs, VRMs, etc.)
    • Familiarity with lower speed interfaces like SPI, I2C, CAN bus, etc.
    • Proficiency in Python, Perl, C++, or other languages on UNIX/Linux

Responsibilities

  • Conduct and lead debug and root-cause analysis of field RMAs.
  • Collaborate with Systems Engineers, Hardware Engineers, Software Engineers, and Operations Engineers as required.
  • Scale Root Cause Failure Analysis capabilities within the organization.
  • Create Failure Analysis result reports that align with standard 8D or similar processes.
  • Develop and optimize RMA testing strategy to improve timeliness and effectiveness of characterization process.
  • Analyze RMA, Failure Analysis, and Repair data. Identify trends and raise quality alerts when necessary. Drive resolution, containment, and mitigation plans for such quality alerts.
  • Oversee hardware quality performance, monitoring field quality data and associated metrics including RMA Rates, MTBF, and Reliability Ratio.
  • Manage operational performance of Failure Analysis at contract manufacturer(s), ensuring partner(s) achieve key performance indicators, including FA cycle times, fault duplication rates, and fault isolation rates.
  • Drive learning’s from RMA / FA back into Manufacturing, Engineering, and Support teams.
  • Oversee the set-up of new products into Failure Analysis operations.

Application Instructions

  • Apply with your resume and cover letter.

Company Information

  • About Groq: Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed and scale they need. Headquartered in Silicon Valley, we are on a mission to make high performance AI compute more accessible and affordable. When real-time AI is within reach, anything is possible.
  • Compensation: At Groq, a competitive base salary is part of our comprehensive compensation package, which includes equity and benefits. For this role, the base salary range is $186,915 to $305,900, determined by your skills, qualifications, experience and internal benchmarks.
  • Location: Some roles may require being located near or on our primary sites, as indicated in the job description.
  • Attributes of a Groqster:
    • Humility - Egos are checked at the door
    • Collaborative & Team Savvy - We make up the smartest person in the room, together
    • Growth & Giver Mindset - Learn it all versus know it all, we share knowledge generously
    • Curious & Innovative - Take a creative approach to projects, problems, and design
    • Passion, Grit, & Boldness - no limit thinking, fueling informed risk taking

If this sounds like you, we’d love to hear from you.

Skills

Electrical Engineering
Physics
Systems Test
Validation Engineering
Root Cause Analysis
Failure Analysis
8D Methodology
Data Analysis
RMA
MTBF
Reliability Ratio

Groq

AI inference technology for scalable solutions

About Groq

Groq specializes in AI inference technology, providing the Groq LPU™, which is known for its high compute speed, quality, and energy efficiency. The Groq LPU™ is designed to handle AI processing tasks quickly and effectively, making it suitable for both cloud and on-premises applications. Unlike many competitors, Groq's products are designed, fabricated, and assembled in North America, which helps maintain high standards of quality and performance. The company targets a variety of clients across different industries that require fast and efficient AI processing capabilities. Groq's goal is to deliver scalable AI inference solutions that meet the growing demands for rapid data processing in the AI and machine learning market.

Mountain View, CaliforniaHeadquarters
2016Year Founded
$1,266.5MTotal Funding
SERIES_DCompany Stage
AI & Machine LearningIndustries
201-500Employees

Benefits

Remote Work Options
Company Equity

Risks

Increased competition from SambaNova Systems and Gradio in high-speed AI inference.
Geopolitical risks in the MENA region may affect the Saudi Arabia data center project.
Rapid expansion could strain Groq's operational capabilities and supply chain.

Differentiation

Groq's LPU offers exceptional compute speed and energy efficiency for AI inference.
The company's products are designed and assembled in North America, ensuring high quality.
Groq emphasizes deterministic performance, providing predictable outcomes in AI computations.

Upsides

Groq secured $640M in Series D funding, boosting its expansion capabilities.
Partnership with Aramco Digital aims to build the world's largest inferencing data center.
Integration with Touchcast's Cognitive Caching enhances Groq's hardware for hyper-speed inference.

Land your dream remote job 3x faster with AI