Groq

Systems Quality and Reliability Lead

San Jose, California, United States

Groq Logo
Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Semiconductor, Artificial Intelligence, AI & Machine LearningIndustries

Requirements

Candidates should possess a Bachelor’s or Master’s degree in Electrical Engineering, Physics, or a related field, along with 7+ years of hands-on experience in systems test and/or validation engineering. Demonstrated management and leadership experience is required, alongside competence using lab equipment such as oscilloscopes, logic analyzers, and power analyzers. Deep knowledge of failure analysis techniques and tools including FIB, SEM, TDR, VNA, and CSAM is also necessary, as is proficiency with high-speed interfaces like Serdes, PCIe, and DDR. Experience testing power sub-sections and familiarity with lower-speed interfaces such as SPI, I2C, and CAN bus are required, as well as proficiency in Python, Perl, C++, or other languages on UNIX/Linux. Experience in Failure Analysis for microprocessors, complex SOC devices, AI systems, servers, or network systems is preferred.

Responsibilities

The Systems Quality and Reliability Lead will own, build, and manage the RMA and FA debug and root-cause analysis for existing and new Groq AI/ML products. This includes conducting and leading debug and root-cause analysis of field RMAs, collaborating with various engineering teams, scaling root cause failure analysis capabilities, creating failure analysis reports, analyzing RMA and FA data to identify trends and drive resolution plans, overseeing hardware quality performance, managing operational performance of Failure Analysis at contract manufacturers, driving learning’s from RMA/FA back into relevant teams, and overseeing the set-up of new products into Failure Analysis operations.

Skills

Electrical Engineering
Physics
Systems Test
Validation Engineering
Root Cause Analysis
Failure Analysis
8D Methodology
Data Analysis
RMA
MTBF
Reliability Ratio

Groq

AI inference technology for scalable solutions

About Groq

Groq specializes in AI inference technology, providing the Groq LPU™, which is known for its high compute speed, quality, and energy efficiency. The Groq LPU™ is designed to handle AI processing tasks quickly and effectively, making it suitable for both cloud and on-premises applications. Unlike many competitors, Groq's products are designed, fabricated, and assembled in North America, which helps maintain high standards of quality and performance. The company targets a variety of clients across different industries that require fast and efficient AI processing capabilities. Groq's goal is to deliver scalable AI inference solutions that meet the growing demands for rapid data processing in the AI and machine learning market.

Key Metrics

Mountain View, CaliforniaHeadquarters
2016Year Founded
$1,266.5MTotal Funding
SERIES_DCompany Stage
AI & Machine LearningIndustries
201-500Employees

Benefits

Remote Work Options
Company Equity

Risks

Increased competition from SambaNova Systems and Gradio in high-speed AI inference.
Geopolitical risks in the MENA region may affect the Saudi Arabia data center project.
Rapid expansion could strain Groq's operational capabilities and supply chain.

Differentiation

Groq's LPU offers exceptional compute speed and energy efficiency for AI inference.
The company's products are designed and assembled in North America, ensuring high quality.
Groq emphasizes deterministic performance, providing predictable outcomes in AI computations.

Upsides

Groq secured $640M in Series D funding, boosting its expansion capabilities.
Partnership with Aramco Digital aims to build the world's largest inferencing data center.
Integration with Touchcast's Cognitive Caching enhances Groq's hardware for hyper-speed inference.

Land your dream remote job 3x faster with AI