Groq

Senior Infrastructure Engineer

Palo Alto, California, United States

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Information Technology & ServicesIndustries

Senior Infrastructure Engineer

About Groq

Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed and scale they need. Headquartered in Silicon Valley, we are on a mission to make high performance AI compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast.

Position Overview

At Groq, we’re building a custom cloud from the ground up — one data center at a time. Our Infrastructure Platform team owns the systems that turn racks of bare metal into production-ready Kubernetes clusters powering the next generation of AI workloads.

We’re looking for an Infrastructure Engineer to help us scale this effort. This is a hands-on role focused on provisioning, automation, and working closely with our data center and networking teams to bring new sites online. If you’re passionate about infrastructure, enjoy debugging things close to the metal, and want to grow your skills across Linux, Kubernetes, and distributed systems — we’d love to talk.

Responsibilities & Opportunities

  • Support the provisioning and deployment of Kubernetes clusters on bare metal servers.
  • Help build and maintain tooling for bare metal provisioning — including DHCP, DNS, PXE/iPXE/HTTPBoot, and Talos Linux Machine Configuration.
  • Write and maintain scripts and services (Go, Python, Bash) to automate deployment workflows across new and existing sites.
  • Partner with data center operations and networking teams to ensure hardware is correctly configured, connected, and ready for use.
  • Manage infrastructure configuration using tools like Git, Flux, and Terraform.
  • Contribute to system documentation, runbooks, and tooling that makes our infrastructure reliable and repeatable.

Ideal Candidate Profile

Experience & Skills:

  • Experience with Linux / Kubernetes systems and comfort working in a terminal.
  • Familiarity with infrastructure-as-code and Git-based workflows (e.g., Terraform, Flux, Kustomize).
  • Ability to write and maintain basic tooling in Go, Python, or Bash.
  • Understanding of networking fundamentals (IPAM, VLANs, DHCP, DNS).
  • Working knowledge of storage concepts (block vs object, NFS, RAID, etc.).
  • Strong sense of ownership and a willingness to dive into hardware, firmware, or low-level provisioning issues.

Nice to Have:

  • Experience provisioning physical machines in a data center environment.
  • Exposure to Talos Linux, Kubernetes bootstrapping, or Kubernetes platform engineering.
  • Previous collaboration with facilities, hardware, or network teams in an operational role.

Attributes of a Groqster:

  • Humility: Egos are checked at the door.
  • Collaborative & Team Savvy: We make up the smartest person in the room, together.
  • Growth & Giver Mindset: Learn it all versus know it all, we share knowledge generously.
  • Curious & Innovative: Take a creative approach to projects, problems, and design.
  • Passion, Grit, & Boldness: No limit thinking, fueling informed risk taking.

Compensation & Benefits

  • Salary Range: $132,100 - $279,800 (determined by skills, qualifications, experience, and internal benchmarks).
  • Package: Competitive base salary, equity, and benefits.

Location

Some roles may require being located near or on our primary sites, as indicated in the job description.

Company Information

Groq is an Equal Opportunity Employer that is committed to inclusion and diversity. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status. Our goal is to hire and promote an exceptional workforce as diverse as the global populations we serve. Groq is an equal opportunity employer committed to diversity, inclusion, and belonging in all aspects of our organization. We value and celebrate diversity in thought, beliefs, talent, expression, and backgrounds. We know that our individual differences make us better.

Skills

Linux
Kubernetes
Bare Metal Provisioning
DHCP
DNS
PXE
iPXE
HTTPBoot
Talos Linux
Go
Python
Bash
Infrastructure-as-code
Git
Terraform

Groq

AI inference technology for scalable solutions

About Groq

Groq specializes in AI inference technology, providing the Groq LPU™, which is known for its high compute speed, quality, and energy efficiency. The Groq LPU™ is designed to handle AI processing tasks quickly and effectively, making it suitable for both cloud and on-premises applications. Unlike many competitors, Groq's products are designed, fabricated, and assembled in North America, which helps maintain high standards of quality and performance. The company targets a variety of clients across different industries that require fast and efficient AI processing capabilities. Groq's goal is to deliver scalable AI inference solutions that meet the growing demands for rapid data processing in the AI and machine learning market.

Mountain View, CaliforniaHeadquarters
2016Year Founded
$1,266.5MTotal Funding
SERIES_DCompany Stage
AI & Machine LearningIndustries
201-500Employees

Benefits

Remote Work Options
Company Equity

Risks

Increased competition from SambaNova Systems and Gradio in high-speed AI inference.
Geopolitical risks in the MENA region may affect the Saudi Arabia data center project.
Rapid expansion could strain Groq's operational capabilities and supply chain.

Differentiation

Groq's LPU offers exceptional compute speed and energy efficiency for AI inference.
The company's products are designed and assembled in North America, ensuring high quality.
Groq emphasizes deterministic performance, providing predictable outcomes in AI computations.

Upsides

Groq secured $640M in Series D funding, boosting its expansion capabilities.
Partnership with Aramco Digital aims to build the world's largest inferencing data center.
Integration with Touchcast's Cognitive Caching enhances Groq's hardware for hyper-speed inference.

Land your dream remote job 3x faster with AI