Infrastructure Engineer at Docker
Salary: €113.2K - €155.7K
Location Type: Remote
Employment Type: Full-Time
Position Overview
At Docker, we make app development easier so developers can focus on what matters. Our remote-first team spans the globe, united by a passion for innovation and great developer experiences. With over 20 million monthly users and 20 billion image pulls, Docker is the #1 tool for building, sharing, and running apps—trusted by startups and Fortune 100s alike. We’re growing fast and just getting started. Come join us for a whale of a ride!
The Infrastructure Engineering team writes software and operates the cloud-native platform that powers Docker products such as Docker Hub, Docker Build Cloud, and Docker Scout. We design resilient services, automate everything, and measure what matters so hundreds of engineers can ship features to millions of users every day.
Responsibilities
1. Ship & Operate Cloud Services
- Design, develop, and ship internal platform services (e.g. provisioning, cost insights, rate-limiting) in Go or Python.
- Partner with product and engineering teams to provide paved-road patterns for deployment, observability, and security.
2. Infrastructure as Code & Reliability
- Codify infrastructure with Terraform and Go; champion GitOps best practices.
- Define SLOs, lead on-call rotations, conduct blameless post-mortems, and implement preventive actions.
3. Platform Foundations (Kubernetes & Networking)
- Evolve Docker’s ingress stack—Envoy Gateway, ALB/NLB, AWS VPC CNI—to deliver secure, reliable, and cost-efficient request routing.
- Operate and scale multi-tenant EKS clusters; guide the evaluation and adoption of new infrastructure technologies.
How We Work
- Code first: We tackle infra problems with software, design docs, and rigorous code review.
- Async & remote-first: Decisions are documented in RFCs; incident reviews are blameless and written.
- Cross-functional: Platform, product, and security engineers collaborate daily to unblock each other.
- Continuous improvement: We ship small, measure impact, and iterate quickly.
Qualifications
Core Engineering Skills (Must-Have)
- Strong software development skills in Go, Python, or similar (design, testing, and code review).
- Significant experience shipping and operating cloud applications/services in production (typically 5+ years of relevant work).
- Solid foundation in Linux, networking, and cloud security.
- Excellent written and verbal communication in a remote environment.
Depth in one or more of the following (Nice-to-Have)
- Kubernetes ecosystem (EKS, ingress, CNI, service mesh).
- Observability tooling (OpenTelemetry, Prometheus, Grafana).
- CI/CD & release automation (GitHub Actions, Argo CD).
- Cost optimisation at scale (FinOps, capacity modelling).
- Distributed systems, containers, and Go-based platform tooling.
Demonstrated expertise in at least one of these areas is welcome; we don’t expect candidates to be experts in all.
What to Expect
First 30 Days
- Complete Docker onboarding and meet teammates across Engineering, Security, and Product.
- Ship your first change to a Terraform module or internal service and shadow on-call.
- Gain a deep understanding of our platform architecture, SLOs, and current reliability initiatives.
First 90 Days
- Take ownership of a critical service or infrastructure component and lead a performance-oriented project from design to production.
- Rotate fully into the on-call schedule, leading incident response when needed.
- Contribute to refining our platform roadmap and advocate for improvements that reduce toil and accelerate delivery.
First Year
- Lead the design and rollout of a major infrastructure initiative.
- Become a go-to subject matter expert.