Senior Site Reliability Engineer
Chainlink LabsFull Time
Senior (5 to 8 years)
Senior Site Reliability Engineer with experience automating and operating hundreds of tenant environments at scale using Terraform, Ansible, and Kubernetes. Proficient in building multi-tenant infrastructure, managing complex state and workspace configurations, and delivering reliable, secure automation across cloud providers (GCP, AWS). Strong knowledge of observability stacks (Prometheus, ELK, Grafana), incident response, and cloud security best practices. Familiarity with deployment packaging (e.g., Helm charts, omnibus-gitlab) and experience deploying and managing microservices on Kubernetes. Ability to design scalable automation, perform root-cause analysis, and influence architectural decisions across teams.
Build and scale multi-tenant infrastructure by designing and implementing automation that provisions and manages hundreds of isolated GitLab environments with Terraform, Ansible, and Kubernetes. Debug and resolve production issues across Kubernetes clusters, cloud services, and GitLab applications, identifying root causes of failed deployments and stability problems. Automate operations at scale by replacing manual workflows with infrastructure-as-code solutions, including automated upgrades, configuration rollouts, and provisioning pipelines for all tenants. Monitor and predict capacity by building observability systems with Prometheus, ELK, and Grafana to detect bottlenecks and optimize resource usage. Lead incident response and postmortems, establishing operational standards to reduce future risk. Architect and collaborate on infrastructure designs, influencing architectural decisions to support scalable, secure, and reliable environments.
Unified DevOps platform for software development
GitLab offers a DevOps platform that simplifies the software development process by providing a single application for collaboration, visibility, and speed. The platform integrates various tools needed for software development, which helps teams manage their projects more efficiently without juggling multiple tools. This allows companies to concentrate on enhancing their products instead of spending too much time on builds. GitLab serves a wide range of clients, including large corporations from different industries, demonstrating its versatility. The company operates on a subscription-based model, where clients pay for access to the platform, which includes features for continuous integration and deployment. GitLab also provides free trials and regularly updates its platform to deliver ongoing value to its users. By customizing its offerings and partnering with other technology providers, GitLab aims to enhance its ecosystem and drive revenue.