Remote Site Reliability Engineer Jobs

Browse a wide range of remote Site Reliability Engineer positions available globally. New jobs added frequently.
Share on:
Illustration of a computer and code
United States
Remote iconRemote

Senior Site Reliability Engineer (SRE)

Cribl

- Extensive experience with enterprise scale continuous delivery environments - Development with JavaScript/Node.js/TypeScript in a Linux/Mac environment - Experience with sustainable incident response in a blameless environment - Experience with Configuration Management Tools like Terraform (preferred) or Puppet, Chef, Ansible - Knowledge of cloud platforms (prefer AWS) and container + orchestration technologies - Experience with APM and Observability and related tools such as, New Relic, Splun…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
Apply Now External link icon
United States
Remote iconRemote

Lead Site Reliability Engineer (SRE)

Canary Technologies

- 7+ years in SRE, platform, systems, or infrastructure engineering - Strong background in AWS and Kubernetes operations - Experience establishing SLOs/SLA frameworks and driving org-wide adoption - Strong track record leading incident response and postmortem culture - Experience with observability ecosystems - Programming/scripting skills in Python, Go, or similar (automation-first mindset) - Strong cross-functional leadership partnering with product, platform, and app teams

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years), Expert & Leadership (9+ years)
Apply Now External link icon
New York
Remote iconRemote

Site Reliability Engineer

Superblocks

Candidates must have 3+ years of experience managing cloud-based production applications with deep knowledge of containers, VMs, caches, task queues, networking, and OS. They should have experience designing and deploying infrastructure in production at scale using containerized solutions like Docker, Kubernetes, ECS/EKS, or Firecracker. A strong product sense focused on great user experiences and strategic thinking to meet market and customer needs is also required. Experience building and oper…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconMid-level (3 to 4 years)
Apply Now External link icon
United States
Remote iconRemote

Site Reliability Engineer

Close

Candidates should have 5+ years of experience building modern infrastructure systems for Senior 1 & 2 level roles, and 8+ years for Staff level roles, with experience as the final point of escalation in the support of mission critical production systems. Familiarity with AWS, Terraform, CircleCI, GitHub Actions, ArgoCD, Ansible, Elasticsearch, MongoDB, PostgreSQL, ClickHouse, Kubernetes, Loki, Tempo, Grafana, Mimir/Prometheus, and Argo Workflow is required.

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconJunior (1 to 2 years)
Apply Now External link icon
San Francisco +7 more
Remote iconRemote

Senior Site Reliability Engineer

Chainlink Labs

Candidates should possess at least 8 years of relevant professional experience, ideally with a background in DevOps, infrastructure, SRE, or platform teams. A strong DevOps mentality, experience building and maturing a GitOps environment, and proficiency in software development beyond typical infrastructure configurations are essential. Demonstrable skills in shell scripting and at least one higher-level programming language, excellent Linux understanding, and expertise in designing, deploying, …

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
Apply Now External link icon
United States
Remote iconRemote

Lead Site Reliability Engineer

Kraken

Candidates must possess excellent communication skills for effective collaboration with developers, product managers, and business stakeholders. A proven record of successfully delivering critical path projects on time and at scale is essential, along with meticulous organization and planning skills. Experience in mentoring and coaching a team to achieve high-quality performance is required, as is experience managing and supporting large-scale, internet-facing distributed systems serving million…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconExpert & Leadership (9+ years)
Apply Now External link icon
United States
Remote iconRemote

Senior Site Reliability Engineer, Devices

Flock Safety

Candidates should possess strong coding skills in languages such as Python, R, JS, Java, or Groovy, and have a solid understanding of common algorithms. Experience with software development workflows including continuous integration and test automation, along with tools like Git, Jenkins, and GitHub Actions, is essential. Proficiency in SQL databases (e.g., PostgreSQL), NoSQL, and Time Series databases (e.g., Prometheus, DataDog) is required, as is experience with volume data processing, data vi…

  • Compensation icon$150,000 - $190,000/year
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
Apply Now External link icon
United States
Remote iconRemote

Senior Engineering Manager, Site Reliability

Ditto

Candidates should have experience leading and scaling a globally distributed SRE organization, including managers and individual contributors. Proven ability to develop engineering leaders and senior talent through coaching is essential. Experience in driving adoption of SRE best practices, establishing incident management practices, and leading the architecture and execution of observability systems is required. Familiarity with defining and implementing SLIs, SLOs, and SLAs, along with experie…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconExpert & Leadership (9+ years)
Apply Now External link icon
United States
Remote iconRemote

Staff Site Reliability Engineer

Arta Finance

Candidates should have 5-8+ years of experience in SRE/DevOps/Infrastructure roles supporting production systems, with a proven ability to own critical infrastructure and drive change end-to-end. Experience with cloud technologies and tools, including Infrastructure-as-Code (IaC) with Terraform for cloud automation (preferably Microsoft Azure), is essential. Proficiency in leveraging AI-driven tooling for development acceleration and automation, strong software development skills (preferably C#/…

  • Compensation icon$182,000 - $239,000/year
  • Employment type iconFull Time
  • Experience level iconExpert & Leadership (9+ years)
Apply Now External link icon
San Antonio
Remote iconRemote

Senior Tech Lead – SRE

Humana

- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience) - 7+ years of relevant experience in SRE, DevOps, or software engineering, including 2+ years in a technical leadership role - Proficiency with cloud platforms (AWS, Azure, GCP), container orchestration, and automation tools - Strong scripting and programming skills (e.g., Python, Go, Bash) - Deep understanding of distributed systems, networking, and security principles - Proven experience leading l…

Apply Now External link icon
United States +2 more
Remote iconRemote

Site Reliability Engineer (.Net)

Virtuous

Candidates must have 5+ years of software engineering experience with strong expertise in .NET (C#, ASP.NET Core). Experience in SaaS environments and a deep understanding of multi-tenant systems, APIs, and distributed application behavior are required. The ideal candidate enjoys debugging complex edge cases and possesses strong communication skills for effective partnership.

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconMid-level (3 to 4 years), Senior (5 to 8 years)
Apply Now External link icon
Remote +2 more
Remote iconRemote

Senior Site Reliability Engineer

Patreon

- Bachelor’s degree in Computer Science, Computer Engineering, or a related field, or equivalent work experience - Experience in DevOps, Site Reliability, or backend/infrastructure engineering for a company experiencing fast-paced growth - Proficiency in a programming language like Python and shell scripting - Hands-on experience implementing Site Reliability Engineering practices (SLIs, SLOs, SLAs) - Knowledge of configuration management with a framework such as Terraform, Ansible, Chef, or Pup…

  • Compensation icon$200,000 - $300,000/year
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
Apply Now External link icon
United States
Remote iconRemote

Senior Site Reliability Engineer

Branch

- Bachelor's degree in an appropriate engineering discipline or equivalent experience - 3+ years experience in site reliability engineering - Experience with Terraform, Go, Java, Gradle, Docker, OpenTelemetry and Kubernetes

  • Compensation icon$150,000 - $220,000/year
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
Apply Now External link icon
Reston +1 more
Remote iconRemote

Senior Site Reliability Engineer

ScienceLogic

- 8-12 years of site reliability engineering, cloud operations or equivalent experience - Proven experience in managing complex Kubernetes environments in multiple Production systems - Working with Cloud Automation tools like CloudFormation, Terraform, aws-cli/CDK, Cloudformation - Scripting languages like Python, Bash, Perl - Exposure to Linux administration skills - Proven track record of operating production SaaS environments within security standards like FedRAMP, SOC2, ISO, PCI - Skilled at…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
Apply Now External link icon
Santa Clara
Remote iconRemote

Site Reliability Engineer - Remote

PayNearMe

- Proficiency in Terraform for infrastructure as code - Experience with Kubernetes and Docker for container orchestration and management - Expertise in Datadog for monitoring and observability - Knowledge of defining, monitoring, and maintaining SLOs and SLAs - Skills in incident response, root cause analysis, and blameless postmortems - Ability to ensure reliability and stability of production environments - Proficiency in automation and scripting using Python, Bash, or Go - Experience with CI/…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconMid-level (3 to 4 years), Senior (5 to 8 years)
Apply Now External link icon

Get Started Today

Land your dream remote job 3x faster with AI