Illustration of a computer and code

Remote Site Reliability Engineer Jobs

Browse a wide range of remote Site Reliability Engineer positions available globally. New jobs added frequently.

Share on:
United States
Remote iconRemote

Staff Software Engineer - SRE/DevEx (Remote)

Rula

Candidates must have 8+ years of experience in Site Reliability Engineering and/or DevOps, with a proven ability to work independently and collaborate across engineering and product teams. A strong understanding of SRE fundamentals in mission-critical environments, experience with on-call rotations, 5+ years of Kubernetes experience, and 5+ years of AWS experience are required. Proficiency in monitoring, alerting, metrics, logging, and application performance monitoring is essential.

  • Compensation icon$184,100 - $216,600/year
  • Employment type iconFull Time
  • Experience level iconExpert & Leadership (9+ years)
United States
Remote iconRemote

Senior Site Reliability Engineer

Red Cell Partners

Candidates should have proven Site Reliability Engineering and DevOps experience, with demonstrated experience in managing complex, large-scale production environments. Experience with cloud platforms such as GCP, AWS, or Azure is required, along with expertise in automation tools and frameworks, monitoring solutions like Prometheus, Grafana, and the ELK stack, and CI/CD pipelines for machine learning models.

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
Boston
Remote iconRemote

Site Reliability Engineer

Hometap

The Site Reliability Engineer should have 3+ years of experience in Site Reliability Engineering (SRE), DevOps, or a similar role, 1+ years of hands-on experience with AWS services including ECS, EKS, CloudWatch, Lambda, CloudFront, S3, and DynamoDB, 1+ years of experience with observability and monitoring tools such as CloudWatch, Sentry, Grafana, or Prometheus, and basic proficiency in Terraform for infrastructure-as-code implementation. Strong troubleshooting and problem-solving skills, along…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconJunior (1 to 2 years)
United States
Remote iconRemote

Staff Software Engineer - SRE, Backend (Reliability Engineering)

Affirm

Candidates should have 7+ years of experience designing, developing, and launching backend systems at scale using languages like Python or Kotlin, 7+ years of experience in a Site Reliability or Production Engineering team, and a Bachelor’s degree in a related field or equivalent practical experience. They should demonstrate curiosity with empathy and strong opinions loosely held, and experience delivering major features, system components, or deprecating existing functionality through the defin…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
United States
Remote iconRemote

Software Engineer, Site Reliability Engineering

Thumbtack

Candidates must have extensive fluency in AWS and Linux, and expertise in designing, analyzing, and troubleshooting large-scale distributed systems across web technologies such as DNS, TLS, HTTP/S, and TCP/IP. They should possess demonstrable knowledge of instrumenting, operating, and observing distributed microservices in a production cloud environment, and the ability to effectively read, write, and debug code in languages like Python, Go, PHP, and Javascript. Strong communication skills and a…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years), Expert & Leadership (9+ years)
New York
Remote iconRemote

Site Reliability Engineer

Superblocks

Candidates must have 3+ years of experience managing cloud-based production applications with deep knowledge of containers, VMs, caches, task queues, networking, and OS. They should have experience designing and deploying infrastructure in production at scale using containerized solutions like Docker, Kubernetes, ECS/EKS, or Firecracker. A strong product sense focused on great user experiences and strategic thinking to meet market and customer needs is also required. Experience building and oper…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconMid-level (3 to 4 years)
San Francisco +7 more
Remote iconRemote

Senior Site Reliability Engineer

Chainlink Labs

Candidates should possess at least 8 years of relevant professional experience, ideally with a background in DevOps, infrastructure, SRE, or platform teams. A strong DevOps mentality, experience building and maturing a GitOps environment, and proficiency in software development beyond typical infrastructure configurations are essential. Demonstrable skills in shell scripting and at least one higher-level programming language, excellent Linux understanding, and expertise in designing, deploying, …

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
New York
Remote iconRemote

SRE Tech Lead Manager

Oura

Candidates should have over 7 years of backend development experience, with at least 2 years managing or leading infrastructure-focused teams. A passion for building inclusive, high-performing teams, excellent communication and decision-making skills, and experience designing and building data-intensive distributed systems in production are essential. Experience designing for scale and growth, particularly fault-tolerant and secure systems, along with strong experience running, monitoring, and d…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconExpert & Leadership (9+ years)
United States
Remote iconRemote

Senior Site Reliability Engineer

Calendly

Candidates must have a strong understanding of the Linux operating system and possess strong technical knowledge of cloud infrastructure, particularly GCP, distributed systems, and reliability practices. Deep experience is required in designing, building, and running highly-available production infrastructure, along with strong Golang or Python development experience, especially writing APIs for cloud infrastructure management. Solid working knowledge of patterns and principles for designing and…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
United States
Remote iconRemote

Staff Systems Reliability Engineer

iRhythm Technologies

The Staff Systems Reliability Engineer V requires a minimum of 8 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or equivalent work experience. Candidates should possess expert-level knowledge of AWS services such as EC2, Lambda, VPC, IAM, RDS, ECS/EKS, and familiarity with regulatory requirements like FDA 21 CFR Part 11, HIPAA, ISO 13485, and EU MDR. Strong proficiency in Python and/or Go for automation and tooling, as well as experience with Helm, Argo C…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconExpert & Leadership (9+ years)
United States
Remote iconRemote

Staff Site Reliability Engineer

Addepar

Candidates should possess extensive hands-on development experience in AWS/cloud, Linux/Unix, networking, advanced scripting, containerization, Kubernetes, Terraform, Information Security, deep debugging, and comprehensive monitoring/observability skills. A leading role in implementing, maintaining, and strategically evolving production infrastructure is expected.

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconExpert & Leadership (9+ years)
United States +2 more
Remote iconRemote

Site Reliability Engineer (.Net)

Virtuous

Candidates must have 5+ years of software engineering experience with strong expertise in .NET (C#, ASP.NET Core). Experience in SaaS environments and a deep understanding of multi-tenant systems, APIs, and distributed application behavior are required. The ideal candidate enjoys debugging complex edge cases and possesses strong communication skills for effective partnership.

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconMid-level (3 to 4 years), Senior (5 to 8 years)
United States
Remote iconRemote

Site Reliability Engineer - Data Platform

Kraken

Candidates should have proficiency in cloud technologies, infrastructure as code, automation, monitoring, logging, user and machine AuthNZ, and certificate management. Experience with Infrastructure as Code (IaC) principles using tools like Terraform, bash/shell scripting, CI/CD pipelines, Kubernetes, Kafka, and Debezium Change Data Capture (CDC) is required. Familiarity with data governance, data ingestion, storage, cataloging, lineage, and BI tools is also necessary.

  • Compensation icon$110,000 - $176,000/year
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
South America +5 more
Remote iconRemote

Principal Site Reliability Engineer - Americas

Ashby

Candidates should possess a Bachelor’s degree in Computer Science or a related field, along with at least 7 years of experience in Site Reliability Engineering, demonstrating a strong understanding of distributed systems, cloud computing, and automation. Experience with infrastructure-as-code tools, containerization technologies (like Docker and Kubernetes), and monitoring/logging systems is essential. Strong coding skills in languages such as Go, Python, or Java are required, and familiarity wi…

  • Compensation icon$200,000 - $260,000/year
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)
California
Remote iconRemote

Senior Site Reliability Engineer, DGX Cloud

NVIDIA

Candidates must have a BS in Computer Science or equivalent experience, with at least 12 years of experience operating production services at scale. Expert-level knowledge of Kubernetes administration, containerization, microservices architecture, Kubernetes operators, and distributed systems is required. Experience with infrastructure automation tools like Terraform, Ansible, Chef, or Puppet, proficiency in Python or Go, and in-depth knowledge of Linux, TCP/IP networking, and cloud security sta…

  • Compensation iconSalary not specified
  • Employment type iconFull Time
  • Experience level iconSenior (5 to 8 years)

Get Started Today

Land your dream remote job 3x faster with AI