Replit

Site Reliability Engineer

Foster City, California, United States

$160,000 – $190,000Compensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Enterprise Software, CybersecurityIndustries

Site Reliability Engineer

Position Overview

  • Employment Type: Full-Time
  • Salary: $160K - $190K

Replit is seeking a Site Reliability Engineer to join our team and ensure the reliability, scalability, and performance of our infrastructure. This role bridges the gap between development and operations, focusing on automation and best practices to enable efficient scaling and high availability for our platform serving millions of developers.

Requirements

  • 3+ years of experience: In Site Reliability Engineering, DevOps, Systems Engineering, or Infrastructure Engineering.
  • Strong Programming Skills: Proficiency in languages commonly used for automation (e.g., Python, Go, or similar).
  • Understanding of Infrastructure: A deep understanding of infrastructure concepts and technologies.
  • Experience with Observability Tools: Familiarity with modern observability tools (details not specified).
  • Experience with Automation Tools: Experience with infrastructure automation tools (e.g., Terraform, Ansible, Pulumi – details not specified).

Responsibilities

  • Design and Implement Observability Solutions:
    • Develop comprehensive monitoring and alerting systems.
    • Create dashboards and metrics for system health and performance.
    • Implement logging strategies for efficient problem identification.
  • Drive Automation and Infrastructure as Code:
    • Architect and implement infrastructure automation solutions.
    • Design and maintain CI/CD pipelines.
    • Create self-healing systems.
  • Establish SLOs and SLIs:
    • Work with product and engineering teams to define SLOs and SLIs.
    • Build systems to track and report on these metrics.
  • Incident Management and Response:
    • Lead incident response efforts.
    • Conduct thorough post-mortems.
    • Develop and maintain runbooks for critical services.
    • Reduce Mean Time To Recovery (MTTR).
  • Performance Optimization:
    • Identify and resolve performance bottlenecks.
    • Implement capacity planning strategies.
    • Optimize resource utilization.
    • Work on reducing latency and improving system efficiency across global regions.

Application Instructions

  • (Not specified)

Company Information

  • Company: Replit
  • Mission: To empower the next generation of builders.
  • Description: Replit is the fastest way to turn ideas into software. It provides a platform for building and deploying full-stack applications directly from a browser, accessible to anyone, regardless of coding experience.

Skills

Python
Go
Terraform
Ansible
Pulumi
Monitoring
Alerting
Logging
CI/CD
Infrastructure as Code
Incident Management
Post-Mortems
Runbooks
MTTR

Replit

Cloud-based platform for coding collaboration

About Replit

Replit provides a cloud-based platform for software development and deployment, allowing users to write, run, and share code directly from their web browser. This eliminates the need for complicated local setups, making it easier for a variety of users, including enterprises, freelancers, and students, to engage in coding. The platform features an online code editor, an integrated development environment (IDE), and AI-powered coding assistance, supporting multiple programming languages. Replit stands out from its competitors by offering real-time collaboration tools and project management features, which enhance teamwork among developers. The company operates on a subscription-based model, providing different pricing tiers that unlock additional features, and also generates revenue through enterprise solutions and educational partnerships. The goal of Replit is to make coding accessible and enjoyable for everyone, regardless of their experience level.

San Francisco, CaliforniaHeadquarters
2016Year Founded
$216MTotal Funding
LATE_VCCompany Stage
Enterprise Software, AI & Machine Learning, EducationIndustries
51-200Employees

Benefits

Competitive salary & equity
Your choice of new equipment & software
Health, dental, & vision insurance
Autonomy at work
Flexible work hours
Learning & development stipend
Monthly health & wellness stipend
Generous parental leave
Unlimited PTO (2 weeks minimum required)
401k matching
Commuter benefits
Expensed lunch
Yearly off-sites

Risks

Replit faces competition from GitHub Codespaces with similar features.
Market saturation in online coding environments may challenge Replit's differentiation.
Significant investment in AI development could strain Replit's financial resources.

Differentiation

Replit offers a browser-based IDE supporting over 50 programming languages.
The platform enables real-time collaboration and code sharing across multiple devices.
Replit's AI-powered coding assistance enhances developer productivity and efficiency.

Upsides

Replit raised $97.4M to expand cloud services and lead in AI development.
The platform benefits from increased demand for remote and collaborative coding tools.
Educational institutions are adopting Replit for remote learning, boosting its user base.

Land your dream remote job 3x faster with AI