Ditto

Senior Engineering Manager, Site Reliability

United States

Not SpecifiedCompensation
Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Biotechnology, Software, Data Management, ConnectivityIndustries

Senior Engineering Manager, Site Reliability Engineering

About Ditto

Ditto is redefining how data moves at the edge. Our mission is to make it seamless for developers to build resilient, real-time applications, regardless of network conditions. Whether you're in a stadium, airplane, or remote military base, Ditto's peer-to-peer sync engine ensures devices stay connected and data stays consistent, even without internet. With more than $145 million in funding and trusted by organizations like Chick-fil-A, Delta Airlines, and the U.S. military, Ditto powers mission-critical experiences across aviation, retail, travel, hospitality, defense, and more. As a globally distributed, fast-growing startup, we’re committed to building a diverse and inclusive team that reflects the wide range of perspectives needed to solve the world’s hardest connectivity problems.

About the Role

Ditto is at an inflection point. As we scale to meet the growing demands of our enterprise customers, we need experienced SRE Leads to drive and mature our Site Reliability Engineering practice.

This is a unique opportunity to play a leading role in shaping enterprise-grade reliability, observability, and incident management to ensure Ditto's systems meet the high standards our customers expect.

As a Senior Engineering Manager of Site Reliability Engineering, you will lead a multi-layered team of SREs, including other SRE managers, to shape and scale reliability practices across our platform. You will drive strategy, execution, and people development across regions while embedding a culture that values high availability, resiliency, and operational excellence.

Responsibilities

As a Senior Engineering Manager, you will:

  • Lead and Scale SRE Organization:
    • Lead and scale a globally distributed SRE organization, including managers and individual contributors, setting the long-term vision and execution plan for reliability at scale.
    • Design talent acquisition strategies, hiring criteria, and interview modules to build a team of exceptional talent.
    • Design and implement a highly effective SRE org structure, including geo-located teams, internal leadership and management lines, and integration/partnership points with other teams.
  • Develop Talent and Culture:
    • Develop engineering leaders and senior talent, coaching on both technical depth and leadership maturity to create a high-trust, high-performance organization.
    • Play a central role in the transformation of Ditto’s engineering culture towards reliability.
    • Lead strategic programs to transform engineering culture toward reliability, such as:
      • Annual "Reliability Weeks," engineering health reviews.
      • Incentivizing reliability work, such as inclusion in promotion criteria and roadmap planning.
      • Designing systems to hold engineering teams accountable for the reliability of their respective systems.
  • Drive SRE Best Practices:
    • Drive adoption of SRE best practices, including:
      • Embedding SREs in product teams to influence design and early detection of failure modes.
      • Defining production-readiness checklists and launch gates tied to SLOs.
      • Championing error budgets as a shared accountability mechanism between product and reliability.
  • Establish Incident Management:
    • Establish and evolve an incident management practice, including:
      • Clear roles (Incident Commander, Scribe, Subject Matter Experts, CX & affected customer communication).
      • Blameless postmortems with systemic and meaningful remediations.
      • Active tracking of incident themes and reliability KPIs, and reporting to senior leadership.
  • Enhance Observability and Tooling:
    • Lead the architecture and execution of observability systems that offer real-time visibility into system health and customer experience.
    • Partner with platform, infrastructure, and security teams to build scalable, self-service reliability tooling (e.g., circuit breakers, automated rollback, chaos testing frameworks).
  • Define and Implement Reliability Metrics:
    • Guide teams to define, implement, and iterate on SLIs, SLOs, and SLAs that are meaningful to end-user experience.
  • Improve Operational Hygiene:
    • Establish best-in-class documentation and operational hygiene, including runbooks, architectural decision records (ADRs), and deep operational reviews.
  • Promote On-Call Excellence:
    • Model on-call excellence, including burnout prevention, clear handoffs, and leveraging automation and toil elimination.

Skills

Site Reliability Engineering
Reliability
Observability
Incident Management
High Availability
Resiliency
Operational Excellence
Team Leadership
People Development
Strategy
Execution

Ditto

Simplifies multi-platform app development and synchronization

About Ditto

Ditto.live simplifies the development of native applications for various platforms, including iOS, macOS, Android, and web. Its main product, the Edge Sync Platform, addresses the challenge of data synchronization by allowing developers to manage data that is distributed across multiple devices and cloud infrastructures. This platform enables developers to write their code once and deploy it across different platforms, which saves time and reduces effort in the app development process. Unlike many competitors, Ditto focuses on providing a seamless experience for developers by offering features like peer-to-peer authentication and offline syncing. The company's goal is to enhance the efficiency of app development and improve user experiences by enabling the creation of interconnected applications.

San Francisco, CaliforniaHeadquarters
2018Year Founded
$52.5MTotal Funding
SERIES_ACompany Stage
Data & Analytics, Consumer Software, Enterprise SoftwareIndustries
51-200Employees

Benefits

Health Insurance
Dental Insurance
Vision Insurance
Life Insurance
Disability Insurance
Flexible Spending Account/Flexible Spending Account
Unlimited Paid Time Off
401(k) Retirement Plan
Stock Options

Risks

Emerging startups may dilute Ditto's market share with similar solutions.
Rapid app framework evolution could outpace Ditto's integration capabilities.
Economic downturns may challenge Ditto's subscription-based revenue model.

Differentiation

Ditto offers real-time data sync without internet, unlike many competitors.
Their Edge Sync Platform supports both iOS and Android, reducing development time.
Ditto's peer-to-peer authentication enhances data privacy and security.

Upsides

Growing demand for edge computing boosts Ditto's market potential.
Offline-first app development trend aligns with Ditto's core capabilities.
5G expansion enhances Ditto's real-time data synchronization benefits.

Land your dream remote job 3x faster with AI