Founding Site Reliability Engineer at Relyance AI

San Francisco, California, United States

Relyance AI Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
AI, TechnologyIndustries

Requirements

  • 5+ years in SRE/DevOps/Infrastructure roles, with experience in enterprise SaaS environments
  • Deep AWS expertise (EC2, ECS/EKS, Lambda, RDS, VPC, IAM)
  • Proven track record with Infrastructure as Code (Terraform, Kubernetes/EKS, CDK, or CloudFormation)
  • Hands-on with observability stacks (CloudWatch, Grafana, Prometheus, Datadog)
  • Incident management experience in production SaaS systems, including on-call, postmortems, and reliability improvements
  • Bonus: Prior exposure to AI/ML platforms, data-heavy systems, or multi-agent workloads

Responsibilities

  • Own SRE establishing best practices, tooling, and culture
  • Tackle reliability challenges unique to multi-agent orchestration at enterprise scale
  • Guarantee >99.9% uptime of production systems, ensuring reliability at global scale
  • Architect and automate AWS infrastructure with Terraform and CI/CD pipelines
  • Design observability systems across microservices, APIs, and vector infrastructure (metrics, tracing, logging)
  • Drive down incidents and MTTR through runbooks, alerting, and incident response excellence
  • Help scale infra to support hundreds of thousands of agents and billions of API calls
  • Partner with engineering teams to embed SRE principles into the SDLC and shape org-wide reliability strategy
  • Act as a founding voice in our SF office, influencing product direction and engineering culture

Skills

Key technologies and capabilities for this role

TerraformAWSCI/CDObservabilityMicroservicesAPIsMetricsTracingLoggingRunbooksAlerting

Questions & Answers

Common questions about this position

What is the work location and arrangement for this role?

The role is hybrid, requiring 3 days per week in the San Francisco office.

What salary or compensation is offered for this position?

This information is not specified in the job description.

What are the key skills and experience required for this Founding SRE role?

Candidates need 5+ years in SRE/DevOps/Infrastructure roles, deep AWS expertise (EC2, ECS/EKS, Lambda, RDS, VPC, IAM), proven track record with IaC like Terraform and Kubernetes/EKS, hands-on with observability stacks (CloudWatch, Grafana, Prometheus, Datadog), and incident management experience.

What is the company culture like for this role?

As the first SRE hire in San Francisco at a fast-scaling AI company, you'll act as a founding voice, partnering closely with founders, engineering leads, and product teams to establish SRE best practices, shape reliability culture, long-term strategy, and influence product direction and engineering culture.

What makes a strong candidate for this Founding SRE position?

Ideal candidates have Senior, Lead, or Principal level experience, are ready to establish and scale the SRE discipline from the ground up, with expertise in enterprise SaaS, AWS, IaC, observability, and incident management; bonus for AI/ML or multi-agent workloads.

Relyance AI

Data protection and privacy compliance platform

About Relyance AI

Relyance AI specializes in data protection and privacy compliance by using machine learning to create a real-time inventory and map of personal data flows within organizations. Their software-as-a-service (SaaS) platform allows clients to monitor data processing activities, ensuring compliance with privacy regulations. What makes Relyance AI different is its focus on automating data tracking, which reduces manual workflows and compliance risks. The company's goal is to help organizations maintain user trust and meet regulatory requirements effectively.

Mountain View, CaliforniaHeadquarters
2020Year Founded
$60.3MTotal Funding
SERIES_BCompany Stage
Enterprise Software, Cybersecurity, AI & Machine LearningIndustries
51-200Employees

Risks

Emerging competition from platforms like OneTrust and TrustArc.
Frequent updates needed due to evolving AI regulations increase costs.
Complex data privacy laws across jurisdictions challenge universal compliance.

Differentiation

Relyance AI uses machine learning for real-time data flow mapping.
The platform offers asset-level visibility and lineage for sensitive data.
Relyance AI integrates privacy compliance directly into code bases.

Upsides

Raised $32M in Series B funding, enhancing growth and innovation.
Named a 2023 Gartner Cool Vendor in Privacy, boosting credibility.
Growing demand for AI-driven compliance solutions supports market expansion.

Land your dream remote job 3x faster with AI