Endor Labs

Member of Technical Staff - Site Reliability Engineer

Palo Alto, California, United States

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Cybersecurity, Software DevelopmentIndustries

Site Reliability Engineer

Position Overview

Endor Labs is seeking a passionate Site Reliability Engineer (SRE) to play a pivotal role in shaping the reliability, performance, and scalability of our systems. You will partner with engineering teams to define and implement best practices, improve operational excellence, reduce incidents, and foster a culture of accountability and continuous improvement.

About Us

Endor Labs is building the Application Security platform for the software development revolution. Our platform addresses the complexity and dependency-rich nature of modern software, amplified by AI code generation. By building a call graph of your entire software estate, Endor Labs enables teams to clearly identify, prioritize, and fix critical risks faster. We secure code written by humans or AI, from legacy C++ to cutting-edge Bazel monorepos. Founded by serial entrepreneurs Varun Badhwar and Dimitri Stiliadis, Endor Labs is backed by leading VC firms including Dell Technologies Capital, Lightspeed, and Sierra Ventures.

Responsibilities

  • Lead the definition and rollout of SRE practices across engineering, including SLAs, SLOs, and error budgets.
  • Design and build monitoring, alerting, and observability frameworks to empower teams to own service reliability.
  • Establish incident response protocols and lead post-incident reviews to drive learning and remediation.
  • Collaborate with product and platform teams to improve system architecture with reliability and performance as key considerations.
  • Advocate for automation of deployments, scaling, and failover procedures across services.
  • Create tooling and dashboards to provide teams with visibility into system health, latency, and error rates.
  • Foster a blameless culture and partner with engineering leadership to drive a proactive approach to reliability.
  • Champion operational readiness for new services before they are deployed to production.
  • Mentor engineers and help scale reliability thinking across the organization.

Requirements

  • 8+ years of software engineering or infrastructure experience, with at least 3 years in an SRE or DevOps capacity.
  • Strong experience designing and scaling production systems in cloud-native environments.
  • Proficiency with observability tooling such as Prometheus, Grafana, Datadog, OpenTelemetry, etc.
  • Experience setting and managing SLAs/SLOs and driving improvements in reliability metrics.
  • Proficient in programming/scripting languages such as Go and Python.
  • Experience with container orchestration (Kubernetes, Helm) and infrastructure-as-code (Terraform, Pulumi, etc.).
  • Familiarity with CI/CD pipelines and deployment strategies.
  • Exceptional communication skills and a collaborative mindset, with the ability to influence and educate across teams.
  • A mindset of ownership, humility, and continuous learning.

What We Offer

  • The opportunity to work with deeply kind, mission-driven people.
  • A focus on quality over speed, and speed over scope.
  • A commitment to making the complex simple.
  • A culture that uses first principles to debate ideas, test assumptions, and make decisions.
  • A drive to seek the truth by putting data above opinions.
  • An environment that assumes good intent and provides tactical feedback for mutual growth.
  • A culture with no ego, where collective customer success is paramount.

Compensation

The compensation range for this position is expected to be between $170,000 - $220,000.

Employment Type

  • [Employment Type Not Specified]

Location Type

  • [Location Type Not Specified]

Skills

SRE practices
SLAs
SLOs
Error budgets
Monitoring
Alerting
Observability
Incident response
Automation
Deployment
Scaling
Failover
Tooling
Dashboards
System architecture
Reliability
Performance

Endor Labs

Cybersecurity software vulnerability analysis services

About Endor Labs

Endor Labs specializes in cybersecurity by focusing on reachability-based dependency analysis to identify vulnerabilities in software that hackers could exploit. Their team, composed of PhDs, analyzes software to provide a comprehensive risk score that evaluates security, quality, popularity, and activity. This analysis helps reduce alert noise by 80%, allowing clients to concentrate on the most critical issues. They offer a flexible policy engine for clients to create tailored risk profiles, minimizing disruptions in the software development process. Additionally, Endor Labs assists businesses in managing Software Bill of Materials (SBOM) and Vulnerability Exploitability Exchange (VEX) to understand the risks and costs associated with software ownership. Their goal is to enhance the security and quality of software for businesses of all sizes while generating revenue through their analysis and monitoring services.

Palo Alto, CaliforniaHeadquarters
2021Year Founded
$92.4MTotal Funding
SERIES_ACompany Stage
Data & Analytics, CybersecurityIndustries
51-200Employees

Benefits

Health Insurance
Dental Insurance
Vision Insurance
Mental Health Support
Unlimited Paid Time Off
401(k) Retirement Plan
Remote Work Options

Risks

Integration with Microsoft Cloud Defender may strain resources to maintain high performance.
New AI model evaluation tool could expose Endor Labs to risks of biases and inaccuracies.
Strategic investment from Citi Ventures may pressure the company for rapid financial growth.

Differentiation

Endor Labs specializes in reachability-based dependency analysis for software vulnerability detection.
The company offers a comprehensive risk score for software packages, reducing alert noise by 80%.
Endor Labs' flexible policy engine allows clients to create specific risk-based policies.

Upsides

Endor Labs' SCA tool is integrated with Microsoft Cloud Defender, expanding its market reach.
The company received strategic investment from Citi Ventures, boosting financial resources.
Endor Labs won 'Most Innovative Technology' award, enhancing its industry credibility.

Land your dream remote job 3x faster with AI