SailPoint

Senior SRE (Site Reliability Engineer) - Remote

Mexico

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Cybersecurity, Consumer Software, Identity SecurityIndustries

Senior Site Reliability Engineer (SRE)

Position Overview

SailPoint is the leader in identity security for the cloud enterprise. Our identity security solutions secure and enable thousands of companies worldwide, giving our customers unmatched visibility into the entirety of their digital workforce, ensuring workers have the right access to do their job – no more, no less.

We are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to join an Identity Security Cloud software development team. This is an embedded role, meaning you will be a full member of the development team, working closely with software engineers, infrastructure platform services, engineering managers, and other stakeholders to ensure the reliability, scalability, and performance of teams’ services. You will be responsible for leveraging the infrastructure, tooling, and processes that support our applications in dev and production. This role offers a unique opportunity to directly influence the design and architecture of our systems from a reliability and performance perspective.

Responsibilities

  • Reliability Engineering: Design, develop, and implement solutions to improve the reliability, availability, performance, and scalability of our systems. Work with technical leaders and infrastructure platform services to develop alerts and dashboards.
  • Operational Excellence: Own and improve key operational metrics (SLIs, SLOs, Error Budgets, monitoring and alerting) for team-related services and drive continuous improvement through post-incident reviews and blameless postmortems of non-functional issues. Develop and maintain comprehensive monitoring, alerting to proactively identify and resolve issues. Create and maintain dashboards, conducting ongoing reviews to address and optimize gaps. Improve operational processes and team practices by working with technical leaders and NOC teams.
  • Capacity Planning: Collaborate with technical leads, DevOps/SRE, and infra teams to forecast capacity needs and ensure sufficient resources are available to support growth.
  • Performance Optimization: Collaborate with performance SMEs to identify and address production performance bottlenecks through profiling, tuning, and optimization of services and infrastructure.
  • Automation: Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
  • Collaboration: Work closely with Software, Performance, and Test Engineers to influence system design and architecture for operability and reliability.
  • Documentation: Review and contribute to clear and concise documentation for systems, processes, runbooks, and procedures.
  • On-Call: Participate in a 24/7 on-call rotation to gain subject matter expertise in the domain.
  • Incident Management: Lead the incident postmortem efforts, working with the SMEs to ensure timely compilation of reports to help drive completion of post-incident actions.
  • Troubleshooting skills: Excellent diagnostic and problem-solving skills, with the ability to analyze complex systems and data.

Qualifications

  • Bachelor’s degree in computer science, a related field, or equivalent practical experience.
  • Proven 5+ years of SRE experience.
  • Strong understanding of SRE principles and practices.
  • Experience with cloud platforms (AWS, GCP, or Azure).
  • Proficiency in at least one scripting language (e.g., Python, Bash, Go).
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Honeycomb, OpenSearch).
  • Level of coding experience beyond simple scripts with one of the programming languages such as Go, Java, or Python to help build reliability engineering; to evaluate and identify where service code can be optimized for enhanced reliability practices.
  • Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes).
  • Understanding of network protocols.

Employment Type

  • Full time

Company Information

SailPoint is the leader in identity security for the cloud enterprise. Our identity security solutions secure and enable thousands of companies worldwide, giving our customers unmatched visibility into the entirety of their digital workforce, ensuring workers have the right access to do their job – no more, no less.

Skills

Reliability Engineering
System Scalability
Performance Optimization
Monitoring and Alerting
Dashboard Development
Operational Metrics
SLIs
SLOs
Error Budgets
Post-Incident Reviews
Blameless Postmortems
Infrastructure and Tooling

SailPoint

Provides identity security solutions for enterprises

About SailPoint

SailPoint provides identity security solutions that help organizations manage and protect digital identities. Its main products, including IdentityIQ, IdentityNow, and File Access Manager, assist businesses in ensuring compliance with regulations, reducing risks, and controlling access to sensitive information. These products work by giving organizations visibility into who has access to what data, allowing them to manage permissions effectively. SailPoint stands out from competitors by utilizing advanced technologies like artificial intelligence and machine learning to enhance its identity governance capabilities. The company's goal is to be a trusted partner for enterprises in navigating the complexities of identity security, ensuring that they can securely manage access to their critical information.

Austin, TexasHeadquarters
2004Year Founded
$20.7MTotal Funding
IPOCompany Stage
Cybersecurity, AI & Machine LearningIndustries
1,001-5,000Employees

Risks

Emerging identity management startups increase competition, potentially eroding market share.
Rapid technological changes may outpace SailPoint's innovation, risking solution obsolescence.
Integration challenges with acquisitions like SecZetta may disrupt services or misalign strategies.

Differentiation

SailPoint specializes in managing and securing digital identities for enterprises.
The company leverages AI and machine learning to enhance identity security solutions.
SailPoint's IdentityIQ provides visibility and control over user access.

Upsides

Growing demand for remote work security boosts SailPoint's remote access management features.
Rising adoption of AI-driven identity analytics aligns with SailPoint's AI capabilities.
Increased regulatory requirements drive demand for SailPoint's identity governance solutions.

Land your dream remote job 3x faster with AI