PagerDuty

Senior Site Reliability Engineer 3

Portugal

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Information Technology & ServicesIndustries

Requirements

Candidates should possess 5+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering roles, demonstrating deep expertise in Kubernetes administration and architecture, a strong track record of leading CI/CD and platform engineering initiatives, and experience working on cloud-native infrastructure such as AWS, GCP, or Azure. Advanced experience with monitoring, observability, and logging platforms like DataDog, New Relic, SumoLogic, or Splunk is required, along with proficiency in at least one programming language such as Python, Ruby, or Go.

Responsibilities

The Senior Site Reliability Engineer 3 will lead the design and implementation of complex platform engineering solutions, drive architectural decisions for the CI/CD infrastructure and Kubernetes platform, mentor junior team members and provide technical leadership, develop and implement strategic initiatives to improve developer experience and platform reliability, design and implement scalable solutions for infrastructure automation using Terraform and other IaC tools, lead post incident reviews and drive systematic improvements, collaborate with other engineering teams globally, champion observability and monitoring best practices, and participate in a 24/7 on-call rotation using PagerDuty.

Skills

Kubernetes
CI/CD
Terraform
Platform Engineering
Infrastructure Automation
Observability
Monitoring
Cloud-native

PagerDuty

Incident management and response platform

About PagerDuty

PagerDuty specializes in incident management and response, providing a platform that helps organizations quickly address IT issues to minimize operational disruptions. The platform integrates with various monitoring tools to detect incidents in real-time, alerting the right personnel for swift action. This process aids in reducing downtime and maintaining service quality across sectors like technology, finance, healthcare, and retail. PagerDuty operates on a subscription-based model, offering different pricing tiers based on user count and feature levels, which ensures a steady revenue stream. The company also provides premium support and professional services, enhancing its offerings. Overall, PagerDuty aims to help organizations efficiently manage and resolve IT incidents, ensuring the reliability of their digital services.

Key Metrics

San Francisco, CaliforniaHeadquarters
2009Year Founded
$168.9MTotal Funding
IPOCompany Stage
Consulting, Enterprise SoftwareIndustries
1,001-5,000Employees

Benefits

Health, AD&D, Disability, Vision, Life, and Dental Insurance
Paternity and Maternity Leave
Employee Assistance Program
PTO (Vacation / Personal Days)
Sick Time
Remote Work
Adoption Assistance
401(k)
Employee Stock Purchase Program
Flexible Spending Account
Student Loan Repayment Plan

Risks

Emerging AIOps platforms may erode PagerDuty's market share.
Economic downturns could affect subscription renewals and acquisitions.
Reliance on third-party integrations poses risks if partners change APIs.

Differentiation

PagerDuty integrates seamlessly with popular tools like Microsoft Teams and Slack.
Recognized as a leader in GigaOm's 2024 Radar for AIOps.
Subscription-based model ensures steady recurring revenue from diverse industries.

Upsides

Enhanced chat collaboration attracts more enterprise clients relying on Microsoft Teams and Slack.
Strategic focus on public sector and Americas sales expands market reach.
Investments by Intech and Quantbot indicate confidence in growth potential.

Land your dream remote job 3x faster with AI