Senior Staff Design Automation Engineer
GroqFull Time
Senior (5 to 8 years)
Candidates must possess a minimum of 12 years of professional software development experience, with significant experience in a technical leadership role driving large-scale, cross-team projects. Expertise is required in designing, implementing, and operating scalable SaaS solutions using Java, Node.js, TypeScript, RESTful APIs, microservices architecture, and AWS cloud services (ECS/EKS, Lambda, DynamoDB, API Gateway). Expert-level knowledge of DevOps methodologies including infrastructure-as-code (Terraform/CloudFormation), CI/CD tooling (GitHub Actions/Jenkins), observability monitoring (Datadog/Prometheus), and containerization/orchestration (Docker/Kubernetes) is essential. A proven track record of delivering strategic cross-domain technical initiatives that align with business objectives, extensive experience influencing engineering direction across teams, setting technical roadmaps, and mentoring senior engineers are also required. The ability and willingness to participate in PagerDuty's on-call rotation for mission-critical systems is necessary.
The Staff Software Engineer V will define and lead the technical vision, architecture, and strategic direction for PagerDuty Workflow Automation, driving cross-domain initiatives and aligning them with company-wide strategic goals. They will architect and deliver complex technical solutions spanning multiple engineering teams, ensuring scalability, reliability, maintainability, and performance. Responsibilities include collaborating closely with engineering management, product leadership, and cross-functional stakeholders to identify critical technology investments for long-term growth. The engineer will act as a recognized technical leader and subject matter expert in SaaS development, workflow automation systems, and cloud-native architectures (AWS), mentoring senior engineers and fostering technical excellence. They will own the end-to-end lifecycle of technology initiatives, from inception and prototyping to implementation and operational excellence, providing technical oversight and strategic guidance. Additionally, they will drive continuous improvement in engineering practices, processes, and tools to enhance team productivity, system reliability, and customer experience.
Incident management and response platform
PagerDuty specializes in incident management and response, providing a platform that helps organizations quickly address IT issues to minimize operational disruptions. The platform integrates with various monitoring tools to detect incidents in real-time, alerting the right personnel for swift action. This process aids in reducing downtime and maintaining service quality across sectors like technology, finance, healthcare, and retail. PagerDuty operates on a subscription-based model, offering different pricing tiers based on user count and feature levels, which ensures a steady revenue stream. The company also provides premium support and professional services, enhancing its offerings. Overall, PagerDuty aims to help organizations efficiently manage and resolve IT incidents, ensuring the reliability of their digital services.