Consumer Data Platform - Senior Site Reliability Engineer at Procter & Gamble Company

Manila, National Capital Region, Philippines

Procter & Gamble Company Logo
Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Consumer Goods, Data PlatformIndustries

Requirements

  • 3+ years of experience as a Site Reliability Engineer (SRE), Operations Engineer, or in a similar role supporting business-critical, high-traffic services, with proficiency in using observability tools
  • Background in Software Engineering with strong programming skills in modern languages (e.g., Java, Go, or C#) and a deep understanding of object-oriented programming and design principles
  • Familiarity with various production failures, enabling early identification of risks and potential failure points
  • Ability to troubleshoot large-scale production systems by integrating insights from diverse domains and signals from monitoring tools
  • Experience designing distributed systems in cloud environments
  • Proficient in modern cloud infrastructure, with hands-on experience in GCP, Infrastructure as Code (Terraform), GKE administration, and virtualization technologies
  • Experience with CI/CD pipelines, container orchestration (e.g., Kubernetes), and infrastructure-as-code practices (e.g., Terraform)
  • Strong understanding of networking, security, and distributed systems architecture
  • Excellent critical thinking and communication skills, with strong leadership and teamwork capabilities, along with a keen enthusiasm for quickly learning new technologies
  • Self-motivated and detail-oriented, demonstrating strong analytical skills, a passion for excellence, and a commitment to continuous learning

Responsibilities

  • Collaborate with Product and Engineering teams to establish and maintain standards and best practices for system reliability, effectively managing these through Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets
  • Develop and implement long-term SRE strategies that align with the overall business objectives of the Consumer Data Platform
  • Lead and mentor junior engineers, fostering a culture of continuous learning, improvement and technical mastery within the team
  • Design and implement comprehensive observability frameworks that integrate metrics, logging, and tracing to enhance visibility across our systems
  • Lead automation initiatives aimed at reducing engineering and operational toil while developing self-healing mechanisms for enhanced system resilience
  • Design, build, and deploy product features with a strong emphasis on reliability to achieve business objectives
  • Support incident resolution for critical issues through effective troubleshooting and facilitate blameless post-mortem reviews to drive learning and improvement
  • Analyze existing system designs and configurations, providing actionable recommendations to enhance system reliability
  • Collaborate with the broader engineering team to propose improvements, recommend new standards, and share valuable insights from learnings and experiences

Skills

Key technologies and capabilities for this role

SRESLIsSLOsError BudgetsObservabilityMetricsLoggingTracingAutomationSelf-HealingIncident ResolutionPost-MortemTroubleshootingMentoring

Questions & Answers

Common questions about this position

What experience is required for this Senior SRE role?

Candidates need 3+ years of experience as a Site Reliability Engineer (SRE), Operations Engineer, or similar role supporting business-critical, high-traffic services, along with a background in Software Engineering with strong programming skills in modern languages like Java, Go, or C#.

Where is this job located?

The job is located at the Manila Net Park Office.

What technical skills are essential for this position?

Key skills include proficiency in modern cloud infrastructure with hands-on experience in GCP, Infrastructure as Code (Terraform), GKE administration, CI/CD pipelines, container orchestration like Kubernetes, and designing distributed systems in cloud environments.

Is this a remote position?

This information is not specified in the job description.

What is the salary or compensation for this role?

This information is not specified in the job description.

Procter & Gamble Company

About Procter & Gamble Company

N/AHeadquarters
N/AYear Founded
N/ACompany Stage

Land your dream remote job 3x faster with AI