Team Lead, Site Reliability Engineering at Pythian

San Francisco, California, United States

Pythian Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Technology, Cloud ServicesIndustries

Requirements

  • A minimum of 3 years previous experience leading a team
  • Experience with Google Cloud, plus IaC tools (Terraform)
  • Strong knowledge of microservices, containers (Kubernetes, Docker), and networking
  • Hands-on experience with PKI, service mesh, and Linux systems administration
  • SRE mindset

Responsibilities

  • Lead and mentor a team of Site Reliability Engineers to ensure technical excellence, timely resolution of incidents, and professional growth of team members
  • Oversee queue management, ticket prioritization, and workload distribution to meet SLA and utilization targets
  • Act as the primary point of contact for critical escalations and severity-1 incidents, providing guidance and technical direction
  • Conduct performance reviews, and knowledge-sharing sessions to strengthen the team’s capabilities
  • Collaborate with management on performance metrics, process adherence, and resource planning
  • Set specific goals and objectives for team members as part of Pythian’s goal planning program
  • Provide guidance to team members in regards to training opportunities as part of Pythian’s self-directed training program
  • Meet regularly with team members for one-on-one sessions to disseminate information and gain feedback on opportunities for improvement
  • Operate and optimize Kubernetes clusters, Istio service mesh, and Linux-based systems
  • Automate workflows using Go, Python, and Shell scripting
  • Build monitoring and observability solutions with Prometheus, Grafana, and Loki
  • Troubleshoot complex networking, storage, and system performance issues
  • Partner with AI/ML teams to ensure infrastructure readiness for model training and data pipelines

Skills

SRE
Distributed Systems
Cloud Infrastructure
Automation
Monitoring
AI/ML
AWS
Google Cloud
Oracle
Snowflake
SLAs

Pythian

Cloud migration and data management services

About Pythian

Pythian assists businesses in managing and optimizing their data and IT infrastructure through services like cloud migration, managed services, and advanced analytics. They help companies transfer their data to cloud platforms such as Google Cloud, AWS, and Microsoft Azure, while providing ongoing support for smooth operations. Pythian differentiates itself by offering specialized services in machine learning and data science, enabling businesses to turn their data into valuable insights. Their goal is to empower organizations to leverage cloud computing and advanced analytics to improve operations and drive growth.

Ottawa, CanadaHeadquarters
1997Year Founded
$20.4MTotal Funding
EARLY_VCCompany Stage
Consulting, Enterprise Software, AI & Machine LearningIndustries
501-1,000Employees

Benefits

Remote Work Options
Flexible Work Hours
Paid Vacation
Paid Sick Leave
Wellness Program
Professional Development Budget
401(k) Company Match

Risks

Emerging cloud providers offering lower-cost services increase competition.
Rapid AI advancements may outpace Pythian's current capabilities.
Economic downturns in key industries could reduce IT service spending.

Differentiation

Pythian offers specialized services in machine learning and AI for data insights.
Their EDP QuickStart provides rapid deployment of enterprise data platforms.
Pythian's global presence with experts in 22 countries enhances their service delivery.

Upsides

Growing demand for cloud migration boosts Pythian's service offerings.
Expansion in database management market offers growth opportunities for Pythian.
Increased focus on cybersecurity drives demand for Pythian's Adminiscope.

Land your dream remote job 3x faster with AI