Senior Site Reliability Engineer (SRE)
CriblFull Time
Senior (5 to 8 years)
Key technologies and capabilities for this role
Common questions about this position
Responsibilities include managing AWS and EKS infrastructure with IaC and GitOps, participating in capacity planning and release management with GitLab CI, handling 24/7 on-call (~2 days/month), conducting blameless post-mortems, developing disaster recovery plans, and implementing security best practices.
Key requirements include familiarity with AWS cloud-native services, hands-on Kubernetes and Terraform IaC experience, proficiency in GitLab CI/CD pipelines, expertise in monitoring/APM/logging tools, strong problem-solving for distributed systems, performance optimization skills, and knowledge of incident management processes.
This information is not specified in the job description.
This information is not specified in the job description.
The role involves 24/7 on-call responsibilities approximately 2 days per month based on the rotation schedule to ensure continuous availability and quick issue response.
Phone and video system