SRE / DevOps Engineer
KrakenFull Time
Senior (5 to 8 years), Expert & Leadership (9+ years)
Candidates should possess a BS in Computer Science, Information Technology, Business / Management Information Systems, or a related field, with typically a minimum of 2 years of relevant experience. Experience in Public and Private Clouds, Jenkins, Terraform, Ansible, OpenShift, Kubernetes, or AWS EKS is desired.
The Site Reliability Engineer is responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. This role involves creating a bridge between development and operations by applying a software engineering mindset to system administration. Responsibilities include performing chaos engineering, pushing systems to their limits to improve performance, utilizing DevOps and GitOps practices for automation, ensuring service reliability and resilience, conducting game days, reviewing designs for stability and risk identification, building systems for monitoring infrastructure health, improving monitoring and alerting, troubleshooting systems and network issues, evolving the SDLC and tooling for Site Reliability, and developing runbooks and documentation.
Payment technologies and software solutions