Site Reliability Engineer
Stitch FixFull Time
Mid-level (3 to 4 years)
Candidates should have 5+ years of experience with Linux systems administration, a strong understanding of networking fundamentals, and hands-on experience with automation tools like Ansible and GitLab CI/CD, along with scripting in Python or a similar language. Working knowledge of Kubernetes and container orchestration principles is essential, as is familiarity with 24/7 production support environments, change management, and operational best practices. Experience with AWS cloud platforms is a plus, and strong problem-solving skills and a collaborative mindset are required.
The DevOps Engineer will deploy, support, and maintain RingCentral's Kubernetes clusters on-premises and in AWS EKS. They will participate in incident response, troubleshooting, and root cause analysis for production issues, and join an on-call rotation covering services in India and U.S. regions. The role involves driving continuous improvements in platform stability, performance, and automation.
Phone and video system