Site Reliability Engineer
Goodleap- Full Time
- Junior (1 to 2 years)
Candidates should have a strong background in Site Reliability Engineering with experience in designing and implementing infrastructure automation and deployment pipelines. Proficiency in tools such as Terraform, Ansible, and Jenkins is required, along with expertise in cloud platforms like AWS, GCP, or Azure. A solid understanding of monitoring and logging systems, as well as security and compliance policies, is essential. Strong collaboration skills to work with cross-functional teams and previous experience in mentoring are preferred.
The Senior Site Reliability Engineer will design and implement infrastructure automation and deployment pipelines. They will maintain monitoring and logging systems to ensure the reliability and performance of the healthcare AI platform. The role involves working closely with software engineers to deploy scalable and secure production systems, developing security and compliance policies, and collaborating with teams to troubleshoot complex infrastructure issues. Additionally, they will implement disaster recovery plans and maintain documentation related to operations.