Principal Site Reliability Engineer
Devoted Health- Full Time
- Senior (5 to 8 years)
Candidates should possess a Bachelor’s degree in Computer Science, Engineering, or a related field, and have at least 7 years of experience in Site Reliability Engineering, with a strong focus on cloud infrastructure and automation. Experience with AWS GovCloud, FedRAMP, NIST, and DoD IL 4/5 frameworks is essential, along with expertise in Infrastructure as Code (IaC) tools like Terraform or CloudFormation. Strong knowledge of observability tools such as Prometheus, Grafana, Datadog, Splunk, and ELK is required, as well as experience with Virtual Desktop Infrastructure (VDI) solutions and Identity & Access Management (IAM) best practices.
As a Senior Site Reliability Engineer, you will architect, deploy, and maintain highly available, scalable, and secure systems in AWS GovCloud environments, automating infrastructure provisioning, scaling, and failover. You will implement SLOs, SLIs, and error budgets to drive reliability improvements, manage and optimize VDI solutions for seamless user experience, deploy and manage monitoring, logging, and alerting tools, lead incident response and postmortems, and ensure audit readiness by maintaining accurate security configurations and compliance reports.
Cloud-native endpoint security solutions provider
CrowdStrike specializes in cybersecurity, focusing on protecting businesses from cyber threats through cloud-native endpoint security solutions. Their main product, the Falcon platform, includes services like Falcon Pro, which replaces traditional antivirus with next-generation antivirus that integrates threat intelligence, Falcon Insight for endpoint detection and response, and Falcon Device Control to manage connected devices. Unlike many competitors, CrowdStrike's services are subscription-based, allowing clients to choose different levels of protection based on their needs. The company serves a diverse clientele, including many Fortune 100 companies, and is recognized as a leader in the cybersecurity field, known for its effectiveness in threat detection and response.