Senior Engineering Manager, Site Reliability
DittoFull Time
Expert & Leadership (9+ years)
Candidates must have a proven track record of building, developing, and retaining high-performing engineering teams, with over 10 years of software engineering experience and at least 5 years in leadership roles. Demonstrated success in driving complex technical initiatives across organizational boundaries, solid design and problem-solving skills, and a passion for engineering excellence, quality, security, and performance are essential. Strong executive presence and communication skills for effective partnership with senior leadership, experience leading distributed teams in a remote-first environment, and proficiency in cloud environments like AWS, Azure, GCP, or OCI are required. A deep understanding of distributed systems and reliability engineering principles, along with a Bachelor's degree in Computer Science or a related field, or equivalent experience, is necessary. Bonus points are awarded for strategic thinking, excellence in stakeholder management, successful cross-functional collaboration, strong change management skills, and the ability to build consensus.
The Senior SRE Engineering Manager will build and develop high-performing engineering teams through mentorship, coaching, and career development, while driving cross-functional collaboration between SRE, Development, Security, and Operations teams to enhance system resilience. They will partner with executive leadership to define and execute strategic reliability initiatives, establish and maintain strong relationships with peer leaders to ensure alignment on technical direction and operational priorities, and lead organizational change management efforts to improve processes and drive engineering excellence. Key technical responsibilities include guiding architectural decisions that impact system reliability and scalability, driving the adoption of SRE best practices across multiple teams, overseeing the implementation of reliability metrics and SLOs, and championing automation and infrastructure-as-code initiatives.
Cloud-native endpoint security solutions provider
CrowdStrike specializes in cybersecurity, focusing on protecting businesses from cyber threats through cloud-native endpoint security solutions. Their main product, the Falcon platform, includes services like Falcon Pro, which replaces traditional antivirus with next-generation antivirus that integrates threat intelligence, Falcon Insight for endpoint detection and response, and Falcon Device Control to manage connected devices. Unlike many competitors, CrowdStrike's services are subscription-based, allowing clients to choose different levels of protection based on their needs. The company serves a diverse clientele, including many Fortune 100 companies, and is recognized as a leader in the cybersecurity field, known for its effectiveness in threat detection and response.