Crowdstrike

Sr. Problem Management Engineer – Engineering Service Management (Remote)

California, United States

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
CybersecurityIndustries

Position Overview

  • Location Type: Not specified
  • Job Type: Full time
  • Salary: Not specified

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day. We have 3.44 PB of RAM deployed across our fleet of C* servers - and this traffic is growing daily. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you.

We are seeking a Senior Engineering Problem Manager to lead the transformation of our Problem Management Engineering function. This strategic role will focus on embedding resilient, automated, and intelligent problem management practices into our engineering, operations, and platform ecosystems. You will be responsible for building technical integrations, leveraging AI/ML for advanced root cause analysis, and driving a culture of continuous learning and operational excellence. You’ll lead end-to-end delivery of initiatives that reduce incident recurrence, improve service stability, and create measurable business value — with a strong focus on automation, governance, and DevOps alignment.

Responsibilities

  • Design and implement modern problem management workflows, tightly integrated into engineering and operations toolchains.
  • Lead the governance of key problem management deliverables including post-incident action tracking, known error records, and systemic remediation.
  • Drive continuous evolution of a structured retrospective process that promotes learning and resilience engineering.
  • Partner with platform, SRE, and observability teams to automate known error workarounds, temporary fixes, and proactive health checks.
  • Utilize AIOps and ML-driven tooling to correlate events, detect patterns, and identify root causes more effectively.
  • Work closely with business units and product teams to perform business impact analysis and prioritize problem resolution based on value and risk.
  • Integrate post-incident review outcomes into continuous improvement loops, product backlogs, and technical roadmaps.
  • Maintain and evolve the tooling ecosystem supporting problem management, including dashboards, knowledge repositories, and workflows.
  • Act as a coach and change agent to promote a culture of accountability, proactive risk reduction, and shared ownership of reliability.

Key Focus Areas

  • Retrospective Process Management: Facilitate structured reviews and systemic RCA that drive long-term improvements.
  • Automation of Known Errors & Workarounds: Reduce manual overhead through scripts, workflows, and proactive detection.
  • AI-Augmented Root Cause Analysis: Integrate ML models and historical telemetry to improve diagnostic speed and accuracy.
  • Post-Incident Governance: Ensure action items are documented, assigned, and driven to closure with cross-functional visibility.
  • Business Impact Analysis: Collaborate with stakeholders to prioritize recurring problems based on cost, customer experience, and risk.
  • Toolchain Integration: Seamlessly embed problem management into DevOps tools (e.g., Jira, ServiceNow, PagerDuty, GitHub).

Requirements

  • 8+ years of experience in Engineering Operations, DevOps, Service Management, Platform/SRE Engineering.
  • Strong understanding of ITSM, particularly Problem, Incident, and Change Management.
  • Experience managing or building po

Skills

Problem Management
AI/ML
Root Cause Analysis
Automation
DevOps
Incident Reduction
Operational Excellence
Technical Integrations

Crowdstrike

Cloud-native endpoint security solutions provider

About Crowdstrike

CrowdStrike specializes in cybersecurity, focusing on protecting businesses from cyber threats through cloud-native endpoint security solutions. Their main product, the Falcon platform, includes services like Falcon Pro, which replaces traditional antivirus with next-generation antivirus that integrates threat intelligence, Falcon Insight for endpoint detection and response, and Falcon Device Control to manage connected devices. Unlike many competitors, CrowdStrike's services are subscription-based, allowing clients to choose different levels of protection based on their needs. The company serves a diverse clientele, including many Fortune 100 companies, and is recognized as a leader in the cybersecurity field, known for its effectiveness in threat detection and response.

Austin, TexasHeadquarters
2011Year Founded
$468MTotal Funding
IPOCompany Stage
Enterprise Software, CybersecurityIndustries
5,001-10,000Employees

Benefits

Competitive Employee Stock Purchase Plan
Remote-friendly culture
Market leader in compensation and equity awards
Competitive vacation and flexible working arrangements
Comprehensive health benefits + 401k plan
Paid Parental Leave, including adoption
Wellness programs
Professional development and mentorship opportunities
Open offices have stocked kitchens, coffee, soda and treats

Risks

Increased competition from companies like Lumos could challenge CrowdStrike's market share.
Recovery from last year's outage may still affect customer trust and future sales.
Pressure to demonstrate ROI by 2025 could challenge CrowdStrike's financial transparency.

Differentiation

CrowdStrike's Falcon platform offers cloud-native endpoint security solutions, a key differentiator.
The company serves 44 of the Fortune 100, showcasing its strong market presence.
CrowdStrike's proactive threat hunting sets it apart in cybersecurity threat detection.

Upsides

Partnership with SonicWall opens new SMB market segment for CrowdStrike.
Recognition as a leader in ransomware prevention boosts CrowdStrike's market credibility.
Gamified learning initiatives help address cybersecurity skills gap, benefiting future talent pipeline.

Land your dream remote job 3x faster with AI