Site Reliability Engineering Manager
Wikimedia FoundationFull Time
Senior (5 to 8 years), Expert & Leadership (9+ years)
Key technologies and capabilities for this role
Common questions about this position
This information is not specified in the job description.
The position is hybrid.
Key skills include experience with Terraform, Kubernetes, and Infrastructure-as-Code, along with expertise in observability, monitoring, alerting, incident management, and automation practices.
CoreWeave thrives in a dynamic environment that values adaptability, resilience, operational excellence, automation, ownership, collaboration, and continuous learning.
A strong candidate is a thoughtful leader with technical depth, strategic vision, and the ability to thrive in fast-moving, high-growth environments, valuing clarity over complexity, mentorship over management, and resilience over rigidity.
Cloud service for GPU-accelerated workloads
CoreWeave provides cloud computing services that focus on GPU-accelerated workloads, which are essential for tasks requiring high computational power. Their services cater to industries such as artificial intelligence, machine learning, visual effects rendering, and data processing. Clients can access powerful computing resources on a pay-as-you-go basis, allowing them to avoid the costs of purchasing expensive hardware. CoreWeave's infrastructure utilizes a bare metal serverless Kubernetes platform, which enhances performance while minimizing operational complexity for users. This setup enables clients to optimize their computing needs with a variety of NVIDIA GPUs, ensuring they can balance performance and cost effectively. The company's goal is to offer flexible and scalable computing solutions that meet the demands of diverse clients, from tech companies to film studios.