Director, Cloud Site Operations at Crusoe

San Francisco, California, United States

Crusoe Logo
Not SpecifiedCompensation
Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
AI, Cloud Computing, High-Performance Computing, Sustainable TechnologyIndustries

Requirements

  • 10+ years of experience in data center or cloud infrastructure operations, including 5+ years in senior leadership
  • Proven success managing global, multi-site operations for cloud or hyperscale environments
  • Deep knowledge of critical power and cooling systems, including liquid cooling for high-density GPU clusters
  • Experience building and scaling global teams in high-growth, mission-critical environments
  • Strong executive communication and cross-functional leadership skills
  • Willingness to travel internationally (25–40%)
  • Preferred Experience
  • Background in cloud service providers, hyperscalers, or large-scale colocation environments
  • Experience with GPU/AI workloads and HPC-optimized facilities
  • Familiarity with clean energy integration (geothermal, hydro, solar + storage) in data center operations
  • Expertise in incident management, root cause analysis, and building resilient systems at scale

Responsibilities

  • Lead 24/7 operations across Crusoe’s global fleet of GPU-focused cloud data centers
  • Ensure world-class uptime, performance, and resiliency while maintaining sustainability goals
  • Standardize operational playbooks and enforce best practices for safety, security, and compliance
  • Drive continuous improvement in efficiency (MTTR, PUE, MW utilization, NRC/OpEx per MW)
  • Manage hardware uptime and operational readiness for large-scale GPU clusters (H200, B200, GB200, MI300X, MI355X, GB300, etc.)
  • Ensure observability into performance and readiness across diverse geographies (U.S., Europe, Asia)
  • Lead and develop a distributed global team of site operations managers, engineers, and technicians
  • Build a safety-first culture focused on reliability, execution, and accountability
  • Implement scalable staffing and shift models to support rapid growth and international operations
  • Manage strategic relationships with colocation partners, OEMs, and service providers
  • Ensure SLAs are exceeded while balancing cost, quality, and sustainability
  • Partner closely with engineering, capacity planning, and product teams to align operational readiness with business growth
  • Ensure global adherence to compliance frameworks (ISO, SOC, Uptime Institute, ASHRAE, etc.)
  • Oversee physical and operational security, incident response, and root cause analysis
  • Maintain operational excellence in high-density, liquid-cooled GPU environments
  • Provide leadership updates on global site performance, capacity growth, and incident management
  • Contribute to long-term site strategy, expansion roadmaps, and scaling models to support 300k+ GPU growth
  • Serve as a thought leader for sustainable AI infrastructure, ensuring Crusoe remains at the forefront of clean compute

Skills

Data Center Operations
GPU Clusters
Operational Leadership
Site Management
Team Leadership
Safety Protocols
Compliance
Observability
MTTR Optimization
PUE Optimization
Sustainability
Global Operations
Hardware Uptime
24/7 Operations

Crusoe

Utilizes wasted energy for computing power

About Crusoe

Crusoe Energy Systems Inc. provides digital infrastructure that focuses on using wasted, stranded, or clean energy sources to power high-performance computing and artificial intelligence. The company helps clients in the technology and energy sectors by offering scalable computing solutions that aim to reduce greenhouse gas emissions and support the transition to cleaner energy. Crusoe's approach involves converting excess natural gas and renewable energy into computing power, which allows them to maximize resource efficiency while minimizing environmental impact. Unlike many competitors, Crusoe specifically targets the intersection of energy and technology, generating revenue by supplying computing resources to enterprises that need significant computational power for applications like AI and machine learning, along with providing technical support.

Denver, ColoradoHeadquarters
2018Year Founded
$1,082.2MTotal Funding
SERIES_DCompany Stage
Energy, AI & Machine LearningIndustries
201-500Employees

Benefits

Industry competitive pay
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Paid life insurance, short-term and long-term disability
Parental leave
Stock options in a fast-growing, well-funded technology company
Pet-friendly offices
Teladoc
401(k) with a 4% match
Unlimited time off
Cell phone reimbursement
Tuition reimbursement
Company paid commuter benefit; $100 per month
Calm

Risks

Increased competition in AI infrastructure could threaten Crusoe's market share.
Regulatory scrutiny may arise from Bitcoin mining's environmental concerns.
Rapid expansion into AI infrastructure may lead to operational challenges.

Differentiation

Crusoe converts wasted energy into computing power, reducing environmental impact.
The company offers scalable solutions for AI and high-performance computing needs.
Crusoe's Digital Flare Mitigation technology utilizes natural gas for eco-friendly Bitcoin mining.

Upsides

Crusoe secured $600M in Series D funding, boosting AI infrastructure expansion.
Partnerships with tech firms enhance Crusoe's AI capabilities and market reach.
AI-driven energy optimization can significantly reduce operational costs in data centers.

Land your dream remote job 3x faster with AI