AI Agent
Resume AI
Interview Prep
Remote Jobs
Login
Sign up
Sr Systems Reliability Engineer
at
The Walt Disney Company
Nicasio, California, United States
Apply Now
Not Specified
Compensation
Senior (5 to 8 years)
Experience Level
Full Time
Job Type
Unknown
Visa
Entertainment, Media
Industries
Requirements
BS Degree in Computer Science
5+ years of experience in DevOps, Site Reliability Engineering, or a related field
Extensive AWS knowledge: EC2, ECS/EKS, Lambda, ELB, ASGs, Route53, KMS, SSM, IAM, S3, ACM, VPC, RDS, Elasticache
Proficiency with modern observability practices: application monitoring, tracing, and profiling tools (e.g. Datadog, New Relic, OpenTelemetry, Splunk)
Proficiency with GitLab CI, Terraform, Helm, and Packer
Demonstrated experience designing and managing CI/CD pipelines for complex software platforms
In-depth knowledge of Containers and Container Orchestration technologies: Docker, Kubernetes
Experience with Terraform or other infrastructure as code tooling
Strong scripting skills in Python, Bash, or similar languages
Familiarity with modern security practices for protecting sensitive assets in distributed systems
Exceptional problem-solving skills, with a proactive and collaborative mindset
Preferred Qualifications
Experience working with media and entertainment pipelines or pre-release content workflows
Proficiency with Golang, Python, or C++
Experience with modern AI/ML frameworks (e.g., TensorFlow, PyTorch, Hugging Face) and their integration into operational workflows
Knowledge of container security tools and systems, such as Falco or Aqua Security
Experience with emerging deployment systems like ArgoCD or Flux for GitOps workflows
Familiarity with serverless computing paradigms and technologies such as AWS Lambda or Google Cloud Run/Functions
Understanding of high-performance computing systems in cloud environments
Experience with administering VMWare vSphere clusters
Responsibilities
Design, manage and maintain critical infrastructure for both software development and deployed global production resources
Collaborate on the provisioning of cloud infrastructure in AWS using Terraform to ensure consistency and scalability
Maintain and manage multiple Kubernetes clusters across both cloud and on-premise environments
Implement and enforce best practices for secure software development and deployment in alignment with industry standards
Monitor, troubleshoot, and optimize build and deployment processes to maximize efficiency and minimize downtime
Collaborate with cross-functional teams, including developers and security experts, to ensure systems meet operational requirements
Develop, maintain, and enhance CI/CD pipelines using GitLab to support build automation, unit testing, and integration testing
Continuously evaluate and implement tools and technologies to improve workflows and platform reliability
Skills
AWS
Terraform
Kubernetes
Cloud Infrastructure
Security
Monitoring
Troubleshooting
CI/CD
Infrastructure as Code
The Walt Disney Company
Leading producers & providers of entertainment and information
Website
About The Walt Disney Company
N/A
Headquarters
1923
Year Founded
N/A
Company Stage
10,001+
Employees
Related Jobs
United States
Remote
Site Reliability Engineer
Close
Salary not specified
Full Time
Junior (1 to 2 years)
Remote
Remote
Staff SRE Engineer (Platform)
Phantom
Salary not specified
Full Time
Senior (5 to 8 years), Expert & Leadership (9+ years)
San Francisco
Remote
DevOps Engineer
Swish Analytics
Salary not specified
Full Time
Senior (5 to 8 years)
Remote
Remote
Principal Site Reliability Engineer
iSpot.tv
Salary not specified
Full Time
Senior (5 to 8 years)
Remote
Remote
Cloud Platform Engineer
PayPal
Salary not specified
Full Time
Junior (1 to 2 years)
United States
Remote
Senior Site Reliability Engineer (AWS, AI/ML, & APM)
Granicus
Salary not specified
Full Time
Senior (5 to 8 years)
Chicago +4 more
Remote
Senior Customer Reliability Engineer (US)
Replicated
Salary not specified
Full Time
Senior (5 to 8 years)
Palo Alto
Remote
Senior Infrastructure Engineer
Groq
Salary not specified
Full Time
Senior (5 to 8 years)
Remote
Remote
Senior Systems Engineer (AWS)
Dev Technology Group
Salary not specified
Full Time
Senior (5 to 8 years)
United States
Remote
Senior Cloud Architect - Remote US
Altera Digital Health
Salary not specified
Full Time
Senior (5 to 8 years), Expert & Leadership (9+ years)
San Antonio
Remote
Senior Tech Lead – SRE
Humana
Salary not specified
United States
Remote
Senior/Staff Software Engineer (SRE)
Chainguard
Salary not specified
Land your dream remote job 3x faster with AI
Try Jobo Free