Site Reliability Engineer 2 - Chicago, IL OR Reston, VA - Onsite at Comcast

Chicago, Illinois, United States

Comcast Logo
Not SpecifiedCompensation
Junior (1 to 2 years)Experience Level
Full TimeJob Type
UnknownVisa
Technology, Advertising, MediaIndustries

Requirements

  • 1-3 years of experience as an SRE, DevOps or Operations Engineer
  • Experience with cloud platforms (e.g. AWS, OCI, GCP, Azure)
  • Hands-on experience with Terraform and infrastructure as code (IaC) principle
  • Proficiency in automation tools and frameworks (e.g. Ansible, Terraform, Kubernetes, Docker) for automating system deployment and maintenance
  • Familiarity with modern data architectures and technologies, including big data platforms (e.g., Kafka, Hadoop, Spark), distributed storage (e.g., Cassandra, HDFS, AWS S3)
  • Extensive experience in database management (e.g. NoSQL databases, MySQL, PostgreSQL)
  • Programming Skills: Proficient in at least one programming language, such as Python, Go, Java, or Scala, with the ability to write efficient scripts and automation tools
  • System Monitoring and Log Management: Familiar with using monitoring and log management tools such as Prometheus, Grafana, ELK Stack, or other similar tools
  • Troubleshooting and Debugging: Strong debugging and troubleshooting skills, with the ability to quickly identify and resolve production issues
  • Team Collaboration and Communication: Excellent communication skills with the ability to convey technical information clearly and concisely to both technical and non-technical stakeholders
  • Proactive learner eager to grow in operations and governance
  • Education: Bachelor’s degree

Responsibilities

  • System Monitoring and Optimization: Design and implement monitoring and alerting systems to ensure the stability, reliability, and performance of data platforms. Join in on-call shift to quickly respond to and resolve issues
  • Automation and Tool Development: Develop and maintain automation tools and scripts for deployment, monitoring, backup and disaster recovery
  • Performance Optimization: Analyze and optimize the performance of data storage, query performance, and data flows to ensure efficient processing of large-scale datasets, reduce latency, and improve processing speed
  • Incident Response and Troubleshooting: Respond quickly to platform failures, perform troubleshooting, and coordinate cross-team efforts to resolve issues and ensure high availability and reliability of data
  • Capacity Planning and Scaling: Work with engineering teams to analyze and forecast capacity requirements, ensuring the system can handle traffic growth and scale infrastructure accordingly. Support Freewheel powered Live events
  • Cloud Access management & Governance: Maintain consistent cloud standards and support enforcement of governance and compliance practices across cloud environment
  • Documentation and Knowledge Sharing: Document the architecture, configurations, and operational procedures for platforms, ensuring knowledge is shared across the team and providing relevant training
  • Security and Compliance: Ensure platforms meet security standards and compliance requirements to prevent breaches or misuse
  • Cross-Team Collaboration: Collaborate with engineering team, product team, and project management team to support product design and implementation, solving reliability-related issues

Skills

SRE
System Monitoring
Alerting
Automation
Scripting
Deployment
Performance Optimization
Incident Response
Troubleshooting
Capacity Planning
Cloud Infrastructure
Disaster Recovery
On-call

Comcast

Comcast Corporation is a global media and technology company.

About Comcast

Philadelphia, PennsylvaniaHeadquarters
1963Year Founded
$42.3MTotal Funding
IPOCompany Stage
10,001+Employees

Benefits

Health Insurance
Dental Insurance
Vision Insurance
401(k) Company Match
Paid Vacation
Paid Parental Leave
Tuition Reimbursement
Unlimited Paid Time Off

Risks

Competition from streaming services impacts Comcast's traditional cable TV business.
5G technology enables new competitors in the broadband market, threatening Comcast's market share.
Consumer scrutiny of data caps and pricing could lead to reputational damage.

Differentiation

Comcast's acquisition of Nitel enhances its managed services offerings in the enterprise sector.
Comcast's digital equity grants highlight its commitment to corporate social responsibility.
Comcast's involvement in rural broadband initiatives opens new markets and customer bases.

Upsides

Comcast's $150M investment in Rio Rancho boosts internet speed and connectivity.
Transform Wealth LLC's investment indicates confidence in Comcast's financial health.
Comcast's expansion efforts could lead to increased customer satisfaction and retention.

Land your dream remote job 3x faster with AI