Site Reliability Engineer at TSMG

New Hope, Pennsylvania, United States

TSMG Logo
Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Social Dating, Video StreamingIndustries

Requirements

  • Several years of experience running latency-sensitive, high-traffic production systems – ideally with a video or streaming focus
  • Solid experience with AWS infrastructure (especially ECS, CloudFormation/CDK, Terraform)
  • Proficiency in Bash, Python, or TypeScript for automation and scripting
  • Hands-on experience with Docker and CI/CD tooling (GitHub Actions, Jenkins)
  • Good understanding of RabbitMQ, Redis, MySQL, PostgreSQL
  • Familiarity with Java and Node.js-based environments
  • Deep knowledge of GitHub Actions, TypeScript, and CDK (preferred)
  • Experience with observability stacks (Prometheus, Grafana, ELK, Sentry) (preferred)
  • Understanding of CDN and edge delivery strategies (preferred)
  • Familiarity with video streaming protocols (e.g. RTMP, HLS, WebRTC) (preferred)
  • Experience working in agile product teams with Scrum and DevOps principles (preferred)

Responsibilities

  • Responsibility for the reliability and performance of our live video infrastructure
  • Real-time monitoring, troubleshooting, and continuous improvements to streaming stability
  • Building automation tools that simplify deployments, monitoring, and failover processes
  • Collaboration with backend and video engineers to design fault-tolerant, scalable systems
  • Involvement in postmortems and service level planning (SLAs/SLOs)
  • Participation in an on-call rotation

Skills

Key technologies and capabilities for this role

AWSECSCloudFormationCDKTerraformBashPythonTypeScriptDockerGitHub ActionsJenkinsRabbitMQRedisMySQLPostgreSQLJavaNode.jsPrometheusGrafanaELKSentryRTMPHLSWebRTC

Questions & Answers

Common questions about this position

What are the minimum qualifications for this Site Reliability Engineer role?

Minimum qualifications include several years of experience running latency-sensitive, high-traffic production systems (ideally video or streaming), solid AWS experience (especially ECS, CloudFormation/CDK, Terraform), proficiency in Bash, Python, or TypeScript, hands-on Docker and CI/CD tooling experience, good understanding of RabbitMQ, Redis, MySQL, PostgreSQL, and familiarity with Java and Node.js.

What does the daily work involve as a Site Reliability Engineer?

You'll handle responsibility for live video infrastructure reliability, real-time monitoring and troubleshooting, building automation tools, collaborating with backend and video engineers, postmortems, SLAs/SLOs, and participate in on-call rotation.

Is this a remote position or does it require office work?

This information is not specified in the job description.

What is the salary or compensation for this role?

This information is not specified in the job description.

What preferred skills will make me stand out for this position?

Preferred qualifications include deep knowledge of GitHub Actions, TypeScript, and CDK; experience with observability stacks like Prometheus, Grafana, ELK, Sentry; understanding of CDN and edge delivery; familiarity with video streaming protocols (RTMP, HLS, WebRTC); and experience in agile Scrum and DevOps teams.

TSMG

Specialized data collection for tech companies

About TSMG

TSMG specializes in data collection services, primarily supporting tech companies in Europe and North America. The company manages complex data collection projects, providing comprehensive support that includes recruiting participants and handling logistics. TSMG works with Fortune 500 clients to gather large volumes of accurate and diverse datasets, which are essential for improving AI systems. What sets TSMG apart from its competitors is its ability to assemble and manage high-performing teams, ensuring that projects are executed efficiently and tailored to meet specific client needs. The goal of TSMG is to empower its clients by delivering high-quality data that enhances their operational intelligence.

Warsaw, PolandHeadquarters
2018Year Founded
VENTURE_UNKNOWNCompany Stage
Data & Analytics, Consulting, AI & Machine LearningIndustries
51-200Employees

Risks

Emerging startups offer innovative data solutions at lower costs, threatening TSMG's market share.
Rapid AI advancements may require TSMG to invest in new tools and training.
Stringent data privacy regulations in Europe could increase TSMG's compliance costs.

Differentiation

TSMG offers end-to-end data collection services for Fortune 500 companies.
The company excels in recruiting and managing high-performing teams for complex projects.
TSMG provides tailored solutions by understanding and addressing specific client needs.

Upsides

Growing demand for AI training data boosts TSMG's data collection services.
Remote work expands the talent pool for TSMG's recruitment outsourcing services.
AI integration in travel management enhances service efficiency and personalization.

Land your dream remote job 3x faster with AI