Zefr

Site Reliability Engineer

London, England, United Kingdom

Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Biotechnology, Marketing TechnologyIndustries

About Zefr

Zefr is the leading global technology company enabling responsible marketing in walled garden social environments. Zefr’s solutions empower brands to manage their content adjacency on scaled platforms such as YouTube, Meta, TikTok, and Snap, in accordance with industry standard frameworks. Through its patented AI technology, Zefr offers brands and agencies more accurate and transparent solutions for social walled gardens. The company is headquartered in Los Angeles, California, with additional locations across the globe.

Location Type: Hybrid Employment Type: FullTime

What You'll Do

As a Site Reliability Engineer at Zefr, you’ll apply your expertise in cloud infrastructure, CI/CD, Observability, and core SRE concepts, to deliver high-quality, reliable, and scalable solutions. A significant aspect of this role involves working closely with Zefr's Engineering and Data Science teams ensuring the infrastructure required for our services is robust, efficient, and scalable.

We’re looking for someone to combine their technical expertise with strong leadership and a passion for continuous improvement and innovation. By ensuring the continuous health and efficiency of our infrastructure, you will directly contribute to Zefr’s commitment to providing a consistently high-quality user experience. This is a role where we both expect to learn from you and have you learn from us!

  • Support and build systems and tools that enable other engineers to generate, deploy, and manage product features.
  • Deploy and support a multi-cloud, micro-service architecture deployed via Github Actions, ArgoCD & Kubernetes.
  • Collaborate with other engineers to architect secure, resilient, scalable, and cost-efficient applications and systems/pipelines in AWS and GCP.
  • Foster and push our DevOps culture and philosophy by encouraging continuous improvement across all engineering teams.
  • Proactively maintain the health of production environments, including monitoring application performance and resource utilization.
  • Participate in 24/7 on-call rotation, respond to system performance issues and outages.
  • Debug code at the application and infrastructure level.
  • Mature our CI/CD workflows and release process.
  • Maintains a forward-thinking approach, actively researching and proposing new solutions.
  • Propose and review Engineering Request for Comments (RFC) to drive Engineering architecture and practices.

Technology Stack at Zefr

Core Infrastructure & Cloud Platforms

  • Cloud Providers: Google Cloud Platform (GCP), Amazon Web Services (AWS)
  • Infrastructure as Code (IaC): Terraform
  • Containerization & Orchestration: Docker, Kubernetes (experience with GKE and/or EKS expected), Helm, Kustomize
  • Service Mesh: Istio

CI/CD & Automation

  • CI/CD Pipelines: GitHub Actions
  • GitOps / Continuous Delivery: Argo CD
  • Primary Scripting/Automation Language: Python

Observability & Monitoring

  • Monitoring & Alerting: Prometheus, Datadog, Pagerduty
  • Telemetry Standards: OpenTelemetry

Application & Data Ecosystem (Supporting)

  • Application Languages/Frameworks: Python, FastAPI, Flask, Node.js, React
  • Data Streaming: Apache Kafka
  • Data Processing/Transformation: Pandas, DBT
  • Workflow Orchestration: Apache Airflow, Ray

Machine Learning Stack

  • Serving: Triton Inference Server
  • MLOps/Experiment Tracking: Weights and Biases, DVC
  • Libraries/Frameworks: Transformers, HuggingFace
  • Model Optimization/Formats: Onnx, TensorRT

Data Stores & Databases

  • Relational Databases: PostgreSQL (including managed versions like AWS Aurora, GCP Cloud SQL)
  • NoSQL Databases: DynamoDB
  • Search Databases: OpenSearch, Elasticsearch
  • Vector Databases: Qdrant
  • Caching: Redis
  • Data Warehousing: Snowflake

What We're Looking For

  • 4+ year job history designing, managing, deploying, and supporting Cloud Infrastructure in a production environment using major public cloud providers. (One of GCP or AWS required)
  • Production experience designing, managing, deploying, and maintaining container based workloads into Kubernetes clusters
  • Knowledge of GitOps

Skills

Cloud Infrastructure
CI/CD
Observability
SRE concepts
AWS
GCP
Kubernetes
ArgoCD
Github Actions
Microservices
DevOps

Zefr

Contextual advertising technology for brands

About Zefr

Zefr focuses on contextual advertising, helping brands place their ads alongside relevant content without using personal information. Their technology is especially effective for video ads on platforms like YouTube and Facebook, allowing brands to reach specific audiences while maintaining user privacy. Zefr's Contextual Data Management Platform organizes brand preferences to deliver targeted ad campaigns with high engagement rates. The company differentiates itself by prioritizing privacy-compliant advertising, making it a valuable partner for brands looking to optimize their digital marketing efforts.

Los Angeles, CaliforniaHeadquarters
2009Year Founded
$63.1MTotal Funding
SERIES_ECompany Stage
Data & Analytics, EntertainmentIndustries
201-500Employees

Risks

Emerging AI technologies may increase competition in contextual advertising.
Reliance on platforms like YouTube and Facebook exposes Zefr to algorithm changes.
Expansion into new platforms like Snapchat may pose integration challenges.

Differentiation

Zefr specializes in privacy-compliant contextual advertising, avoiding personal information usage.
Their Contextual Data Management Platform offers impression-level transparency for ad campaigns.
Zefr's Atrium suite provides transparency in walled gardens like Meta and TikTok.

Upsides

Growing demand for privacy-compliant ads aligns with Zefr's contextual advertising approach.
Expansion into Snapchat's ad ecosystem enhances Zefr's brand safety measurement capabilities.
Promotion of Jon Morra to Chief AI Officer boosts AI-driven content analysis.

Land your dream remote job 3x faster with AI