Principal Site Reliability Engineer, ML Platform at Zscaler

Short Hills, New Jersey, United States

Zscaler Logo
Not SpecifiedCompensation
Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Cybersecurity, Cloud Security, Artificial Intelligence, TechnologyIndustries

Requirements

  • 10+ years of experience in Site Reliability Engineering, cloud infrastructure, and/or applications architecture, with a strong foundation in Kubernetes and Docker
  • Proven programming expertise in Python, SQL, and distributed processing technologies such as Spark, BigQuery, or Apache Beam
  • Hands-on experience building and maintaining CI/CD pipelines, leveraging infrastructure-as-code tools like ArgoCD, Terraform, or similar
  • Strong knowledge of cloud platforms (AWS preferred, GCP acceptable), including certification or equivalent skills specific to cloud-native system management
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • Working knowledge of AI/ML pipelines and frameworks (preferred)

Responsibilities

  • Architect, build, and maintain large-scale distributed systems to support end-to-end AI pipelines, including data collection, feature engineering, model training, evaluation, deployment, and real-time serving
  • Act as the owner of Site Reliability Engineering (SRE) for AI-driven applications deployed on AWS, ensuring performance, availability, observability, and scalability
  • Collaborate with the engineering team to design and implement CI/CD pipelines, infrastructure provisioning, scripting automation for deployment and customer-facing services, robust monitoring frameworks using tools and techniques for real-time statistics and performance tracking across production systems
  • Drive innovation and best practices in integrating Kubernetes, ArgoCD, and similar tools into cloud environments, with a focus on AI/ML pipelines and GPU-based cloud structures (e.g., SkyPilot)
  • Serve as the group's FinOps expert and AWS admin, taking ownership of hosting cost optimization and all administrative aspects of the AWS account for ZAIRe

Skills

SRE
Site Reliability Engineering
ML Platform
Cloud Infrastructure
AI/ML
DevOps
Scalability
Monitoring
Zero Trust
SASE
SSE

Zscaler

Cloud-based cybersecurity and secure gateway services

About Zscaler

Zscaler provides cloud-based information security services, focusing on internet, web, and cloud security. Its platform functions as a secure gateway that inspects all internet traffic between users and applications, ensuring that threats are identified and stopped before they can access a client's network. This service is offered through a subscription model, allowing large enterprises and government organizations to select the level of security that meets their needs. Zscaler differentiates itself from competitors by offering a strong partner program that enhances market reach and provides partners with training and resources. The company's goal is to support secure digital transformation for its clients by delivering reliable security solutions.

San Jose, CaliforniaHeadquarters
2008Year Founded
$148.8MTotal Funding
IPOCompany Stage
Enterprise Software, CybersecurityIndustries
5,001-10,000Employees

Benefits

Comprehensive health plans
Supportive parental & family leave
On-demand learning & development
Company-sponsored volunteering
Global tuition assistance program
Guilt-free paid time off

Risks

Emerging cybersecurity firms may erode Zscaler's market share.
Economic downturns could impact Zscaler's subscription-based revenue model.
The retirement of CFO Mr. Canessa may lead to financial instability.

Differentiation

Zscaler offers a 100% cloud-based security platform, eliminating on-premise hardware needs.
The company is a Gartner magic quadrant leader for secure web gateways.
Zscaler's platform inspects all internet traffic, ensuring threats are neutralized pre-network.

Upsides

Zscaler's FY/25 guidance was revised upward, indicating strong financial performance.
The partnership with Bharti Airtel enhances Zscaler's zero-trust architecture offerings.
Zscaler's hiring of government experts strengthens its position in the public sector.

Land your dream remote job 3x faster with AI