Lead Site Reliability Engineer (SRE) at Mattermost

United States

Mattermost Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Defense, Government, Critical Infrastructure, TechnologyIndustries

Requirements

  • BS in Computer Science, Cybersecurity, Software Engineering, or a related technical field, or equivalent experience, with 5+ years of relevant experience in site reliability engineering, DevOps, or cloud infrastructure roles
  • Proven expertise in container orchestration platforms, ideally Kubernetes
  • Extensive experience with infrastructure-as-code, ideally Terraform
  • Strong background in cloud platforms, ideally AWS
  • Demonstrated experience designing and implementing monitoring, alerting, and performance optimization strategies
  • Exceptional troubleshooting and incident management skills for distributed systems
  • Proficiency in at least one scripting or programming language for automation
  • Excellent communication skills with a track record of influencing cross-functional teams
  • Experience leading globally distributed teams in a remote-first environment
  • For candidates residing in the U.S.: Must be U.S. citizens and eligible under applicable U.S. government security clearance requirements, and meet eligibility requirements for access to export-controlled information

Responsibilities

  • Define the strategy, architecture, and roadmap for Mattermost’s site reliability engineering function, aligning infrastructure initiatives with product and business goals
  • Lead the design, deployment, and optimization of production-grade containerized workloads, infrastructure-as-code, and compliant cloud environments for regulated domains (e.g., FedRAMP, DoD)
  • Establish and evolve observability, monitoring, and alerting frameworks to ensure performance, reliability, and capacity planning at scale
  • Drive incident management processes, including on-call rotations, root cause analysis, and systemic reliability improvements
  • Partner with security and compliance teams to meet data sovereignty, security, and regulatory requirements
  • Champion automation and operational excellence to improve efficiency, reduce risk, and scale operations
  • Oversee cloud cost management and capacity planning to optimize infrastructure spending while meeting performance targets
  • Build and maintain a developer platform that enables fast, secure software delivery and improves application stability in production
  • Mentor and coach SRE team members, fostering a culture of learning, collaboration, and technical excellence

Skills

SRE
Site Reliability Engineering
Infrastructure as Code
Containerized Workloads
Cloud Environments
FedRAMP
DoD Compliance
Observability
Monitoring
Alerting
Incident Management
Scalability
Automation
Kubernetes
Hybrid Environments

Mattermost

Secure collaboration platform for technical teams

About Mattermost

Mattermost provides a secure collaboration hub specifically designed for technical teams. Its platform allows users to communicate in real-time, share files and code snippets, and automate workflows, all while maintaining a high level of security and control over data. Users can customize the platform to fit their needs and deploy it in various environments. Mattermost integrates with essential tools like GitHub and GitLab, enabling teams to streamline their workflows within a single interface. Unlike many competitors, Mattermost offers both premium features, such as advanced compliance and admin controls, and an open-source version, allowing for greater flexibility and customization. The company's goal is to enhance productivity and security for teams working in complex operational settings.

Palo Alto, CaliforniaHeadquarters
2016Year Founded
$68.1MTotal Funding
SERIES_BCompany Stage
Enterprise Software, CybersecurityIndustries
51-200Employees

Benefits

Fully remote work
Office setup fund
Coworking space stipend
Internet and mobile phone reimbursement
401k
Unlimited vacation
Family & friends days
Async weeks
Health benefits
Global and regional team meetups
Open source Fridays
Community hackathons and events

Risks

Emerging open-source platforms may offer similar features at lower costs.
AI-driven tools could outpace Mattermost's current feature set.
Security vulnerabilities in open-source platforms may increase scrutiny on Mattermost.

Differentiation

Mattermost offers a secure, open-source platform for technical team collaboration.
The platform supports real-time communication and workflow automation for agile development.
Mattermost provides flexible integrations with tools like GitHub and ServiceNow.

Upsides

Rising demand for remote collaboration tools boosts Mattermost's market potential.
Emphasis on cybersecurity increases the appeal of Mattermost's secure platform.
Growth in open-source software market expands Mattermost's community and contributor base.

Land your dream remote job 3x faster with AI