[Remote] Site Reliability Engineer II at Atlan

India

Atlan Logo
Not SpecifiedCompensation
Junior (1 to 2 years)Experience Level
Full TimeJob Type
UnknownVisa
Software, DataIndustries

Requirements

  • Proven experience managing alerts, incidents, and root cause analyses in production environments
  • Hands-on knowledge of cloud platforms (AWS, GCP, or Azure) and Kubernetes — including networking, deployments, and troubleshooting
  • Familiarity with monitoring and observability tools such as Prometheus, Grafana, ELK/EFK, or Datadog
  • Ability to automate repetitive operational tasks using scripting (Python, Bash, or Shell)
  • Strong communication and collaboration skills — especially in distributed or remote-first teams
  • A mindset of ownership, curiosity, and calm under pressure — you thrive in incident response and turn challenges into learning opportunities

Responsibilities

  • Own and operate end-to-end reliability for critical systems — from alert triage and incident resolution to long-term preventive improvements
  • Proactively manage incidents within defined SLAs (60 mins for Critical, 180 mins for High) and ensure smooth collaboration across teams during resolution
  • Enhance observability by improving monitoring systems, refining alerts, and reducing noise to focus on what truly matters
  • Automate operations and incident workflows to eliminate manual toil, improving speed, consistency, and reliability
  • Collaborate across teams — work with Platform, Observability, and Product Engineering teams to strengthen uptime and service stability
  • Contribute to documentation and playbooks, ensuring that every incident drives learning, process improvement, and team efficiency

Skills

Alert Management
Incident Response
Automation
Observability
Monitoring
Systems Thinking
Data
Reliability Engineering

Atlan

Unified platform for data management and collaboration

About Atlan

Atlan offers a platform for data management that helps teams access and understand their data while promoting collaboration. Its unique data catalog organizes metadata, reducing data silos and integrating with various industry tools to create a comprehensive data stack. The company uses a pay-as-you-go revenue model, making it flexible for businesses of all sizes, and is known for its strong customer service with quick response times. Atlan's goal is to streamline data management processes, enhancing data accessibility and collaboration for organizations.

Singapore, SingaporeHeadquarters
2019Year Founded
$195.5MTotal Funding
SERIES_CCompany Stage
Data & Analytics, Enterprise SoftwareIndustries
201-500Employees

Benefits

Remote Work Options

Risks

Emerging startups with similar capabilities could dilute Atlan's market share.
Rapid AI evolution may require Atlan to invest heavily in R&D.
Over-reliance on key partnerships poses risks if terms change or dissolve.

Differentiation

Atlan offers a unique data catalog that organizes and activates metadata effectively.
The platform integrates seamlessly with tools like Slack, Snowflake, and Tableau.
Atlan provides a personalized, collaboration-first experience for data management.

Upsides

Atlan raised $105M in Series C funding, boosting its valuation to $750 million.
The demand for data democratization tools is increasing, benefiting Atlan's offerings.
Growing interest in AI and machine learning enhances Atlan's data processing capabilities.

Land your dream remote job 3x faster with AI