Site Reliability Engineer (SRE) at Fireblocks

United States

Fireblocks Logo
Not SpecifiedCompensation
Mid-level (3 to 4 years)Experience Level
Full TimeJob Type
UnknownVisa
Blockchain, Fintech, Digital AssetsIndustries

Requirements

  • At least 3+ years of experience as an SRE or Infra Backend in a SaaS environment
  • Curious, self-motivated, easy to work with, responsible, and production-aware—fast learner able to take a project from POC to production, handling decision-making and communication
  • Experience with coding languages - Python/JavaScript/Bash (Must)
  • At least 3+ years of experience with Alerting & Monitoring systems such as DataDog, Coralogix / Splunk / New Relic / Prometheus
  • Experience working with Linux systems from kernel to shell and beyond
  • Experience with Cloud systems such as AWS / Google Cloud / Azure
  • Experience with Configuration management, such as Ansible/Chef/Puppet/ArgoCD
  • Experience with Docker, Kubernetes, and Helm
  • SCM experience - Git/bitbucket/gitlab/Phabricator/gerrit
  • High Analytical & Troubleshooting skills - ability to solve complex problems
  • Strong verbal and written communication skills and a collaborative mindset

Responsibilities

  • Improve and establish new monitoring, alerting, and observability of services using a wide range of tools
  • Handle critical alerts and incidents and work directly with R&D to improve and optimize availability
  • Research Fireblocks blockchain workflows, identify optimization opportunities, issues, and improve monitoring
  • Help identify root causes for incidents and prevent them from happening again; solve and orchestrate outages by working with multiple teams
  • Improve and establish alerting for infrastructure, services, and business logic
  • Work closely with R&D and Support: offering education and guidance on integration, support, and monitoring across the toolset
  • Communicate and escalate issues to senior management in R&D and support, write RCAs, and define next steps
  • Document actions in runbooks and then into automation using Python, Lambda, shell scripts, ArgoCD, and Ansible
  • Focus on the system's observability, availability, reliability, performance/latency, and monitoring
  • Conduct periodic on-call duties and emergency response

Skills

Key technologies and capabilities for this role

PythonLambdaShell ScriptsArgoCDAnsibleMonitoringAlertingObservabilityIncident ResponseRoot Cause AnalysisRunbooksInfrastructureBlockchain

Questions & Answers

Common questions about this position

What experience level is required for this SRE role?

At least 3+ years of experience as an SRE or Infra Backend in a SaaS environment is required.

What programming languages are must-haves for this position?

Experience with coding languages Python, JavaScript, and Bash is required.

What is the team structure and location setup for the SRE team?

The SRE team was recently formed, follows a 'follow the sun' model with members in several international locations for 24x7 availability, and consists of unique, experienced, independent individuals who get things done.

Is this a remote position, and are there office requirements?

This information is not specified in the job description.

What makes a candidate stand out for this SRE role?

Candidates with previous experience in cryptocurrencies or blockchains have a big advantage.

Fireblocks

Full-stack NFT building service

About Fireblocks

Fireblocks is a platform that protects digital assets in transit, focusing on protecting the transmission of customers' digital assets between exchanges, counter brokers, hot wallets, and cold stores. Fireblocks enables banks, fintech, exchanges, liquidity providers, OTCs, and hedge funds to securely manage digital assets across a wide rangeof products and services.

N/AHeadquarters
2018Year Founded
N/ACompany Stage
201-500Employees

Land your dream remote job 3x faster with AI