Site Reliability Engineer
Stitch FixFull Time
Mid-level (3 to 4 years)
Candidates should possess a Bachelor's or Master's degree in computer science, Information Technology, or a related field, with over 4 years of professional experience in software engineering, preferably in backend or platform teams. Proficiency in programming languages like Java, Go, or Python is essential, along with experience in automation scripting, cloud platforms, container orchestration (e.g., Kubernetes), and observability stacks (e.g., Prometheus, Grafana, ELK, OpenTelemetry). Strong incident management, leadership, technical triage, and troubleshooting abilities are required, as are excellent interpersonal and communication skills.
The SRE Support and Automation Engineer will proactively monitor the health of critical services to identify and address potential issues, and collaborate with various teams to develop solutions ensuring high site availability, reliability, and performance. Responsibilities include resolving recurring technical issues, onboarding new alerts, developing high-quality Standard Operating Procedures (SOPs), and building/improving monitoring tools. The role also involves conducting reliability audits and tests, acting as Incident Commander to manage major incidents and alarms, and ensuring effective communication with leadership and partner teams.
Provides launch services for small satellites
Astra provides launch services specifically for small satellites, catering to commercial businesses, government agencies, and research institutions that need reliable access to space. The company operates small, agile rockets designed to transport these satellites into low Earth orbit (LEO). Astra's approach focuses on making space more accessible by reducing the costs and complexities associated with satellite launches, which allows a wider range of customers to utilize their services. Unlike many competitors, Astra emphasizes efficiency and cost-effectiveness in its operations, aiming to meet the growing demand for satellite-based services such as Earth observation and telecommunications. The company's goal is to facilitate more frequent and affordable satellite launches, thereby expanding opportunities for various applications in the space industry.