Site Reliability Engineer
Stitch FixFull Time
Mid-level (3 to 4 years)
Candidates must possess proficiency in at least one programming language such as Python, Go, or Java, with familiarity across multiple language ecosystems. A solid understanding of operating systems, networking, distributed systems, databases, and storage architectures is essential, alongside a deep comprehension of how code executes on hardware, including operating systems, algorithms, and data structures. Experience in handling production incidents, performing root cause analysis, and mitigating complex system failures is required. Strong communication skills to articulate technical concepts to both engineering and business stakeholders are necessary, as is proven experience in automating manual processes, building deployment pipelines, or managing configuration systems. A Bachelor's degree in computer science or a related field, or equivalent work experience, is mandatory. Preferred qualifications include experience with cloud platforms like AWS, GCP, or Azure, familiarity with container orchestration systems such as Kubernetes, a track record of managing or developing distributed systems, prior experience with Java in production, and over 8 years of relevant experience.
The Software Engineering Site Reliability Engineer (SRE) is responsible for ensuring the reliability, scalability, and performance of software systems. This includes monitoring system performance and availability, identifying and resolving issues, and implementing proactive measures to prevent future incidents. The role involves developing and maintaining automation tools and infrastructure for software deployment, configuration management, and system monitoring. Responsibilities also include analyzing system performance, identifying bottlenecks, and implementing optimizations to enhance efficiency and scalability. The SRE will respond to incidents, conduct root cause analysis, and implement corrective actions. Collaboration with software development teams to integrate reliability and scalability considerations into software design and implementation is a key duty. The engineer will also identify opportunities for process improvement, implement best practices, and drive initiatives to enhance the overall reliability and performance of software systems.
Designs, manufactures, and sells vehicles
General Motors designs, manufactures, and sells vehicles and vehicle parts, catering to individual consumers, businesses, and government entities. The company operates in both traditional internal combustion engine vehicles and the growing electric vehicle (EV) market, generating revenue through vehicle sales and financing services. GM stands out from competitors with its commitment to community service, sustainability, and diversity, as evidenced by a majority female Board of Directors. The company's goal is to balance traditional automotive manufacturing with technological advancements in electric and autonomous vehicles.