Staff Software Engineer, Platform
LuminaiFull Time
Expert & Leadership (9+ years)
Candidates must have strong experience in Non-Abstract Systems design and implementation, with proficiency in Python, Golang, and in-depth experience with Kubernetes (CKA or equivalent or greater). Experience with observability principles and technologies, including SLI/SLO definition and tracking, is required. Strong communication skills, both written and verbal, with experience in working with a globally distributed team, are essential. A passion for reliability and operational excellence, a low tolerance for toil, and the ability to accurately estimate work scope and coordinate with stakeholders are also necessary. Experience with software development best practices such as code review, testing, CI/CD, version control, automation, and debugging is expected, along with a proactive approach to identifying and addressing issues with a focus on ownership and accountability. Bonus points for experience working on a SaaS/PaaS product across multiple cloud providers, experience with CircleCI, Chronosphere (Prometheus), Splunk, Bazel, Istio, Playwright, Karpenter, Github [Actions], and knowledge of AWS, GCP, and Azure internals, as well as participation in an on-call rotation.
The Senior Software Engineer will make high-quality, data-driven, and experience-driven decisions on how to build and evolve the production platform. They will own and build the processes for testing, building, and deploying code in a high-scale PaaS environment. Responsibilities include collaborating across the company on production system design, setting standards, and making technology choices for new and existing products. They will deliver results by changing the production infrastructure in a predictable, safe, and reliable way. This role involves being at the forefront of team collaboration within Platform Engineering, building out the Platform/Reliability practice, and participating directly in decision-making regarding work scope and methods. The engineer will be involved in determining platform functionality, participating in incident management, and establishing sensible practices as the platform evolves. They will also create and maintain comprehensive internal documentation for systems and processes.
Data orchestration platform for pipeline management
Astronomer.io provides a data orchestration platform that utilizes Apache Airflow to simplify the deployment of data pipelines. Its main product, Astro, helps businesses manage and monitor their data flows, allowing them to focus on delivering essential data pipelines. The platform supports data unification across various clouds and offers over 1500 integrations, making it suitable for data and machine learning teams in industries like finance and e-commerce. Astronomer.io distinguishes itself from competitors by offering enterprise-grade security, zero-downtime upgrades for Airflow, and tools for monitoring pipeline health, which enhance compute efficiency and reduce delays in task scheduling. The company's goal is to empower organizations to optimize their data strategies and achieve a significant return on investment by ensuring their applications operate with maximum reliability.