Senior Software Engineer - Observability
TetraScienceFull Time
Senior (5 to 8 years)
Candidates should possess 5+ years of Site Reliability Engineering or DevOps experience, deep expertise in Kubernetes administration and troubleshooting, hands-on experience deploying and maintaining observability tools such as Prometheus, Grafana, and Mimir/Cortex, a strong understanding of Helm charts, GitOps practices, and CNCF tooling, and experience with service mesh technologies like Istio. They should also have proven ability to debug complex distributed systems and networking issues, an understanding of authentication systems and security in regulated environments, and the ability to work independently and collaboratively in a remote setting. Preferred qualifications include active security clearance or the ability to obtain a Secret-level security clearance, previous experience with DoD software deployments and Platform One, and familiarity with BigBang charts and Iron Bank containers.
The Senior Site Reliability Engineer will be responsible for deploying and maintaining the observability stack across multiple customer clusters and DoD networks, building Helm chart abstractions and automation to streamline monitoring deployments, troubleshooting and debugging complex Kubernetes issues, networking problems, and monitoring stack failures, configuring and maintaining BigBang charts and DoD Platform One integrations, designing and implementing infrastructure automation using tools like Pulumi, ArgoCD, and Flux, working with Istio service mesh and Keycloak for authentication, monitoring and optimizing the performance of monitoring infrastructure, collaborating with security teams to ensure compliance with NIST requirements and DoD standards, and participating in on-call rotation and incident response for production environments.
DevSecOps platform for government software deployment
Second Front Systems connects the commercial software industry with U.S. government defense and national security sectors. Its main product, Game Warden, is a managed DevSecOps platform that simplifies the process of getting commercial software approved for government use. By integrating security practices into the software development lifecycle, Game Warden helps speed up the Authorization to Operate (ATO) process, ensuring that software meets government security standards for faster deployment. Unlike competitors, Second Front Systems focuses specifically on the needs of defense and national security professionals, providing a subscription-based service that includes ongoing updates and compliance management. The goal is to enable government agencies and defense contractors to deploy secure software solutions quickly, allowing them to concentrate on their primary missions.