\u003C/p>\u003Cp style=\"min-height:1.5em\">\u003C/p>\u003Cp style=\"min-height:1.5em\">\u003Cstrong>This position includes a requirement to work from 9AM-3PM Eastern US, Monday to Friday. Your remaining work time is flexible.\u003C/strong>\u003C/p>\u003Cp style=\"min-height:1.5em\">\u003C/p>\u003Ch2>\u003Cstrong>What you get to do:\u003C/strong>\u003C/h2>\u003Cul style=\"min-height:1.5em\">\u003Cli>\u003Cp style=\"min-height:1.5em\">Learn and build expertise across several software engineering disciplines, including:\u003C/p>\u003Cul style=\"min-height:1.5em\">\u003Cli>\u003Cp style=\"min-height:1.5em\">Kubernetes\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Cloud engineering\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Cloud networking\u003C/p>\u003C/li>\u003C/ul>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Gain exposure to the big picture; learn about product, engineering, customer relationship management, and more.\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Spend up to 20% of your time on side projects that contribute to Astronomer’s overall success, such as contributing to the open-source Airflow repository or developing Astronomer’s internal monitoring and alerting systems built on Airflow.\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Work on a modern, sophisticated, cloud-native product that customers use to connect to dozens of other systems. Gain depth and breadth of learning!\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Work directly with our customers’ data engineers, system admins, DevOps teams, and management.\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Provide feedback from your experience that can shape the direction of Astronomer’s products\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Own the customer experience, working directly with customers to prioritize and solve issues and meet SLAs.\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Participate remotely within a fully distributed team. Approximately 2-4 in-person events per year.\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Help maintain 24x7 coverage through a specified 6-hour pager period during your work day.\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Participate in paid on-call rotation for weekend coverage.\u003C/p>\u003C/li>\u003C/ul>\u003Cp style=\"min-height:1.5em\">\u003C/p>\u003Ch2>\u003Cstrong>What you bring to the role:\u003C/strong>\u003C/h2>\u003Cul style=\"min-height:1.5em\">\u003Cli>\u003Cp style=\"min-height:1.5em\">Motivation to learn\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Commitment to excellence\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Problem-solving and troubleshooting abilities\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Willingness to identify and own problems through the full lifecycle, from vague problem to delivered solution\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Excellent written and verbal communication for connecting with our customers over our ticketing system and through Zoom\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Demonstrable Linux familiarity\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">4 years of professional experience\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Experience with Kubernetes/Docker/Containers\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Experience with any major cloud provider (AWS, GCP, Azure)\u003C/p>\u003C/li>\u003C/ul>\u003Cp style=\"min-height:1.5em\">\u003C/p>\u003Ch2>\u003Cstrong>Bonus points if you have:\u003C/strong>\u003C/h2>\u003Cul style=\"min-height:1.5em\">\u003Cli>\u003Cp style=\"min-height:1.5em\">Previous experience working directly with customers (internal or external)\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Experience with DevOps\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Contributions to open-source projects\u003C/p>\u003C/li>\u003Cli>\u003Cp style=\"min-height:1.5em\">Experience with Splunk or Prometheus\u003C/p>\u003C/li>\u003C/ul>\u003Cp style=\"min-height:1.5em\">\u003C/p>\u003Cp style=\"min-height:1.5em\">\u003C/p>\u003Cp style=\"min-height:1.5em\">\u003Cem>\u003Cstrong>The salary for this role is $140,000-$150,000, depending on experience level, along with an equity component.\u003C/strong>\u003C/em>\u003C/p>\u003Cp style=\"min-height:1.5em\">\u003C/p>\u003Cp style=\"min-height:1.5em\">\u003Cem>#LI-Remote\u003C/em>\u003C/p>\u003Cp style=\"min-height:1.5em\">\u003C/p>\u003Cp style=\"min-height:1.5em\">At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Astronomer is a remote-first company.\u003C/p>","https://jobs.ashbyhq.com/astronomer/01afeadf-ae91-40ec-84d4-f1082a084c2d","Astronomer",{"id":408,"name":406,"urlSafeSlug":406,"logo":409},"79eb3d67-3f97-46aa-beca-e75a4bc9e760","lix4t9xeotpsufw7mosi",[411],{"city":17,"region":17,"country":16},"2025-09-05T07:16:32.428Z","Candidates should have 4 years of professional experience, demonstrable Linux familiarity, and experience with Kubernetes/Docker/Containers. Experience with any major cloud provider (AWS, GCP, Azure) is required, along with motivation to learn, commitment to excellence, problem-solving and troubleshooting abilities, and excellent written and verbal communication skills. Previous experience working directly with customers, experience with DevOps, contributions to open-source projects, and experience with Splunk or Prometheus are considered bonus points.","The Customer Reliability Engineer will operate, monitor, and maintain the platform to ensure availability, predictability, and reliable operations. They will learn and build expertise in Kubernetes, cloud engineering, and cloud networking, and work on a modern, cloud-native product. Responsibilities include creating strong relationships with customers, helping them achieve their reliability goals, providing feedback to shape product direction, owning the customer experience, prioritizing and solving issues, meeting SLAs, and participating in a pager rotation for 24x7 coverage. Up to 20% of time can be spent on side projects such as contributing to the open-source Airflow repository or developing internal monitoring and alerting systems.",{"employment":416,"compensation":418,"experience":419,"visaSponsorship":426,"location":427,"skills":428,"industries":433},{"type":417},{"id":65,"name":66,"description":67},{"minAnnualSalary":17,"maxAnnualSalary":17,"currency":17,"details":17},{"experienceLevels":420},[421,425],{"id":422,"name":423,"description":424},"d9dd41a2-3551-412f-981e-de2bc1e7bb34","Mid-level (3 to 4 years)","Build upon established skills and take on more responsibility.",{"id":136,"name":137,"description":245},{"type":79},{"type":79},[85,86,429,430,431,432,376,144,202,328],"Azure","GCP","Apache Airflow","DataOps",[434,436,438],{"id":91,"name":435},"Data Management",{"id":91,"name":437},"Cloud Computing",{"id":91,"name":439},"Software Development",{"id":441,"title":442,"alternativeTitles":443,"slug":452,"jobPostId":441,"description":453,"isReformated":15,"applyUrl":454,"company":455,"companyOption":456,"locations":459,"listingDate":461,"listingSite":59,"isRemote":15,"requirements":462,"responsibilities":463,"status":18,"expiryDate":17,"isGoogleIndexed":50,"summary":464},"187978e8-1c8d-403c-b702-17636e1bb594","Platform Engineer",[444,445,216,446,11,159,270,447,220,448,449,450,162,339,451],"Cloud Platform Engineer","AWS Platform Engineer","Infrastructure Engineer","Lead Platform Engineer","Platform Operations Engineer","SRE Platform Engineer","Cloud Automation Engineer","Application Platform Engineer","platform-engineer-187978e8-1c8d-403c-b702-17636e1bb594","# Comulate - Platform Engineer\n\n**Salary:** $160K - $240K\n**Location Type:** Remote\n**Employment Type:** FullTime\n\n## Position Overview\n\nComulate is revolutionizing the insurance back office with AI, streamlining accounting processes that are traditionally expensive and time-consuming. Our platform is the first step in our vision to optimize the hundreds of billions of dollars spent on manual insurance operations. Following our Series B funding round led by BOND & Workday in early 2025, we are experiencing record growth and accelerating our expansion plans.\n\n## Why Join Comulate?\n\n* **Record-Setting Growth:** Achieved 8-figure ARR within three years, placing us in the 95th+ percentile for company growth among startups.\n* **Strong Product-Market Fit:** Our platform is described by users as \"the best thing since sliced bread\" and \"life-changing.\"\n* **Impactful Work:** Our lean, talented team builds category-defining products for large enterprises, driving significant ROI and cash-flow operations.\n* **Employee Ownership:** Benefit from outsized employee ownership with low company risk.\n* **Ambitious Vision:** We are making bold, first-to-market bets for a committed customer base and an exciting pipeline.\n* **Early Innings:** We are at the beginning of our journey to deploy AI into core insurance industry workflows.\n\n## About the Role\n\nAs our first Platform Engineer, you will be instrumental in shaping our cloud infrastructure, ensuring our systems are reliable, scalable, and secure. We need an experienced team member to implement best practices from day one to support our rapid growth.\n\n**Our Stack:** TypeScript, Node, Postgres, React, Next.js, AWS (ECS, RDS, ElastiCache)\n\nLearn more about our engineering culture and learnings on our [engineering blog](link_to_blog_if_available).\n\n## Responsibilities\n\n* Shape and manage our cloud infrastructure on AWS.\n* Ensure the reliability, scalability, and security of our systems.\n* Implement and maintain best practices for cloud infrastructure and platform engineering.\n* Execute on projects such as blue-green deployments, VPN for employees (e.g., Tailscale), and Postgres observability tooling (e.g., pganalyze or pghero).\n* Write application layer code.\n* Drive engineering best practices around application performance, monitoring, and alerting.\n* Improve CI/CD pipelines and local development tooling (e.g., hot-reloading, unit testing speed).\n\n## Requirements\n\n* 3+ years of cloud infrastructure, platform engineering, or SRE experience.\n* 1+ years of experience architecting AWS infrastructure.\n* 1+ years of experience writing application layer code.\n* Familiarity with Infrastructure as Code (IaC) tools like Terraform or CloudFormation.\n* Understanding of AWS networking, including VPCs and VPNs.\n* Proven ability to drive engineering best practices for application performance, monitoring, and alerting.\n* Experience improving developer experience, CI/CD pipelines, and local development tooling.\n\n**Nice to Have:**\n\n* Deep understanding of Postgres performance and advising on query optimization.\n* Familiarity with compliance standards such as SOC 2.\n\n## Team & Philosophy\n\nWe are backed by leading investors including BOND, Spark Capital, Neo, and Workday, as well as founders and executives from prominent tech companies like Brex, Asana, Plaid, Applied Intuition, and Coalition. Our team members come from companies such as Airbnb, Google, Brex, and LiveRamp. We maintain a low-profile approach, focusing our energy on delivering value to customers and building a category-defining company.\n\n## Benefits\n\n* Competitive base salary and generous equity.\n* Generous medical, dental, and vision benefits.\n* 401K plan enrollment.\n* Flexible time-off policy.\n* Daily lunch & dinner.\n* Paid parental leave.\n* Company outings and offsites.\n* (And more benefits as we grow!)\n\n## Company Information\n\nComulate is committed to building a diverse and inclusive culture that celebrates authenticity. We are proud to be an equal opportunity employer and do not discriminate on the basis of race, religion, color, national origin, gender, gender identity, sexual orientation, age, marital status, disability, protected veteran status, or any other legally protected characteristic.","https://jobs.ashbyhq.com/comulate/b553bfae-6722-4d3f-8585-372c98708996","Collate",{"id":457,"name":455,"urlSafeSlug":455,"logo":458},"d1d21e97-31c9-47ba-9734-acbf82743413","company_logo",[460],{"city":286,"region":287,"country":16},"2025-03-12T17:16:24.372Z","Candidates should have 3+ years of experience in cloud infrastructure, platform engineering, or SRE, with at least 1 year of experience architecting AWS infrastructure. A minimum of 1 year of experience writing application layer code and familiarity with Infrastructure as Code tools like Terraform or CloudFormation are required. Understanding of AWS networking, including VPCs and VPNs, is necessary, along with a proven ability to drive engineering best practices for application performance, monitoring, and alerting. Experience improving CI/CD pipelines and local development tooling is also expected. Nice-to-have qualifications include a deep understanding of Postgres performance and familiarity with compliance standards like SOC 2.","The Platform Engineer will be responsible for shaping the company's cloud infrastructure, ensuring systems are reliable, scalable, and secure, and executing with best practices to support explosive growth. This includes implementing solutions for blue-green deployments, VPNs for employees, and Postgres observability tooling. The role involves improving CI/CD pipelines and local development tooling to enhance developer experience. Additionally, the engineer will advise on best practices for application performance, monitoring, and alerting, and potentially contribute to Postgres performance optimization and compliance standards.",{"employment":465,"compensation":467,"experience":470,"visaSponsorship":474,"location":475,"skills":476,"industries":487},{"type":466},{"id":65,"name":66,"description":67},{"minAnnualSalary":468,"maxAnnualSalary":469,"currency":71,"details":17},160000,240000,{"experienceLevels":471},[472,473],{"id":422,"name":423,"description":424},{"id":136,"name":137,"description":245},{"type":79},{"type":79},[328,477,478,479,480,481,482,483,86,484,485,486],"Platform Engineering","SRE","TypeScript","Node","Postgres","React","Next.js","ECS","RDS","ElastiCache",[488,490],{"id":91,"name":489},"Insurance",{"id":91,"name":491},"Artificial Intelligence",{"id":493,"title":271,"alternativeTitles":494,"slug":510,"jobPostId":493,"description":511,"isReformated":15,"applyUrl":512,"company":513,"companyOption":514,"locations":517,"listingDate":519,"listingSite":520,"isRemote":15,"requirements":521,"responsibilities":522,"status":18,"expiryDate":17,"isGoogleIndexed":50,"summary":523},"7ae115c9-9fc5-485d-839d-6b94fc642091",[495,496,497,498,499,500,501,502,503,504,505,506,507,508,509],"Senior Site Reliability Engineer (SRE)","Senior DevOps Engineer (AWS)","Senior Cloud Infrastructure Engineer (AWS)","Senior SRE Engineer (CI/CD, Terraform)","Senior Platform Engineer (AWS, Ansible)","Senior Infrastructure Automation Engineer","Senior Systems Reliability Engineer","Senior Cloud Automation Engineer","Senior DevOps Infrastructure Specialist","Senior SRE Lead (AWS)","Senior Site Reliability Architect","Senior Cloud Operations Engineer","Senior Infrastructure Engineer (IaC, SRE)","Senior CI/CD Engineer (AWS)","Senior Systems Automation Engineer","senior-systems-engineer-7ae115c9-9fc5-485d-839d-6b94fc642091","## About Us\nSpreedly is the world's leading Open Payments Platform, sitting at the center of a network processing more than $50b of GMV annually. Spreedly's Payments Orchestration platform enables and optimizes digital transactions with the world’s most complete payment services marketplace. Built on Spreedly’s PCI-compliant architecture, our Advanced Vault solution combines a modern feature-set with rule-based configurations to optimize the vaulting experience for all stored payment methods. Global enterprises and hyper-growth companies grow their digital business faster by relying on our payments platform. Hundreds of customers worldwide secure card data in our PCI-compliant vault and use tokenized card data to enable and optimize over $45 billion of annual transaction volumes with any payment service.\n\nOur vision is that the world is better with a diversified, inclusive payment ecosystem. Our mission is to accelerate commerce with an open, secure, and flexible payment platform that welcomes all payment participants. Our employees help us execute our vision by building a culture focused on autonomy, transparency, and collaboration in a dynamic, high-growth organization.\n\n## Product Offering\nSpreedly provides an open payments platform. The platform’s connectivity provides payments performance. Key products and services include:\n\n- **Payment Gateway Integration**: Connects merchants, platforms, and marketplaces to multiple payment gateways and payment services.\n- **Tokenization**: Securely stores and manages payment data with a universal tokenization service.\n- **Transaction Routing**: Enables intelligent routing of transactions to optimize success rates and costs.\n- **Payment Vault**: A secure storage solution for sensitive payment information.\n- **Fraud Tools Integration**: Integrates with various fraud prevention tools to enhance transaction security.\n\n## About the Role\nAs a **Senior Systems Engineer** at Spreedly, you'll play a key role in scaling and strengthening a platform that demands resilience at every layer. This position is grounded in practical execution, as you'll be central to initiatives that advance reliability and operational maturity across our infrastructure. You'll collaborate with teams throughout the organization to simplify systems, enhance stability, and ensure that changes are safe, observable, and repeatable.\n\n## Responsibilities\n- **Automation & CI/CD**: Support and evolve our build and deployment pipelines (AWS Developer Tools, GitHub Actions) to improve reliability, speed, and developer autonomy.\n- **Infrastructure as Code**: Implement, improve, and maintain IaC (Terraform, Ansible, Packer) to provision and manage infrastructure in a repeatable, auditable, and scalable manner.\n- **Reliability & Observability**: Apply SRE principles to proactively monitor system health and meet strict availability targets. Use tools like Datadog, AWS native tooling, and OpenTelemetry to create actionable dashboards and alerts, enabling adherence to crucial SLOs.\n- **System Resiliency**: Stabilize critical infrastructure by designing for fault tolerance at every layer through early failure detection, graceful degradation, and automated recovery mechanisms. Continuously reduce MTTD and MTTR through continuously improved alerting, runbooks designed for rapid execution, and streamlined recovery workflows.\n- **Platform Modernization**: Improve infrastructure maturity through clear, incremental changes that promote simplicity, reduce legacy complexity, and strengthen the integrity and standardization of the platform as it evolves.\n- **Security & Compliance**: Contribute to infrastructure that meets compliance standards by ensuring controls around access, data protection, and deployment integrity are built into the platform.\n- **Simplicity & Maintainability**: Build and maintain a clean, well-documented, and consistent platform. Favor clarity, shared ownership, and design choices that minimize operational overhead.\n- **Operational Support**:","https://jobs.lever.co/spreedly/de42232a-faca-4149-addc-d0fe66cd2acb","Spreedly",{"id":515,"name":513,"urlSafeSlug":513,"logo":516},"146d9566-34e7-4368-8027-2a45c63a73d7","e18f3q4lgxbmpgpdpd2c",[518],{"city":17,"region":17,"country":16},"2025-08-01T00:00:00Z",15,"The ideal candidate will have experience with AWS Developer Tools, GitHub Actions, Terraform, Ansible, Packer, Datadog, AWS native tooling, and OpenTelemetry. A background in Site Reliability Engineering (SRE) principles and a focus on building secure, reliable, and maintainable infrastructure are essential.","The Senior Systems Engineer will support and evolve CI/CD pipelines, implement and maintain infrastructure as code using Terraform, Ansible, and Packer, and apply SRE principles to monitor system health and meet availability targets. Responsibilities also include designing for fault tolerance, stabilizing critical infrastructure, modernizing the platform through incremental changes, contributing to compliance standards, and building a simple, well-documented, and maintainable platform.",{"employment":524,"compensation":526,"experience":527,"visaSponsorship":531,"location":532,"skills":533,"industries":537},{"type":525},{"id":65,"name":66,"description":67},{"minAnnualSalary":17,"maxAnnualSalary":17,"currency":17,"details":17},{"experienceLevels":528},[529,530],{"id":136,"name":137,"description":245},{"id":75,"name":76,"description":77},{"type":79},{"type":79},[534,535,144,437,194,196,536,378,197,202,382,379,85,86],"Systems Engineering","Platform Scaling","Security",[538,539,541],{"id":153,"name":154},{"id":91,"name":540},"Payments",{"id":91,"name":542},"SaaS",{"id":544,"title":442,"alternativeTitles":545,"slug":551,"jobPostId":544,"description":552,"isReformated":15,"applyUrl":553,"company":554,"companyOption":555,"locations":558,"listingDate":563,"listingSite":59,"isRemote":15,"requirements":564,"responsibilities":565,"status":18,"expiryDate":17,"isGoogleIndexed":50,"summary":566},"c54bb151-44f8-4948-a45f-ed90fec740f6",[444,216,446,11,159,270,546,445,547,548,549,550,220,449],"GCP Platform Engineer","Kubernetes Platform Engineer","Terraform Engineer","Backend Platform Engineer","Systems Engineer","platform-engineer-c54bb151-44f8-4948-a45f-ed90fec740f6","### Position Overview\n- **Location Type:** Remote\n- **Job Type:** Full-Time\n- **Salary:** Not Specified\n\nKnock is on a mission to help products communicate with their users in a more thoughtful way. Building product notifications in-house takes months, often leading to poor user experiences. We believe that—when done right—product notifications help users find value in the products they use every day. That’s why we built Knock.\n\nWe're a remote-first (with a NYC base) Series A startup of 25 employees that believe in the power of great software. We're APIs all the way down at Knock—Stripe for payments, Algolia for search, WorkOS for SSO. We're excited to add Knock to that list and to push forward the API-first movement. If you are, too, come join us and let's build something great together.\n\nWe’re backed by top investors and operators including Craft Ventures, Afore Capital, Preface Ventures, Worklife Capital, Guillermo Rauch (CEO/Founder @ Vercel), Scott Belsky (CPO @ Adobe), Adam Gross (CEO @ Heroku), and John Kodumal (CTO @ LaunchDarkly), to name a few.\n\n### About the Role\nWe’re looking for a software engineer to join our small but growing platform team. Platform engineering at Knock is the foundation for everything else we do. Because Knock is built by and for engineers, there’s a blurry line between “platform” and “product.” The product is the platform.\n\nThe platform team is a specialized engineering team at Knock that designs, builds, and maintains the infrastructure and services needed in order to run our product. They are responsible for the availability and reliability of our service, from API to notification engine execution. They support the product engineering team in their role to build and ship customer-facing features.\n\nKnock’s platform engineers orient around measurable goals set by our CTO, and work with a high degree of autonomy to achieve greater scale, resilience, and performance for our customers & partners. This is a high-agency role. As a team member, you’ll craft and own initiatives to help us hit our goals. The team collaborates heavily, but each individual has decision-making authority over their initiatives.\n\nWe care deeply about building a team and culture that is inclusive and equitable for people of all backgrounds and experiences, and believe firmly that the best teams are diverse. We particularly encourage people from underrepresented communities to apply.\n\nLast thing: you can be a great fit even if you don't perfectly match the requirements below. We know there's a lot we don't know and haven't thought of yet, and we're looking for teammates who can tell us what those things are. If that's you, don't hesitate to apply and tell us about yourself!\n\n### What You'll Be Doing in this Role\nAs a platform engineer you’ll contribute across a range of scaling, product and DX initiatives. By way of example, here are some platform team highlights from the last year:\n- Facilitated a 8x YoY increase in monthly messages sent\n- Significantly improved latency and margins of our observability product features by adopting ClickHouse for event data\n- Zero-downtime Postgres upgrade from 11.9 → 15.3\n- Instrumented our services with distributed tracing\n\nOver the next year you may find yourself working on:\n- Scaling our service for the next several growth multiples (billions of txns/month)\n- Large-scale dynamic user segmentation\n- Multi-region support\n- Canary deploys\n\n### What We're Looking For in this Role\n- We’re looking for a senior engineer (5+ years of experience).\n- While experience with our stack is not required, you should be very comfortable with multiple levels of the modern backend stack. For example, you might have experience with GCP or AWS, a modern language, container orchestration, and a tool such as Chef or Ansible. We work primarily with Elixir, Terraform, Kubernetes, and AWS.\n- We’d love to see expertise in an area that complements our team’s skillset, be it databases, event-driven architectures, security, dev tooling, etc.\n- The ideal candidate will have experience building and operating large-scale production systems.","https://jobs.ashbyhq.com/knock/beee9279-e40a-4944-ab20-dd473998b429","Knock",{"id":556,"name":554,"urlSafeSlug":554,"logo":557},"9ee8d8eb-7c6b-4523-a8a4-2bf5a162a1d0","bropegpbvpqs2ju3a360",[559,561],{"city":560,"region":560,"country":16},"New York",{"city":17,"region":17,"country":562},"Remote","2025-06-04T08:05:49.578Z","Candidates should have 5+ years of experience as a software engineer, with a strong understanding of the modern backend stack, including experience with GCP or AWS, a modern language, container orchestration, and tools such as Chef or Ansible. Familiarity with Elixir, Terraform, Kubernetes, and AWS is preferred.","As a platform engineer, you will contribute to scaling initiatives, product features, and developer experience improvements, such as facilitating a 8x YoY increase in monthly messages sent, improving latency and margins of observability features, performing zero-downtime database upgrades, and instrumenting services with distributed tracing. You may also work on scaling the service for future growth, implementing large-scale dynamic user segmentation, and enabling canary deployments.",{"employment":567,"compensation":569,"experience":570,"visaSponsorship":573,"location":574,"skills":575,"industries":581},{"type":568},{"id":65,"name":66,"description":17},{"minAnnualSalary":17,"maxAnnualSalary":17,"currency":17,"details":17},{"experienceLevels":571},[572],{"id":370,"name":371,"description":17},{"type":79},{"type":79},[576,144,577,578,579,580],"API development","Services","Reliability engineering","Performance optimization","Platform engineering",[582,583,585],{"id":91,"name":439},{"id":91,"name":584},"API Services",{"id":91,"name":328},{"id":587,"title":588,"alternativeTitles":589,"slug":600,"jobPostId":587,"description":601,"isReformated":15,"applyUrl":602,"company":603,"companyOption":604,"locations":608,"listingDate":610,"listingSite":180,"isRemote":15,"requirements":611,"responsibilities":612,"status":18,"expiryDate":17,"isGoogleIndexed":50,"summary":613},"53455143-0ca8-4179-9fd4-041dccb8ce7d","Staff Software Engineer, Platform",[270,590,591,495,592,31,447,593,594,595,596,500,597,598,599],"Staff Cloud Engineer","Principal Platform Engineer","Staff Infrastructure Engineer","Staff Kubernetes Engineer","Senior AWS Engineer","Staff SRE","Principal Cloud Engineer","Staff DevOps Architect","Senior Platform Architect","Staff Systems Engineer","staff-software-engineer-platform-53455143-0ca8-4179-9fd4-041dccb8ce7d","## Job Overview\nOmada Health is a remote-first healthcare and technology company on a mission to inspire and engage people in lifelong health, one step at a time. We are seeking an experienced engineer to join our Platform Engineering team. This team is responsible for the operational integrity of our infrastructure and DevOps processes, helping other engineers work more efficiently.\n\n## About You\nYou are a thoughtful engineer with empathy for teammates, partners, customers, and Omada members. You excel at cross-functional collaboration, taking initiative to meet business needs. You work directly with stakeholders to design solutions and drive technical decisions. You are motivated to learn new technologies and can be trusted to plan, develop, and deliver exemplary technical work. You are humble, flexible in your thinking, and committed to providing the best possible solutions. You have years of hands-on experience building scalable cloud infrastructure and collaborating with talented individuals to achieve significant goals.\n\n## Your Impact\n- Contribute to the development and operation of Omada’s digital health program infrastructure.\n- Play a key role in building and scaling our platform to support personalized experiences for program participants and empower health coaches.\n- Collaborate closely (including pair programming) with the Platform team and internal customers from Product Engineering, InfoSec, IT, and other departments.\n- Build cross-functional relationships with software engineers, AI/ML engineers, data scientists, and technical program managers to understand their needs and deliver solutions.\n- Manage high availability services, with a focus on designing redundancy and disaster recovery readiness.\n- Work with developers to design and improve developer tooling and capabilities, such as secrets management with Vault or cloud development environments.\n- Maintain and enhance our Gitlab CI/CD infrastructure.\n- Mentor and guide the professional and technical development of team members.\n\n## You Will Love This Job If You:\n- Want to make a difference in how people manage real-life health problems.\n- Prefer to work in an agile, test-driven, pair programming, continuous delivery environment.\n- Are eager to leverage AI tools across all aspects of the software development lifecycle.\n- Seek to collaborate with AI experts to integrate AI into existing systems, expanding your knowledge and skills.\n- Are interested in working with modern technologies including AWS EKS, Terraform, Datadog, and GitLab.\n- Enjoy mentoring and learning from teammates, collaborating with developers to write great code and solve problems together.\n\n## Requirements for this Role\n- **8+ years** of working experience designing and building diverse AWS cloud architectures.\n- **Expert experience** building and operating Kubernetes clusters.\n- **Deep experience** with Terraform and Ansible.\n- Experience working with Datadog application performance monitoring, custom metrics, and logs.\n- Comfortable managing, configuring, and troubleshooting persistent virtual Linux hosts and volumes.\n- **Skilled** at building, deploying, and orchestrating Docker images and containers.\n- **Great communication skills** – both written and verbal. Skilled at writing documentation for others.\n- Readiness to participate in an on-call rotation.\n\n## Bonus Points For:\n- Experience managing self-hosted Gitlab.\n- Experience writing Golang, Ruby, or Python.\n- Experience working with PostgreSQL, Redis.","https://job-boards.greenhouse.io/omadahealth/jobs/7170806","Omada Health",{"id":605,"name":603,"urlSafeSlug":606,"logo":607},"32a4ce12-d92c-4636-a4cd-b7c1c0459ef0","Omada-Health","wxdvxnvmgfynvcvm4zpo",[609],{"city":17,"region":17,"country":16},"2025-08-24T03:44:17.517Z","The ideal candidate has 8+ years of experience designing and building AWS cloud architectures, expert experience with Kubernetes clusters, and deep experience with Terraform and Ansible. They should also have experience with Datadog application performance monitoring, managing Linux hosts and volumes, and building/deploying Docker containers. Strong communication and documentation skills are essential, as is a willingness to participate in an on-call rotation. Bonus points for experience managing self-hosted GitLab and proficiency in Golang, Ruby, or Python.","The Staff Software Engineer will be responsible for the development and operation of Omada’s digital health program infrastructure, building and scaling the platform to support personalized experiences and empower health coaches. This includes collaborating closely with the Platform team and internal customers, managing high availability services with redundancy and disaster recovery, and improving developer tooling like secrets management and cloud development environments. The role also involves maintaining and enhancing GitLab CI/CD infrastructure and mentoring team members on professional and technical development.",{"employment":614,"compensation":616,"experience":617,"visaSponsorship":621,"location":622,"skills":623,"industries":626},{"type":615},{"id":65,"name":66,"description":67},{"minAnnualSalary":17,"maxAnnualSalary":17,"currency":17,"details":17},{"experienceLevels":618},[619,620],{"id":136,"name":137,"description":245},{"id":75,"name":76,"description":77},{"type":79},{"type":79},[477,328,83,145,624,625],"Pair Programming","Cross-functional Collaboration",[627,628],{"id":94,"name":95},{"id":205,"name":206},{"id":630,"title":631,"alternativeTitles":632,"slug":647,"jobPostId":630,"description":648,"isReformated":15,"applyUrl":649,"company":650,"companyOption":651,"locations":654,"listingDate":656,"listingSite":180,"isRemote":15,"requirements":657,"responsibilities":658,"status":18,"expiryDate":17,"isGoogleIndexed":50,"summary":659},"095fdc32-6692-4440-a5be-8ce57efd1840","Staff Software Engineer - SRE, Backend (Reliability Engineering)",[28,29,37,633,634,30,635,636,637,274,638,639,640,641,642,643,644,645,46,47,646],"Senior SRE Engineer","Staff SRE Engineer","Senior Backend Reliability Engineer","Staff Backend Reliability Engineer","Principal Backend Reliability Engineer","Staff Production Engineer","Principal Production Engineer","Senior SRE Backend Developer","Staff SRE Backend Developer","Principal SRE Backend Developer","Senior SRE Systems Engineer","Staff SRE Systems Engineer","Principal SRE Systems Engineer","Principal SRE Platform Engineer","staff-software-engineer-sre-backend-reliability-engineering-095fdc32-6692-4440-a5be-8ce57efd1840","### Position Overview\n\n* **Location Type:** Remote\n* **Employment Type:** Full-time\n* **Salary:** Not specified (Refer to Affirm's pay structure for details)\n* **Base Pay Grade:** P\n* **Equity Grade:** 13\n\nAffirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. Site Reliability Engineering at Affirm is a small, yet crucial, team that helps our Engineering partners to “Operate What They Own” with excellence to protect their customers’ experience. SRE accomplishes this through defining frameworks and best practices for operating applications, building tooling, and providing training and consulting.\n\n### Requirements\n\n* 7+ years of experience designing, developing, and launching backend systems at scale using languages like Python or Kotlin.\n* Extensive track record of developing highly available distributed systems using technologies like AWS, MySQL, Spark, and Kubernetes.\n* 7+ years experience in a Site Reliability or Production Engineering team.\n* Demonstrates curiosity with empathy, and strong opinions loosely held.\n* Experience delivering major features, system components, or deprecating existing functionality in a system through the definition of a technical and execution plan.\n* Writes high-quality code that is easily understood and used by others.\n* Strong verbal and written communication skills for effective collaboration with a global engineering team.\n* Either equivalent practical experience or a Bachelor’s degree in a related field.\n\n### Responsibilities\n\n* Providing data and visibility to teams and leadership on application performance.\n* Guiding the development of SLOs (Service Level Objectives).\n* Driving the Incident Management and Analysis process.\n* Steering the implementation of Change Management and Deployment practices.\n* Engaging in service and architectural conversations.\n* Recommending observability and alerting configurations.\n* Setting technical strategy for the team on a year-long time scale.\n* Collaborating across teams in the product development lifecycle with product management, design, and analytics to ensure technical sustainability, risks, and trade-offs are well understood and managed.\n* Acting as a force-multiplier for the team through definition and advocacy of technical solutions and operational processes.\n* Taking ownership of the team’s operations and availability by ensuring the right monitoring, triage rotations, playbooks, policies, testing, and alerting are in place to support “keep the lights on” & on-call efforts.\n* Fostering a culture of quality and ownership on the team by setting code review and design standards, and advocating for them beyond the team through writing and tech talks.\n* Helping develop talent on the team by providing feedback and guidance, and leading by example.\n\n### Company Information\n\nAffirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest.","https://job-boards.greenhouse.io/affirm/jobs/6356178003","Affirm",{"id":652,"name":650,"urlSafeSlug":650,"logo":653},"2e061730-c007-4f76-a90f-13172035968a","xjv7yomvmvgq2zcdvckv",[655],{"city":17,"region":17,"country":16},"2025-05-21T08:02:30.274Z","Candidates should have 7+ years of experience designing, developing, and launching backend systems at scale using languages like Python or Kotlin, 7+ years of experience in a Site Reliability or Production Engineering team, and a Bachelor’s degree in a related field or equivalent practical experience. They should demonstrate curiosity with empathy and strong opinions loosely held, and experience delivering major features, system components, or deprecating existing functionality through the definition of a technical and execution plan.","The Staff Software Engineer - SRE will set technical strategy for their team on a year-long time scale, collaborate across teams in the product development lifecycle, act as a force-multiplier for their team by defining technical solutions and operational processes, foster a culture of quality and ownership on their team, and help develop talent on their team by providing feedback and guidance. They will also be responsible for providing data and visibility to teams and leadership on application performance, guiding the development of SLOs, driving the Incident Management and Analysis process, steering the implementation of Change Management and Deployment practices, engaging in service and architectural conversations, and recommending observability and alerting configurations.",{"employment":660,"compensation":662,"experience":663,"visaSponsorship":666,"location":667,"skills":668,"industries":675},{"type":661},{"id":65,"name":66,"description":17},{"minAnnualSalary":17,"maxAnnualSalary":17,"currency":17,"details":17},{"experienceLevels":664},[665],{"id":136,"name":137,"description":17},{"type":79},{"type":374},[669,376,670,378,84,671,380,672,673,674],"Software Engineering","Distributed Systems","Configuration Management","Capacity Management","Load Testing","Chaos Testing",[676,678],{"id":91,"name":677},"Financial Technology",{"id":153,"name":154},{"id":680,"title":28,"alternativeTitles":681,"slug":690,"jobPostId":680,"description":691,"isReformated":15,"applyUrl":692,"company":693,"companyOption":694,"locations":698,"listingDate":700,"listingSite":180,"isRemote":15,"requirements":701,"responsibilities":702,"status":18,"expiryDate":17,"isGoogleIndexed":50,"summary":703},"cc1854ff-8c48-4ea8-b712-22b5e81e16c0",[633,158,37,682,683,684,501,685,38,686,505,42,687,688,689],"Senior DevOps Engineer (SRE Focus)","Senior Cloud Reliability Engineer","Senior Infrastructure Engineer (SRE)","Senior ML Operations Engineer","Senior Production Engineer (SRE)","Senior Cloud Infrastructure SRE","Senior Reliability and Operations Engineer","Senior SRE Manager","senior-site-reliability-engineer-cc1854ff-8c48-4ea8-b712-22b5e81e16c0","## About Us\nRed Cell Partners is an incubation firm building and investing in rapidly scalable technology-led companies that are bringing revolutionary advancements to market in three distinct practice areas: healthcare, cyber, and national security. United by a shared sense of duty and deep belief in the power of innovation, Red Cell is developing powerful tools and solutions to address our Nation’s most pressing problems.\n\n## About Trase\nCo-founded in 2023 by Joe Laws and Grant Verstandig, Trase Systems is AI, Uncomplicated. Trase empowers enterprise leaders to harness the full potential of AI without the associated complexity and risks. We are an end-to-end solution for deploying, managing, and optimizing AI in the enterprise. Our platform specializes in bridging the “last mile” of AI adoption, unlocking AI's full potential while driving efficiency and significant cost savings. Trase is at the forefront of AI Agent innovation, topping the Hugging Face GAIA Leaderboard for Generalized AI Assistants, ahead of industry giants such as Google, Meta, Microsoft, and OpenAI. We are leveraging our cutting-edge technologies to develop mission-critical agentic applications in complex industries such as Healthcare, Oil & Gas, and National Security.\n\n## About the Role\nAre you passionate about building and maintaining the resilient, scalable infrastructure that powers cutting-edge AI? Do you excel at ensuring the reliability and performance of complex, distributed systems? If you thrive on automating, monitoring, and optimizing the platforms that enable machine learning innovation, we have an exciting opportunity for you as a Site Reliability Engineer at Trase Systems.\n\nAs a Site Reliability Engineer, you will be a cornerstone of our engineering team, responsible for the availability, latency, performance, and capacity of Trase's AI platform. You will work closely with our ML engineers, software engineers, and product teams to build and operate the infrastructure that runs our advanced AI agents and machine learning models. If you are a proactive problem-solver with a passion for building highly reliable systems in a fast-paced, innovative environment, we invite you to join our mission.\n\n## Responsibilities\n- **Design, Build, and Maintain Core Infrastructure**: Architect and implement scalable, highly available, and secure infrastructure on cloud platforms (GCP, AWS, Azure) to support our AI-driven applications and services.\n- **Automate Everything**: Develop and maintain automation tools and frameworks to eliminate manual effort in deployment, configuration, and management of our production environment.\n- **Ensure System Reliability and Performance**: Establish and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for our production systems. Proactively identify and resolve performance bottlenecks and availability issues.\n- **Manage ML Infrastructure and Pipelines**: Collaborate with ML engineers to build and maintain robust CI/CD pipelines for machine learning models, ensuring seamless training, deployment, and monitoring.\n- **Incident Response and Post-Mortems**: Lead incident response efforts to minimize downtime and conduct thorough post-incident reviews to identify root causes and implement preventative measures.\n- **Implement and Enhance Observability**: Deploy and manage comprehensive monitoring, logging, and tracing solutions (e.g., Prometheus, Grafana, ELK stack) to provide deep visibility into system health.\n- **Capacity Planning and Cost Optimization**: Forecast infrastructure needs and optimize resource utilization to ensure our platform can scale efficiently and cost-effectively.\n- **Foster a Culture of Reliability**: Champion SRE best practices across the engineering organization and mentor team members on reliability, performance, and scalability.\n\n## Requirements\n- **Proven SRE and DevOps Experience**: Demonstrated experience in a Site Reliability Engineering or DevOps role, managing complex, large-scale production environments.","https://job-boards.greenhouse.io/redcellpartners/jobs/4793863007","Red Cell Partners",{"id":695,"name":693,"urlSafeSlug":696,"logo":697},"2c601b3b-00b0-48a9-bc40-c69bb6ec5270","RedCellPartners","zznxo2xrsb1vzlwmdok0",[699],{"city":17,"region":17,"country":16},"2025-08-15T07:16:42.224Z","Candidates should have proven Site Reliability Engineering and DevOps experience, with demonstrated experience in managing complex, large-scale production environments. Experience with cloud platforms such as GCP, AWS, or Azure is required, along with expertise in automation tools and frameworks, monitoring solutions like Prometheus, Grafana, and the ELK stack, and CI/CD pipelines for machine learning models.","The Senior Site Reliability Engineer will design, build, and maintain scalable and highly available cloud infrastructure, automate deployment and management processes, and ensure system reliability and performance by establishing and monitoring SLOs and SLIs. They will also manage ML infrastructure and pipelines, lead incident response efforts, implement observability solutions, conduct capacity planning, and foster a culture of reliability across the engineering team.",{"employment":704,"compensation":706,"experience":707,"visaSponsorship":710,"location":711,"skills":712,"industries":716},{"type":705},{"id":65,"name":66,"description":67},{"minAnnualSalary":17,"maxAnnualSalary":17,"currency":17,"details":17},{"experienceLevels":708},[709],{"id":136,"name":137,"description":245},{"type":79},{"type":79},[82,144,713,670,378,202,714,715,437,85,379],"AI Platform","Optimization","Machine Learning",[717,718,721,722,725],{"id":91,"name":491},{"id":719,"name":720},"2f8d3a99-57b5-4365-b0e6-328b91298787","AI & Machine Learning",{"id":94,"name":95},{"id":723,"name":724},"efd54195-3c42-4e0e-b49f-7fa970ca772f","Cybersecurity",{"id":91,"name":726},"National Security",{"id":728,"title":729,"alternativeTitles":730,"slug":734,"jobPostId":728,"description":735,"isReformated":15,"applyUrl":736,"company":737,"companyOption":738,"locations":741,"listingDate":744,"listingSite":59,"isRemote":15,"requirements":745,"responsibilities":746,"status":18,"expiryDate":17,"isGoogleIndexed":50,"summary":747},"6d7861a5-21b5-42ee-82c5-79bd83c7c0f9","Software Engineer, Infrastructure (5+ years of experience)",[731,159,216,11,442,269,222,220,550,162,444,31,732,733,271],"Infrastructure Software Engineer","Reliability Engineering Lead","Cloud Infrastructure Specialist","software-engineer-infrastructure-5-years-of-experience-6d7861a5-21b5-42ee-82c5-79bd83c7c0f9","### Position Overview\n- **Location Type:** Remote\n- **Employment Type:** Full-Time\n- **Salary:** Not specified\n\nAnrok is building the tools behind the scenes that make compliant digital commerce a reality for companies big and small. We connect with billing and payment systems to automate sales tax compliance end-to-end. We have raised over $50M from leading investors like Sequoia, Index, and Khosla Ventures. This role focuses on designing, building, and operating the systems that support our product and the engineers who build it.\n\n### Requirements\n- **Experience:** 5+ years of experience in software engineering.\n- **Cloud Operations:** 3+ years of experience operating cloud-deployed software at scale using config-as-code.\n- **Database Experience:** Experience managing relational databases at scale is a plus, but not required.\n- **Security Mindset:** A strong security mindset and the ability to think carefully about edge cases.\n- **Problem-Solving:** Ability to think through problems thoroughly and articulate tradeoffs.\n\n### Responsibilities\n- Take responsibility for Anrok’s reliability, security, scalability, and performance.\n- Drive technical decisions about our infrastructure.\n- Identify and fix bugs and technical debt.\n- Collaborate with other engineers and management to build high-impact solutions.\n- Example project areas:\n - Improve system observability and provide tools to enable product engineers to own application observability.\n - Scale our database layer through sharding, replication, and splitting out services when it makes sense.\n - Refactor existing systems to take into account present and evolving needs.\n\n### Company Information\n- **Company:** Anrok\n- **Funding:** Over $50M from Sequoia, Index, and Khosla Ventures.\n- **Technologies:** GCP, Pulumi, Postgres, and TypeScript.\n- **Benefits:**\n - Equity upside of an early-stage startup with product-market fit.\n - Daily lunch and snacks (for those working out of the San Francisco office).\n - Medical, dental, and vision insurance (100% covered).\n - One Medical membership covered.\n - Flexible sick benefits.\n - Annual learning and development stipend.","https://jobs.ashbyhq.com/anrok/006ac43b-0a0a-4ccc-9170-0482abed125e","Anrok",{"id":739,"name":737,"urlSafeSlug":737,"logo":740},"580db3e4-d2cf-4bed-89f5-b40652052c2b","dz0fuxtfz4kfvh9s3ssa",[742,743],{"city":286,"region":287,"country":16},{"city":17,"region":17,"country":16},"2025-03-12T17:15:50.478Z","Candidates should have a strong technical background with 5+ years of experience in software engineering and 3+ years of experience operating cloud-deployed software at scale using config-as-code. A security mindset and the ability to think through problems thoroughly are essential, along with the capacity to articulate design tradeoffs and share knowledge with peers. Experience managing relational databases at scale is a plus, but not required.","The Software Engineer, Infrastructure will be responsible for Anrok's reliability, security, scalability, and performance. They will drive technical decisions about the infrastructure, identify and fix bugs and technical debt, and collaborate with other engineers and management to build high-impact solutions. Key project areas include improving system observability, scaling the database layer, and refactoring existing systems to meet evolving needs.",{"employment":748,"compensation":750,"experience":752,"visaSponsorship":755,"location":756,"skills":757,"industries":763},{"type":749},{"id":65,"name":66,"description":17},{"minAnnualSalary":17,"maxAnnualSalary":17,"currency":17,"details":751},"Equity upside of an early-stage startup with product-market fit.",{"experienceLevels":753},[754],{"id":136,"name":137,"description":17},{"type":79},{"type":79},[430,758,481,479,759,760,536,761,762],"Pulumi","Cloud Operations","Config-as-code","Problem-Solving","Database Management",[764,765],{"id":153,"name":154},{"id":766,"name":767},"2c10e840-ed58-446f-8e1c-3880568d941f","Enterprise Software",{"id":769,"title":770,"alternativeTitles":771,"slug":778,"jobPostId":769,"description":779,"isReformated":15,"applyUrl":780,"company":118,"companyOption":781,"locations":782,"listingDate":784,"listingSite":59,"isRemote":15,"requirements":785,"responsibilities":786,"status":18,"expiryDate":17,"isGoogleIndexed":50,"summary":787},"d1a92ce2-1528-4ecb-bf99-d951f9156cf4","SRE / DevOps Engineer",[335,11,159,342,772,773,442,220,774,160,775,776,164,777,347],"Infrastructure as Code Engineer","CI/CD Engineer","Automation Engineer","Observability Engineer","Reliability Engineer","DevOps Automation Engineer","sre-devops-engineer-d1a92ce2-1528-4ecb-bf99-d951f9156cf4","## About Kraken\nKrakenites are a world-class team with crypto conviction, united by our desire to discover and unlock the potential of crypto and blockchain technology.\n\nKraken is a mission-focused company rooted in crypto values. As a Krakenite, you’ll join us on our mission to accelerate the global adoption of crypto, so that everyone can achieve financial freedom and inclusion. For over a decade, Kraken’s focus on our mission and crypto ethos has attracted many of the most talented crypto experts in the world.\n\nAs a fully remote company, we have Krakenites in 70+ countries who speak over 50 languages. Krakenites are industry pioneers who develop premium crypto products for experienced traders, institutions, and newcomers to the space. Kraken is committed to industry-leading security, crypto education, and world-class client support through our products like Kraken Pro, Desktop, Wallet, and Kraken Futures.\n\nBecome a Krakenite and build the future of crypto!\n\n## The Developer-Experience (DX) Team\nKraken’s Developer-Experience (DX) team exists to make writing, shipping, and running software effortless and efficient. We work closely with product-engineering to identify friction in the development lifecycle, and eliminate it through purpose-built tooling, streamlined processes, clear documentation, and data-driven insights.\n\n## The Opportunity\n- Build and support infrastructure and tools, on-prem and in the cloud\n- Drive standardization: Author RFCs and internal guides covering process improvements, reliability patterns, and best practices\n- Support, and guide engineers on SRE related topics\n- Partner with product-development teams to identify, and eliminate friction\n\n## Skills You Should HODL\n- **3+ years** in a DevOps role (DevOps, SRE, etc)\n- **3+ years** experience with a systems programming language (e.g. Rust)\n- Proficient in **Git** source version-control\n- Thorough knowledge of **Docker**, experience with **Terraform**\n- Passion for improving process and products\n- Experience configuring **Continuous Integration (CI)**\n- Ability to thrive while working independently and remotely in a team-based environment\n- Experience with **monitoring / alerting** (primarily with Prometheus / Grafana) and knowledge of best practices in the area\n- **Self-starter**, ability to context-switch between various projects, codebases and concepts\n- Ability to independently debug problems involving the network and operating system\n- Well-versed in **scripting languages** (e.g. Bash and/or Python), building and administration of **Linux**\n- Interest in security and a thoughtful and thorough consideration of the security implications of development decisions\n\n## Job Details\n- **Salary:** $110K - $176K\n- **Location Type:** Remote\n- **Employment Type:** FullTime\n\n## Important Information\n- This job is accepting ongoing applications and there is no application deadline.\n- Applicants are permitted to redact or remove information on their resume that identifies age, date of birth, or dates of attendance at or graduation from an educational institution.\n- We consider qualified applicants with criminal histories for employment on our team, assessing candidates in a manner consistent with the requirements of the San Francisco Fair Chance Ordinance.\n- Kraken is powered by people from around the world and we celebrate all Krakenites for their diverse talents, backgrounds, contributions and unique perspectives. We hire strictly based on merit, meaning we seek out the candidates with the right abilities, knowledge, and skills considered the most suitable for the job. We encourage you to apply for roles where you don't fully meet the listed requirements, especially if you're passionate or knowledgeable about crypto!\n- As an equal opportunity employer, we don’t tolerate discrimination or harassment of any kind. Whether that’s based on race, ethnicity, age, gender identity, etc.","https://jobs.ashbyhq.com/kraken.com/308e2a7c-197b-4ba0-a485-f7a6b00079d3",{"id":120,"name":118,"urlSafeSlug":118,"logo":121},[783],{"city":17,"region":17,"country":16},"2025-08-08T07:16:28.071Z","Candidates should possess at least 3 years of experience in a DevOps or SRE role, with 3+ years of experience in a systems programming language such as Rust. Proficiency in Git, thorough knowledge of Docker, and experience with Terraform are required. Experience configuring Continuous Integration, monitoring/alerting with Prometheus/Grafana, and best practices in these areas are also necessary. Familiarity with scripting languages like Bash and/or Python, Linux administration, and an interest in security are expected. The ability to work independently and remotely in a team environment is crucial.","The SRE/DevOps Engineer will build and support infrastructure and tools, both on-premises and in the cloud. They will drive standardization by authoring RFCs and internal guides for process improvements, reliability patterns, and best practices. The role involves supporting and guiding engineers on SRE-related topics and partnering with product development teams to identify and eliminate friction in the development lifecycle.",{"employment":788,"compensation":790,"experience":791,"visaSponsorship":795,"location":796,"skills":797,"industries":801},{"type":789},{"id":65,"name":66,"description":67},{"minAnnualSalary":131,"maxAnnualSalary":132,"currency":71,"details":17},{"experienceLevels":792},[793,794],{"id":136,"name":137,"description":245},{"id":75,"name":76,"description":77},{"type":79},{"type":79},[478,83,144,193,798,799,146,800,148],"On-prem","Tooling","Best practices",[802,803,804],{"id":91,"name":151},{"id":91,"name":327},{"id":153,"name":154},["Reactive",806],{"$ssite-config":807},{"env":808,"name":809,"url":810},"production","nuxt-app","https://jobo.world/",["Set"],["ShallowReactive",813],{"landing-page-remote-devops-us":-1,"jobs-remote-devops-us-1":-1},"/jobs/remote-devops-us",{}]