Senior Site Reliability Engineer
Chainlink LabsFull Time
Senior (5 to 8 years)
Candidates should possess at least 6 years of experience in the software development industry, with a focus on building and operating scalable, fault-tolerant, distributed systems. Proficiency in Go, C/C++, Java, or similar programming languages is required, along with experience in concurrency, multithreading, and deploying distributed system architectures. Experience developing cloud infrastructure services, preferably with Kubernetes, and expertise with a public cloud provider (AWS, GCP, Azure) and their IaaS offerings are essential. Strong problem-solving abilities, solid production debugging skills, excellent communication, and the capacity to lead large technical projects collaboratively are also necessary. A passion for efficiency, availability, scalability, data governance, and thriving in a fast-paced environment with a high degree of ownership and accountability are key attributes.
The role involves benchmarking system and database performance, analyzing performance, and optimizing capacity. Responsibilities include troubleshooting and debugging application and server errors, recommending configuration tuning for performance bottlenecks, and collaborating with core development, cloud, and security teams to enhance ClickHouse Cloud performance. The engineer will also plan, enable, and drive Chaos initiatives, develop and manage tools for chaos experiments, study software resilience and operational challenges, and extend the backend to support Chaos Engineering techniques.
High-speed column-oriented database management system
ClickHouse provides a high-speed, column-oriented database management system designed for developers and businesses that manage large-scale data. Its primary product processes analytical queries quickly by storing data from the same columns together, making it significantly faster than traditional row-oriented databases, especially in Online Analytical Processing (OLAP) scenarios. ClickHouse stands out from competitors by offering a free, open-source database that can be deployed on local machines or in the cloud, along with a fully managed service on platforms like AWS, GCP, and Microsoft Azure. The company's goal is to deliver a cost-effective solution that simplifies data management for its clients, as evidenced by user feedback highlighting substantial cost savings.