Data Engineer
SweedFull Time
Mid-level (3 to 4 years), Senior (5 to 8 years)
Candidates should possess a Bachelor's or Master's degree in Computer Science or a related field, with at least 5 years of experience in Reliability Engineering, QA, or customer-facing engineering roles. Prior experience operating ClickHouse or other SQL databases in production is required, along with a strong understanding of distributed database internals and SQL, with ClickHouse experience being a significant advantage. Proficiency in scripting languages such as Shell or Python, the ability to read C++ code, and knowledge of cloud computing platforms like AWS, Azure, or Google Cloud Platform are essential. The ideal candidate is a strong problem-solver with excellent production debugging skills, thrives in a fast-paced global team environment, and demonstrates a high degree of responsibility, ownership, and accountability.
The Database Reliability Engineer will be responsible for building and leading processes to enhance the reliability, availability, scalability, and performance of ClickHouse core. This includes continuously improving ClickHouse's reliability and performance, creating and refining metrics and alerts to proactively identify and prevent production issues, and investigating common customer-encountered problems to identify root causes and submit bug fixes or suggest improvements. The role involves enhancing incident response processes and post-mortem analysis, planning and driving Chaos initiatives, managing on-call processes for performance and reliability issues, and establishing best practices for escalation to minimize customer impact. Additionally, the engineer will collaborate with various teams, including Control Plane, Dataplane, Security, Support, and Operations, to guide them in implementing ClickHouse effectively for customers.
High-speed column-oriented database management system
ClickHouse provides a high-speed, column-oriented database management system designed for developers and businesses that manage large-scale data. Its primary product processes analytical queries quickly by storing data from the same columns together, making it significantly faster than traditional row-oriented databases, especially in Online Analytical Processing (OLAP) scenarios. ClickHouse stands out from competitors by offering a free, open-source database that can be deployed on local machines or in the cloud, along with a fully managed service on platforms like AWS, GCP, and Microsoft Azure. The company's goal is to deliver a cost-effective solution that simplifies data management for its clients, as evidenced by user feedback highlighting substantial cost savings.