Lead Java and Data Integration Engineer
Research InnovationsFull Time
Expert & Leadership (9+ years)
Candidates should possess 5+ years of software development experience with a focus on building and delivering high-quality, data-intensive solutions, strong proficiency in Java and the JVM ecosystem including knowledge of memory management, garbage collection tuning, and performance profiling, solid experience with concurrent programming in Java, and experience developing, extending, or working with connectors, sinks, or sources for at least one big data processing framework such as Apache Spark, Flink, Beam, or Kafka Connect. A strong understanding of database fundamentals, including SQL, data modeling, query optimization, and familiarity with OLAP/analytical databases is required, along with outstanding written and verbal communication skills and a passion for open-source development.
As a Senior Java Engineer, you will serve as a core contributor, owning and maintaining critical parts of ClickHouse’s Data engineering ecosystem, crafting tools that enable Data Engineers to harness ClickHouse’s incredible speed and scale, and owning the full lifecycle of data framework integrations—from the core database driver to SDKs and connectors. You will collaborate closely with the open-source community, internal teams, and enterprise users to ensure JVM integrations set the standard for performance, reliability, and developer experience, directly impacting how companies process massive datasets and contributing to the evolution of the core system through open-source contributions.
High-speed column-oriented database management system
ClickHouse provides a high-speed, column-oriented database management system designed for developers and businesses that manage large-scale data. Its primary product processes analytical queries quickly by storing data from the same columns together, making it significantly faster than traditional row-oriented databases, especially in Online Analytical Processing (OLAP) scenarios. ClickHouse stands out from competitors by offering a free, open-source database that can be deployed on local machines or in the cloud, along with a fully managed service on platforms like AWS, GCP, and Microsoft Azure. The company's goal is to deliver a cost-effective solution that simplifies data management for its clients, as evidenced by user feedback highlighting substantial cost savings.