Clickhouse

Data Connector Engineer - Integrations

United States

Not SpecifiedCompensation
Junior (1 to 2 years)Experience Level
Full TimeJob Type
UnknownVisa
Database & Data Management, Data Integration & ETL, Business Intelligence & AnalyticsIndustries

Software Engineer - Data Integrations

Position Overview

ClickHouse is seeking a Software Engineer specializing in data connectors and transformation frameworks to join the Integrations team. This role is responsible for the technical ownership of ClickHouse's critical data pipeline integrations, bridging high-performance database engineering with modern data orchestration. You will build and maintain connectors and adapters that enable data teams to seamlessly incorporate ClickHouse into their existing workflows.

About the Team

The Integrations team is dedicated to connecting ClickHouse with the broader data ecosystem. We develop and maintain connections, ranging from low-level database drivers to high-level data visualization plugins, ensuring ClickHouse integrates with popular tools. Our responsibilities include official language clients (Python, JavaScript, Java, Go, Rust, C++), major data connectors (Kafka, dbt, Spark, Fivetran), and integrations with visualization platforms (Grafana, Tableau, PowerBI, Metabase).

Responsibilities

  • Technical Ownership: Serve as the technical owner for ClickHouse's most critical data pipeline integrations.
  • dbt Adapter Development: Own the development and maintenance of the dbt-clickhouse adapter, enabling data analysts and engineers to leverage dbt's transformation capabilities with ClickHouse.
  • Data Ingestion Connectors: Maintain and enhance data ingestion connectors in JVM and golang ecosystems, ensuring reliable and performant automated data replication from diverse sources into ClickHouse.
  • Architecture: Architect data infrastructure that supports companies in building real-time analytics platforms and modern data warehouses at scale.
  • Collaboration: Collaborate with commercial partners and the broader data community to ensure integrations represent best practices in the modern data stack.

Requirements

  • Experience: 5+ years of software development experience with a focus on data engineering, ETL/ELT pipelines, or database integrations.
  • Python: Strong Python proficiency with experience in data-focused libraries (Pandas, SQLAlchemy) and familiarity with the Python data ecosystem.
  • dbt: Hands-on experience with dbt, ideally including custom adapter development, advanced dbt modeling patterns, or contributions to dbt packages.
  • Modern Data Stack: Experience with modern data stack tools such as Fivetran, Airbyte, or similar ELT platforms.
  • Java: Solid Java development skills, including understanding of JVM performance tuning and concurrent programming patterns.
  • Database Fundamentals: Strong understanding of database fundamentals, including SQL, data modeling, query optimization, and familiarity with OLAP/analytical databases.
  • Communication: Outstanding written and verbal communication skills for effective collaboration.
  • Open-Source Passion: A passion for open-source development, including community engagement and contributing to the evolution of core systems.

Bonus Points For

  • Familiarity with ClickHouse or similar high-performance data platforms.
  • Prior contributions to open-source projects.
  • Expertise in building sinks or source connectors for big data frameworks.
  • Golang knowledge for high-performance data processing components.

Company Information

About ClickHouse: Established in 2009, ClickHouse is a leader in open-source column-oriented database systems, aiming to be the fastest OLAP database globally. We empower users to generate real-time analytical reports via SQL queries, focusing on speed for escalating data volumes. ClickHouse Cloud is utilized by global enterprises including Lyft, Sony, IBM, GitLab, Twilio, and HubSpot. It is available as open-source or on AWS, GCP, Azure, and Alibaba.

Salary Ranges

  • New York Area / San Francisco Area: $157,000 - $232,000 USD
  • Washington State: $141,300 - $197,200 USD
  • General US Remote: $125,600 - $185,500 USD
  • Los Angeles, CA / Washington, DC: $141,300 - $208,800 USD

Employment Type:

  • [Employment Type will be specified here]

Location Type:

  • [Location Type will be specified here]

Skills

Database Drivers
Data Connectors
Data Visualization
Python
JavaScript
Java
Go
Rust
C++
Kafka
dbt
Spark
Fivetran
Grafana
Tableau
PowerBI
Metabase
High-Performance Database Engineering
Data Orchestration

Clickhouse

High-speed column-oriented database management system

About Clickhouse

ClickHouse provides a high-speed, column-oriented database management system designed for developers and businesses that manage large-scale data. Its primary product processes analytical queries quickly by storing data from the same columns together, making it significantly faster than traditional row-oriented databases, especially in Online Analytical Processing (OLAP) scenarios. ClickHouse stands out from competitors by offering a free, open-source database that can be deployed on local machines or in the cloud, along with a fully managed service on platforms like AWS, GCP, and Microsoft Azure. The company's goal is to deliver a cost-effective solution that simplifies data management for its clients, as evidenced by user feedback highlighting substantial cost savings.

San Francisco, CaliforniaHeadquarters
2021Year Founded
$291.8MTotal Funding
SERIES_BCompany Stage
Data & Analytics, Enterprise SoftwareIndustries
201-500Employees

Benefits

Health Insurance
Unlimited Paid Time Off
Flexible Work Hours
Remote Work Options
Stock Options
Home Office Stipend

Risks

Redpanda Serverless poses a competitive threat in real-time data processing.
Integration challenges with PeerDB may delay expected benefits.
Dependency on Supabase could pose operational risks.

Differentiation

ClickHouse's column-oriented design offers superior speed for analytical queries.
The open-source model allows flexible deployment across various environments.
Integration with Grafana enhances data visualization capabilities.

Upsides

Partnership with Alibaba Cloud boosts presence in the Chinese market.
Acquisition of PeerDB enhances real-time analytics capabilities.
Launch of ClickPipes improves data processing efficiency for real-time updates.

Land your dream remote job 3x faster with AI