Software Engineer, Data Ingestion & Transformation at Starburst

Boston, Massachusetts, United States

Starburst Logo
Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Technology, Data Analytics, AIIndustries

Requirements

  • 3+ years of experience developing software
  • Prior experience developing distributed systems
  • Extensive software development experience with Java (experience with other systems programming languages like Rust, C++, Go, etc., can be considered)
  • Demonstrated experience with software engineering and design best practices
  • Prior experience with software development using Trino, Apache Iceberg, Apache Kafka, or cloud object storage (huge plus)
  • Demonstration of ownership, grit, and bias for action
  • Ability to travel occasionally for onboarding, offsites, customer engagements, and company events
  • Based in Boston office with hybrid model (onsite 2-3 days per week)

Responsibilities

  • Design, develop, and operate systems and features relating to data ingestion and transformation (building on systems with proven ingestion up to 100GB/second)
  • Work cross-functionally to ensure the best experience for customers
  • Build and implement features for creating and operating data lakes based on Apache Iceberg, including streaming ingestion from Apache Kafka and Kafka-compatible systems, file ingestion from cloud object storage (e.g., Amazon S3), data transformations, and automated scalable data maintenance
  • Provide considerate and timely review of peers' design proposals and pull requests
  • Help build a highly effective culture across Starburst and the team

Skills

Trino
Apache Iceberg
Apache Kafka
Amazon S3
Data Ingestion
Data Transformation
Distributed Systems
Cloud Object Storage
Streaming Ingestion
Scalable Systems

Starburst

Data analytics and SQL engine distribution

About Starburst

Starburst specializes in data analytics by providing a distribution and support for the Trino SQL engine, which is designed for efficient and scalable analytics on data lakes and various data sources. Their products, Starburst Galaxy and Starburst Enterprise, allow clients to access and analyze data quickly, whether in the cloud or on-premises. Starburst connects seamlessly with popular data visualization tools like Tableau, Power BI, and Looker, making it easier for users to integrate and access their data. What sets Starburst apart from competitors is its enhancement of the open-source Trino engine with additional connectors, security features, and dedicated enterprise support. The company's goal is to help organizations achieve faster data insights and better decision-making through improved analytics capabilities.

Boston, MassachusettsHeadquarters
2017Year Founded
$402.7MTotal Funding
SERIES_DCompany Stage
Data & Analytics, Enterprise SoftwareIndustries
501-1,000Employees

Benefits

Competitive salary & attractive stock grants
Remote-friendly work options
Quality & affordable insurance
Flexible & generous paid time off
Environment of transparency, honesty & respect

Risks

Increased competition from companies like Dell could impact Starburst's market share.
The rapid growth of unique data vendors may lead to increased market complexity.
Enterprises moving towards single-cloud strategies could challenge Starburst's multi-cloud offerings.

Differentiation

Starburst offers both cloud-based and on-premises solutions, catering to diverse client needs.
The company enhances the open-source Trino engine with additional connectors and security features.
Starburst's platform integrates with popular data tools like Tableau, Power BI, and Looker.

Upsides

Starburst Galaxy achieved 3x year-over-year growth in active customers and usage volume.
The platform enables 10X faster data processing and 66% cost reduction for clients like Arity.
Starburst's Icehouse platform leverages open-source Trino and Apache Iceberg for scalability.

Land your dream remote job 3x faster with AI