Staff Data Engineer at Starburst

Boston, Massachusetts, United States

Starburst Logo
Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Technology, Analytics, AIIndustries

Requirements

  • At least 7 years of data engineering experience, and a clear passion for data and analytics
  • Enthusiasm for working both independently and collaboratively with strong, diverse, high-performing teams to get value and insights from data
  • Experience building and optimizing data pipelines using Trino, Spark, dbt, and related frameworks
  • Experience managing data infrastructure in public clouds
  • Experience using and managing orchestration frameworks such as Apache Airflow or Dagster
  • Knowledge of RAG and other design patterns for AI applications
  • Fluency in SQL
  • Experience building API integrations for extracting data from third party sources
  • Excellent coding ability in Java, Python or Scala
  • Knowledge of data modelling techniques which are appropriate for modern data lakes
  • Experience with a variety of AWS services such as EMR, EC2, S3, and IAM. Multi-cloud experience (GCP/Azure) is also nice to have
  • Able to use Configuration-as-Code and Infrastructure-as-Code tools such as Pulumi, Terraform, and/or Ansible
  • Demonstrable experience in delivering value and hitting deadlines consistently
  • Has disciplined software engineering practices, including high code quality, extensive automated testing, and rigorous code review
  • Highly proficient in both written and verbal communication, coupled with strong organizational abilities

Responsibilities

  • Build and manage a high quality data lake to support various aspects of Starburst’s business, including product management, finance, customer support, and engineering
  • Find innovative ways to use Trino and Starburst to solve data management challenges
  • Collaborate with technical leads, product managers and data analysts to build robust data products and analytics
  • Leverage AI to democratize access to datasets for users throughout Starburst
  • Enable dataset preparation and model evaluation for Starburst’s AI projects
  • Define and adapt data engineering processes and best practices to focus on execution and getting reliable answers to important business questions
  • Work closely with leaders from other teams and departments to iterate on both data architecture and design of data solutions, focusing on high-quality results accessible at several levels
  • Envision innovative approaches to data management and work with Starburst’s product teams to bring those innovations to market

Skills

Trino
Starburst
Data Lake
AI
Data Analytics
Telemetry
Data Engineering
Analytics

Starburst

Data analytics and SQL engine distribution

About Starburst

Starburst specializes in data analytics by providing a distribution and support for the Trino SQL engine, which is designed for efficient and scalable analytics on data lakes and various data sources. Their products, Starburst Galaxy and Starburst Enterprise, allow clients to access and analyze data quickly, whether in the cloud or on-premises. Starburst connects seamlessly with popular data visualization tools like Tableau, Power BI, and Looker, making it easier for users to integrate and access their data. What sets Starburst apart from competitors is its enhancement of the open-source Trino engine with additional connectors, security features, and dedicated enterprise support. The company's goal is to help organizations achieve faster data insights and better decision-making through improved analytics capabilities.

Boston, MassachusettsHeadquarters
2017Year Founded
$402.7MTotal Funding
SERIES_DCompany Stage
Data & Analytics, Enterprise SoftwareIndustries
501-1,000Employees

Benefits

Competitive salary & attractive stock grants
Remote-friendly work options
Quality & affordable insurance
Flexible & generous paid time off
Environment of transparency, honesty & respect

Risks

Increased competition from companies like Dell could impact Starburst's market share.
The rapid growth of unique data vendors may lead to increased market complexity.
Enterprises moving towards single-cloud strategies could challenge Starburst's multi-cloud offerings.

Differentiation

Starburst offers both cloud-based and on-premises solutions, catering to diverse client needs.
The company enhances the open-source Trino engine with additional connectors and security features.
Starburst's platform integrates with popular data tools like Tableau, Power BI, and Looker.

Upsides

Starburst Galaxy achieved 3x year-over-year growth in active customers and usage volume.
The platform enables 10X faster data processing and 66% cost reduction for clients like Arity.
Starburst's Icehouse platform leverages open-source Trino and Apache Iceberg for scalability.

Land your dream remote job 3x faster with AI