Sr Data Engineer at Illumina

Bengaluru, Karnataka, India

Illumina Logo
Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Biotechnology, HealthcareIndustries

Requirements

  • 5+ years of experience as a Data Engineer Developer
  • Python
  • Spark
  • SQL
  • Snowflake
  • dbt (Data Build Tool)
  • Strong understanding of data warehousing concepts
  • Strong domain expertise in operations organizations, particularly in supply chain and manufacturing functions

Responsibilities

  • Lead the design, development, and optimization of data pipelines, ETL processes, and data integration solutions using Python, Spark, SQL, Snowflake, dbt, and other relevant technologies
  • Apply strong domain expertise in operations organizations, particularly in functions like supply chain and manufacturing, to understand data requirements and deliver tailored solutions
  • Utilize big data processing frameworks such as Apache Spark to process and analyze large volumes of operational data efficiently
  • Implement data transformations, aggregations, and business logic to support analytics, reporting, and operational decision-making
  • Leverage cloud-based data platforms such as Snowflake to store and manage structured and semi-structured operational data at scale
  • Utilize dbt (Data Build Tool) for data modeling, transformation, and documentation to ensure data consistency, quality, and integrity
  • Monitor and optimize data pipelines and ETL processes for performance, scalability, and reliability in operations contexts
  • Conduct data profiling, cleansing, and validation to ensure data quality and integrity across different operational data sets
  • Collaborate closely with cross-functional teams, including operations stakeholders, data scientists, and business analysts, to understand operational challenges and deliver actionable insights
  • Stay updated on emerging technologies and best practices in data engineering and operations management, contributing to continuous improvement and innovation within the organization

Skills

Python
Spark
SQL
Snowflake
dbt
data warehousing

Illumina

Supports genomics startups through funding and resources

About Illumina

Illumina focuses on fostering innovation in the genomics industry by supporting startups through its Illumina Accelerator program. This program helps entrepreneurs create, launch, and grow genomics-focused companies by providing funding and resources. The accelerator operates in two main locations: the San Francisco Bay Area and Cambridge, UK. Illumina Accelerator has successfully invested in 68 genomics startups, which have collectively raised over $1 billion in venture capital. What sets Illumina apart from its competitors is its strong partnership with leading venture capital investors and its dedicated focus on the genomics sector. The goal of Illumina is to build a thriving ecosystem for genomics innovation, enabling new companies to emerge and advance the field.

San Diego, CaliforniaHeadquarters
1998Year Founded
$27.2MTotal Funding
IPOCompany Stage
Venture Capital, BiotechnologyIndustries
5,001-10,000Employees

Risks

Over-reliance on NVIDIA's AI technology may limit flexibility in AI solution adoption.
Standardizing proteomics data across platforms could challenge Illumina's data reliability.
Single-flow-cell NovaSeq X System might cannibalize sales of higher-end models.

Differentiation

Illumina leads in genomic sequencing with advanced AI integration and multiomic data analysis.
The company offers innovative array-based solutions for DNA, RNA, and protein analysis.
Illumina's global expansion includes a new Global Capability Center in Bengaluru.

Upsides

Collaboration with NVIDIA enhances drug discovery and clinical development through AI integration.
Pilot proteomics program with UK Biobank aims to generate crucial reference datasets.
Single-cell sequencing kits make high-throughput sequencing accessible to smaller labs.

Land your dream remote job 3x faster with AI