Staff Data Engineer at Illumina

Bengaluru, Karnataka, India

Illumina Logo
Not SpecifiedCompensation
Senior (5 to 8 years), Expert & Leadership (9+ years)Experience Level
Full TimeJob Type
UnknownVisa
Biotechnology, ManufacturingIndustries

Requirements

  • 10+ years of experience as a Data Engineer
  • Python
  • Spark
  • SQL
  • Snowflake
  • dbt (Data Build Tool)
  • Machine Learning
  • Deep understanding of operations organizations, particularly in supply chain, product lifecycle management, and manufacturing functions
  • Experience with SAP Hana (a plus)
  • Experience with Teamcenter applications (a plus)

Responsibilities

  • Lead the design, development, and optimization of complex data solutions
  • Utilize big data processing frameworks like Apache Spark to efficiently process large volumes of operational data
  • Implement advanced data modeling, machine learning algorithms, and predictive analytics to derive actionable insights
  • Leverage cloud-based data platforms such as Snowflake for data storage, management, and analysis
  • Utilize dbt for data modeling, transformation, and documentation to ensure data quality and integrity
  • Monitor and optimize complex data pipelines, ETL processes, and machine learning models
  • Conduct data profiling, cleansing, and validation to ensure data quality
  • Collaborate with cross-functional teams (operations stakeholders, data scientists, business analysts, and IT teams) to understand operational challenges and deliver insights

Skills

Python
Spark
SQL
Snowflake
dbt
Machine Learning

Illumina

Supports genomics startups through funding and resources

About Illumina

Illumina focuses on fostering innovation in the genomics industry by supporting startups through its Illumina Accelerator program. This program helps entrepreneurs create, launch, and grow genomics-focused companies by providing funding and resources. The accelerator operates in two main locations: the San Francisco Bay Area and Cambridge, UK. Illumina Accelerator has successfully invested in 68 genomics startups, which have collectively raised over $1 billion in venture capital. What sets Illumina apart from its competitors is its strong partnership with leading venture capital investors and its dedicated focus on the genomics sector. The goal of Illumina is to build a thriving ecosystem for genomics innovation, enabling new companies to emerge and advance the field.

San Diego, CaliforniaHeadquarters
1998Year Founded
$27.2MTotal Funding
IPOCompany Stage
Venture Capital, BiotechnologyIndustries
5,001-10,000Employees

Risks

Over-reliance on NVIDIA's AI technology may limit flexibility in AI solution adoption.
Standardizing proteomics data across platforms could challenge Illumina's data reliability.
Single-flow-cell NovaSeq X System might cannibalize sales of higher-end models.

Differentiation

Illumina leads in genomic sequencing with advanced AI integration and multiomic data analysis.
The company offers innovative array-based solutions for DNA, RNA, and protein analysis.
Illumina's global expansion includes a new Global Capability Center in Bengaluru.

Upsides

Collaboration with NVIDIA enhances drug discovery and clinical development through AI integration.
Pilot proteomics program with UK Biobank aims to generate crucial reference datasets.
Single-cell sequencing kits make high-throughput sequencing accessible to smaller labs.

Land your dream remote job 3x faster with AI