H1

Sr. Data Engineer

India

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Healthcare, HealthTech, Data & AnalyticsIndustries

Position Overview

  • Location Type:
  • Job Type: Full time
  • Salary:

At H1, we believe access to the best healthcare information is a basic human right. Our mission is to provide a platform that can optimally inform every doctor interaction globally. This promotes health equity and builds needed trust in healthcare systems. To accomplish this, our teams harness the power of data and AI technology to unlock groundbreaking medical insights and convert those insights into action that result in optimal patient outcomes and accelerates an equitable and inclusive drug development lifecycle. Visit h1.co to learn more about us.

Data Engineering is responsible for the development and delivery of our most important asset—our data. With thousands of data sources from around the world, the team ensures that data is accurate, normalized, and delivered at a velocity that keeps up with real-world changes. As we expand our markets and the scope of data we provide to our customers, our team must scale to meet that demand.

What You'll Do at H1

We’re looking for a seasoned Senior Data Engineer who is operating at a high level and is either ready or nearly ready to step into a Staff-level individual contributor role. You will take ownership of designing and scaling the systems and pipelines that power H1’s data platform. You will work cross-functionally with other engineers, product managers, and stakeholders to deliver high-performance, reliable, and maintainable data solutions. This is an opportunity to play a key role in shaping the future of our data infrastructure while mentoring others and driving best practices.

  • Design, develop, and maintain scalable data extraction frameworks that ingest structured and unstructured data from diverse sources.
  • Build and optimize robust ETL/ELT pipelines using big data technologies, especially Apache Spark on cloud platforms (preferably AWS EMR).
  • Improve the efficiency, reliability, and performance of data processing systems through thoughtful design and continuous optimization.
  • Transform, clean, and normalize complex datasets for downstream use, ensuring high standards of data quality and consistency.
  • Partner with senior engineers to evolve H1’s data architecture and infrastructure in support of product and platform scalability.
  • Lead data integration efforts across multiple systems, ensuring accuracy and seamless collaboration across teams.
  • Monitor and troubleshoot data flows and pipelines, proactively identifying and resolving performance issues.
  • Maintain clear documentation of systems, workflows, and processes to promote transparency and operational excellence.
  • Participate in code reviews and promote a culture of engineering excellence, mentorship, and continuous improvement.
  • Collaborate closely with cross-functional teams to align technical execution with business goals.

About You

You are a seasoned data engineer with a track record of building and maintaining large-scale data systems. You’re excited by the opportunity to work on complex problems, enjoy collaborative work, and are passionate about building high-quality, performant solutions that impact real-world healthcare outcomes.

  • You have an understanding of Large Language Models (LLMs) and their applications.
  • It’s a bonus if you’re familiar with model training and fine-tuning, particularly in NLP (Natural Language Processing) contexts.
  • You possess a basic knowledge of network, security, and encryption protocols such as HTTP/HTTPS/TLS.
  • You’re able to work collaboratively across teams and communicate effectively with both technical and non-technical stakeholders.
  • You have strong analytical and problem-solving skills with a focus on data quality and performance optimization.
  • You have a passion for writing clean, efficient code and following best practices.

Requirements

  • 6+ years of experience in data engineering, working with large-scale data systems and pipelines.
  • Proficiency in programming languages like Python, Java, or similar.

Skills

Data Engineering
Data Pipelines
ETL/ELT
Apache Spark
Cloud Platforms
AWS EMR
Data Modeling
Big Data Technologies
Data Infrastructure
Data Normalization
Data Extraction

H1

Healthcare data analytics and research solutions

About H1

H1.co operates in the healthcare technology sector, focusing on connecting healthcare professionals with research and insights. The company provides data-driven solutions that help clients, including healthcare professionals, life sciences companies, payors, and patients, make informed decisions. H1.co's products utilize healthcare data to offer insights for various applications such as clinical trial design and market access. Unlike many competitors, H1.co emphasizes the democratization of healthcare data and offers a free platform, H1 Connect, for accessing expert insights. The company's goal is to enhance the efficiency of healthcare delivery and research by providing accurate and comprehensive data.

New York City, New YorkHeadquarters
2017Year Founded
$181.9MTotal Funding
SERIES_CCompany Stage
Data & Analytics, Biotechnology, HealthcareIndustries
201-500Employees

Risks

Integration of Ribbon Health may lead to data inconsistencies.
AI-driven platforms like GenosAI Pro risk biases in trial designs.
Global expansion through H1 Connect may face regulatory hurdles.

Differentiation

H1 offers a comprehensive healthcare platform connecting diverse stakeholders in real-time.
The company provides AI-enabled data analytics for strategic insights in life sciences.
H1's Trial Landscape platform enhances clinical trial diversity and efficiency.

Upsides

H1's acquisition of Ribbon Health strengthens its data management capabilities.
The partnership with CTI advances diversity and efficiency in clinical trials.
H1's inclusion in the New York Digital Health 100 highlights its innovation.

Land your dream remote job 3x faster with AI