Imbue

Data Engineer

San Francisco, California, United States

Not SpecifiedCompensation
Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Artificial Intelligence, AI & Machine Learning, BiotechnologyIndustries

Data Engineer

Employment Type: Full-Time


Position Overview

We are a small, cross-functional team focused on building AI systems that reason and code. We care deeply about understanding how people interact with these systems and how we can use data to make them safer, smarter, and more useful.

We're looking for a Data Engineer to build and own the pipelines and data infrastructure that power our product and research efforts. Your work will directly support model training, evaluation, product analytics, and safety systems. You’ll partner closely with team members building our coding agents to make sure we’re capturing the right signals and using them well.

If you’re excited about turning messy product data into actionable insights, and building systems that can scale with our research, we’d love to get connected!


Example Projects

  • Combine synthetic data generation with human annotation platforms to produce high-quality data that advances our product and research roadmap.
  • Design and build resilient, scalable pipelines (ETL and ELT) for batch and streaming data.
  • Develop and maintain infrastructure to support self-serve analytics, experimentation, and dataset generation. Prototype, evaluate, and make “build vs buy” decisions.
  • Help define and improve data modeling practices across the company, including instrumentation standards, dimensional modeling for analytics and feature stores for machine learning (ML).
  • Build integrations with ML infrastructure to support training pipelines, inference logging, and model monitoring (MLOps).
  • Debug pipeline failures, automate deployment processes, and improve data quality and reusability.

Requirements

  • A strong software engineer with 5+ years of experience, ideally working with large-scale data systems.
  • Experienced in designing and maintaining data pipelines and infrastructure, especially for analytics, experimentation, and ML.
  • Comfortable with tools for data orchestration (Airflow, Prefect), batch or streaming processing (Spark, Ray, Flink), and event tracking and analytics (Amplitude, PostHog).
  • Experienced with cloud-based infrastructure and storage (e.g., S3, GCP, Snowflake, or Redshift), and thoughtful about cost-performance tradeoffs.
  • Exposure to MLOps, model serving infrastructure, or ML workflows.
  • Pragmatic and principled! You know when to optimize and when to ship.

Compensation and Benefits

  • Work directly on creating software with human-like intelligence.
  • Generous compensation, equity, and benefits.
  • Budget for self-improvement: coaching, courses, conferences, etc.
  • Actively co-create and participate in a positive, intentional team culture.
  • Spend time learning, reading papers, and deeply understanding prior work.
  • Frequent team events, dinners, off-sites, and hanging out.

Compensation Range:

  • Cash: $170,000–$350,000
  • Equity: $10,000–$2,000,000

Note: Compensation packages are highly variable based on a variety of factors. If your salary requirements fall outside of the stated range, we still encourage you to apply.


How to Apply

All submissions are reviewed by a person, so we encourage you to include notes on why you're interested in working with us. If you have any other work that you can showcase (open source code, side projects, etc.), certainly include it! We know that talent comes from many backgrounds, and we aim to build a team with diverse skillsets that spike strongly in different areas.


About Us

Imbue builds AI systems that reason and code, enabling AI agents to accomplish larger goals and safely work in the real world. We train our own foundation models optimized for reasoning and prototype agents on top of these models. By using these agents extensively, we gain insights into improving both the capabilities of the underlying models and the interaction design for agents.

We aim to rekindle the dream of the personal computer, where computers become truly intelligent tools that empower us, giving us freedom, dignity, and agency to pursue the things we love.

Skills

Data Engineering
ETL
ELT
Data Pipelines
Data Infrastructure
Analytics
Experimentation
Machine Learning (ML)
MLOps
Airflow
Prefect
Spark
Ray
Data Orchestration
Batch Processing
Streaming Processing
Data Modeling
Instrumentation
Dimensional Modeling
Feature Stores
Model Training
Model Evaluation
Product Analytics
Safety Systems
Software Engineering

Imbue

Digital fitness platform with influencer-led workouts

About Imbue

Imbue Fitness operates in the digital fitness market by connecting users with their favorite fitness influencers through a unique platform. Users can access exclusive workout classes anytime and anywhere, which allows for a personalized and engaging fitness experience. The platform primarily targets fitness enthusiasts who want a more intimate and interactive way to follow their workout routines. Imbue Fitness offers a flexible payment model, allowing users to choose between a pay-per-class system or a subscription for weekly classes. This approach generates revenue through class fees and subscriptions, benefiting both the company and the influencers. By utilizing the popularity and expertise of well-known fitness influencers, Imbue Fitness provides high-quality, influencer-led fitness content that stands out in the crowded digital fitness space.

Sunnyvale, CaliforniaHeadquarters
1989Year Founded
$237.1MTotal Funding
SERIES_BCompany Stage
Consumer Software, Healthcare, Consumer GoodsIndustries
1-10Employees

Benefits

Work directly on creating software with human-like intelligence
Very generous compensation
Flexible working hours
Work remotely
Time and budget for learning and self improvement

Risks

Increased competition from AI recruitment platforms threatens Sourceress's market share.
Free or lower-cost fitness platforms challenge Imbue Fitness's revenue model.
Fitness influencers launching own apps may reduce collaboration with Imbue Fitness.

Differentiation

Imbue Fitness connects users with favorite fitness influencers for personalized experiences.
Sourceress uses AI to create highly personalized candidate introductions for recruitment.
Imbue Fitness offers a pay-per-class or subscription model for flexible user payments.

Upsides

AI-driven personalized fitness recommendations are gaining traction, benefiting Imbue Fitness.
Collaborations with wearable tech companies enhance real-time tracking for Imbue Fitness users.
Rise of micro-influencers offers niche content opportunities for Imbue Fitness.

Land your dream remote job 3x faster with AI