Plaid

Data Engineer - Data Engineering

San Francisco, California, United States

$130,000 – $180,000Compensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
InternshipJob Type
UnknownVisa
Data Engineering, Software DevelopmentIndustries

Data Engineer

Position Overview

  • Location Type: Not specified
  • Employment Type: Full-time
  • Salary: Not specified

The main goal of the DE team in 2024-25 is to build robust golden data sets to power our business goals of creating more insights based products. Making data-driven decisions is key to Plaid's culture. To support that, we need to scale our data systems while maintaining correct and complete data. We provide tooling and guidance to teams across engineering, product, and business and help them explore our data quickly and safely to get the data insights they need, which ultimately helps Plaid serve our customers more effectively.

Data Engineers heavily leverage SQL and Python to build data workflows. We use tools like DBT, Airflow, Redshift, ElasticSearch, Atlanta, and Retool to orchestrate data pipelines and define workflows. We work with engineers, product managers, business intelligence, data analysts, and many other teams to build Plaid's data strategy and a data-first mindset.

Our engineering culture is IC-driven -- we favor bottom-up ideation and empowerment of our incredibly talented team. We are looking for engineers who are motivated by creating impact for our consumers and customers, growing together as a team, shipping the MVP, and leaving things better than we found them.

You will be in a high impact role that will directly enable business leaders to make faster and more informed business judgements based on the datasets you build. You will have the opportunity to carve out the ownership and scope of internal datasets and visualizations across Plaid which is a currently unowned area that we intend to take over and build SLAs on. You will have the opportunity to learn best practices and up-level your technical skills from our strong DE team and from the broader Data Platform team. You will collaborate with and have strong and cross functional partnerships with literally all teams at Plaid from Engineering to Product to Marketing/Finance etc.

Requirements

  • 2+ years of dedicated data engineering experience, solving complex data pipeline issues at scale.
  • Experience building data models and data pipelines on top of large datasets (in the order of 500TB to petabytes).
  • Values SQL as a flexible and extensible tool and is comfortable with modern SQL data orchestration tools like DBT, Mode, and Airflow.

Responsibilities

  • Understanding different aspects of the Plaid product and strategy to inform golden dataset choices, design and data usage principles.
  • Having data quality and performance top of mind while designing datasets.
  • Advocating for adopting industry tools and practices at the right time.
  • Owning core SQL and Python data pipelines that power our data lake and data warehouse.
  • Well-documented data with defined dataset quality, uptime, and usefulness.

Nice to Have

  • Experience working with different performant warehouses and data lakes; Redshift, Snowflake, Databricks.
  • Experience building and maintaining batch and real-time pipelines using technologies.

Skills

SQL
Python
DBT
Mode
Airflow
Redshift
Snowflake
Databricks
Data Modeling
Data Pipelines
Data Warehousing
Data Lakes
Golden Dataset
Data Quality

Plaid

Connects financial accounts to apps securely

About Plaid

Plaid simplifies financial data management for individuals and businesses by connecting various financial accounts to apps and services. Its main product is a set of APIs that allow developers to integrate financial data into their applications, enabling users to track spending, initiate payments, and access financial services all in one place. Plaid serves a wide range of clients, including app developers and financial institutions, and is used by popular apps like LendingTree and Square. Unlike many competitors, Plaid focuses on providing a comprehensive and scalable platform that supports various financial use cases, such as transactions and identity verification. The company's goal is to enhance the way users interact with their financial data, making it easier and more secure.

San Francisco, CaliforniaHeadquarters
2013Year Founded
$714.3MTotal Funding
SERIES_DCompany Stage
Fintech, Financial ServicesIndustries
1,001-5,000Employees

Benefits

We've got you covered: From medical, life, and 401ks, we’re here to support your physical, mental, and financial wellbeing.
Everyone is an owner: We want everyone to feel ownership over their work - literally, which is why we offer equity to full-time Plaids.
Vacation your way: We want to make sure you have time to meet your personal needs with unlimited PTO and two weeks of synchronous, company-wide vacation.
Grow your skills: Every Plaid is in control of their career development with our learning stipends, tools, and trainings.

Risks

Increased competition from API-based banking solutions like FIS's Code Connect platform.
Potential legal challenges, such as PNC's lawsuit over trademark issues.
Demand for enhanced transparency and security in financial data sharing.

Differentiation

Plaid offers seamless financial data integration through robust APIs for diverse clients.
Plaid's Pay by Bank for Bill Pay provides a cost-effective recurring payment solution.
Plaid's strategic partnerships enhance its value proposition in payroll and payment sectors.

Upsides

Plaid's expansion into the Triangle area indicates growth and increased hiring potential.
Partnership with Dwolla enhances Plaid's presence in the secure payments sector.
Collaboration with Ansa expands market reach through pay-by-bank capabilities for merchants.

Land your dream remote job 3x faster with AI