AI Evaluation PM at Plaid

Singapore

Plaid Logo
Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
AI, Technology, Productivity ToolsIndustries

Requirements

  • Minimum Qualifications
  • Master's degree or above, preferably in Computer Science, Data Science, AI, or related fields
  • 3+ years of product experience, with at least 1 year working on AI or LLM-related products or features
  • Solid analytical skills and familiarity with data processing (SQL/Python preferred)
  • Ability to communicate clearly in English and collaborate across product, engineering, and operations
  • Strong ownership, structured problem solving, and willingness to learn quickly
  • Understanding of English-speaking user scenarios, product expectations, or cultural differences
  • Preferred Qualifications
  • Hands-on experience designing test plans, evaluation standards, or datasets for LLM or AI features
  • Familiarity with evaluation of memory-related behaviors (extraction, retrieval, contextual usage), or prior work on summary/Ask AI products
  • Experience reviewing real user data and translating findings into actionable product improvements

Responsibilities

  • Define how 'AI memory quality' should be measured for the industry. Turn product goals and real user scenarios into measurable evaluation criteria, evaluation frameworks and reproducible test cases
  • Build the global memory evaluation system from the ground up. Create evaluation standards, test pipelines, and datasets across markets (US, EU, Japan, China), working with real multilingual and multimodal data
  • Run continuous quality evaluations across Summary, Ask Plaud and other memory-enabled experiences, combining scalable automated workflows with targeted human checks when precision matters
  • Become the source of truth for ‘is memory working?’ Your analysis will directly influence product decisions, launch readiness, and go-to-market
  • Benchmark against global AI meeting and knowledge products (Otter, Notion AI, NotebookLM, etc.) to extract best practices in memory accuracy, retrieval reliability, and user trust — and convert these insights into scientific evaluation methodology

Skills

Key technologies and capabilities for this role

AI EvaluationProduct ManagementAI-native ProductivityMemory EvaluationHuman-AI Interaction

Questions & Answers

Common questions about this position

Is this role remote or onsite?

The role is onsite in Singapore.

What is the compensation for this position?

The job offers market-competitive compensation.

What are the key responsibilities for this role?

You will define AI memory quality measurements, build global evaluation systems with standards, pipelines, and datasets across markets, run continuous quality evaluations, influence product decisions, and benchmark against competitors.

What is the company culture like at Plaud?

Plaud offers a culture that champions continuous learning and fast career development, with passionate teammates who value innovation, collaboration, and customer success in a vibrant, creativity-fueled work atmosphere.

What makes a strong candidate for this AI Evaluation PM role?

Strong candidates should have the ability to define evaluation frameworks, build test systems from scratch, work with multilingual and multimodal data, and benchmark AI products, as this is the first dedicated owner shaping AI memory evaluation.

Plaid

Connects financial accounts to apps securely

About Plaid

Plaid simplifies financial data management for individuals and businesses by connecting various financial accounts to apps and services. Its main product is a set of APIs that allow developers to integrate financial data into their applications, enabling users to track spending, initiate payments, and access financial services all in one place. Plaid serves a wide range of clients, including app developers and financial institutions, and is used by popular apps like LendingTree and Square. Unlike many competitors, Plaid focuses on providing a comprehensive and scalable platform that supports various financial use cases, such as transactions and identity verification. The company's goal is to enhance the way users interact with their financial data, making it easier and more secure.

San Francisco, CaliforniaHeadquarters
2013Year Founded
$714.3MTotal Funding
SERIES_DCompany Stage
Fintech, Financial ServicesIndustries
1,001-5,000Employees

Benefits

We've got you covered: From medical, life, and 401ks, we’re here to support your physical, mental, and financial wellbeing.
Everyone is an owner: We want everyone to feel ownership over their work - literally, which is why we offer equity to full-time Plaids.
Vacation your way: We want to make sure you have time to meet your personal needs with unlimited PTO and two weeks of synchronous, company-wide vacation.
Grow your skills: Every Plaid is in control of their career development with our learning stipends, tools, and trainings.

Risks

Increased competition from API-based banking solutions like FIS's Code Connect platform.
Potential legal challenges, such as PNC's lawsuit over trademark issues.
Demand for enhanced transparency and security in financial data sharing.

Differentiation

Plaid offers seamless financial data integration through robust APIs for diverse clients.
Plaid's Pay by Bank for Bill Pay provides a cost-effective recurring payment solution.
Plaid's strategic partnerships enhance its value proposition in payroll and payment sectors.

Upsides

Plaid's expansion into the Triangle area indicates growth and increased hiring potential.
Partnership with Dwolla enhances Plaid's presence in the secure payments sector.
Collaboration with Ansa expands market reach through pay-by-bank capabilities for merchants.

Land your dream remote job 3x faster with AI