Deepgram

Research Program Manager

Remote

$150,000 – $220,000Compensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
AI & Machine Learning, Data & AnalyticsIndustries

Position Overview

  • Location Type: Remote
  • Job Type: FullTime
  • Salary: $150K - $220K

Deepgram is the leading voice AI platform for developers building speech-to-text (STT), text-to-speech (TTS) and full speech-to-speech (STS) offerings. 200,000+ developers build with Deepgram’s voice-native foundational models – accessed through APIs or as self-managed software – due to our unmatched accuracy, latency and pricing. Customers include software companies building voice products, co-sell partners working with large enterprises, and enterprises solving internal voice AI use cases. The company ended 2024 cash-flow positive with 400+ enterprise customers, 3.3x annual usage growth across the past 4 years, over 50,000 years of audio processed and over 1 trillion words transcribed. There is no organization in the world that understands voice better than Deepgram.

The Opportunity

Voice AI stands at the cusp of a paradigm shift. Current approaches to voice interaction are fundamentally limited by the scarcity and diversity of audio data, combined with the prohibitive computational costs of processing high-dimensional audio at scale. These challenges have created a gap between the promise of universal voice interaction and today's reality. While our research scientists pioneer new approaches in Latent Space Models (LSMs) to solve these fundamental challenges, we need exceptional program leadership to transform these breakthroughs into real-world impact. The complexity of our mission—spanning audio compression, generative modeling, and multimodal systems—demands strategic orchestration of multiple research workstreams, careful resource allocation, and relentless focus on outcomes that advance our vision of making voice interaction universally accessible.

The Role

You will lead the strategic execution of our Voice AI Foundations research program, working at the intersection of cutting-edge research and practical implementation. Your mission is to accelerate our path to breakthrough voice AI capabilities by:

  • Architecting and managing a portfolio of research initiatives across neural audio codecs, generative models, and multimodal speech systems
  • Developing strategic roadmaps that align research objectives with company goals, identifying dependencies and critical paths while maintaining flexibility for scientific exploration
  • Orchestrating collaboration between research teams, ensuring efficient knowledge sharing and preventing duplicate efforts across related projects
  • Building and maintaining research infrastructure that enables rapid experimentation, including data pipelines, evaluation frameworks, and deployment systems
  • Creating processes that balance our researchers' need for creative freedom with the imperative to validate ideas quickly and scale successful approaches
  • Establishing clear success metrics and experimental frameworks that help teams rapidly iterate while maintaining scientific rigor
  • Managing relationships with external partners, from academic collaborators

Requirements

  • No specific requirements listed in the provided text.

Responsibilities

  • No specific responsibilities listed in the provided text.

Skills

Research Program Management
Strategic Execution
Resource Allocation
Audio Compression
Generative Modeling
Multimodal Systems
Latent Space Models (LSMs)
Voice AI

Deepgram

Speech recognition APIs for audio transcription

About Deepgram

Deepgram specializes in artificial intelligence for speech recognition, offering a set of APIs that developers can use to transcribe and understand audio content. Their technology allows clients, ranging from startups to large organizations like NASA, to process millions of audio minutes daily. Deepgram's APIs are designed to be fast, accurate, scalable, and cost-effective, making them suitable for businesses needing to handle large volumes of audio data. The company operates on a pay-per-use model, where clients are charged based on the amount of audio they transcribe, allowing Deepgram to grow its revenue alongside client usage. With a focus on the high-growth market of speech recognition, Deepgram is positioned for future success.

San Francisco, CaliforniaHeadquarters
2015Year Founded
$100.5MTotal Funding
SERIES_BCompany Stage
Data & Analytics, AI & Machine LearningIndustries
51-200Employees

Benefits

Comprehensive Health Plans
FSA Health Matching up to $1,000
Work from Home Ergonomic Stipend
Healthy Food & Snacks in offices
Community Groups
Unlimited Vacation

Risks

Increased competition from open-source solutions like OpenAI's Whisper threatens market share.
Recent layoffs suggest potential financial instability or strategic restructuring challenges.
Integration of Poised may cause disruptions in service or product development.

Differentiation

Deepgram's APIs offer fast, accurate, and scalable speech recognition solutions.
The acquisition of Poised enhances Deepgram's real-time feedback capabilities in virtual meetings.
Aura API provides low-latency, human-like voice models for conversational AI agents.

Upsides

Strategic partnership with Clarifai accelerates AI application development and market expansion.
Aura API positions Deepgram to capitalize on real-time conversational voice AI trends.
Deepgram's technology is used by large enterprises like NASA, indicating strong market trust.

Land your dream remote job 3x faster with AI