Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS) at Hippocratic AI

Palo Alto, California, United States

Hippocratic AI Logo
Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Healthcare, AIIndustries

Requirements

  • Strong experience with speech/audio cleaning using tools such as iZotope RX, Audacity, Adobe Audition, or SoX
  • Proficiency in Python and audio-related scripting for automation and batch processing
  • Familiarity with digital audio principles, including sample rates, bit depth, frequency bands, and compression artifacts
  • Experience designing or operating scalable, automated workflows for handling audio at volume
  • Meticulous attention to detail in audio quality control and error spotting

Responsibilities

  • Clean, denoise, and enhance large volumes of recorded speech data for use in TTS and voice synthesis pipelines
  • Build and maintain automated audio preprocessing pipelines using scripting tools and open-source libraries
  • Apply techniques such as background noise removal, silence trimming, gain normalization, and sample rate conversion
  • Integrate tools like ffmpeg, sox, or Python-based scripts (pydub, torchaudio, librosa) into scalable workflows
  • Collaborate with ML researchers and speech scientists to deliver high-quality, ready-to-train datasets
  • Evaluate audio quality using perceptual and quantitative metrics, and maintain audio QA checklists

Skills

Key technologies and capabilities for this role

PythonffmpegSoXpydubtorchaudiolibrosaiZotope RXAudacityAdobe Audition

Questions & Answers

Common questions about this position

What is the salary for this Audio Data Engineer position?

This information is not specified in the job description.

Is this role remote or onsite?

The position is onsite.

What skills are required for this Audio Data Engineer role?

Required skills include strong experience with speech/audio cleaning using tools like iZotope RX, Audacity, Adobe Audition, or SoX; proficiency in Python and audio-related scripting for automation; familiarity with digital audio principles; experience with scalable automated workflows; and meticulous attention to detail in audio quality control.

What is the company culture like at Hippocratic AI?

Hippocratic AI focuses on an innovative mission to develop a safety-focused LLM for healthcare to improve global health outcomes, with visionary leadership from top institutions and strong backing from strategic investors.

What makes a strong candidate for this role?

A strong candidate has strong experience in speech/audio cleaning with specific tools, Python proficiency for automation, knowledge of audio principles, experience with scalable workflows, and meticulous attention to detail, with nice-to-haves like TTS pipeline experience enhancing fit.

Hippocratic AI

About Hippocratic AI

N/AHeadquarters
N/AYear Founded
N/ACompany Stage

Land your dream remote job 3x faster with AI