Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS) at Hippocratic AI

Palo Alto, California, United States

Hippocratic AI Logo
Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Healthcare, AIIndustries

Requirements

  • Strong experience with speech/audio cleaning using tools such as iZotope RX, Audacity, Adobe Audition, or SoX
  • Proficiency in Python and audio-related scripting for automation and batch processing
  • Familiarity with digital audio principles, including sample rates, bit depth, frequency bands, and compression artifacts
  • Experience designing or operating scalable, automated workflows for handling audio at volume
  • Meticulous attention to detail in audio quality control and error spotting

Responsibilities

  • Clean, denoise, and enhance large volumes of recorded speech data for use in TTS and voice synthesis pipelines
  • Build and maintain automated audio preprocessing pipelines using scripting tools and open-source libraries
  • Apply techniques such as background noise removal, silence trimming, gain normalization, and sample rate conversion
  • Integrate tools like ffmpeg, sox, or Python-based scripts (pydub, torchaudio, librosa) into scalable workflows
  • Collaborate with ML researchers and speech scientists to deliver high-quality, ready-to-train datasets
  • Evaluate audio quality using perceptual and quantitative metrics, and maintain audio QA checklists

Skills

Python
ffmpeg
SoX
pydub
torchaudio
librosa
iZotope RX
Audacity
Adobe Audition

Hippocratic AI

About Hippocratic AI

N/AHeadquarters
N/AYear Founded
N/ACompany Stage

Land your dream remote job 3x faster with AI