Demo

Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS)

Hippocratic AI
Palo Alto, CA Full Time
POSTED ON 11/30/2025 CLOSED ON 5/14/2026

What are the responsibilities and job description for the Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS) position at Hippocratic AI?

About Us

Hippocratic AI is developing the first safety-focused Large Language Model (LLM) for healthcare. Our mission is to dramatically improve healthcare accessibility and outcomes by bringing deep healthcare expertise to every person. No other technology has the potential for this level of global impact on health.

Why Join Our Team

  • Innovative mission: We are creating a safe, healthcare-focused LLM that can transform health outcomes on a global scale.
  • Visionary leadership: Hippocratic AI was co-founded by CEO Munjal Shah alongside physicians, hospital administrators, healthcare professionals, and AI researchers from top institutions including El Camino Health, Johns Hopkins, Washington University in St. Louis, Stanford, Google, Meta, Microsoft and NVIDIA.
  • Strategic investors: We have raised a total of $278 million in funding, backed by top investors such as Andreessen Horowitz, General Catalyst, Kleiner Perkins, NVIDIA’s NVentures, Premji Invest, SV Angel, and six health systems.
  • Team and expertise: We are working with top experts in healthcare and artificial intelligence to ensure the safety and efficacy of our technology.

For more information, visit www.HippocraticAI.com.

We value in-person teamwork and believe the best ideas happen together. Our team is expected to be in the office five days a week in Palo Alto, CA unless explicitly noted otherwise in the job description.

About The Role

Hippocratic AI is seeking a skilled Audio Data Engineer to help us scale and improve our speech datasets for use in Text-to-Speech (TTS) and speech synthesis systems. In this role, you will clean and enhance real-world audio data, build automation pipelines for processing, and ensure our voice models are trained on the highest quality inputs. This work will directly shape the clarity and expressiveness of the voices used in healthcare AI applications.

Responsibilities

  • Clean, denoise, and enhance large volumes of recorded speech data for use in TTS and voice synthesis pipelines.
  • Build and maintain automated audio preprocessing pipelines using scripting tools and open-source libraries.
  • Apply techniques such as background noise removal, silence trimming, gain normalization, and sample rate conversion.
  • Integrate tools like ffmpeg, sox, or Python-based scripts (pydub, torchaudio, librosa) into scalable workflows.
  • Collaborate with ML researchers and speech scientists to deliver high-quality, ready-to-train datasets.
  • Evaluate audio quality using perceptual and quantitative metrics, and maintain audio QA checklists.

Required Qualifications

  • Strong experience with speech/audio cleaning using tools such as iZotope RX, Audacity, Adobe Audition, or SoX.
  • Proficiency in Python and audio-related scripting for automation and batch processing.
  • Familiarity with digital audio principles, including sample rates, bit depth, frequency bands, and compression artifacts.
  • Experience designing or operating scalable, automated workflows for handling audio at volume.
  • Meticulous attention to detail in audio quality control and error spotting.

Nice to Have

  • Experience working on TTS model pipelines (e.g., Tacotron, VITS, FastSpeech) or speech synthesis datasets.
  • Background in audio engineering, phonetics, or signal processing.
  • Familiarity with real-time or low-latency audio processing constraints.
  • Experience with cloud platforms and tools for automation (e.g., AWS, Airflow, or containerized audio workflows).
  • Be aware of recruitment scams impersonating Hippocratic AI. All recruiting communication will come from @hippocraticai.com email addresses. We will never request payment or sensitive personal information during the hiring process. If anything appears suspicious, stop engaging immediately and report the incident.

Salary.com Estimation for Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS) in Palo Alto, CA
$100,373 to $121,774
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets
This job has expired.
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Hippocratic AI

  • Hippocratic AI Cincinnati, OH
  • About Us Hippocratic AI is the leading generative AI company in healthcare. We have the only system that can have safe, autonomous, clinical conversations ... more
  • 1 Day Ago

  • Hippocratic AI Palo Alto, CA
  • About Us Hippocratic AI is the leading generative AI company in healthcare. We have the only system that can have safe, autonomous, clinical conversations ... more
  • 2 Days Ago

  • Hippocratic AI Palo Alto, CA
  • About Us Hippocratic AI is the leading generative AI company in healthcare. We have the only system that can have safe, autonomous, clinical conversations ... more
  • 5 Days Ago

  • Hippocratic AI Cincinnati, OH
  • About Us Hippocratic AI is the leading generative AI company in healthcare. We have the only system that can have safe, autonomous, clinical conversations ... more
  • 5 Days Ago


Not the job you're looking for? Here are some other Audio Data Engineer – Speech Cleaning & Pipeline Automation (TTS) jobs in the Palo Alto, CA area that may be a better fit.

  • VAST Data Campbell, CA
  • Description We are seeking a talented Full Stack Software Engineer to join our Manufacturing Test Automation team. In this role, you will design, develop, ... more
  • 1 Day Ago

  • VAST Data Campbell, CA
  • We are seeking a talented Full Stack Software Engineer to join our Manufacturing Test Automation team. In this role, you will design, develop, and scale th... more
  • 28 Days Ago

AI Assistant is available now!

Feel free to start your new journey!