Demo

Senior Applied Researcher, Audio Understanding

Cartesia
San Francisco, CA Full Time
POSTED ON 2/22/2026
AVAILABLE BEFORE 5/12/2026
About Cartesia

Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best models can continuously process and reason over a year-long stream of audio, video and text—1B text tokens, 10B audio tokens and 1T video tokens—let alone do this on-device.

We're pioneering the model architectures that will make this possible. Our founding team met as PhDs at the Stanford AI Lab, where we invented State Space Models or SSMs, a new primitive for training efficient, large-scale foundation models. Our team combines deep expertise in model innovation and systems engineering paired with a design-minded product engineering team to build and ship cutting edge models and experiences.

We're funded by leading investors at Index Ventures and Lightspeed Venture Partners, along with Factory, Conviction, A Star, General Catalyst, SV Angel, Databricks and others. We're fortunate to have the support of many amazing advisors, and 90 angels across many industries, including the world's foremost experts in AI.

The Role

As a Senior Applied Researcher in Audio Understanding, you will be responsible for tackling the most challenging problems in audio perception. Your work will go beyond traditional speech recognition to encompass the full spectrum of audio perception, from identifying speakers and interpreting emotion to understanding complex acoustic environments. You will lead high-impact projects that are critical to our mission of building truly aware AI.

What You’ll Do

  • Architect and develop novel, large-scale models for complex audio understanding tasks, including multi-speaker ASR, diarization, and non-speech audio classification and deploy them to production at scale.
  • Pioneer research in areas like self-supervised learning for audio, few-shot learning, and robust audio-visual perception.
  • Set new standards for how we evaluate and benchmark our audio understanding systems.
  • Build large scale pre-training and fine-tuning datasets for audio understanding capabilities.

What We’re Looking For

  • Deep expertise in ASR, audio understanding, language modeling, or generative modeling more broadly.
  • Experience with large-scale training, GPU/TPU acceleration, and model optimization.
  • Strong applied mindset—able to balance scientific novelty with product impact.

Our perks

🍽 Lunch, dinner and snacks at the office.

🏥 Fully covered medical, dental, and vision insurance for employees.

🏦 401(k).

✈️ Relocation and immigration support.

🦖 Your own personal Yoshi.

Our Culture

🏢 We’re an in-person team based out of San Francisco. We love being in the office, hanging out together, and learning from each other every day.

🚢 We ship fast. All of our work is novel and cutting edge, and execution speed is paramount. We have a high bar, and we don’t sacrifice quality or design along the way.

🤝 We support each other. We have an open & inclusive culture that’s focused on giving everyone the resources they need to succeed.

Salary.com Estimation for Senior Applied Researcher, Audio Understanding in San Francisco, CA
$103,971 to $135,513
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior Applied Researcher, Audio Understanding?

Sign up to receive alerts about other jobs on the Senior Applied Researcher, Audio Understanding career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$71,440 - $100,629
Income Estimation: 
$98,735 - $185,128
Income Estimation: 
$302,228 - $379,575
Income Estimation: 
$115,229 - $156,440
Income Estimation: 
$80,445 - $108,756
Income Estimation: 
$64,451 - $83,138
Income Estimation: 
$74,029 - $94,382
Income Estimation: 
$74,029 - $94,382
Income Estimation: 
$91,459 - $117,736
Income Estimation: 
$91,459 - $117,736
Income Estimation: 
$96,123 - $134,937
Income Estimation: 
$96,123 - $134,937
Income Estimation: 
$74,073 - $107,266
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Cartesia

  • Cartesia San Francisco, CA
  • About Cartesia Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best... more
  • 16 Days Ago

  • Cartesia San Francisco, CA
  • About Cartesia Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best... more
  • 1 Day Ago

  • Cartesia San Francisco, CA
  • About Cartesia Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best... more
  • 1 Day Ago

  • Cartesia San Francisco, CA
  • About Cartesia Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Senior Applied Researcher, Audio Understanding jobs in the San Francisco, CA area that may be a better fit.

  • techire ai San Francisco, CA
  • Senior Applied Researcher Want to build vision-language models that understand complex, real-world environments? You’ll join a small, highly technical team... more
  • 18 Days Ago

  • OpenAI San Francisco, CA
  • About The Role This role focuses on building the strategic unit economics understanding of OpenAI, guiding sustainable growth to make it the most impactful... more
  • 21 Days Ago

AI Assistant is available now!

Feel free to start your new journey!