Demo

Research Scientist - Speech & Audio Understanding (Speech Generation)

Tencent
Bellevue, WA Full Time
POSTED ON 12/16/2025
AVAILABLE BEFORE 1/16/2026
Business Unit

What The Role Entails

Job Responsibilities:

  • Track the latest research in speech generation algorithms, explore next-generation paradigms for speech/audio generation, and push the boundaries of speech generation capabilities.
  • Investigate cutting-edge multimodal voice foundation model technologies to enhance voice interaction experiences by integrating text, speech, and vision.
  • Lead the technical R&D of voice foundation models, driving model performance improvements and innovative applications.

Who We Look For

Job Requirements:

  • Master’s or Ph.D. in Computer Science, Artificial Intelligence, Electronic Engineering, Signal Processing, or related fields.
  • Research or development experience in one or more areas: voice foundation models, speech synthesis, speech recognition, audio generation, voice conversion, or speech codec.
  • Familiarity with mainstream voice-enabled large models (e.g., GPT4o, GLM-4-Voice, Qwen2.5-Omni, Voila). Prior project experience is preferred.
  • Proficient in deep learning frameworks (e.g., PyTorch). Experience with large-scale model training frameworks (Megatron/Deepspeed) is a plus.
  • Solid understanding of large model architectures and principles. Experience in large-scale pretraining or post-training is preferred.

Location State(s)

US-Washington-Bellevue

The expected base pay range for this position in the location(s) listed above is $122,500.00 to $229,700.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis. Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company’s 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee’s tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year. Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Salary : $122,500 - $229,700

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Scientist - Speech & Audio Understanding (Speech Generation)?

Sign up to receive alerts about other jobs on the Research Scientist - Speech & Audio Understanding (Speech Generation) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$108,245 - $136,486
Income Estimation: 
$136,683 - $171,343
Income Estimation: 
$113,077 - $147,784
Income Estimation: 
$135,356 - $164,911
Income Estimation: 
$153,902 - $198,246
Income Estimation: 
$68,606 - $89,684
Income Estimation: 
$88,975 - $120,741
Income Estimation: 
$68,121 - $81,836
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$125,958 - $157,570
Income Estimation: 
$82,813 - $108,410
Income Estimation: 
$120,989 - $162,093
Income Estimation: 
$74,806 - $91,633
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$145,337 - $174,569
Income Estimation: 
$102,775 - $137,396
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Tencent

  • Tencent Bellevue, WA
  • Business Unit What The Role Entails Conduct research and development of Omni multimodal large models, including the design and construction of training dat... more
  • 13 Days Ago

  • Tencent Palo Alto, CA
  • Responsibilities Operations and maintenance of Tencent Cloud's security related products including Firewall, WAF etc in North American region to ensure ser... more
  • 13 Days Ago

  • Tencent Palo Alto, CA
  • About The Hiring Team Tencent Overseas IT has the mission to empower Tencent’s rapid global growth with future ready, global IT platforms, applications and... more
  • 13 Days Ago

  • Tencent Palo Alto, CA
  • Business Unit What The Role Entails Conduct research and development of Omni multimodal large models, including the design and construction of training dat... more
  • 13 Days Ago


Not the job you're looking for? Here are some other Research Scientist - Speech & Audio Understanding (Speech Generation) jobs in the Bellevue, WA area that may be a better fit.

  • Tencent Bellevue, WA
  • Business Unit What The Role Entails Job Responsibilities: We are building large-scale, native multimodal model systems that jointly support vision, audio, ... more
  • 12 Days Ago

  • Meta Seattle, WA
  • Meta AI is currently seeking Research Scientist interns. Our team creates spoken language technology to make it faster and easier for people to build commu... more
  • 16 Days Ago

AI Assistant is available now!

Feel free to start your new journey!