Demo

Research Internship – Reinforcement Learning for Large Foundation Models

tencent
range, AL Full Time
POSTED ON 12/30/2025
AVAILABLE BEFORE 2/28/2026
Business Unit What the Role Entails About Tencent AI Lab at Seattle Area Tencent is a leading internet company in China. Tencent AI Lab at Seattle Area was established in May 2017. The lab strives to continuously improve AI's capability in perception, cognition, and creativity. Researchers there aim at solving challenging real-world problems with advanced technologies and publish extensively at top conferences and journals. Research Internship – Reinforcement Learning for Large Foundation Models Tencent AI Lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. The lab's long-term ambition is to drive the development of Artificial General Intelligence (AGI), and ultimately, Artificial Superintelligence (ASI). We are currently seeking research interns for the year of 2026, in the area of reinforcement learning (RL) for large foundation models, with an emphasis on developing stable and efficient RL algorithms. The goal is to empower large foundation models in complex reasoning ang agent tasks and enhance their capabilities in autonomous exploration and continuous learning. Our Seattle area office is located in Bellevue WA. Every research intern will work with researchers on a research project aimed at attacking one of the core problems on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models. We will conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers. Who We Look For Requirements & Qualifications The ideal intern candidates are those who Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university, are self-motivated and excited about developing novel techniques, have research experiences in natural language processing or machine learning, are proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch. have good publication track records and history of creativity and intellectual flexibility, have excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation. Intern duration: 3 months (with the possibility of extension). Can start any time in the year 2026. Location State(s) US-Washington-Bellevue The expected base pay range for this position in the location(s) listed above is $80,169.00 to $120,000.14 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan. Equal Employment Opportunity at Tencent As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals. Who we are Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world. Equal Employment Opportunity at Tencent As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Salary : $80,169 - $120,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Internship – Reinforcement Learning for Large Foundation Models?

Sign up to receive alerts about other jobs on the Research Internship – Reinforcement Learning for Large Foundation Models career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$56,065 - $75,749
Income Estimation: 
$98,735 - $185,128
Income Estimation: 
$302,228 - $379,575
Income Estimation: 
$68,596 - $101,765
Income Estimation: 
$58,530 - $79,170
Income Estimation: 
$103,625 - $127,928
Income Estimation: 
$88,975 - $120,741
Income Estimation: 
$68,121 - $81,836
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$125,958 - $157,570
Income Estimation: 
$52,329 - $73,966
Income Estimation: 
$58,931 - $72,415
Income Estimation: 
$67,065 - $95,497
Income Estimation: 
$145,337 - $174,569
Income Estimation: 
$153,127 - $203,425
Income Estimation: 
$139,626 - $193,276
Income Estimation: 
$164,650 - $211,440
Income Estimation: 
$130,030 - $173,363
Income Estimation: 
$80,927 - $120,007
Income Estimation: 
$103,989 - $160,643
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at tencent

  • tencent Bellevue, WA
  • Business Unit What The Role Entails Conduct research and development of Omni multimodal large models, including the design and construction of training dat... more
  • 12 Days Ago

  • tencent Palo Alto, CA
  • Responsibilities Operations and maintenance of Tencent Cloud's security related products including Firewall, WAF etc in North American region to ensure ser... more
  • 12 Days Ago

  • tencent Palo Alto, CA
  • About The Hiring Team Tencent Overseas IT has the mission to empower Tencent’s rapid global growth with future ready, global IT platforms, applications and... more
  • 12 Days Ago

  • tencent Palo Alto, CA
  • Business Unit What The Role Entails Conduct research and development of Omni multimodal large models, including the design and construction of training dat... more
  • 12 Days Ago


Not the job you're looking for? Here are some other Research Internship – Reinforcement Learning for Large Foundation Models jobs in the range, AL area that may be a better fit.

  • Alfa Internship Mobile, AL
  • Company Overview Alfa Insurance® is an A-rated insurance carrier that offers an excellent array of auto, home, life, farm and business insurance products. ... more
  • 27 Days Ago

  • franklintempleton range, AL
  • At Franklin Templeton, we’re advancing our industry forward by developing new and innovative ways to help our clients achieve their investment goals. Our d... more
  • 6 Days Ago

AI Assistant is available now!

Feel free to start your new journey!