Demo

Research Internship – Reinforcement Learning for Large Foundation Models

tencent
range, AL Full Time
POSTED ON 6/20/2026
AVAILABLE BEFORE 8/20/2026
Business Unit What the Role Entails About Tencent AI Lab at Seattle Area Tencent is a leading internet company in China. Tencent AI Lab at Seattle Area was established in May 2017. The lab strives to continuously improve AI's capability in perception, cognition, and creativity. Researchers there aim at solving challenging real-world problems with advanced technologies and publish extensively at top conferences and journals. Research Internship – Reinforcement Learning for Large Foundation Models Tencent AI Lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. The lab's long-term ambition is to drive the development of Artificial General Intelligence (AGI), and ultimately, Artificial Superintelligence (ASI). We are currently seeking research interns for the year of 2026, in the area of reinforcement learning (RL) for large foundation models, with an emphasis on developing stable and efficient RL algorithms. The goal is to empower large foundation models in complex reasoning ang agent tasks and enhance their capabilities in autonomous exploration and continuous learning. Our Seattle area office is located in Bellevue WA. Every research intern will work with researchers on a research project aimed at attacking one of the core problems on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models. We will conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers. Who We Look For Requirements & Qualifications The ideal intern candidates are those who Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university, are self-motivated and excited about developing novel techniques, have research experiences in natural language processing or machine learning, are proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch. have good publication track records and history of creativity and intellectual flexibility, have excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation. Intern duration: 3 months (with the possibility of extension). Can start any time in the year 2026. Location State(s) US-Washington-Bellevue The expected base pay range for this position in the location(s) listed above is $80,168.40 to $124,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan. Equal Employment Opportunity at Tencent As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals. Who we are Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world. Equal Employment Opportunity at Tencent As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Salary : $80,168 - $124,800

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Internship – Reinforcement Learning for Large Foundation Models?

Sign up to receive alerts about other jobs on the Research Internship – Reinforcement Learning for Large Foundation Models career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$56,065 - $75,749
Income Estimation: 
$98,735 - $185,128
Income Estimation: 
$302,228 - $379,575
Income Estimation: 
$68,596 - $101,765
Income Estimation: 
$58,530 - $79,170
Income Estimation: 
$72,001 - $91,803
Income Estimation: 
$88,975 - $120,741
Income Estimation: 
$68,121 - $81,836
Income Estimation: 
$71,928 - $87,026
Income Estimation: 
$125,958 - $157,570
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at tencent

  • tencent Los Angeles, CA
  • This position offers flexible location options and is open to candidates based in Palo Alto or Los Angeles. About the Company: Tencent is a leading global ... more
  • 3 Days Ago

  • tencent range, AL
  • Business Unit LIGHTSPEED STUDIOS is made up of passionate players who advance the art & science of game development through great stories, great gameplay, ... more
  • 4 Days Ago

  • tencent Bellevue, WA
  • Business Unit LIGHTSPEED STUDIOS is made up of passionate players who advance the art & science of game development through great stories, great gameplay, ... more
  • 4 Days Ago

  • tencent Palo Alto, CA
  • Business Unit Cloud & Smart Industries Group (CSIG) is responsible for promoting the company's cloud and industry Internet strategy. CSIG explores the inte... more
  • 9 Days Ago


Not the job you're looking for? Here are some other Research Internship – Reinforcement Learning for Large Foundation Models jobs in the range, AL area that may be a better fit.

  • Large Family Practice Fairhope, AL
  • About us We are an 8 physician family practice office in Fairhope, AL. We are professional, friendly, and fast-paced. Our goal is to provide exceptional pa... more
  • 13 Days Ago

  • MDH Foundation Repair Daphne, AL
  • Join MDH Foundation Repair – Build Your Career in Construction! MDH Foundation Repair is a customer-first, regional leader in the foundation repair industr... more
  • 14 Days Ago

AI Assistant is available now!

Feel free to start your new journey!