Demo

Hunyuan Multimodal Reinforcement Learning Research Intern

Tencent
Palo Alto, CA Intern
POSTED ON 5/15/2026
AVAILABLE BEFORE 6/13/2026
Business Unit

What The Role Entails

Responsibilities:

  • Conduct research on RL algorithms for multimodal models, including diffusion models for image, video, and 3D generation, autoregressive models for multimodal understanding, and potentially unified multimodal frameworks.
  • Design and develop RL infrastructure and reward modeling strategies to enable efficient large-scale training, improve training stability, and mitigate reward hacking and related failure modes.
  • Explore next-generation RL paradigms that more directly and effectively learn from environment feedback.

Who We Look For

Requirements:

  • Currently enrolled as a PhD student in Computer Science or a closely related field.
  • Demonstrated strong research capability, with publications in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV, SIGGRAPH.
  • Strong hands-on programming skills, with solid experience in deep learning system implementation, model training and inference optimization, CPU/GPU acceleration, and distributed training and inference.
  • Prior experience with diffusion models, autoregressive models, and/or text-to-image or text-to-video generation is highly preferred.
  • Participation in ACM/NOIP is a strong plus.

Location State(s)

US-California-Palo Alto

The expected base pay range for this position in the location(s) listed above is $80,168.40 to $124,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Salary : $80,168 - $124,800

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Hunyuan Multimodal Reinforcement Learning Research Intern?

Sign up to receive alerts about other jobs on the Hunyuan Multimodal Reinforcement Learning Research Intern career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$208,966 - $334,311
Income Estimation: 
$323,592 - $466,778
Income Estimation: 
$70,310 - $88,223
Income Estimation: 
$88,950 - $110,401
Income Estimation: 
$84,958 - $111,603
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Tencent

  • Tencent Palo Alto, CA
  • Job Description Own end-to-end operations of Tencent's overseas Elastic IP and load balancing gateway platform, covering user support ticket resolution, so... more
  • 3 Days Ago

  • Tencent Palo Alto, CA
  • About The Hiring Team Tencent Overseas IT has the mission to empower Tencent’s rapid global growth with future ready, global IT platforms, applications and... more
  • 3 Days Ago

  • Tencent range, AL
  • Business Unit What the Role Entails With over 20 years of research and experience in audio and video technology, Tencent Cloud launched Tencent Cloud Media... more
  • 4 Days Ago

  • Tencent Palo Alto, CA
  • Business Unit What The Role Entails Research and development on video technologies and applications Who We Look For Bachelor’s degree or above in computer ... more
  • 4 Days Ago


Not the job you're looking for? Here are some other Hunyuan Multimodal Reinforcement Learning Research Intern jobs in the Palo Alto, CA area that may be a better fit.

  • PlusAI Santa Clara, CA
  • PlusAI is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with opera... more
  • 11 Days Ago

  • Tencent Palo Alto, CA
  • Business Unit What The Role Entails Conduct research and development of Omni multimodal large models, including the design and construction of training dat... more
  • 24 Days Ago

AI Assistant is available now!

Feel free to start your new journey!