Demo

Machine Learning Engineer Intern, ML Runtime & Optimization (Spring 2026, Master/PhD)

Pony.ai
Fremont, CA Intern
POSTED ON 1/7/2026
AVAILABLE BEFORE 2/16/2026

Description

Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in Nov. 2024.



Responsibility

The ML Infrastructure team at Pony.ai provides a set of tools to support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment and monitoring.

As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to advance the training and inferences of the AI models in autonomous driving systems.



This includes:

  • Performing in-depth analysis and optimization to model training and deployment to achieve the state of art in performance and efficiency in autonomous driving.
  • Work across the entire AI framework/compiler stack (e.g. Torch, CUDA and TensorRT), support model development and prototype key deep learning algorithms.
  • Analyze the tradeoffs between performance, cost and energy for autonomous driving.
  • Collaborating closely with diverse groups in Pony.ai to influence the next-generation compute platform HW and SW design.
  • Research the latest model architectures, programming models and hardware.


Requirements

  • Currently pursuing a Masters or PhD program or a related discipline.
  • Strong programming skills in C/C or Python.
  • Solid understanding of CPU or GPU execution model, e.g. threads, registers, cache, memory, cost and performance trade-off, etc.
  • Experience in benchmarking, profiling and validating performance.
  • Strong communication skills and ability to work cross-functionally between software and hardware teams


Preferred Qualifications:

One or more of the following fields are preferred

  • Experience with parallel programming: CUDA, ROCm, Triton, Cutlass, etc.
  • Experience in computer vision, image processing, machine learning and deep learning.
  • Experience in model optimization techniques such as quantization, pruning, etc.
  • Experience in optimizing the utilization of compute resources, identifying and resolving compute and data flow bottlenecks.
  • Strong knowledge of software design, programming techniques and algorithms.
  • Strong knowledge of common deep learning frameworks and libraries.
  • Strong knowledge on system performance, GPU optimization or ML compiler.


Note

  • This position is fully onsite in Fremont, at least 3 months


Compensation

  • Master: $7000/month
  • PhD: $10,000/month

Salary : $7,000 - $10,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Machine Learning Engineer Intern, ML Runtime & Optimization (Spring 2026, Master/PhD)?

Sign up to receive alerts about other jobs on the Machine Learning Engineer Intern, ML Runtime & Optimization (Spring 2026, Master/PhD) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Pony.ai

  • Pony.ai Fremont, CA
  • Description Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous m... more
  • 5 Days Ago

  • Pony.ai Fremont, CA
  • Description Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous m... more
  • 5 Days Ago

  • Pony.ai Fremont, CA
  • Description Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous m... more
  • 6 Days Ago

  • Pony.ai Fremont, CA
  • Description Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous m... more
  • 6 Days Ago


Not the job you're looking for? Here are some other Machine Learning Engineer Intern, ML Runtime & Optimization (Spring 2026, Master/PhD) jobs in the Fremont, CA area that may be a better fit.

  • Pony.ai Fremont, CA
  • Description Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous m... more
  • 6 Days Ago

  • Waymo Mountain View, CA
  • Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Pr... more
  • 16 Days Ago

AI Assistant is available now!

Feel free to start your new journey!