Demo

AI Research Engineer Intern (PhD), Real-Time Inference for Embodied AI

RoboForce
Milpitas, CA Intern
POSTED ON 4/2/2026
AVAILABLE BEFORE 9/27/2026
We are seeking an AI Research Engineer Intern (PhD) to join us in building the next generation of Embodied AI systems for robotics, with a focus on real-time model inference, systems optimization, and deployment efficiency.

In this role, you will work at the intersection of foundation models, robotics, and high-performance ML systems, helping make advanced robot intelligence practical for real-world deployment. You will collaborate with a world-class team of researchers and engineers to optimize model serving, reduce latency, improve throughput, and enable reliable on-robot inference for embodied decision-making. This is a highly applied research role with opportunities to contribute to impactful systems work and, where appropriate, research publications at top-tier venues.

Responsibilities

  • Research and develop techniques to enable real-time inference for embodied AI models deployed on robotic platforms.
  • Optimize inference performance for models such as:
    • Vision-Language-Action (VLA) models
    • World models
    • Multimodal transformer-based policies
    • Perception and state estimation models used in robot control loops
  • Improve model latency, throughput, memory efficiency, and system reliability through methods such as:
    • model compression
    • quantization
    • distillation
    • batching and scheduling optimization
    • KV-cache / decoding optimization
    • graph compilation and kernel-level acceleration
  • Collaborate with robotics, infrastructure, and hardware teams to integrate optimized models into real robot stacks and edge/on-device systems.
  • Design benchmarking pipelines for evaluating end-to-end performance, including control frequency, action latency, and system robustness under real deployment constraints.
  • Explore tradeoffs between model quality and runtime efficiency to support practical deployment in real-world robotic tasks.
  • Contribute to internal technical reports, system design discussions, and publications where appropriate.
Qualifications

  • Currently pursuing or recently completed a PhD in Computer Science, Electrical Engineering, Robotics, Machine Learning, Systems, or a related field.
  • Strong background in machine learning systems, model inference optimization, or efficient deep learning.
  • Experience optimizing modern ML models for production or low-latency deployment.
  • Hands-on experience with one or more of the following:
    • real-time inference systems
    • efficient transformer inference
    • model compression, pruning, quantization, or distillation
    • GPU performance optimization
    • deployment frameworks such as TensorRT, ONNX Runtime, XLA, TVM, Triton, or similar systems
  • Proficiency with deep learning frameworks such as PyTorch, JAX, or TensorFlow.
  • Strong programming and systems skills, including experience with performance profiling and debugging.
  • Ability to work across the stack, from model architecture to runtime systems and hardware-aware optimization.
  • Requires 5 days/week in-office collaboration with the team.
Preferred Skills

  • Familiarity with Embodied AI, robot learning, or robotics foundation models.
  • Experience optimizing multimodal or autoregressive models for low-latency inference.
  • Understanding of robotics system constraints such as control-loop timing, sensor fusion latency, and edge compute limitations.
  • Experience with deployment on embedded or edge hardware for robotics.
  • Exposure to compiler-based optimization, CUDA programming, custom kernels, or distributed inference systems.
  • Interest in co-design across model architecture, inference runtime, and robotic execution.

Why Join Us

  • Work on high-impact problems at the frontier of AI systems and robotics
  • Help turn cutting-edge embodied AI models into practical real-world robotic capabilities
  • Collaborate with a deeply technical team spanning research, systems, and hardware
  • Gain hands-on experience with challenging deployment problems in real robotic settings
  • Opportunity to contribute to research publications and advance the state of the art in efficient embodied AI

Hourly Wage Estimation for AI Research Engineer Intern (PhD), Real-Time Inference for Embodied AI in Milpitas, CA
$79.00 to $102.00
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a AI Research Engineer Intern (PhD), Real-Time Inference for Embodied AI?

Sign up to receive alerts about other jobs on the AI Research Engineer Intern (PhD), Real-Time Inference for Embodied AI career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$149,493 - $192,976
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at RoboForce

  • RoboForce Milpitas, CA
  • Why RoboForce RoboForce is an AI robotics company developing Physical AI–powered Robo-Labor for dull, dirty, and dangerous work. The company’s robots are e... more
  • 10 Days Ago

  • RoboForce Milpitas, CA
  • Why RoboForce RoboForce is an AI robotics company developing Physical AI–powered Robo-Labor for dull, dirty, and dangerous work. The company’s robots are e... more
  • 10 Days Ago

  • RoboForce Milpitas, CA
  • We are seeking an AI Research Scientist Intern (PhD) to join us in advancing the frontier of Embodied AI for robotics. This role is centered on developing ... more
  • 15 Days Ago

  • RoboForce Milpitas, CA
  • Why RoboForce RoboForce is an AI robotics company developing Physical AI–powered Robo-Labor for dull, dirty, and dangerous work. The company’s robots are e... more
  • 2 Days Ago


Not the job you're looking for? Here are some other AI Research Engineer Intern (PhD), Real-Time Inference for Embodied AI jobs in the Milpitas, CA area that may be a better fit.

  • Feather Robotics Palo Alto, CA
  • About the Job We're seeking an engineer passionate about advancing general-purpose labor through robotics. You'll play a central role in developing the AI ... more
  • 8 Days Ago

  • RoboForce Milpitas, CA
  • Why RoboForce RoboForce is an AI robotics company developing Physical AI–powered Robo-Labor for dull, dirty, and dangerous work. The company's robots are e... more
  • 3 Days Ago

AI Assistant is available now!

Feel free to start your new journey!