Demo

Staff Machine Learning Engineer – Autonomous Driving Model Quantization & Deployment

XPENG
Santa Clara, CA Full Time
POSTED ON 4/17/2026
AVAILABLE BEFORE 5/16/2026

XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and landing (eVTOL) aircraft, and robotics. With a strong focus on intelligent mobility, XPENG is dedicated to reshaping the future of transportation through cutting-edge R&D in AI, machine learning, and smart connectivity.


The Mission: The challenge of Vision-Language-Action (VLA) models and Foundation Models isn't just their intelligence—it's their real-time execution at the edge. We are seeking a high-caliber Staff Machine Learning Engineer to bridge the gap between massive research models and production-ready L4 autonomous driving systems. You will lead the effort to optimize and deploy our VLA models onto vehicle-grade compute platforms for our global fleet.


Key Responsibilities:

  • Lead Optimization Strategy: Own the end-to-end quantization and optimization roadmap for large-scale multimodal models (Transformers, VLMs).
  • Model Compression: Apply and innovate in PTQ (Post-Training Quantization), QAT (Quantization-Aware Training), and pruning techniques to fit VLA models into strict memory and power envelopes.
  • Hardware-Software Co-design: Collaborate directly with model researchers to ensure architectures are "deployment-friendly" and with platform teams to influence future hardware requirements.
  • Production Excellence: Develop and maintain robust, safety-critical deployment stacks in Modern C , ensuring 24/7 stability and deterministic performance on the road.


Basic Qualifications:

  • Proven Track Record: 5-8 years of experience in model deployment, quantization, or high-performance computing (HPC).
  • Core Technical Skills: Mastery of Modern C and deep experience with CUDA or other hardware acceleration libraries.
  • Deep Learning Expertise: Strong familiarity with PyTorch and deep knowledge of inference engines like TensorRT, ONNX Runtime, or TVM.
  • Quantization Depth: Hands-on experience with INT8/FP8/INT4 quantization and knowledge of the unique challenges in quantizing Large Language Models (LLMs) or Transformers.
  • Platform Knowledge: Solid understanding of computer architecture (Cache, Memory Bandwidth, SIMD) and experience with embedded/edge compute constraints.
  • Systems Thinking: Ability to debug complex performance bottlenecks across the entire software stack.


Preferred Qualifications:

  • Experience with VLA/VLM or other Foundation Model deployment.
  • Background in autonomous driving, robotics, or real-time safety-critical systems.
  • Contributions to open-source inference or compiler projects.


What do we provide:

  • A fun, supportive and engaging environment
  • Infrastructures and computational resources to support your ML model development/research.
  • Opportunity to work on cutting edge technologies with the top talent in the field.
  • Opportunity to make significant impact on transportation revolution by the means of advancing autonomous driving
  • Competitive compensation package
  • Snacks, lunches, dinners, and fun activities


The base salary range for this full-time position is $215,280-$364,320, in addition to bonus, equity and benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.


We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status or marital status or any other prescribed category set forth in federal or state regulations.

Salary.com Estimation for Staff Machine Learning Engineer – Autonomous Driving Model Quantization & Deployment in Santa Clara, CA
$165,612 to $212,796
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Staff Machine Learning Engineer – Autonomous Driving Model Quantization & Deployment?

Sign up to receive alerts about other jobs on the Staff Machine Learning Engineer – Autonomous Driving Model Quantization & Deployment career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$149,493 - $192,976
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at XPENG

  • XPENG Santa Clara, CA
  • XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, ... more
  • 1 Day Ago

  • XPENG Santa Clara, CA
  • XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, ... more
  • 1 Day Ago

  • XPENG Santa Clara, CA
  • XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, ... more
  • 1 Day Ago

  • XPENG Santa Clara, CA
  • XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, ... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Staff Machine Learning Engineer – Autonomous Driving Model Quantization & Deployment jobs in the Santa Clara, CA area that may be a better fit.

  • xpengmotors Santa Clara, CA
  • XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, ... more
  • 1 Month Ago

  • Lucid Motors Newark, CA
  • Leading the future in luxury electric and mobility At Lucid, we set out to introduce the most captivating, luxury electric vehicles that elevate the human ... more
  • 4 Days Ago

AI Assistant is available now!

Feel free to start your new journey!