Demo

ML Performance Engineer

Gridmatic
Cupertino, CA Full Time
POSTED ON 1/13/2026
AVAILABLE BEFORE 3/12/2026

The Company

Gridmatic Inc. is a high-growth startup with offices in the Bay Area and Houston that is accelerating the clean energy transition by applying our expertise in data, machine learning, and energy to power markets. We are the rare startup that has multiple years of profitability without raising venture capital. At Gridmatic, we foster a collaborative and inclusive culture where learning and growth are constant. We move quickly, solve problems with integrity, and balance environmental responsibility with data-driven excellence.


We are looking for a Machine Learning Infrastructure Engineer to accelerate the decarbonization of the electricity system by building and optimizing the backbone of our ML platform. The ideal candidate will have solid expertise in machine learning, distributed systems and GPU-based training. They will design scalable, high-performance infrastructure for training, inference, and evaluation. They will push the boundaries of throughput and efficiency on large-scale time-series and weather datasets, while shaping the long-term vision of our ML platform. A successful candidate will thrive on continuous learning across engineering, ML systems, and energy markets, while contributing to a collaborative, mission-driven team.The ideal candidate must have strong deep learning fundamentals in addition to strong software engineering skills.

\n


You will:
  • Own a significant piece of our ML platform while rapidly building and iterating scalable, robust distributed infrastructure for ML training, inference, and evaluation on large-scale time-series and weather datasets.
  • Optimize throughput and cost by supporting model training and deployment across multiple clusters and clouds.
  • Improve the efficiency of machine learning models and other workloads by optimizing latency, throughput, and memory consumption. This involves pushing the boundaries of current hardware capabilities through techniques like GPU performance engineering.
  • Help define the long-term vision for Gridmatic’s ML platform.
  • Play a key role in mentoring junior engineers and interns, contributing to a collaborative, innovative, and growth-oriented team culture.


You must be:
  • A strong engineer with 3 years of full-time industry experience working on ML systems.. You possess a deep understanding of the codebases you work in and write readable, scalable code.
  • Experienced in optimizing GPU throughput in deep learning models.
  • Experienced in distributed training and inference of large models on GPU clusters, utilizing core libraries and frameworks such as PyTorch, PyTorch Lightning, and Ray.
  • A self-starter with a strong sense of independence and ownership, and the capability to engineer large, robust systems from the initial design and conceptualization to productionization.
  • Hold a Masters or Doctorate degree in engineering or a related technical field.
  • A mission-driven individual who is enthusiastic about working toward a renewable grid and diving into the intersection of ML and energy. No prior energy experience required, but curiosity and a willingness to learn are must-haves!


Nice to haves:
  • End to end proficiency in building, maintaining, and debugging cluster infrastructure, utilizing Kubernetes and Terraform.
  • Expertise in identifying performance bottlenecks and designing and writing high-performance code for large-scale ML workloads.
  • Experience with at least one of: torch.profiler, TorchDynamo, TorchInductor, Triton, or other deep learning compiler stacks.
  • Understanding of GPU architectures or experience with GPU kernel programming.
  • Knowledge of cluster communication protocols such as nccl or gloo.
  • Experience working with any of the following: weather data, energy systems, time-series forecasting, electricity markets, or financial trading.


\n
$174,000 - $231,000 a year
\n

#LI-DNI


Join our team and make a difference! Click below or email us at careers@gridmatic.com.

Salary : $174,000 - $231,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a ML Performance Engineer?

Sign up to receive alerts about other jobs on the ML Performance Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$77,900 - $95,589
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$77,900 - $95,589
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$149,493 - $192,976
Income Estimation: 
$184,796 - $233,226
Income Estimation: 
$179,606 - $233,815
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Gridmatic

  • Gridmatic Cupertino, CA
  • The Company: Gridmatic is a startup trying to help decarbonize the grid by using deep learning to forecast energy prices. We believe better forecasting can... more
  • 1 Day Ago


Not the job you're looking for? Here are some other ML Performance Engineer jobs in the Cupertino, CA area that may be a better fit.

  • Advanced Micro Devices, Inc San Jose, CA
  • WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. O... more
  • 13 Days Ago

  • Waymo Mountain View, CA
  • Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Pr... more
  • 19 Days Ago

AI Assistant is available now!

Feel free to start your new journey!