Demo

Founding Engineer, ML Performance & Systems

Stealth
San Francisco, CA Full Time
POSTED ON 11/4/2025
AVAILABLE BEFORE 12/4/2025
About Us

We’re an early-stage stealth startup building a new kind of platform for generative media. Our mission is to enable the future of real-time generative applications: we’re building the foundational tools and infrastructure that make entirely new categories of generative experiences and applications finally possible.

We’re a small, focused team of ex-YC and unicorn founders and senior engineers with deep experience across 3D, generative video, developer platforms, and creative tools. We're backed by top-tier investors and top angels, and we're building a new technical foundation purpose-built for the next era of generative media.

We’re operating at the edge of what’s technically possible: high-performance inference and real-time orchestration of multimodal models. As one of our founding engineers, you’ll play a key role in architecting the core platform, shaping system design decisions, and owning critical infrastructure from day one.

If you're excited about architecting and building high-performance infrastructure that empowers the next generation of developers and unlocks entirely new products categories, we’d love to talk.

About The Role

We’re looking for a Founding Engineer, ML Performance & Systems with deep expertise in high-performance ML infrastructure. This is a highly technical, high-impact role focused on squeezing every drop of performance from real-time generative media models.

You’ll work across the model-serving stack, designing novel architectures, optimizing inference performance, and shaping Reactor’s competitive edge in ultra-low-latency, high-throughput environments.

What You’ll Do

  • Drive our frontier position on real-time model performance for diffusion models
  • Design and implement a high-performance in-house inference engine
  • Focus on maximizing throughput and minimizing latency and resource usage
  • Develop performance monitoring and profiling tools to identify bottlenecks and optimization opportunities

Requirements

About You

  • Strong foundation in systems programming, with a track record of identifying and resolving bottlenecks
  • Deep expertise in the ML infrastructure stack:
    • PyTorch, TensorRT, TransformerEngine, Nsight
    • Model compilation, quantization, and advanced serving architectures
  • Working knowledge of GPU hardware (NVIDIA) and the ability to dive deep into the stack as needed (e.g., writing custom GEMM kernels with CUTLASS)
  • Proficient in Triton or willing to learn, with comparable experience in low-level accelerator programming
  • Excited by the frontier of multi-dimensional model parallelism (e.g., combining tensor, context, and sequence parallelism)
  • Familiarity with internals of cutting-edge techniques such as Ring Attention, FA3, and FusedMLP implementations
Minimum Qualifications

  • Expertise in systems programming (C , CUDA)
  • Experience optimizing ML inference on GPUs
  • Proficient with PyTorch and tools like TensorRT
  • Deep understanding of NVIDIA GPU architecture
  • Familiar with model serving, compilation, and quantization

Benefits

  • Competitive SF Salary Equity

Salary : $160,000 - $200,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Founding Engineer, ML Performance & Systems?

Sign up to receive alerts about other jobs on the Founding Engineer, ML Performance & Systems career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$70,609 - $91,165
Income Estimation: 
$86,680 - $110,316
Income Estimation: 
$117,033 - $148,289
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$172,688 - $210,712
Income Estimation: 
$170,589 - $211,671
Income Estimation: 
$178,619 - $225,190
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$81,253 - $112,554
Income Estimation: 
$89,966 - $112,616
Income Estimation: 
$95,407 - $122,738
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Stealth

Stealth
Hired Organization Address Anaheim, CA Full Time
As we execute our growth plan to commercialize our technology, so arises the need for a leader with accountability and r...
Stealth
Hired Organization Address Towson, MD Full Time
A well-established law firm in Towson, MD is seeking a skilled Estate & Trust Paralegal to join its growing team. The id...
Stealth
Hired Organization Address Edison, NJ Full Time
A fast-growing, multi-practice law firm in Edison, New Jersey is seeking an experienced Plaintiff Personal Injury Parale...
Stealth
Hired Organization Address Scotch Plains, NJ Full Time
A well-established, multi-practice New Jersey law firm is seeking a Litigation Associate Attorney to join its dynamic te...

Not the job you're looking for? Here are some other Founding Engineer, ML Performance & Systems jobs in the San Francisco, CA area that may be a better fit.

Founding Engineer, ML Performance & Systems

Isotron AI, San Francisco, CA

Founding AI/ML Engineer

Stealth, San Francisco, CA

AI Assistant is available now!

Feel free to start your new journey!