Demo

Member of Technical Staff - GPU Performance Engineer

Liquid AI
San Francisco, CA Full Time
POSTED ON 2/17/2026
AVAILABLE BEFORE 6/1/2026
About Liquid AI

Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets, from data center accelerators to on-device hardware, ensuring low latency, minimal memory usage, privacy, and reliability. We partner with enterprises across consumer electronics, automotive, life sciences, and financial services. We are scaling rapidly and need exceptional people to help us get there.

The Opportunity

Our models and workflows require performance work that generic frameworks don’t solve. You’ll design and ship custom CUDA kernels, profile at the hardware level, and integrate research ideas into production code that delivers measurable speedups in real pipelines (training, post-training, and inference). Our team is small, fast-moving, and high-ownership. We're looking for someone who finds joy in memory hierarchies, tensor cores, and profiler output.

While San Francisco and Boston are preferred, we are open to other locations.

What We're Looking For

We need someone who:

  • Works profiler-first: You use tools like Nsight Systems / Nsight Compute to find bottlenecks, validate hypotheses, and iterate until improvements show up in end-to-end benchmarks.
  • Bridges theory and practice: You can translate ideas from papers into implementations that are robust, testable, and performant.
  • Executes independently: Given an ambiguous bottleneck, you can drive from profiling to kernel/integration changes to benchmarked results to maintained ownership.
  • Cares about the details: Memory hierarchy, occupancy, launch configs, tensor core utilization, bandwidth vs compute limits.

The Work

  • Write high-performance GPU kernels for our novel model architectures
  • Integrate kernels into PyTorch pipelines (custom ops, extensions, dispatch, benchmarking)
  • Profile and optimize training and inference workflows to eliminate bottlenecks
  • Build correctness tests and numerics checks
  • Build/maintain performance benchmarks and guardrails to prevent regressions
  • Collaborate closely with researchers to turn promising ideas into shipped speedups

Must-have

Desired Experience

  • Authored custom CUDA kernels (not only calling cuDNN/cuBLAS)
  • Strong understanding of GPU architecture and performance: memory hierarchy, warps, shared memory/register pressure, bandwidth vs compute limits
  • Proficiency with low-level profiling (Nsight Systems/Compute) and performance methodology
  • Strong C/C skills

Nice-to-have

  • CUTLASS experience and tensor core utilization strategies
  • Triton kernel experience and/or PyTorch custom op integration
  • Experience building benchmark harnesses and perf regression tests

What Success Looks Like (Year One)

  • Measurable improvement on at least one critical end-to-end pipeline (throughput and/or latency), validated by repeatable benchmarks
  • At least one research-driven technique shipped as a production kernel and maintained over time
  • Performance regressions are detectable early via benchmarks/guardrails, not discovered late

What We Offer

  • Unique challenges: Our architectural innovations and efficiency requirements offer unique optimization challenges. High ownership from day one.
  • Compensation: Competitive base salary with equity in a unicorn-stage company
  • Health: We pay 100% of medical, dental, and vision premiums for employees and dependents
  • Financial: 401(k) matching up to 4% of base pay
  • Time Off: Unlimited PTO plus company-wide Refill Days throughout the year

Salary.com Estimation for Member of Technical Staff - GPU Performance Engineer in San Francisco, CA
$87,192 to $109,340
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Member of Technical Staff - GPU Performance Engineer?

Sign up to receive alerts about other jobs on the Member of Technical Staff - GPU Performance Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$50,145 - $86,059
Income Estimation: 
$77,602 - $107,094
Income Estimation: 
$62,373 - $78,280
Income Estimation: 
$85,996 - $102,718
Income Estimation: 
$111,859 - $131,446
Income Estimation: 
$110,457 - $133,106
Income Estimation: 
$105,809 - $128,724
Income Estimation: 
$122,763 - $145,698
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Liquid AI

  • Liquid AI San Francisco, CA
  • Work With Us At Liquid, we’re not just building AI models—we’re redefining the architecture of intelligence itself. Spun out of MIT, our mission is to buil... more
  • 1 Day Ago

  • Liquid AI San Francisco, CA
  • About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets, from data center accelerators to... more
  • 2 Days Ago

  • Liquid AI Boston, MA
  • Liquid AI Job Description Role: Member Of Technical Staff, Infrastructure Department: Research & Engineering Location: Boston Location Type: Hybrid Employm... more
  • 4 Days Ago

  • Liquid AI San Francisco, CA
  • About Liquid Labs Research has been core to Liquid AI from the beginning. Liquid Labs gives that work a formal home; an internal research accelerator drivi... more
  • 4 Days Ago


Not the job you're looking for? Here are some other Member of Technical Staff - GPU Performance Engineer jobs in the San Francisco, CA area that may be a better fit.

  • Reflection AI San Francisco, CA
  • Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all . We’re developing open weight models for individuals, ag... more
  • 2 Months Ago

  • fireworksai San Mateo, CA
  • About Us: At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and mo... more
  • 20 Days Ago

AI Assistant is available now!

Feel free to start your new journey!