Demo

Member of Technical Staff - Kernels & GPU Performance

Gimlet Labs, Inc.
San Francisco, CA Full Time
POSTED ON 5/11/2026
AVAILABLE BEFORE 6/7/2026
About Us

Gimlet Labs is building the first heterogeneous neocloud for AI workloads.

As AI systems scale, the industry is hitting fundamental limits in power, capacity, and cost with today’s homogeneous, vertically integrated infrastructure. Gimlet addresses this by decoupling AI workloads from the underlying hardware. Our platform intelligently partitions workloads into components and orchestrates each component to hardware that best fits its performance and efficiency needs. This approach enables heterogeneous systems across multi-vendor and multi-generation hardware, including the latest emerging accelerators. These systems unlock step-function improvements in performance and cost efficiency at scale.

On top of this foundation, Gimlet is building a production-grade neocloud for agentic workloads. Customers use Gimlet to deploy and manage their workloads through stable, production-ready APIs, without having to reason about hardware selection, placement, or low-level performance optimization.

Gimlet works with foundation labs, hyperscalers, and AI native companies to power real production workloads built to scale to gigawatt-class AI datacenters.

Mission

Gimlet Labs is seeking a Member of Technical Staff focused on kernels and GPU performance. In this role, you will work close to accelerators and execution hardware to extract maximum performance from AI workloads across diverse and rapidly evolving platforms. You will analyze low-level execution behavior, design and optimize kernels, and ensure performance is reliable across both established and emerging hardware.

This role is ideal for engineers who enjoy deep performance work, reasoning about hardware tradeoffs, and turning theoretical peak performance into real-world results.

Responsibilities

  • Design, implement, and optimize GPU and accelerator kernels for AI workloads
  • Analyze and tune performance across the GPU execution stack, including memory access patterns, synchronization, and instruction scheduling
  • Work with compilers and runtimes to ensure kernels integrate cleanly and perform well in end-to-end systems
  • Bring up and optimize execution on new or emerging accelerators
  • Profile, benchmark, and debug performance issues across kernels, runtimes, and hardware
  • Ensure performance optimizations are robust, correct, and production-ready at scale

Qualifications

  • Strong software engineering fundamentals
  • Experience working on performance-critical systems close to hardware
  • Comfort reasoning about low-level execution behavior, memory hierarchies, and performance tradeoffs

Preferred Qualifications

  • Experience with CUDA, Triton, CUTLASS, or other accelerator programming models
  • Deep understanding of GPU execution models (warps/wavefronts, blocks, grids)
  • Experience optimizing memory access patterns (coalescing, shared memory, cache behavior)
  • Familiarity with occupancy, latency hiding, and instruction-level parallelism
  • Experience using profiling and performance analysis tools
  • Familiarity with multi-GPU or distributed execution is a plus

Compensation Range: $150K - $350K

Salary : $150,000 - $350,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Member of Technical Staff - Kernels & GPU Performance?

Sign up to receive alerts about other jobs on the Member of Technical Staff - Kernels & GPU Performance career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$36,436 - $44,219
Income Estimation: 
$50,145 - $86,059
Income Estimation: 
$48,515 - $60,705
Income Estimation: 
$93,348 - $109,523
Income Estimation: 
$112,230 - $133,397
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Gimlet Labs, Inc.

  • Gimlet Labs, Inc. San Francisco, CA
  • About Us Gimlet Labs is building the first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits in powe... more
  • 14 Days Ago

  • Gimlet Labs, Inc. San Francisco, CA
  • About Us Gimlet Labs is building the first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits in powe... more
  • 14 Days Ago

  • Gimlet Labs, Inc. San Francisco, CA
  • About Us Gimlet Labs is building the first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits in powe... more
  • 14 Days Ago

  • Gimlet Labs, Inc. San Francisco, CA
  • About Us Gimlet Labs is building the first heterogeneous neocloud for AI workloads. As AI systems scale, the industry is hitting fundamental limits in powe... more
  • 14 Days Ago


Not the job you're looking for? Here are some other Member of Technical Staff - Kernels & GPU Performance jobs in the San Francisco, CA area that may be a better fit.

  • Magic San Francisco, CA
  • Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to saf... more
  • 22 Days Ago

  • Liquid AI San Francisco, CA
  • About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets, from data center accelerators to... more
  • 1 Month Ago

AI Assistant is available now!

Feel free to start your new journey!