Demo

Software Engineer

techire.®
San Francisco, CA Full Time
POSTED ON 4/17/2026
AVAILABLE BEFORE 5/16/2026

We’re looking for an Inference Engineer to design and optimize the systems that power our models in production.


This role sits at the intersection of:

  • ML systems
  • distributed systems
  • hardware-aware performance engineering


You’ll take cutting-edge models and make them fast, scalable, and efficient in real-world environments.


What You’ll Work On

Inference Systems & Serving

  • Design and build low-latency inference pipelines for large multimodal models
  • Implement advanced serving techniques such as:
  • continuous batching
  • KV cache optimization
  • Work with modern inference frameworks (e.g. vLLM, SGLang, TensorRT-LLM, Triton)


Performance Optimization

  • Optimize inference across:
  • model level (quantization, architecture-aware tuning)
  • hardware level (GPU / accelerator utilization, kernel optimization)
  • Improve latency, throughput, and cost efficiency for production systems
  • Profile and debug bottlenecks using tools like Nsight, nsys, or similar


Distributed & Real-Time Systems

  • Build high-throughput, distributed inference infrastructure
  • Design systems for real-time workloads with strict latency constraints
  • Optimize multi-GPU / multi-node inference using:
  • tensor parallelism
  • pipeline parallelism
  • distributed scheduling


Infrastructure & Observability

  • Develop robust monitoring, benchmarking, and evaluation systems
  • Track metrics such as:
  • GPU utilization
  • Build tooling to support rapid iteration and production reliability


Research → Production

  • Work closely with research teams to productionize new model architectures
  • Translate experimental ideas into high-performance serving systems
  • Contribute to the design of next-generation inference stacks


Why This Role

  • Work on cutting-edge AI systems that go beyond current model limitations
  • Solve hard systems problems at the core of how modern AI runs
  • Join a team that values:
  • speed
  • ownership
  • technical excellence


Compensation & Benefits

  • Competitive salary equity
  • Full medical, dental, and vision coverage
  • In-office meals and a highly collaborative environment


How to Apply

  • If you’re excited about building high-performance inference systems and pushing the limits of real-time AI, we’d love to hear from you.

Salary : $200,000 - $350,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Software Engineer?

Sign up to receive alerts about other jobs on the Software Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$77,657 - $95,021
Income Estimation: 
$97,257 - $120,701
Income Estimation: 
$91,370 - $117,201
Income Estimation: 
$115,390 - $147,559
Income Estimation: 
$106,780 - $140,358
Income Estimation: 
$104,963 - $131,876
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at techire.®

  • techire.® San Francisco, CA
  • Most AI roles build on top of models. This one builds what makes them actually work. We’re hiring ML Infrastructure Engineers to tackle a hard, real-world ... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Software Engineer jobs in the San Francisco, CA area that may be a better fit.

  • Beacon Software San Francisco, CA
  • Beacon Software is a permanent capital holding company which acquires and grows essential businesses. We are a profitable series B firm that combines great... more
  • 2 Days Ago

  • Advent Software, Inc. San Francisco, CA
  • As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000 employees... more
  • 1 Month Ago

AI Assistant is available now!

Feel free to start your new journey!