What are the responsibilities and job description for the Software Engineer position at techire.®?

We’re looking for an Inference Engineer to design and optimize the systems that power our models in production.

This role sits at the intersection of:

ML systems
distributed systems
hardware-aware performance engineering

You’ll take cutting-edge models and make them fast, scalable, and efficient in real-world environments.

What You’ll Work On

Inference Systems & Serving

Design and build low-latency inference pipelines for large multimodal models
Implement advanced serving techniques such as:
continuous batching
KV cache optimization
Work with modern inference frameworks (e.g. vLLM, SGLang, TensorRT-LLM, Triton)

Performance Optimization

Optimize inference across:
model level (quantization, architecture-aware tuning)
hardware level (GPU / accelerator utilization, kernel optimization)
Improve latency, throughput, and cost efficiency for production systems
Profile and debug bottlenecks using tools like Nsight, nsys, or similar

Distributed & Real-Time Systems

Build high-throughput, distributed inference infrastructure
Design systems for real-time workloads with strict latency constraints
Optimize multi-GPU / multi-node inference using:
tensor parallelism
pipeline parallelism
distributed scheduling

Infrastructure & Observability

Develop robust monitoring, benchmarking, and evaluation systems
Track metrics such as:
GPU utilization
Build tooling to support rapid iteration and production reliability

Research → Production

Work closely with research teams to productionize new model architectures
Translate experimental ideas into high-performance serving systems
Contribute to the design of next-generation inference stacks

Why This Role

Work on cutting-edge AI systems that go beyond current model limitations
Solve hard systems problems at the core of how modern AI runs
Join a team that values:
speed
ownership
technical excellence

Compensation & Benefits

Competitive salary equity
Full medical, dental, and vision coverage
In-office meals and a highly collaborative environment

How to Apply

If you’re excited about building high-performance inference systems and pushing the limits of real-time AI, we’d love to hear from you.

Salary : $200,000 - $350,000

Apply for this job

Receive alerts for other Software Engineer job openings

What is the career path for a Software Engineer?

Sign up to receive alerts about other jobs on the Software Engineer career path by checking the boxes next to the positions that interest you.

Software Engineer I

Income Estimation:

$77,657 - $95,021

Software Engineer II

Income Estimation:

$97,257 - $120,701

Software Systems Engineer II

Income Estimation:

$91,370 - $117,201

Software Systems Engineer III

Income Estimation:

$115,390 - $147,559

ERP Configuration Specialist III

Income Estimation:

$106,780 - $140,358

Operating Systems Programmer III

Income Estimation:

$104,963 - $131,876

Job openings at techire.®

Machine Learning Infrastructure Engineer

Apply

techire.® San Francisco, CA
Most AI roles build on top of models. This one builds what makes them actually work. We’re hiring ML Infrastructure Engineers to tackle a hard, real-world ... more
1 Day Ago

Not the job you're looking for? Here are some other Software Engineer jobs in the San Francisco, CA area that may be a better fit.

Senior Software Engineer, Platform

Apply

Beacon Software San Francisco, CA
Beacon Software is a permanent capital holding company which acquires and grows essential businesses. We are a profitable series B firm that combines great... more
2 Days Ago

Software Engineer

Apply

Advent Software, Inc. San Francisco, CA
As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000 employees... more
1 Month Ago

Software Engineer

What are the responsibilities and job description for the Software Engineer position at techire.®?

What is the career path for a Software Engineer?

Job openings at techire.®

Not the job you're looking for? Here are some other Software Engineer jobs in the San Francisco, CA area that may be a better fit.

We don't have any other Software Engineer jobs in the San Francisco, CA area right now.

AI Assistant is available now!