Demo

Member of Technical Staff - ML Infrastructure & Performance

Embedding VC
San Mateo, CA Full Time
POSTED ON 1/9/2026
AVAILABLE BEFORE 2/7/2026
Introducing Moonlake, AI for creating real-time interactive content

Mission: Improve Throughput, Latency, & Cost - deploying our models 2–10× faster & cheaper without quality regressions.

Scope of Work:

  • GPU performance: CUDA/Triton kernels, FlashAttention family, paged attention, CUDA Graphs.
  • Serving stack: TensorRT-LLM/Triton Inference Server, vLLM/TGI; continuous batching; on-GPU KV reuse; speculative decoding/medusa; mixture-of-agents routing.
  • Parallelism: FSDP/ZeRO, TP/PP/expert parallel; NCCL tuning.
  • Quantization/PEFT: AWQ/GPTQ/FP8; LoRA/DoRA serving.
  • Systems: Ray/k8s/Argo, observability (Prom/Grafana/OpenTelemetry), autoscaling, A/B infra, canary rollback.

Tech signals:

Previous experience at Infra-heavy startups such as Databricks, Roblox

We are committed to being an on-site, in-person team currently based in San Mateo

Salary.com Estimation for Member of Technical Staff - ML Infrastructure & Performance in San Mateo, CA
$45,140 to $54,771
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Member of Technical Staff - ML Infrastructure & Performance?

Sign up to receive alerts about other jobs on the Member of Technical Staff - ML Infrastructure & Performance career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$36,436 - $44,219
Income Estimation: 
$50,145 - $86,059
Income Estimation: 
$48,515 - $60,705
Income Estimation: 
$89,966 - $112,616
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Embedding VC

  • Embedding VC San Mateo, CA
  • Introducing Moonlake, AI for creating real-time interactive content Mission: Product-level UX Full-stack, turn research into 'magical', shippable experienc... more
  • 3 Days Ago


Not the job you're looking for? Here are some other Member of Technical Staff - ML Infrastructure & Performance jobs in the San Mateo, CA area that may be a better fit.

  • Essential AI San Francisco, CA
  • About Us Essential AI is building an open platform to fuel and accelerate AI breakthroughs globally. Our open models, robust tooling, reproducible pipeline... more
  • 16 Days Ago

  • Fireworks AI San Mateo, CA
  • About Us: At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and mo... more
  • 5 Days Ago

AI Assistant is available now!

Feel free to start your new journey!