Demo

AI Infrastructure & Experience Engineer

FocusKPI Inc.
Mountain View, CA Other
POSTED ON 6/7/2026
AVAILABLE BEFORE 9/3/2026

FocusKPI is seeking an AI Infrastructure & Experience Engineer to join one of our clients, a high-tech SaaS company. 

Work Location: Mountain View, CA (Onsite role, 5 days/week onsite)
Duration: 4-month contract 
Pay Range: $70 - 79/hr

**No C2C resumes are considered**
 

Position Responsibilities:

  • Inference Optimization: Deploy and tune multiple LLMs and generative multimodal models on local inference hardware. Optimize performance metrics (TTFT, tokens/sec) via model quantization, caching strategies, and architecture-specific adjustments.
  • Systems Engineering & CUDA: Leverage deep knowledge of the CUDA environment to build custom kernels, ensuring maximum utilization of the low-cost GPU compute.
  • Orchestration & Integration: Seamlessly bridge inference backends with orchestration layers (LiteLLM, Ollama, etc.) and frontends like OpenWebUI.
  • Rapid Prototyping: Build functional, high-fidelity demos showcasing model memory capabilities, agentic workflows, and context-aware web search.
  • Peripheral Connectivity: Implement communication protocols to bridge local AI compute with peripheral devices, including smart TVs, household appliances, and XR hardware.
Requirements/Technical qualifications:
  • Recent experience in model optimization is required
  • Hardware & Compute: Proven experience with NVIDIA ecosystems and ARM64 architecture.
  • Systems Programming: Advanced proficiency in C , Python, and Rust. Deep familiarity with CUDA and the ability to author/debug custom CUDA kernels for compute-intensive tasks.
  • AI/ML Frameworks: Extensive experience with modern inference engines (llama.cpp, TensorRT-LLM, Ollama) and orchestration frameworks (LiteLLM).
  • Software Engineering: Robust understanding of asynchronous programming (FastAPI), containerization (Docker/Kubernetes), sandbox environments, and API design for low-latency communication.
  • Full-Stack Prototyping: Ability to quickly spin up modern frontend UIs (React, Next.js, or similar) to present AI-driven intelligence to end users.
  • Communication Protocols: Familiarity with WebSockets, gRPC, and REST for device-to-device communication in a local network environment.
  • Overall Mandatory skills required: Model optimization recent exparience, Interference Optimization, NVIDIA ecosystems, Custom CUDA Kernel Development, ARM64 architecture, Python
Ideal Candidate Profile:
  • A minimum of 3 years of relevant industry experience is required
  • The "Builder" Mindset: You are energized by the prospect of building proofs-of-concept in days rather than months. You thrive in environments where speed and creativity are paramount.
  • Problem Solver: You approach unsolved, messy engineering challenges with enthusiasm rather than trepidation.
  • Architectural Vision: You see the "big picture" of how AI becomes part of consumers' daily lives, not just how the model generates text.
  • Agile & Adaptable: You are comfortable working in a fast-paced environment where priorities shift based on the results of rapid experimentation.
  • Degree in Computer Science, Machine Learning, or Artificial Intelligence Specialization preferred, but not required

**No C2C resumes are considered**
 

Thank you!

FocusKPI Hiring Team

Founded in 2010, FocusKPI, Inc. (FocusKPI) is a data science and technology firm specializing in predictive analytics practice and methodologies. FocusKPI is a US company headquartered in Silicon Valley, California, with an East Coast office in Boston, Massachusetts.

NOTICE: Please be aware of fraudulent emails regarding job postings, job offers and fake checks. FocusKPI's recruiting team will strictly reach out via @focuskpi.com email domain. If you have received fraudulent emails now or in the past, please report it to https://reportfraud.ftc.gov/ .
The domain @focuskpijobs.com is fraudulent and not related to FocusKPI. Please do not not reply or communicate to anyone with @focuskpijobs.com.

Salary : $70 - $79

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a AI Infrastructure & Experience Engineer?

Sign up to receive alerts about other jobs on the AI Infrastructure & Experience Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at FocusKPI Inc.

  • FocusKPI Inc. San Francisco, CA
  • FocusKPI is looking for a Data Analyst to join our client's growing team to spearhead analyses and provide meaningful insights into our users and product. ... more
  • 8 Days Ago

  • FocusKPI Inc. York, NY
  • FocusKPI is looking for a Data Engineer to join our client's growing team and spearhead analyses, providing meaningful insights into their users and produc... more
  • 9 Days Ago

  • FocusKPI Inc. Mountain View, CA
  • FocusKPI is seeking an Executive Administrative Assistant to join one of our clients, a high-tech SaaS company. Work Location: Mountain View, CA (Onsite ro... more
  • 13 Days Ago

  • FocusKPI Inc. Boston, MA
  • We are seeking an Agentic AI Engineer to support the development of a customized AI agent solution for one of our clients. We're a consulting team that hel... more
  • 4 Days Ago


Not the job you're looking for? Here are some other AI Infrastructure & Experience Engineer jobs in the Mountain View, CA area that may be a better fit.

  • Hippocratic AI Palo Alto, CA
  • About Us Hippocratic AI is the leading generative AI company in healthcare. We have the only system that can have safe, autonomous, clinical conversations ... more
  • 18 Days Ago

  • DGN Technologies Mountain View, CA
  • Job Category: Technical Job Title: AI Infrastructure & Experience Engineer Duties: Key Responsibilities Inference Optimization: Deploy and tune multiple LL... more
  • 9 Days Ago

AI Assistant is available now!

Feel free to start your new journey!