Demo

Solutions Engineer

GMI Cloud
Mountain View, CA Full Time
POSTED ON 6/16/2026
AVAILABLE BEFORE 7/15/2026

About the Role

We’re looking for a Forward Deployment Engineer (FDE) to work directly with customers and partners to design, deploy, and validate Inference dedicated endpoint & Model-as-a-Service products on GMI’s global infrastructure.

This is a high-impact, hybrid engineering role that sits at the intersection of platform engineering, applied ML, and customer success. You’ll be embedded with customers during early-stage deployments—turning research ideas, datasets, and business requirements into working, performant systems on real GPU clusters.

If you enjoy being close to users, debugging real systems, and shipping results fast (not just writing docs), this role is for you.



What You’ll Do

Own customer POCs end-to-end

  • Deploy and optimize LLM and multi-modal inference workflows on GMI clusters
  • Translate customer requirements into concrete system designs and experiments

Forward-deploy with customers

  • Work hands-on with research teams, startups, and enterprise customers
  • Debug performance, stability, and correctness issues in real environments

Inference deployment

  • Stand up and tune inference stacks (e.g. vLLM / SGLang / Ray Serve–style architectures)
  • Optimize latency, throughput, GPU utilization, and cost efficiency

Model-as-a-Service enablement

  • Help customers test, evaluate, and adopt the most frontier LLM and multi-modal models through GMI's unified API
  • Guide model selection, API integration, and migration across providers; shorten the "idea → production" cycle
  • Validate correctness, compatibility, and performance across the MaaS model catalog

Performance & reliability

  • Diagnose GPU, networking, and distributed system bottlenecks
  • Run benchmarks, profiling, and stress tests on multi-GPU / multi-node setups

Feedback loop to product

  • Feed real-world customer learnings back into GMI's platform, SDKs, and APIs
  • Help shape reference architectures, cookbooks, and best practices



What We’re Looking For

Core Requirements

  • Proficiency in at least one programming language (Python and Golang preferred)
  • Solid understanding of software systems and distributed systems
  • Hands-on experience with ML inference or serving systems
  • Comfort working directly with customers and ambiguous requirements
  • Ability to debug end-to-end systems (code, infra, networking, performance)

Nice to Have

  • Experience with:
  • LLM inference frameworks (vLLM, SGLang, Ray Serve, Triton, etc.)
  • Global, distributed systems
  • Hands-on experience developing and maintaining production services on Kubernetes
  • GPU performance profiling, optimization, and inference benchmarking
  • Prior experience as:
  • Forward Deployed Engineer
  • Solutions Engineer
  • ML Platform Engineer
  • Applied Research Engineer



What Makes This Role Special

  • You’re close to real users and real GPUs—not abstract roadmaps
  • You’ll work on cutting-edge inference and frontier models, not toy demos
  • You’ll influence product direction through direct customer feedback
  • Fast iteration, high ownership, and visible impact



Who Thrives Here

  • Engineers who like shipping over theorizing
  • People who enjoy being the “last mile” problem solver
  • Builders who want exposure to both deep systems and applied ML
  • Those excited by early-stage POCs that turn into real production systems

Salary.com Estimation for Solutions Engineer in Mountain View, CA
$152,844 to $184,144
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Solutions Engineer?

Sign up to receive alerts about other jobs on the Solutions Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$123,167 - $152,295
Income Estimation: 
$146,673 - $180,130
Income Estimation: 
$70,609 - $91,165
Income Estimation: 
$86,680 - $110,316
Income Estimation: 
$117,033 - $148,289
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at GMI Cloud

  • GMI Cloud Mountain View, CA
  • About US GMI Cloud is a fast-growing AI infrastructure company backed by Headline VC and one of only six cloud providers worldwide to earn NVIDIA’s prestig... more
  • 11 Days Ago

  • GMI Cloud Mountain View, CA
  • About the Role We’re looking for a BD manager to drive the identification, execution, and commercialization of customer POC for our AI infrastructure platf... more
  • 12 Days Ago

  • GMI Cloud Mountain View, CA
  • MLE (LLM inference) About US GMI Cloud is a fast-growing AI infrastructure company backed by Headline VC and one of only six cloud providers worldwide to e... more
  • 12 Days Ago

  • GMI Cloud California, CA
  • Job Title: Open Talent — All Functions | GMI Cloud Location: Mountain view (Hybrid-first) · Open to global candidates for select roles Employment Type: Ful... more
  • 16 Days Ago


Not the job you're looking for? Here are some other Solutions Engineer jobs in the Mountain View, CA area that may be a better fit.

  • Cisco San Jose, CA
  • The application window is expected to close on: 05/25/2026 Job posting may be removed earlier if the position is filled or if a sufficient number of applic... more
  • 6 Days Ago

  • Amazon Sunnyvale, CA
  • Description Are you passionate about mobile apps? As a Solutions Engineer for Content Apps and Partner Engagement (CAPE) team, you will develop world class... more
  • 22 Days Ago

AI Assistant is available now!

Feel free to start your new journey!