Demo

LLM Inference / AI Infrastructure Engineer

Apex 2000
Charlotte, NC Contractor
POSTED ON 5/27/2026
AVAILABLE BEFORE 6/26/2026

LLM Inference / AI Infrastructure Engineer
Location: Charlotte, NC
Duration: 9-12 Month

JD:
vLLM TensorRTLLM Triton Inference Server SGLang Inference Optimization Continuous Batching Speculative Decoding KV Cache / Prefix Caching FP8 / AWQ / GPTQ Tensor Parallelism Kubernetes ML Serving KServe OpenShift AI Helm / Operators GPU Orchestration Run:AI Performance Benchmarking CUDA / NCCL / MIG Prometheus / Grafana ML Observability

skills sanity check: HAVE YOU WORKED ON Nvidia H200? If yes, chances are you will know all above skills

Hourly Wage Estimation for LLM Inference / AI Infrastructure Engineer in Charlotte, NC
$51.00 to $66.00
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a LLM Inference / AI Infrastructure Engineer?

Sign up to receive alerts about other jobs on the LLM Inference / AI Infrastructure Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Apex 2000

  • Apex 2000 Minnesota, MN
  • Healthcare Systems Analyst (Technical) Location: Minnesota, MN Duration: 9-12 Month JD: Systems Analyst (Technical) Drug Cost Estimator & Provider Search R... more
  • 4 Days Ago

  • Apex 2000 Collegeville, PA
  • Veeva CTMS Location: Collegeville PA Duration: 9-12 Month JD: 1.Certified would be required 2.Experience with configuration of Vaults more
  • 4 Days Ago

  • Apex 2000 Raleigh, NC
  • Hi , I hope you are doing well Job Title: Java spring boot Duration: 9 - 12 Months position type: Contract Location: Raleigh NC more
  • 5 Days Ago

  • Apex 2000 Pittsburgh, PA
  • This aligns with AMS/support models typically used in Life Sciences programs (Run & Maintain, SLA-driven operations, continuous improvement) Key Responsibi... more
  • 8 Days Ago


Not the job you're looking for? Here are some other LLM Inference / AI Infrastructure Engineer jobs in the Charlotte, NC area that may be a better fit.

  • Outlier AI Charlotte, NC
  • About the Project Outlier helps the world's most innovative companies improve their AI agents by providing human feedback. Do you want to shape the future ... more
  • 16 Days Ago

  • ARK Infotech Spectrum Charlotte, NC
  • Role : LLM Inference & GPU Systems Consultant Location : Charlotte , NC ( Locals only) We are seeking an AI Infrastructure Runtime Engineer to build and ma... more
  • 19 Days Ago

AI Assistant is available now!

Feel free to start your new journey!