Demo

Lead MLOps / AI Platform Engineer

SATCON Inc
Charlotte, NC Contractor
POSTED ON 5/23/2026
AVAILABLE BEFORE 6/22/2026

Job Description: Lead MLOps / AI Platform Engineer

Location: Charlotte, NC

Duration: Long Term

Visa Type: & Candidates

Role Overview

We are seeking a highly skilled Lead MLOps / AI Platform Engineer to design, build, and optimize our next-generation Generative AI and Large Language Model (LLM) infrastructure. This role is pivotal in bridging the gap between cutting-edge AI research and robust production deployment. You will be responsible for orchestrating high-performance GPU environments (specifically leveraging Nvidia H200s), optimizing LLM inference, and maintaining enterprise-grade infrastructure across both Cloud (Google Cloud Platform/Azure) and On-Premise environments.

Key Responsibilities

  1. AI Inference Optimization & Serving
  • Deploy, scale, and manage large-scale language models using advanced inference frameworks such as vLLM, TensorRT-LLM, SGLang, and Triton Inference Server.
  • Implement and fine-tune performance optimization strategies, including Continuous Batching and advanced Parallelism techniques.
  • Conduct load testing, benchmarking, and profiling of LLM deployments using GuideLLM and Locust to ensure optimal latency and throughput.
  1. Cloud & Infrastructure Orchestration
  • Architect and maintain scalable, secure infrastructure on Google Cloud Platform and Azure using Infrastructure as Code (Terraform).
  • Design and execute Cloud Networking, Landing Zones, and Organization Policies/Governance.
  • Manage secrets and secure workloads utilizing HashiCorp Vault.
  • Develop and champion Internal Developer Portals to streamline workflows for data science and product teams.
  1. On-Premise & Kubernetes Engineering
  • Orchestrate ML workloads on Kubernetes, utilizing KServe, OpenShift AI / OpenShift Functions, and Helm charts/Operators.
  • Manage compute clusters with a heavy focus on advanced GPU Orchestration (Nvidia H200 ecosystems).
  • Demonstrate deep hands-on expertise in architecture and "know-how to unfold an LLM" into highly constrained or custom on-premise hardware setups.
  1. Observability & SRE
  • Implement end-to-end ML Observability and monitoring frameworks using Arize AI.
  • Establish Site Reliability Engineering (SRE) best practices, maintaining strict SLOs/SLIs for model deployment pipelines and production APIs.

Required Skills & Qualifications

Core AI / MLOps Stack:

  • Inference Engines: vLLM, TensorRT-LLM, Triton Inference Server, SGLang
  • ML Frameworks/Ops: KServe, OpenShift AI, Arize AI, GenAI Platforms, RAG architecture
  • Performance & Testing: GuideLLM, Locust, Continuous Batching, Parallelism optimization
  • Infrastructure & Cloud Stack:
  • Cloud Providers: Google Cloud Platform (Google Cloud Platform), Microsoft Azure
  • Containerization & Orchestration: Kubernetes, OpenShift, Helm/Operators, GPU Orchestration
  • IaC & Automation: Terraform, Python
  • Security & Networking: HashiCorp Vault, Landing Zones, Org Policy & Governance
  • Hardware Sanity Check:
  • Mandatory Experience: Direct, hands-on experience working with Nvidia H200 GPUs and optimizing workloads specifically for this architecture.

Salary : $60 - $70

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Lead MLOps / AI Platform Engineer?

Sign up to receive alerts about other jobs on the Lead MLOps / AI Platform Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at SATCON Inc

  • SATCON Inc York, NY
  • Title: Scrum Master Address: 1 New York Plaza, New York, NY 10004 Duration: 3 Years Contract Need only local to New york and New jersey Skills 18 -25 Years... more
  • 2 Days Ago

  • SATCON Inc Charlotte, NC
  • Hi, Our Client is looking for Google Cloud Platform(Terraform) Engineer for Charlotte, NC. If you are looking for a job change, Please let me know Google C... more
  • 9 Days Ago


Not the job you're looking for? Here are some other Lead MLOps / AI Platform Engineer jobs in the Charlotte, NC area that may be a better fit.

  • DPR Construction Charlotte, NC
  • Job Description DPR is looking for an experienced Data and MLOps Engineer to join our Data and AI team and work closely with the Data Platform, BI and Ente... more
  • 18 Days Ago

  • VDart, Inc. Charlotte, NC
  • Role: AI Infrastructure Platform Engineer Location: Charlotte, NC [Hybrid] Type: Contract Description: Lead complex infrastructure initiatives supporting G... more
  • 13 Days Ago

AI Assistant is available now!

Feel free to start your new journey!