Demo

Senior MLOps / LLMOps Engineer

ITCAPS LLC
Jersey, NJ Full Time
POSTED ON 5/13/2026
AVAILABLE BEFORE 6/12/2026

Job Title - Senior MLOps / LLMOps Engineer Kubernetes & AI Inference Platforms

Duration - 2 Months

Location: New Jersey

Job Summary

We are seeking a highly skilled Senior MLOps / LLMOps Engineer to design, deploy, and support enterprise-scale AI/LLM platforms in production environments. The ideal candidate will have strong experience with Kubernetes/OpenShift, NVIDIA TensorRT-LLM, Triton Inference Server, and scalable AI infrastructure. This role focuses on building reliable, secure, and high-performance inference platforms for mission-critical AI applications.

Key Responsibilities

  • Deploy, manage, and troubleshoot containerized AI/LLM applications on Kubernetes/OpenShift platforms.
  • Configure, optimize, and support LLM inference workloads using NVIDIA TensorRT-LLM and Triton Inference Server.
  • Design and maintain scalable MLOps/LLMOps and container deployment pipelines.
  • Build CI/CD workflows for AI models, containers, and infrastructure deployments.
  • Package and deploy AI models across UAT, testing, and production environments.
  • Monitor platform performance, GPU utilization, availability, and operational health.
  • Implement logging, alerting, monitoring, and automated operational support processes.
  • Troubleshoot model deployment, scaling, networking, and load balancing issues.
  • Support model optimization techniques including quantization, pruning, and performance tuning.
  • Create operational runbooks, deployment procedures, health checks, and support documentation.
  • Support backup, restore, disaster recovery, failover, and business continuity planning.
  • Ensure platform security, RBAC, compliance, and governance standards are maintained.
  • Collaborate with AI, infrastructure, DevOps, and operations teams to deliver scalable AI solutions.

Required Qualifications

  • 5 years of experience in Kubernetes/OpenShift administration and containerized environments.
  • Strong hands-on experience with NVIDIA TensorRT-LLM and Triton Inference Server.
  • Experience deploying and supporting LLM/AI inference services in production.
  • Strong knowledge of Docker, microservices, and API-based architectures.
  • Experience building and supporting MLOps/LLMOps pipelines and CI/CD workflows.
  • Expertise in monitoring, logging, and troubleshooting distributed systems.
  • Experience with NVIDIA GPU infrastructure and AI workload optimization.
  • Understanding of incident management, change management, and operational best practices.
  • Strong problem-solving, communication, and collaboration skills.

Preferred Qualifications

  • Experience with OpenShift AI and enterprise AI platforms.
  • Knowledge of model optimization and inference acceleration techniques.
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
  • Familiarity with Infrastructure as Code (Terraform, Ansible, Helm, etc.).
  • Kubernetes/OpenShift or cloud certifications are a plus.

Salary : $70 - $80

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Senior MLOps / LLMOps Engineer?

Sign up to receive alerts about other jobs on the Senior MLOps / LLMOps Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$149,493 - $192,976
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$149,493 - $192,976
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at ITCAPS LLC

  • ITCAPS LLC Springfield, OH
  • Senior Delivery Specialist Introduction: The Senior Delivery Specialist will play a crucial role in supporting day-to-day data center activities, maintaini... more
  • 6 Days Ago

  • ITCAPS LLC York, NY
  • Job Description: We are seeking an experienced AS400 Synon Developer with strong expertise in CA:2E/Synon development and IBM iSeries application support. ... more
  • 11 Days Ago


Not the job you're looking for? Here are some other Senior MLOps / LLMOps Engineer jobs in the Jersey, NJ area that may be a better fit.

  • Goldman Sachs Jersey, NJ
  • Job Description What We Do At Goldman Sachs, our Engineers don’t just make things – we make things possible. Change the world by connecting people and capi... more
  • 27 Days Ago

  • Fidelity Investments Jersey, NJ
  • Job Description:Principal Quant Developer The Role The Quantitative Research & Investing Technology (QRIT) team within Fidelity's Asset Management Technolo... more
  • 7 Days Ago

AI Assistant is available now!

Feel free to start your new journey!