What are the responsibilities and job description for the On-Prem Cloud Engineer position at Success In Cloud, Inc.?

Title: Cloud Engineer

Location: Brevard, Charlotte

Experience: 5 to 8 yrs

W2, C2C

Must Have: Arize AI, Claude Cowork, GCP, Terraform

Technical Skilled Required:

VLLM, TensorRT-LLM-Triton Inference Server, SGLang, Inference, Optimization, Continuous Batching, Speculative Decoding KV, Cache / Prefix Caching, FP8 /AWQ/GPTQ, Tensor, Parallelism, Kubernetes ML Serving, KServe OpenShift Al. Helm /Operators, GPU, Orchestration, Run:AI., Performance, Benchmarking, CUDA/NCCL/MIG, Prometheus /Grafana ML Observability GuideLLM, Locust.

Responsibilities:

Build, configure, and operate on-prem Kubernetes/OpenShift Al platforms for deploying and serving GenAl models and LLM inference workloads.
Design and optimize high-performance inference stacks using vLLM, TensorRT-LLM, Triton Inference Server, SGLang, and advanced techniques (continuous batching, speculative decoding, KV caching).
Manage GPU orchestration and capacity using Run:AI, MIG, CUDA/NCCL, and tensor parallelism to maximize utilization and throughput.
Deploy and operate Kubernetes ML serving frameworks (KServe, Helm, Operators) for scalable, reliable model serving.
Drive inference optimization and benchmarking, leveraging FP8, AWQ, GPTQ, and performance tools such as GuideLLM and Locust.
Implement observability and ML monitoring using Prometheus, Grafana, Arize Al, ensuring SLA/SLO compliance for GenAl services.
Collaborate with ML and research teams to onboard new models, tune inference performance, and productionize GenAI use cases.

Apply for this job

Receive alerts for other On-Prem Cloud Engineer job openings

What is the career path for a On-Prem Cloud Engineer?

Sign up to receive alerts about other jobs on the On-Prem Cloud Engineer career path by checking the boxes next to the positions that interest you.

Cloud Architecture Analyst II

Income Estimation:

$95,407 - $122,738

Systems Architect III

Income Estimation:

$118,163 - $145,996

Cloud Architecture Analyst III

Income Estimation:

$120,777 - $151,022

Enterprise Infrastructure Architect III

Income Estimation:

$129,363 - $167,316

Enterprise Operations Supervisor

Income Estimation:

$86,891 - $130,303

AI Engineer II

Income Estimation:

$101,387 - $124,118

AI Engineer III

Income Estimation:

$119,030 - $151,900

Not the job you're looking for? Here are some other On-Prem Cloud Engineer jobs in the Brevard, NC area that may be a better fit.

Project Engineer

Apply

myGwork - LGBTQ Business Community Asheville, NC
This job is with Jabil, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ business community. Please do not contact... more
17 Days Ago

Civil Engineer

Apply

Career Collective Asheville, NC
Title: Project Manager Division: Land Development Our Client recognizes that our success depends on the quality of the people we hire. We are currently see... more
18 Days Ago

On-Prem Cloud Engineer

What are the responsibilities and job description for the On-Prem Cloud Engineer position at Success In Cloud, Inc.?

What is the career path for a On-Prem Cloud Engineer?

Not the job you're looking for? Here are some other On-Prem Cloud Engineer jobs in the Brevard, NC area that may be a better fit.

We don't have any other On-Prem Cloud Engineer jobs in the Brevard, NC area right now.

AI Assistant is available now!