Demo

Backend ML Engineer

Sterling Computers Corporation
North, SD Full Time
POSTED ON 6/22/2026
AVAILABLE BEFORE 8/20/2026

Title: Backend ML Engineer

Reports to: Senior Software Architect

Location: North Sioux City, SD

Job Description: Sterling Computers is a technology company that provides IT solutions to a variety of clients, including the federal government, state and local governments, education, and commercial entities. Sterling's Strategic Technologies Group is responsible for learning and becoming subject matter experts in new and emerging technologies. Our team uses this expertise to broaden the portfolio of products and solutions that the company sells, delivers, and manages. Our engineers work on a range of AI-integrated systems, from production RAG platforms and LLM orchestration layers to digital human solutions and intelligent automation pipelines. We are looking for a Backend ML Engineer who is interested in taking AI/ML systems from prototype to production, designing inference APIs, building retrieval and orchestration pipelines, integrating large language models, and operating ML infrastructure at scale. If you thrive in a collaborative, client-focused environment and enjoy shipping AI features that real users depend on, we'd love to have you on our team.

Required Technical Skills:

  • 3–5 years of experience in backend or ML engineering
  • Strong working knowledge of Python, including FastAPI or Flask
  • Experience with modern ML libraries such as PyTorch, Hugging Face Transformers, and sentence-transformers
  • Proficiency with cloud platforms including AWS, GCP, or Azure
  • Hands-on experience integrating LLMs (OpenAI, Anthropic, Gemini, or open-source models) into production systems
  • Familiarity with vector databases such as Weaviate, pgvector, Pinecone, or similar
  • Experience with retrieval-augmented generation (RAG) patterns
  • Self-motivated with a positive and professional attitude
  • Knowledge of additional languages such as Node.js, JavaScript, or other relevant languages is a plus

Required Education/Experience:

  • Bachelor’s degree in Computer Science, Machine Learning, or a related field (minimum requirement), or equivalent practical experience
  • Graduate-level coursework or specialization in ML/AI is a plus
  • Relevant cloud certifications are a plus
  • Demonstrated experience shipping ML systems to production is a plus
  • US DoD Clearance preferred or willingness to obtain such

Qualifications:

  • Strong experience building backend services with Python (FastAPI/Flask); comfort working with async APIs and request/response patterns for ML inference workloads.
  • Hands-on experience integrating LLMs and embedding models into production applications, including prompt engineering, context management, and handling rate limits, retries, and streaming responses.
  • Familiarity with RAG architectures: chunking strategies, embedding pipelines, vector search, reranking, and evaluation metrics (Recall@k, MRR, faithfulness, answer relevance).
  • Experience with vector databases (Weaviate, pgvector, Pinecone, Qdrant, or similar) and traditional databases (PostgreSQL, MariaDB) for hybrid retrieval and metadata filtering.
  • Cloud experience (AWS/GCP/Azure) for deploying ML services — including managed inference endpoints, GPU instances, or serverless model hosting.
  • Strong understanding of API authentication, secure handling of model inputs/outputs, and PII/PHI-aware design where applicable.
  • Experience with ML observability: tracking latency, token usage, cost-per-query, retrieval quality, and model drift in production.
  • Background in data pipelines, document ingestion/parsing, or evaluation frameworks (Ragas, TruLens, Docling, custom harnesses) is needed.
  • Familiarity with fine-tuning, LoRA/PEFT, or model distillation is appreciated.
  • Experience with MLOps tooling (MLflow, Weights & Biases, Kubeflow) or LLM orchestration frameworks (LangChain, LlamaIndex, Haystack, or custom orchestrators) is a plus.

Responsibilities:

  • Build, test, and maintain production ML services — inference APIs, retrieval pipelines, orchestration layers, and guardrail/evaluation components.
  • Design scalable RESTful and streaming APIs that serve ML model outputs reliably under real-world load.
  • Integrate and tune LLMs, embedding models, and rerankers; evaluate trade-offs across hosted (Anthropic, OpenAI, Vertex) and self-hosted (HF, vLLM) options on cost, latency, and quality.
  • Build ingestion and chunking pipelines for unstructured data (PDFs, HTML, transcripts) and maintain vector store schemas for multi-tenant or multi-domain retrieval.
  • Implement evaluation harnesses to measure retrieval quality, generation faithfulness, and end-to-end answer correctness; close the loop from evals back into pipeline improvements.
  • Containerize and deploy ML workloads with Docker and Kubernetes; manage GPU/CPU resource allocation and model versioning.
  • Optimize database queries, vector search performance, and caching strategies (including LLM prompt caching) to reduce latency and cost.
  • Implement CI/CD pipelines for ML services and instrument monitoring for both system metrics (latency, error rate) and ML-specific metrics (retrieval quality, hallucination rate, drift)
  • Collaborate with frontend engineers, ML researchers, and product analysts to translate model capabilities into shipped features.
  • Document backend and ML infrastructure, including model cards, evaluation results, and architectural decisions
  • Travel - must be willing to travel 25% and periodically up to 50%.


Sterling Computers Corporation (“Sterling”) is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to age, race, color, creed, religion, disability, medical condition, economic status or status with regard to public assistance, citizenship status, national or social or ethnic origin, past or present membership in the uniformed services, protected veteran status, sex, pregnancy, marital or civil union or domestic partnership status, family or parental status, sexual orientation, gender expression or identity, family medical history or genetic information, HIV status, political belief, or any other status or characteristic protected by applicable law.


Salary.com Estimation for Backend ML Engineer in North, SD
$79,997 to $105,552
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Backend ML Engineer?

Sign up to receive alerts about other jobs on the Backend ML Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$88,984 - $115,784
Income Estimation: 
$111,369 - $141,168
Income Estimation: 
$117,871 - $153,580
Income Estimation: 
$109,939 - $144,341
Income Estimation: 
$114,500 - $144,633
Income Estimation: 
$92,017 - $124,111
Income Estimation: 
$111,369 - $141,168
Income Estimation: 
$117,871 - $153,580
Income Estimation: 
$109,939 - $144,341
Income Estimation: 
$114,500 - $144,633
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Sterling Computers Corporation

  • Sterling Computers Corporation North, SD
  • Title : Full Stack Developer/Engineer Reports to : Senior Software Architect Location : North Sioux City, Other Sterling Locations Job Description : Sterli... more
  • 1 Day Ago

  • Sterling Computers Corporation North, SD
  • Title : Director of Federal Presales Reports to : VP of Engineering Location : Remote - United States of America Job Description : The Sterling Federal Pre... more
  • 1 Day Ago

  • Sterling Computers Corporation Washington, DC
  • Title : Presales Solutions Architect (Federal) Reports to : Manager of Presales Engineering Location : Remote (Washington, D.C.) Job Description : The Ster... more
  • 1 Day Ago

  • Sterling Computers Corporation Des Moines, IA
  • Title: Maintenance Technician – Surface Go Refresh Program Location: Centralized Location Reports to: Project Manager Duration: 8 Weeks (Temporary, Full-Ti... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Backend ML Engineer jobs in the North, SD area that may be a better fit.

  • Magnetic Technologies Corporation Norfolk, NE
  • Job purpose Develops and improves manufacturing processed by studying product and manufacturing methods. Duties and responsibilities Using manufacturing en... more
  • 15 Days Ago

  • MANITOU Group Yankton, SD
  • Job ID 45805 Date 21 April 2026 Job Family Production Type of contract Permanent , Permanent Why work for Manitou Group? Manitou is purposefully committed ... more
  • 17 Days Ago

AI Assistant is available now!

Feel free to start your new journey!