Demo

LLM / GenAI Engineer

Scale.jobs
Los Angeles, CA Full Time
POSTED ON 6/4/2026
AVAILABLE BEFORE 7/3/2026
About The Role

The role is responsible for the architecture and deployment of large language model systems, moving past simple API wrappers to build robust, scalable agentic workflows and retrieval-augmented generation (RAG) architectures. The focus is on bridging the gap between cutting-edge research and stable production software that handles high-concurrency enterprise workloads.

The engineer will collaborate with infrastructure and product teams to optimize inference latency, implement sophisticated grounding mechanisms, and establish rigorous automated evaluation pipelines to ensure model safety and accuracy in real-world environments.

Key Responsibilities

  • Architect and maintain production-grade RAG pipelines using LangChain or LlamaIndex, integrating advanced retrieval techniques like hybrid search and reranking.
  • Implement and manage vector database infrastructure (Pinecone, Weaviate, or Milvus) to support high-dimensional similarity search at scale.
  • Develop and deploy systematic evaluation frameworks utilizing 'LLM-as-a-judge' and deterministic benchmarking to quantify model performance and prevent regressions.
  • Execute fine-tuning jobs using PEFT techniques such as LoRA and QLoRA to adapt open-source models (Llama 3, Mistral) to domain-specific tasks.
  • Build and optimize backend services in Python (FastAPI/pydantic) to serve model outputs with low latency, incorporating streaming and caching strategies.
  • Design observability and monitoring systems to track token usage, cost, and hallucination rates in live production environments.

What We Are Looking For

  • 3–6 years of experience in software engineering or machine learning, with at least 1 year of hands-on experience deploying LLMs in a production capacity.
  • Deep technical proficiency in Python and familiarity with deep learning frameworks such as PyTorch or JAX.
  • Proven experience with orchestration tools and vector databases for semantic search and memory management.
  • Solid understanding of NLP fundamentals, including tokenization, attention mechanisms, and transformer architectures.
  • Strong background in cloud infrastructure (AWS/GCP) and containerization using Docker and Kubernetes.
  • Bonus: Experience with model quantization (GGUF, AWQ), low-level inference optimization (vLLM, TensorRT-LLM), or contributions to major open-source AI projects.

Salary.com Estimation for LLM / GenAI Engineer in Los Angeles, CA
$121,700 to $156,770
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a LLM / GenAI Engineer?

Sign up to receive alerts about other jobs on the LLM / GenAI Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Scale.jobs

  • Scale.jobs Seattle, WA
  • About The Role The role focuses on the architecture and implementation of scalable data infrastructure that powers both real-time product features and high... more
  • 6 Days Ago

  • Scale.jobs Seattle, WA
  • About The Role The role involves managing the full sales cycle for complex SaaS solutions, targeting mid-market and enterprise-level accounts within the te... more
  • 6 Days Ago

  • Scale.jobs Chicago, IL
  • About The Role This role focuses on translating complex technical requirements into intuitive, high-fidelity user interfaces for a high-traffic SaaS platfo... more
  • 6 Days Ago

  • Scale.jobs Chicago, IL
  • About The Role The role bridges the gap between machine learning research and production engineering, focusing on the infrastructure and automation require... more
  • 6 Days Ago


Not the job you're looking for? Here are some other LLM / GenAI Engineer jobs in the Los Angeles, CA area that may be a better fit.

  • Activision Santa Monica, CA
  • Job Title: Full Stack Engineer, GenAI Requisition ID: R027460 Job Description: Your Mission Our team is expanding to build practical, production-ready gene... more
  • 10 Days Ago

  • Outlier AI Los Angeles, CA
  • About the Project Outlier helps the world's most innovative companies improve their AI agents by providing human feedback. Do you want to shape the future ... more
  • 26 Days Ago

AI Assistant is available now!

Feel free to start your new journey!