What are the responsibilities and job description for the AI/ML Engineer position at Inherent Technologies?
Position: AI/ML Engineer
Location: Scottsdale, AZ*From Day 1 Onsite
We are seeking an experienced AIML Engineer to design, build, and operate Al/ML infrastructure and agentic systems. This role involves developing MCP servers and agents, integrating LLMs, and implementing RAG pipelines for production environments.
Key Responsibilities
Design and implement RAG
(Retrieval-Augmented Generation) pipelines and integrations with vector stores and retrieval tooling; use LangChain and Langfuse for orchestration, chaining, and observability.
Core Responsibilities
Implement and maintain MCP server and agent code, APls, and SKs for model access and agent orchestration.
Design agent behavior, workflows and
safety guards for agentic Al systems.
Create, test and iterate prompt templates,
evaluation harnesses and
grounding/chain-of-thought strategies.
Integrate LLMs and model providers
(self-hosted and cloud APls) with unified adapters and telemetry.
Required Skills & Experience
2 years of Experience Practical experience implementing RAG: embeddings, vector DBs and retrieval tuning.
2 years of Experience with LangChain patterns and with toolchain telemetry (Langfuse or similar) for prompt/model traceability.
2 years of Experience with Practical experience with Google Cloud Platform services
Location: Scottsdale, AZ*From Day 1 Onsite
We are seeking an experienced AIML Engineer to design, build, and operate Al/ML infrastructure and agentic systems. This role involves developing MCP servers and agents, integrating LLMs, and implementing RAG pipelines for production environments.
Key Responsibilities
- Design, build and operate MCP servers and
- Develop agentic Al, prompt engineering patterns, LLM integrations and developer tooling for production use.
Design and implement RAG
(Retrieval-Augmented Generation) pipelines and integrations with vector stores and retrieval tooling; use LangChain and Langfuse for orchestration, chaining, and observability.
Core Responsibilities
Implement and maintain MCP server and agent code, APls, and SKs for model access and agent orchestration.
Design agent behavior, workflows and
safety guards for agentic Al systems.
Create, test and iterate prompt templates,
evaluation harnesses and
grounding/chain-of-thought strategies.
Integrate LLMs and model providers
(self-hosted and cloud APls) with unified adapters and telemetry.
- Build developer tooling: CLI, local runner,
- Containerize services (Docker), manage
- Ensure observability: logging, metrics, traces, dashboards, alerting and SLOs for model infra and agents.
- Design and maintain RAG workflows:
- Integrate and instrument LangChain for
Required Skills & Experience
- 5 years of Strong Software Engineering (Python/NodeJS), system design and production service experience.
- 2 years of Experience with LLMs, prompt
2 years of Experience Practical experience implementing RAG: embeddings, vector DBs and retrieval tuning.
2 years of Experience with LangChain patterns and with toolchain telemetry (Langfuse or similar) for prompt/model traceability.
- 5 years of Experience with Kubernetes,
2 years of Experience with Practical experience with Google Cloud Platform services
- 2 years of Experience with Observability, testing, and security best practices for distributed systems.
- Familiarity with vendor and open-source
- Familiarity with CI/CD pipelines (Jenkins,