Demo

Gen AI Architect

Intake IT Solutions
Santa Clara, CA Full Time
POSTED ON 10/7/2025
AVAILABLE BEFORE 12/6/2025

Job Details

Job Title: Gen AI Architect
Location: Santa Clara, CA
Duration: Contract
Educational Qualification*
Bachelor's or Master's degree in Computer Science, Data Science, or a related field.
Experience Range
Total IT 15 & 10-12 years of experience in AI/ML-related roles, with a strong focus on LLM's & Agentic AI technology.
Primary (Must have skills)* - To be Screened by TA Team
Generative AI Solution Architecture (2 3 years): Proven experience in designing and architecting GenAI applications, including Retrieval-Augmented Generation (RAG), LLM orchestration (LangChain, LangGraph), and advanced prompt design strategies.
Backend & Integration Expertise (5 years): Strong background in architecting Python-based microservices, APIs, and orchestration layers that enable tool invocation, context management, and task decomposition across cloud-native environments (Azure Functions, Google Cloud Platform Cloud Functions, Kubernetes).
Enterprise LLM Architecture (2 3 years): Hands-on experience in architecting end-to-end LLM solutions using Azure OpenAI, Azure AI Studio, Hugging Face models, and Google Cloud Platform Vertex AI, ensuring scalability, security, and performance.
RAG & Data Pipeline Design (2 3 years): Expertise in designing and optimizing RAG pipelines, including enterprise data ingestion, embedding generation, and vector search using Azure Cognitive Search, Pinecone, Weaviate, FAISS, or Google Cloud Platform Vertex AI Matching Engine.
LLM Optimization & Adaptation (2 3 years): Experience in implementing fine-tuning and parameter-efficient tuning approaches (LoRA, QLoRA, PEFT) and integrating memory modules (long-term, short-term, episodic) to enhance agent intelligence.
Multi-Agent Orchestration (2 3 years): Skilled in designing multi-agent frameworks and orchestration pipelines with LangChain, AutoGen, or DSPy, enabling goal-driven planning, task decomposition, and tool/API invocation.
Performance Engineering (2 3 years): Experience in optimizing Google Cloud Platform Vertex AI models for latency, throughput, and scalability in enterprise-grade deployments.
AI Application Integration (2 3 years): Proven ability to integrate OpenAI and third-party models into enterprise applications via APIs and custom connectors (MuleSoft, Apigee, Azure APIM).
Governance & Guardrails (1 2 years): Hands-on experience in implementing security, compliance, and governance frameworks for LLM-based applications, including content moderation, data protection, and responsible AI guardrails.
Job Description of Role* (RNR) - To be Evaluated by Technical Panel (Define it to give more clarity)
Key technical skills :
As a Technical Architect specializing in LLMs and Agentic AI, you will own the architecture, strategy, and delivery of enterprise-grade AI solutions. You will work with cross-functional teams and customers to define the AI roadmap, design scalable solutions, and ensure responsible deployment of Generative AI across the organization:
Primary Responsibilities:
Architect Scalable GenAI Solutions: Lead the design of enterprise architectures for LLM and multi-agent systems, ensuring scalability, resilience, and security across Azure and Google Cloud Platform platforms.
Technology Strategy & Guidance: Provide strategic technical leadership to customers and internal teams, aligning GenAI projects with business outcomes.
LLM & RAG Applications: Architect and guide development of LLM-powered applications, assistants, and RAG pipelines for structured and unstructured data.
Agentic AI Frameworks: Define and implement agentic AI architectures leveraging frameworks like LangGraph, AutoGen, DSPy, and cloud-native orchestration tools.
Integration & APIs: Oversee integration of OpenAI, Azure OpenAI, and Google Cloud Platform Vertex AI models into enterprise systems, including MuleSoft Apigee connectors.
LLMOps & Governance: Establish LLMOps practices (CI/CD, monitoring, optimization, cost control) and enforce responsible AI guardrails (bias detection, prompt injection protection, hallucination reduction).
Enterprise Governance: Lead architecture reviews, governance boards, and technical design authority for all LLM initiatives.
Collaboration: Partner with data scientists, engineers, and business teams to translate use cases into scalable, secure solutions.
Documentation & Standards: Define and maintain best practices, playbooks, and technical documentation for enterprise adoption.
Monitoring & Observability: Guide implementation of AgentOps dashboards for usage, adoption, ingestion health, and platform performance visibility.

Secondary Responsibilities:
Innovation & Research: Stay ahead of advancements in OpenAI, Azure AI, and Google Cloud Platform Vertex AI, evaluating new features and approaches for enterprise adoption.
Proof of Concepts: Lead or sponsor PoCs to validate feasibility, ROI, and technical fit for new AI capabilities.
Ecosystem Expertise: Remain current on Azure AI services (Cognitive Search, AI Studio, Cognitive Services) and Google Cloud Platform AI stack (Vertex AI, BigQuery, Matching Engine).
Business Alignment: Collaborate with product and business leadership to prioritize high-value AI initiatives with measurable outcomes.
Mentorship: Coach engineering teams on LLM solution design, performance tuning, and evaluation techniques.
Soft skills/other skills - To be Evaluated by Hiring Manager (To define how this will be evaluated)
Communication Skills:
Communicate effectively with internal and customer stakeholders
Communication approach: verbal, emails and instant messages
Interpersonal Skills:
Strong interpersonal skills to build and maintain productive relationships with team members & customer representatives
Provide constructive feedback during code reviews and be open to receiving feedback on your own code.
Problem-Solving and Analytical Thinking:
Capability to troubleshoot and resolve issues efficiently.
Analytical mindset
Ability to bring idea into reality throught technology implementation & adoption
Task/ Work Updates
Prior experience in working on Agile/Scrum projects with exposure to tools like Jira/Azure DevOps
Provides regular updates, proactive and due diligent to carry out responsibilities
Expected Outcome
Secondary Skills to be planned Post Hiring - Training Plan
Knowledge of MCP s and A2A SDK
Version Control: Proficiency with version control tools like Git.
Agile Methodologies: Experience working in Agile development environments.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Gen AI Architect?

Sign up to receive alerts about other jobs on the Gen AI Architect career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$149,493 - $192,976
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$172,688 - $210,712
Income Estimation: 
$170,589 - $211,671
Income Estimation: 
$178,619 - $225,190
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$161,406 - $211,884
Income Estimation: 
$188,022 - $236,092
Income Estimation: 
$205,940 - $255,928
Income Estimation: 
$199,907 - $266,531
Income Estimation: 
$195,700 - $270,403
Income Estimation: 
$103,114 - $138,258
Income Estimation: 
$118,163 - $145,996
Income Estimation: 
$120,777 - $151,022
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$86,891 - $130,303
Income Estimation: 
$129,363 - $167,316
Income Estimation: 
$145,845 - $177,256
Income Estimation: 
$147,836 - $182,130
Income Estimation: 
$154,597 - $194,610
Income Estimation: 
$86,891 - $130,303
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Intake IT Solutions

Intake IT Solutions
Hired Organization Address San Francisco, CA Full Time
Job Details Job Title: DevOps Lead Location: SFO, CA (Onsite) Duration: Long term contract Educational Qualification* Ba...

Not the job you're looking for? Here are some other Gen AI Architect jobs in the Santa Clara, CA area that may be a better fit.

Gen AI Lead / Architect

Jobs via Dice, Sunnyvale, CA

Principal Gen AI Architect

HCLTech, Santa Clara, CA

AI Assistant is available now!

Feel free to start your new journey!