Demo

AI/Ml Architect

Relanto
Fremont, CA Full Time
POSTED ON 3/21/2026
AVAILABLE BEFORE 7/9/2026

We are seeking an experienced and visionary AI/ML Architect to lead the end-to-end design, development, deployment, and operationalization of advanced AI/ML and Generative AI (GenAI) solutions on cloud platforms. The ideal candidate will possess deep technical expertise in ML architecture, GenAI frameworks, Retrieval-Augmented Generation (RAG) pipelines, cloud-native deployment, and MLOps practices. You will work closely with cross-functional teams, clients, and engineering teams to define scalable AI strategies and deliver cutting-edge solutions across various domains.


Key Responsibilities:

Customer Engagement & Solution Architecture

  • Interact with clients and stakeholders to gather business and technical requirements and translate them into scalable AI/ML solutions.
  • Architect and design AI/ML systems across AWS, GCP, or Azure with a strong focus on cloud-native and cost-optimized architecture.
  • Create detailed system design documents, architecture diagrams, and technical roadmaps.
  • Define data architecture, storage, and retrieval strategies tailored to AI/ML workflows.


GenAI & RAG Architecture

  • Lead the design and implementation of Generative AI solutions using LLMs, LangChain, LlamaIndex, Prompt Engineering, and vector databases such as Pinecone, FAISS, Weaviate, or Elasticsearch.
  • Architect RAG (Retrieval-Augmented Generation) pipelines for enterprise use cases including knowledge management, chatbot development, and document summarization.
  • Implement prompt orchestration, retrieval optimization, and grounding techniques to enhance LLM output accuracy and relevance.
  • AI/ML Model Development & MLOps
  • Guide the development of Python-based APIs, data preprocessing workflows, and model training pipelines.
  • Design and implement robust CI/CD pipelines for ML model deployment using tools like SageMaker, Vertex AI, or Azure ML.
  • Define and implement model monitoring, retraining, and performance management strategies for production-grade ML systems.
  • Ensure best practices in versioning, reproducibility, model lineage, and auditability (MLOps/LLMOps).


Technical Leadership & Governance

  • Review and approve system designs, PoCs, and implementation approaches.
  • Provide hands-on leadership and mentorship to data scientists, ML engineers, and software developers.
  • Lead architectural decision-making, code quality reviews, and sprint grooming sessions.
  • Champion best practices in security, compliance, scalability, and performance optimization for AI/ML solutions.
  • Project Management & Collaboration
  • Own end-to-end technical delivery of AI/ML and GenAI projects across multiple domains (e.g., BFSI, Retail, Healthcare, Manufacturing).
  • Coordinate with product owners, business analysts, data engineers, and DevOps teams to ensure seamless delivery.
  • Manage stakeholder expectations, project timelines, and resource allocation efficiently.


Required Qualifications

  • 7 years of overall IT experience, with minimum of 5 years in designing, developing, deploying, and operationalizing AI/ML solutions.
  • Minimum 2–3 years of experience in architecting end-to-end AI/ML solutions, including design, implementation, and production deployment.
  • Proven experience in GenAI, LLMs, RAG architecture, prompt engineering, and orchestration tools like LangChain, LlamaIndex, etc.
  • Hands-on with vector databases (e.g., Pinecone, FAISS, Elasticsearch) and unstructured data retrieval.
  • Deep knowledge of Machine Learning and Deep Learning algorithms: CNNs, RNNs, LSTMs, Transformers, etc.
  • Experience in Natural Language Processing (NLP), including language modeling, summarization, classification, and NER.
  • Strong expertise in Python, with frameworks like PyTorch, TensorFlow, HuggingFace, NumPy, and Pandas.
  • Demonstrated experience in designing cloud-native AI/ML solutions on AWS, GCP, or Azure.
  • Skilled in deploying models via services like SageMaker, Vertex AI, Azure ML, or using containers and Kubernetes.
  • Solid understanding of MLOps/LLMOps lifecycle: pipeline automation, model registry, monitoring, CI/CD.
  • Excellent communication, leadership, and stakeholder management skills.


Preferred Qualifications

  • Certification in AWS/GCP or ML specializations.
  • Experience in leading large-scale AI transformation programs.

Salary.com Estimation for AI/Ml Architect in Fremont, CA
$195,708 to $247,352
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Not the job you're looking for? Here are some other AI/Ml Architect jobs in the Fremont, CA area that may be a better fit.

  • Jobs via Dice Fremont, CA
  • Dice is the leading career destination for tech experts at every stage of their careers. Our client, Relanto, Inc., is seeking the following. Apply via Dic... more
  • 15 Days Ago

  • DeWinter Group Campbell, CA
  • Title: Solutions Architect (AI/ML) Job Type: Contract Contract Length: 12 Months Pay Range: $50/hr – $175/hr Start Date: ASAP Location: Remote About The Op... more
  • 19 Days Ago

AI Assistant is available now!

Feel free to start your new journey!