What are the responsibilities and job description for the AI Platform Technical Architect position at Saxon Global?
Hello,
Greetings of the day!
We have an exciting job opportunity for the position of AI Platform Technical Architect with one of our esteemed clients.
Based on your experience, I believe this could be a great fit for you.
If you’re interested, please share your latest resume with me at Ramesh.s@saxonglobal.com
Also, if you know someone who might be a good fit and is currently exploring opportunities, referrals are highly appreciated!
Job Title: AI Platform Technical Architect
Location : San Jose, CA (Onsite)
Duration: Fulltime role/Long term contract
Job Summary:
Bachelor's or Master's degree in Computer Science, Data Science, or a related field.
10-16 years of experience in AI/ML-related roles, with a strong focus on LLM's & Agentic AI technology.
6-10 years of experience in Designing and implementing large-scale distributed systems, microservices, serverless, and event-driven architectures.
5-8 years of experience n Cloud-native architecture experience in Azure / AWS / GCP including networking, storage, compute scaling, GPU workloads, and managed AI services.
5-8 years of experience with platform components, API design, integration patterns, and high-performance compute architecture.
4-7 years of experience building or integrating AI/ML platforms, pipelines, model lifecycle components, inference gateways, and/or enterprise GenAI frameworks.
3-6 years of experience using AI platform tools such as Databricks, Vertex AI, Azure AI Studio, AWS Bedrock, LangChain, PromptFlow, Ray, Kubeflow, MLflow, Airflow, Kafka, etc.
2-5 years of experience in designing and integrating vector database solutions such as Pinecone, Weaviate, FAISS, Milvus, Qdrant, Elastic, OpenSearch, CosmosDB Vector.
2-3 years of experience in LLM architectures, embeddings, tokenization, prompt engineering, evaluation strategies, hallucination reduction, and RAG patterns.
2-3 years of experience building GenAI applications, agent workflows, or knowledge retrieval systems using frameworks like LangChain, LlamaIndex, GraphRAG, or custom implementations.
Key technical skills :
As a Technical Architect specializing in LLMs and Agentic AI, you will own the architecture, strategy, and delivery of enterprise-grade AI solutions. You will work with cross-functional teams and customers to define the AI roadmap, design scalable solutions, and ensure responsible deployment of Generative AI across the organization:
Primary Responsibilities:
• Architect scalable and secure AI/ML/LLM platform solutions including data, model, and inference pipelines.
• Establish enterprise reference architectures, reusable components, best practices, and governance standards for AI adoption.
• Integrate cloud-native, open-source, and enterprise tools such as vector databases, feature stores, registries, and orchestration frameworks.
• Implement automated MLOps/LLMOps workflows covering deployment, monitoring, observability, compliance, and performance optimization.
• Collaborate with cross-functional teams (engineering, data science, security, and product) to align platform capabilities with business goals and drive adoption.
Secondary Responsibilities:
• Support GenAI and AI application teams by providing platform enablement, solution advisory, and architecture reviews.
• Conduct technology research, PoCs, benchmarking, and evaluate emerging AI tools, frameworks, and deployment patterns.
• Drive knowledge sharing through documentation, workshops, training sessions, and internal community building initiatives.
• Provide guidance on cost estimation, usage monitoring, finops optimization, and capacity planning.
• Partner with security, compliance, and cloud teams to ensure alignment with regulatory, data privacy, and policy frameworks.
Communication Skills:
Communicate effectively with internal and customer stakeholders
Communication approach: verbal, emails and instant messages
Interpersonal Skills:
Strong interpersonal skills to build and maintain productive relationships with team members & customer representatives
Provide constructive feedback during code reviews and be open to receiving feedback on your own code.
Problem-Solving and Analytical Thinking:
Capability to troubleshoot and resolve issues efficiently.
Analytical mindset
Ability to bring idea into reality through technology implementation & adoption
Task/ Work Updates
Prior experience in working on Agile/Scrum projects with exposure to tools like Jira/Azure DevOps
Provides regular updates, proactive and due diligent to carry out responsibilities
• Automation for data pipelines, feature engineering, model training, validation, packaging, deployment, versioning, and rollback.
• Implementing model observability, drift monitoring, logging, tracing, metrics, experiment tracking, and governance.
• Familiarity with end-to-end evaluation workflows for LLMs including latency, throughput, cost optimization, caching, and fallback strategies.
• Experience with containerization, Kubernetes, Istio/Linked, service mesh patterns.
• Familiarity with feature stores, knowledge graphs, ontology and metadata platform
• AI benchmarking, evaluation frameworks (RAGAS, Promptfoo, Langsmith, TruLens).
• Experience working in Agile, product-based delivery.