What are the responsibilities and job description for the Vector Databases & RAG Consultant position at Jobs via Dice?
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Marlabs LLC, is seeking the following. Apply via Dice today!
Position Type: Contract
Location: Philadelphia | Work Mode: Hybrid, minimum 3 days in the office
Core Experience
Consultant Requirements – On-Prem LLM & Vector DB Implementation
Position Type: Contract
Location: Philadelphia | Work Mode: Hybrid, minimum 3 days in the office
Core Experience
Consultant Requirements – On-Prem LLM & Vector DB Implementation
- Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
- Strong proficiency in Python for LLM inference, prompt engineering, and integration
- Experience with CPU-based inference, model quantization, and performance tuning
- Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
- Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
- Experience generating and managing embeddings and metadata filtering
- Understanding of data privacy, air-gapped deployments, and enterprise security requirements
- Experience implementing access controls and audit logging
- Experience with LangChain or LlamaIndex
- Exposure to Rust, Go, or C for high-performance services
- Familiarity with Docker and Kubernetes for on-prem deployments
- Knowledge of inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers)
- Prior work in regulated or enterprise environments
- Reference architecture and deployment guidance
- Working prototype (LLM vector DB RAG)
- Documentation and knowledge transfer to internal teams