What are the responsibilities and job description for the Generative AI Engineer - Houston, TX position at Jobs via Dice?
Dice is the leading career destination for tech experts at every stage of their careers. Our client, TechniPros, LLC, is seeking the following. Apply via Dice today!
Job Title: Generative AI Engineer
Location: Houston, TX
Domain: Healthcare
Duration: Long Term Contract
Looking for W2 Candidates. No C2C
Job Summary:
We are seeking a highly skilled and innovative Generative AI Engineer to lead the design and deployment of cutting-edge LLM-powered applications for enterprise use cases. This role requires an individual with a strong foundation in NLP, machine learning, and agentic AI orchestration. You'll collaborate with cross-functional teams to develop systems that automate knowledge retrieval, generate human-like responses, and drive intelligent automation workflows across various business domains.
Key Responsibilities:
Tanuja P
Phone:
Email:
Job Title: Generative AI Engineer
Location: Houston, TX
Domain: Healthcare
Duration: Long Term Contract
Looking for W2 Candidates. No C2C
Job Summary:
We are seeking a highly skilled and innovative Generative AI Engineer to lead the design and deployment of cutting-edge LLM-powered applications for enterprise use cases. This role requires an individual with a strong foundation in NLP, machine learning, and agentic AI orchestration. You'll collaborate with cross-functional teams to develop systems that automate knowledge retrieval, generate human-like responses, and drive intelligent automation workflows across various business domains.
Key Responsibilities:
- Design, build, and fine-tune LLMs and multimodal generative models using state-of-the-art architectures.
- Lead the development of generative AI applications including chatbots, content generation tools, summarizers, and automated document processors.
- Integrate external tools and APIs using agentic workflows (CrewAI, LangChain agents) to build reasoning-capable AI solutions.
- Optimize model inference using quantization, distillation, and hardware-specific acceleration (ONNX, Triton).
- Ensure secure, ethical, and compliant AI use through robust evaluation pipelines and fairness metrics.
- 8 years of experience in AI/ML, with 3 years focused on generative AI or large language models.
- Solid understanding of transformer-based architectures and vector similarity search.
- Experience deploying LLMs (e.g., GPT, LLaMA, Falcon, Claude) in production settings.
- Proven record of using embedding models and retrieval-augmented generation (RAG) to improve contextual accuracy.
- Proficient in Python, with strong experience in PyTorch, Transformers, and LangChain.
- Hands-on experience with OpenAI, Azure OpenAI, Anthropic Claude, and other LLM providers.
- Familiarity with vector databases (Pinecone, FAISS), orchestration tools, and scalable inference.
- Strong communication and leadership qualities.
- Proven ability to collaborate with cross-functional engineering and product teams.
- Comfortable working in fast-paced, iterative environments.
Tanuja P
Phone:
Email: