Demo

mid-level Generative AI / LLM-focused Software Developer (AI Engineer)

Jobs via Dice
Philadelphia, PA Full Time
POSTED ON 4/14/2026
AVAILABLE BEFORE 5/8/2026
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Cogent IBS, Inc, is seeking the following. Apply via Dice today!

Dear Partner,

Good Morning ,

Greetings from Nukasani group Inc !, We have below urgent long term contract project immediately available for mid-level Generative AI / LLM-focused Software Developer (AI Engineer), Philadelphia, PA, Hybrid need submissions you please review the below role, if you are available, could you please send me updated word resume, and below candidate submission format details, immediately. If you are not available, any referrals would be greatly appreciated.

Interviews are in progress, urgent response is appreciated. Looking forward for your immediate response and working with you.

**Candidate Submission Format - needed from you**

Full Legal Name

Personal Cell No ( Not google phone number)

Email Id

Skype Id

Interview Availability

Availability to start, if selected

Current Location

Open to Relocate

Work Authorization

Total Relevant Experience

Education./ Year of graduation

University Name, Location

Last 5 digits of SSN

Country of Birth

Contractor Type

DOB: (dd/mm) mm/dd

Home Zip Code

LinkedIn ID

Assigned Job Details

Job Title : mid-level Generative AI / LLM-focused Software Developer (AI Engineer)

Location: Philadelphia, PA, Hybrid

Rate : Best competitive rate

**Position Overview**

We are seeking a mid-level Software Developer/Engineer with strong expertise in Generative AI systems, particularly in deploying Large Language Models (LLMs) within secure, enterprise environments.

The ideal candidate will have hands-on experience with on-premise LLM deployments, Retrieval-Augmented Generation (RAG) pipelines, and vector database integration, along with a solid foundation in Python-based backend development.

This role involves working on cutting-edge AI solutions while ensuring performance, scalability, and data security in enterprise-grade systems.

**Key Responsibilities**

Deploy and manage open-source LLMs (e.g., Llama 3, Mistral, Mixtral) in on-premise or private cloud environments

Design, build, and optimize LLM inference pipelines using Python

Develop and implement Retrieval-Augmented Generation (RAG) workflows

Design and integrate vector databases for semantic search and retrieval

Optimize model performance through quantization and CPU-based inference tuning

Ensure data privacy, governance, and security compliance in enterprise environments

Implement access controls, logging, and monitoring for AI systems

Create reference architectures, prototypes, and technical documentation

Collaborate with cross-functional teams to support deployment, adoption, and knowledge transfer

**Required Qualifications**

5–9 years of experience in software development or engineering

Strong proficiency in Python for backend and AI/ML development

Hands-on experience deploying open-source LLMs (e.g., Llama 3, Mistral, Mixtral)

Experience building and optimizing RAG pipelines

Practical knowledge of vector databases (e.g., Qdrant, Chroma, Milvus, pgvector)

Understanding of embeddings, similarity search, and metadata filtering

Experience with CPU-based inference optimization techniques

Familiarity with enterprise security practices, including data privacy and air-gapped environments

**Preferred Qualifications**

Experience with LangChain or LlamaIndex

Familiarity with Docker and Kubernetes

Exposure to Rust, Go, or C for high-performance systems

Experience with LLM inference frameworks (e.g., vLLM, llama.cpp, Hugging Face Transformers)

Prior experience working in regulated or enterprise environments

Deliverables

End-to-end reference architecture for LLM and vector database solutions

Fully functional prototype (LLM RAG Vector Database)

Comprehensive technical documentation and knowledge transfer

Best,

Bhavani

Recruiter | IT & Digital Marketing

P:

540 W Galena Blvd, Suite 200

Aurora, IL 60506

Salary.com Estimation for mid-level Generative AI / LLM-focused Software Developer (AI Engineer) in Philadelphia, PA
$136,194 to $174,873
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a mid-level Generative AI / LLM-focused Software Developer (AI Engineer)?

Sign up to receive alerts about other jobs on the mid-level Generative AI / LLM-focused Software Developer (AI Engineer) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$149,493 - $192,976
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Jobs via Dice

  • Jobs via Dice Sheridan, WY
  • Dice is the leading career destination for tech experts at every stage of their careers. Our client, Varmoda Tech LLC, is seeking the following. Apply via ... more
  • 8 Days Ago

  • Jobs via Dice Burlington, VT
  • Desktop Deployment Technician (Part-Time - 20 Hours a week) (Contract Role) Overview We are seeking a Desktop Deployment Technician to support a large-scal... more
  • 8 Days Ago

  • Jobs via Dice Georgia, VT
  • Dice is the leading career destination for tech experts at every stage of their careers. Our client, AaraTechnologies Inc, is seeking the following. Apply ... more
  • 8 Days Ago

  • Jobs via Dice Alaska, AK
  • job summary: Enterprise Healthcare client has an immediate opening for a highly motivated Project Manager III to join their dynamic and growing team. All q... more
  • 8 Days Ago


Not the job you're looking for? Here are some other mid-level Generative AI / LLM-focused Software Developer (AI Engineer) jobs in the Philadelphia, PA area that may be a better fit.

  • Tri-Force Consulting Services, Inc. Philadelphia, PA
  • Title: Software Developer/Engineer (Mid Level experience) Duration: 12 Months Location: Philadelphia, PA Note: Hybrid role, minimum 3 days in the office In... more
  • 8 Days Ago

  • Jobs via Dice Philadelphia, PA
  • Dice is the leading career destination for tech experts at every stage of their careers. Our client, Tri-Force Consulting Services Inc, is seeking the foll... more
  • 8 Days Ago

AI Assistant is available now!

Feel free to start your new journey!