What are the responsibilities and job description for the AI/ML Software Engineer position at Datasoft Technologies, Inc.?
AI/ML Software Engineer
Remote Annapolis, MD
About the Job
The client is seeking proposals from prospective Offerors to provide one (1) AI/ML Software Engineer. The AI/ML Software Engineer will build software tools that incorporate AI/ML techniques to automate narrowly defined tasks with high accuracy, assist internal users with their job functions, and improve the experience external users have when interacting with the Client. This includes, but is not limited to, RPA work, building or refining chatbots, incorporating AI/ML into reporting tools, building llm agents for knowledge retrieval, deep research, translation, transcription, redaction, document analysis, document generation, agentic coding, and data processing.
Responsibilities:
Proposed resource(s) shall be responsible for the following:
1. System Design & Collaboration:
RESOURCE QUALIFICATIONS
DataSoft Technologies is a highly recognized provider of professional IT Consulting services in the US. Founded in 1994, DataSoft Technologies, Inc. provides staff augmentation services for Information Technology and Automotive Services. Our team member benefits include:
Remote Annapolis, MD
About the Job
- Duration: Long Term Contract Position Possibility of extension
- Location: 100% Remote: Annapolis, MD
- Pay rate: Hourly
- Job ID: K23-0094-25L-17
The client is seeking proposals from prospective Offerors to provide one (1) AI/ML Software Engineer. The AI/ML Software Engineer will build software tools that incorporate AI/ML techniques to automate narrowly defined tasks with high accuracy, assist internal users with their job functions, and improve the experience external users have when interacting with the Client. This includes, but is not limited to, RPA work, building or refining chatbots, incorporating AI/ML into reporting tools, building llm agents for knowledge retrieval, deep research, translation, transcription, redaction, document analysis, document generation, agentic coding, and data processing.
Responsibilities:
Proposed resource(s) shall be responsible for the following:
1. System Design & Collaboration:
- a. Work within established constraints regarding infrastructure, programming languages, and model selection
- b. Contribute to technical decision-making related to data processing, retrieval strategies, and system integration
- c. Collaborate with team members to define agent architectures, workflows, and system design decisions
- d. Evaluate and select appropriate approaches for given tasks, including determining when to use LLM-based versus non-LLM techniques
- e. Designing and building software systems that integrate AI/ML techniques to automate tasks, assist internal users, and improve user-facing services.
- a. Assist in the design and implementation of testing and evaluation pipelines for AI/ML systems
- b. Develop unit and integration tests for AI-enabled workflows and data pipelines
- c. Generate and utilize synthetic data to support evaluation and benchmarking efforts
- d. Contribute to improving system performance, including accuracy, latency, and cost efficiency
- a. Support deployment of AI/ML applications within a hybrid cloud environment
- b. Work with containerized applications to ensure reliable deployment and updates.
- c. Optimize systems for environments with limited computational resources, including minimal GPU availability
- a. Deliver production-grade systems aligned with defined requirements, while supporting iterative improvement of evolving tools
- b. Document system designs, workflows, and technical decisions as required
- c. Stay informed on relevant advancements in AI/ML and apply them where appropriate within project constraints
- 5. In addition to the overall responsibilities described in Section III.C.1-4, the Offeror’s proposed resource(s) will complete the deliverables listed below by Purchase Order year. The estimated level of effort for each deliverable may vary based on its complexity and may be adjusted as needed, including extension beyond the applicable Purchase Order year
- a. Year 1
- (1) Internal Chatbot Refinement: UI improvements, user history & feedback – 240 hours.
- i Deliverables: Application code Docker build; user profile & history DB; test cases & privacy/compliance pipeline.
- “UI improvements, enable user history and feedback...”
- (2) External Chatbot Development: Initial conversational bot (non-analytical) – 480 hours.
- i Deliverables: Application code Docker build; conversation DB; test cases & compliance pipeline; UX/agency scoring. “Goal is conversational but not analytical. Must point users to resources...”
- (3) RPA: Local LLM analysis tools with batching – 240 hours.
- i Deliverables: Application code; integration documentation; usage & process reporting.
- (4) Knowledge Retrieval (RAG & Search): Improve vector/hybrid search & case mgmt integration – 520 hours.
- i Deliverables: Comparative RAG results; agent code/prompts; test pipeline; recommendations for knowledge store
- updates.
- (5) Translation: MD-specific terminology & guidelines – 80 hours.
- i Deliverables: Translation agent code/prompts; test cases & pipeline.
- (6) Transcription: Refine deployment based on feedback – 160 hours.
- i Deliverables: Comparative pipeline results; updated code/prompts; test cases & pipeline.
- (7) Redaction (PII & Sensitive Data): Build detection agent – 240 hours.
- i Deliverables: Application code; test cases & pipeline for PII/sensitive data identification.
- b. Year 2, c. Year 3, d. Year 4, e. Year 5 – same as year 1
RESOURCE QUALIFICATIONS
- 1. Proposed resource(s) should meet the following minimum qualifications:
- a. Bachelor of Science in Engineering, Computer Science, Data Science, or Mathematics, or a related field (as determined by the CLIENT).
- 2. The CLIENT prefers Offeror proposed resource(s) to have the following qualifications:
- a. At least three (3) years’ experience in data science, machine learning, or applied AI development.
- b. At least three (3) years’ experience in software engineering, architecture, or web development.
- 1. Offeror shall propose resource(s) possessing the following preferred skills, experience, and capabilities:
- a. Experience with:
- (1) SQL and relational database systems (e.g., PostgreSQL)
- (2) Fine-tuning small language models or embedding models
- (3) Contributing to or maintaining open-source software projects
- (4) Graph databases or graph extensions (e.g., Neo4j, Apache AGE)
- (5) Designing and implementing multi-agent or task-oriented AI systems
- (6) Embedding models, vector similarity, re-ranking, and graph retrieval techniques in RAG systems
- (7) Version control systems (e.g., Git), containerization technologies (e.g., Docker), and service-oriented architectures
- (8) Collaborating with large language models (LLMs), including both API-based integration and local deployment
- (9) Validating AI-generated outputs, mitigating hallucinations, and integrating AI tools into production service pipelines
- b. Ability to:
- (1) Understand data structures, algorithms, and clean coding principles
- (2) Select and apply appropriate techniques (LLM and non-LLM) based on task requirements
- (3) Develop and improve testing and evaluation pipelines for AI systems, including use of synthetic data
- (4) Demonstrate proficiency in Python, including the ability to develop production-grade backend services, APIs, middleware, and data pipelines.
- (5) Design and implement AI/ML systems that operate effectively on complex, inconsistent, or evolving datasets while balancing accuracy, latency, and cost (token consumption)
- (6) Collaborate with team members to define system architecture, agent workflows, and data pipelines while working in constrained environments, including limited GPU availability and predefined infrastructure
- c. Knowledge of:
- (1) Hybrid cloud environments and distributed system considerations
- (2) Threading, asynchronous processing, and queues in backend servers
- (3) React and Microsoft Teams Toolkit for developing chatbot user interfaces
- (4) Non-llm data analysis techniques for structured, semi-structured, and unstructured data
- (5) Classical natural language processing (NLP) techniques in addition to LLM-based approaches
- (6) Data science and LLM-related libraries in Rust or other performance-oriented programming languages.
DataSoft Technologies is a highly recognized provider of professional IT Consulting services in the US. Founded in 1994, DataSoft Technologies, Inc. provides staff augmentation services for Information Technology and Automotive Services. Our team member benefits include:
- Paid Holidays/Paid Time Off (PTO)
- Medical/Dental Insurance
- Vision Insurance
- Short Term/Long Term Disability
- Life Insurance
- 401 (K)