What are the responsibilities and job description for the Sr Python Developer || Plano, TX (Hybrid) position at Verito Solutions?
Sr Python Developer
Plano, TX (Hybrid)
Phone Video
Job description:
Plano, TX (Hybrid)
Phone Video
Job description:
Technical Stack:
Python 3.13 , Crawl4AI (Playwright-based async deep crawling), OpenAI GPT-4o (structured data extraction), Kreuzberg (PDF/DOCX/XLSX extraction), Pydantic v2, PostgreSQL, AWS S3, Prefect 3 (workflow orchestration), Docker, pytest snapshot testing, Ruff, structlog.
What You'll Work On
**New scrapers** — Build modules that discover and crawl grant listings from state government websites, handling SPAs, paginated results, and document downloads
**AI extraction** — Design LLM prompts for structured grant field extraction and document classification; work with cost controls like content fingerprinting and relevance filtering
**Document processing** — Extract content from NOFO attachments (PDFs, DOCX, XLSX) and integrate with the existing multi-document assembly pipeline
**Change detection & sync** — Implement fingerprint-based change detection and status derivation; sync only changed grants downstream
**Testing** — Write unit tests, record HAR fixtures for deterministic crawl replay, and maintain QC regression tests against golden snapshots
**AI extraction** — Design LLM prompts for structured grant field extraction and document classification; work with cost controls like content fingerprinting and relevance filtering
**Document processing** — Extract content from NOFO attachments (PDFs, DOCX, XLSX) and integrate with the existing multi-document assembly pipeline
**Change detection & sync** — Implement fingerprint-based change detection and status derivation; sync only changed grants downstream
**Testing** — Write unit tests, record HAR fixtures for deterministic crawl replay, and maintain QC regression tests against golden snapshots
Required Skills:
Strong Python (async/await, type hints, Pydantic)
Web scraping experience — ideally with Playwright or similar browser automation
Familiarity with LLM APIs and prompt engineering for data extraction
Comfort with immutable data models and clean architecture patterns
Experience with PostgreSQL and Docker
Bonus: Prefect or workflow orchestration tools
Bonus: document extraction (PDF/DOCX parsing)
Web scraping experience — ideally with Playwright or similar browser automation
Familiarity with LLM APIs and prompt engineering for data extraction
Comfort with immutable data models and clean architecture patterns
Experience with PostgreSQL and Docker
Bonus: Prefect or workflow orchestration tools
Bonus: document extraction (PDF/DOCX parsing)
(“Believe you can and you’re halfway there.”)
– Theodore Roosevelt
Yogesh Sharma | Lead Tech Recruiter
An -E Verified Company
E:
P: 1 |