What are the responsibilities and job description for the Sr Python Developer || Plano, TX (Hybrid) position at Jobs via Dice?
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Verito Solutions, is seeking the following. Apply via Dice today!
Sr Python Developer
Plano, TX (Hybrid)
Phone Video
Job description:
Technical Stack:
Python 3.13 , Crawl4AI (Playwright-based async deep crawling), OpenAI GPT-4o (structured data extraction), Kreuzberg (PDF/DOCX/XLSX extraction), Pydantic v2, PostgreSQL, AWS S3, Prefect 3 (workflow orchestration), Docker, pytest snapshot testing, Ruff, structlog.
What You'll Work On
Strong Python (async/await, type hints, Pydantic)
Web scraping experience — ideally with Playwright or similar browser automation
Familiarity with LLM APIs and prompt engineering for data extraction
Comfort with immutable data models and clean architecture patterns
Experience with PostgreSQL and Docker
Bonus: Prefect or workflow orchestration tools
Bonus: document extraction (PDF/DOCX parsing)
(“Believe you can and you’re halfway there.”)
– Theodore Roosevelt
Yogesh Sharma | Lead Tech Recruiter
An -E Verified Company
E:
P: 1 |
Sr Python Developer
Plano, TX (Hybrid)
Phone Video
Job description:
Technical Stack:
Python 3.13 , Crawl4AI (Playwright-based async deep crawling), OpenAI GPT-4o (structured data extraction), Kreuzberg (PDF/DOCX/XLSX extraction), Pydantic v2, PostgreSQL, AWS S3, Prefect 3 (workflow orchestration), Docker, pytest snapshot testing, Ruff, structlog.
What You'll Work On
- New scrapers** — Build modules that discover and crawl grant listings from state government websites, handling SPAs, paginated results, and document downloads
- AI extraction** — Design LLM prompts for structured grant field extraction and document classification; work with cost controls like content fingerprinting and relevance filtering
- Document processing** — Extract content from NOFO attachments (PDFs, DOCX, XLSX) and integrate with the existing multi-document assembly pipeline
- Change detection & sync** — Implement fingerprint-based change detection and status derivation; sync only changed grants downstream
- Testing** — Write unit tests, record HAR fixtures for deterministic crawl replay, and maintain QC regression tests against golden snapshots
Strong Python (async/await, type hints, Pydantic)
Web scraping experience — ideally with Playwright or similar browser automation
Familiarity with LLM APIs and prompt engineering for data extraction
Comfort with immutable data models and clean architecture patterns
Experience with PostgreSQL and Docker
Bonus: Prefect or workflow orchestration tools
Bonus: document extraction (PDF/DOCX parsing)
(“Believe you can and you’re halfway there.”)
– Theodore Roosevelt
Yogesh Sharma | Lead Tech Recruiter
An -E Verified Company
E:
P: 1 |