Demo

Python Insfrastructure Engineer - Model Evaluation

Alignerr
York, NY Contractor
POSTED ON 4/20/2026
AVAILABLE BEFORE 5/19/2026
Python Infrastructure Engineer — Model Evaluation (AI Training)

About The Role

What if your Python expertise could directly shape how the world's most advanced AI models are built, evaluated, and improved? We're looking for a Senior Python Infrastructure Engineer to design and build the data pipelines, evaluation harnesses, and annotation tooling that power next-generation AI systems at leading research labs.

This is a fully remote contract role with serious technical depth — the kind of work that ships to production and influences model quality at scale.

  • Organization: Alignerr
  • Type: Hourly Contract
  • Location: Remote
  • Commitment: 20–40 hours/week

What You'll Do

  • Design, build, and optimize high-performance Python systems supporting AI data pipelines and model evaluation workflows
  • Develop full-stack tooling and backend services for large-scale data annotation, validation, and quality control
  • Build and maintain evaluation harnesses that integrate with inference frameworks and benchmarking pipelines
  • Improve reliability, performance, and safety across existing Python codebases
  • Instrument systems with observability tooling and metrics collection to monitor model performance and system health
  • Identify bottlenecks and edge cases in data and system behavior, and implement scalable, maintainable fixes
  • Collaborate with data, research, and engineering teams through synchronous design reviews and async communication

Who You Are

  • Native or fluent English speaker with clear written and verbal communication skills
  • 3–5 years of professional experience writing production-grade Python
  • Full-stack developer with a strong systems programming background
  • Experienced building evaluation harnesses for ML models and integrating with inference frameworks
  • Strong grasp of observability, metrics collection, and system reliability practices
  • Able to commit 20–40 hours per week with consistent availability

Nice to Have

  • Prior experience with data annotation pipelines, data quality systems, or model evaluation infrastructure
  • Familiarity with AI/ML workflows, model training, or benchmarking frameworks
  • Experience with distributed systems or internal developer tooling
  • Background working directly with AI labs or ML research teams

Why Join Us

  • Work on real production systems at the frontier of AI development alongside leading research labs
  • Fully remote and flexible — work from wherever you do your best work
  • Freelance autonomy with the structure of high-impact, technically challenging projects
  • Make a direct, measurable contribution to how next-generation AI models are evaluated and improved
  • Potential for ongoing work and contract extension as new projects launch

Salary : $50 - $75

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Alignerr

  • Alignerr Charlotte, AR
  • About The Job At Alignerr, we partner with the world’s leading AI research teams and labs to build and train cutting-edge AI models. As a Population Health... more
  • 13 Days Ago

  • Alignerr Denver, CO
  • About The Role We’re looking for entrepreneurship instructors to help evaluate AI systems trained on startups, innovation, and business planning. Organizat... more
  • 13 Days Ago

  • Alignerr Seattle, WA
  • Role Overview The Principal Cloud Security Architect evaluates cloud architectures, identity models, permissions, and security controls across large-scale ... more
  • 13 Days Ago

  • Alignerr Seattle, WA
  • Location: Remote About The Job At Alignerr, we partner with the world’s leading AI research teams and labs to build and train cutting-edge AI models. Remot... more
  • 13 Days Ago


Not the job you're looking for? Here are some other Python Insfrastructure Engineer - Model Evaluation jobs in the York, NY area that may be a better fit.

  • Cohere York, NY
  • Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are bui... more
  • 4 Days Ago

  • Cohere York, NY
  • Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are bui... more
  • 8 Days Ago

AI Assistant is available now!

Feel free to start your new journey!