Demo

ML Engineer (Evaluation and Experimentation)

Cynnovative
Arlington, VA Full Time
POSTED ON 5/2/2026
AVAILABLE BEFORE 10/28/2026

Company Overview

At Cynnovative, we leverage machine learning, computer science, and software

engineering to address high-impact problems in the cyber domain, specifically

those which are critical to U.S. national security. We primarily extend

fundamental research to invent, design, develop, and deploy prototype solutions

that support persistent problems in this domain.


Job Overview

As a Machine Learning Engineer (Evaluation & Experimentation) at Cynnovative, you will build and maintain systems that run large-scale experiments and evaluate LLM outputs. This role is crucial to rapid, experiment-driven iteration on LLM systems in support of U.S. national security efforts.


NOTE: This role requires an active TS/SCI security clearance and is located on-site in Northern Virginia.


Responsibilities \ May Include


Design and implement evaluation pipelines for LLM experimentation

  • Implement and apply metrics over model outputs at scale
  • Build automated evaluation workflows across large experiment sets
  • Execute statistical analysis and testing over experimental results
  • Ensure consistency and comparability of results across runs, configurations, and datasets

Develop experiment tracking and logging specifications

  • Define schemas for capturing prompts, perturbations, outputs, and configurations
  • Specify and validate logging of token-level probabilities, scores, and derived metrics
  • Ensure experiment data is structured, complete, and queryable for downstream analysis

Build and maintain datasets and evaluation inputs

  • Curate prompt sets, perturbation strategies, and test cases provided by the research team
  • Maintain versioned datasets and experiment inputs
  • Enable rapid iteration on experiment configurations and evaluation coverage

Collaborate cross-functionally

  • Work closely with ML systems engineers to ensure correct data capture at scale
  • Provide feedback on experiment execution, data quality, and metric behavior
  • Support interpretation of experimental results through reliable measurement


Requirements \ Must Have


  • B.S. in Computer Science, Data Science, or related field (M.S. or Ph.D. preferred)
  • Strong communication skills and ability to collaborate cross-functionally
  • Proficiency in Python and data processing
  • Experience building experiment, evaluation, or analytics pipelines
  • Familiarity with experiment tracking tools (MLflow or similar)
  • Experience working with large-scale or batch data processing workflows
  • Understanding of statistical methods
  • Experience working with structured and semi-structured data
  • Experience with version control systems, particularly Git
  • U.S. Citizenship and active TS/SCI security clearance


Desired Skills \ Nice To Have


  • Familiarity with prompt sensitivity, perturbation analysis, or robustness testing
  • Prior experience in a research-to-product environment
  • Understanding of A/B testing and large-scale experimentation
  • Familiarity with cyber-related data, tools, and techniques


Salary.com Estimation for ML Engineer (Evaluation and Experimentation) in Arlington, VA
$115,435 to $148,652
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a ML Engineer (Evaluation and Experimentation)?

Sign up to receive alerts about other jobs on the ML Engineer (Evaluation and Experimentation) career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$77,900 - $95,589
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Not the job you're looking for? Here are some other ML Engineer (Evaluation and Experimentation) jobs in the Arlington, VA area that may be a better fit.

  • ARSquare Tech LLC Washington, DC
  • Master''''''''s Degree or Ph.D. in STEM with 2 years preferred OR Bachelor-s Degree in STEM with 7 year. * At least 1 years focused on applications Generat... more
  • Just Posted

  • Leidos Alexandria, VA
  • General program information and/or position overview. This Department of War enterprise data and analytics program delivers mission-critical capabilities t... more
  • Just Posted

AI Assistant is available now!

Feel free to start your new journey!