Demo

Measurement Scientist, AI Evaluation Platform

Apple, Inc.
Washington, WA Full Time
POSTED ON 6/29/2026
AVAILABLE BEFORE 7/29/2026
Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or App Store experience we deliver is the result of us making each other's ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It's the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you'll do more than join something - you'll add something.

Our team, part of Apple Services Engineering, is building the scientific foundation for how AI systems are evaluated across Apple. We are seeking a Measurement Scientist to ensure that our evaluation methods are not just sophisticated, but scientifically valid and trustworthy . In this role, you will apply psychometric theory , validity frameworks, and statistical rigor to establish measurement standards for AI evaluation - ensuring that when we claim an evaluator measures \"helpfulness\" or \"safety ,\" it actually does. We are looking for individuals across a range of experience levels. \nThis role uniquely bridges measurement science and cutting-edge AI evaluation. You will develop methods for validating LLM-as-judge evaluators, automated benchmarks, and human evaluations. And you will create statistical tools that help engineers trust their evaluation results. You will work on an interdisciplinary team with ML researchers to solve new problems in AI evaluation. Your work will be both published at top measurement and ML venues and productionized into the evaluation SDK used across Apple. \nThe successful candidate will have deep expertise in psychometrics and measurement theory , with the ability to apply these principles to novel AI evaluation challenges. You will work collaboratively with ML researchers, platform engineers, and evaluation practitioners to translate measurement science into practical tools that scale across the organization.

PhD in Psychometrics, Educational Measurement, Quantitative Psychology , Statistics, or equivalent research/work experience.\nDeep expertise in modeling test data (IRT or related methods) and construct validation.\nStrong statistical foundation including experimental design, power analysis, sampling theory , and uncertaintyquantification.\nTrack record of designing and validating measurement instruments as demonstrated through publications or applied work.\nProficiency in Python (preferred) or R for statistical analysis, psychometric modeling, and method implementation.\nStrong working knowledge of generative AI technology\nExcellent communication skills with the ability to explain complex measurement concepts to engineers, ML researchers, and non-technical stakeholders.

Experience applying measurement science to AI/ML evaluation, automated scoring systems, or computational assessment.\nKnowledge of modern ML evaluation challenges including LLM-as-judge, automated metrics, benchmark design, and agentic systems.\nPublications at measurement venues or top ML conferences (NeurIPS, ICML, ICLR).\nExpertise in computational social or behavioral science using generative AI\nExperience collaborating with engineers to turn research methods into production tools and scalable infrastructure.

Salary.com Estimation for Measurement Scientist, AI Evaluation Platform in Washington, WA
$107,589 to $131,684
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Measurement Scientist, AI Evaluation Platform?

Sign up to receive alerts about other jobs on the Measurement Scientist, AI Evaluation Platform career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$116,765 - $144,626
Income Estimation: 
$142,836 - $179,016
Income Estimation: 
$96,240 - $123,168
Income Estimation: 
$120,579 - $154,482
Income Estimation: 
$115,522 - $153,258
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Apple, Inc.

  • Apple, Inc. Washington, DC
  • Summary Apple Retail is where the best of Apple comes together. We bring our expertise to help people do what they love, delivering an only-at-Apple experi... more
  • 1 Day Ago

  • Apple, Inc. Beaverton, OR
  • Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring p... more
  • 1 Day Ago

  • Apple, Inc. Boulder, CO
  • Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or... more
  • 1 Day Ago

  • Apple, Inc. Boulder, CO
  • Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Measurement Scientist, AI Evaluation Platform jobs in the Washington, WA area that may be a better fit.

  • Apple, Inc. Washington, WA
  • Join Apple Services Engineering to build the next generation of AI evaluation systems. We are seeking a staff machine learning platform engineer to lead th... more
  • 1 Day Ago

  • Apple, Inc. Washington, WA
  • AI systems are only as trustworthy as the methods used to evaluate them. At Apple, where AI powers experiences for billions of people, getting evaluation r... more
  • 1 Day Ago

AI Assistant is available now!

Feel free to start your new journey!