Demo

Staff Machine Learning Platform Engineer, AI Evaluation

Apple, Inc.
Washington, WA Full Time
POSTED ON 5/22/2026
AVAILABLE BEFORE 6/22/2026
Join Apple Services Engineering to build the next generation of AI evaluation systems. We are seeking a staff machine learning platform engineer to lead the architectural design and development of the high availability services and internal tools powering self-service evaluation at scale. You will partner with researchers to operationalize their innovations, transforming complex workflows into intuitive, developer-first platforms. We are looking for builders who thrive in the ambiguity of new initiatives and are passionate about creating scalable infrastructure.

You will join the engineering team responsible for democratizing AI evaluation across the organization. Your focus will be on developing the developer experience-architecting and implementing the APIs, SDKs, and platform services that turn complex evaluation metrics into simple, self-service calls. You will work hand-in-hand with researchers to operationalize sophisticated measurement techniques, ensuring they scale reliably within our high-availability infrastructure. In this role, you will drive the engineering standards for a new organization, upholding the code quality, automation, and testing rigor required to support the rapid evolution of Generative AI and Agentic systems.

8 years of hands-on software engineering experience, with a track record of owning the technical direction of a platform or infrastructure domain. \nStrong proficiency in the Python ecosystem (e.g., FastAPI, Pydantic, Pandas). You write production-grade code and lead architectural discussions on day one.\nCustomer Obsession & Product Thinking: You have owned the technical roadmap for an internal platform, presented it to senior stakeholders, and shipped against it. You independently translate vague requirements from other teams into concrete engineering specifications and platform roadmaps.\nDemonstrated experience leading technical partnerships with Data Scientists or Researchers: You have taken research code and shipped it as a production service and built the abstractions, testing frameworks, and deployment pipelines that made the next handoff faster than the last..\nStrong expertise in API Design & Platform Infrastructure: You have designed and owned APIs and SDKs that other developers rely on, with a focus on versioning, backward compatibility, and developer experience at scale.\nOperational excellence background: You have architected and owned CI/CD pipelines, containerization (Docker/Kubernetes), and monitoring (Datadog/Prometheus) for production services, and have been accountable for their reliability.\nBachelors in Computer Science or related field, Masters preferred.

Deep familiarity with AI Evaluation Frameworks: You have built, extended, or contributed to modern evaluation tools like DeepEval, Ragas, TruLens, or LangSmith. You understand how to implement and scale model-based evaluation workflows across a large organization.\nEvaluation Service Deployment: Own the deployment, scaling, and operational health of evaluation services in production - including high-throughput evaluation job orchestration (queueing, prioritization, concurrency, auto-scaling), and defining SLAs for evaluation pipeline latency and availability.\nObservability & Reliability: Experience instrumenting production ML evaluation pipelines including tracking evaluation job throughput, queue depth, judge model latency SLAs, scoring drift over time, and failure modes specific to non-deterministic LLM-based evaluation workflows.\nDeep understanding of Generative AI & Agents: You understand the engineering challenges of relying on LLMs and Agents as software components-specifically managing token economics, handling rate limits, and evaluating non-deterministic, multi-step reasoning capabilities. You have built production systems that depend on these components and have solved these problems at scale.\nBuilder Experience: You have thrived in startup-like environments, navigating high ambiguity to deliver complex technical roadmaps from scratch.

Salary.com Estimation for Staff Machine Learning Platform Engineer, AI Evaluation in Washington, WA
$111,764 to $143,925
If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Staff Machine Learning Platform Engineer, AI Evaluation?

Sign up to receive alerts about other jobs on the Staff Machine Learning Platform Engineer, AI Evaluation career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Income Estimation: 
$101,387 - $124,118
Income Estimation: 
$119,030 - $151,900
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Apple, Inc.

  • Apple, Inc. Tigard, OR
  • Apple Retail is where the best of Apple comes together. We bring our expertise to help people do what they love, delivering an only-at-Apple experience. We... more
  • 1 Day Ago

  • Apple, Inc. Beaverton, OR
  • Are you a big-picture thinker who loves setting high-reaching goals? Do you have a passion for understanding how each line of code affects all the others? ... more
  • 1 Day Ago

  • Apple, Inc. Beaverton, OR
  • Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products very quickly. Bring passion and dedication to your job, a... more
  • 1 Day Ago

  • Apple, Inc. Beaverton, OR
  • We are seeking a highly motivated and innovative Embedded CPU Engineer to join the Platform Architecture team. In this role, you will drive performance and... more
  • 1 Day Ago


Not the job you're looking for? Here are some other Staff Machine Learning Platform Engineer, AI Evaluation jobs in the Washington, WA area that may be a better fit.

  • Apple, Inc. Washington, WA
  • Apple is where individual imaginations come together, committing to the values that lead to great work. Every new product we build, service we create, or A... more
  • 1 Day Ago

  • Apple, Inc. Washington, WA
  • Imagine what you could do here. At Apple, great new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. B... more
  • 2 Days Ago

AI Assistant is available now!

Feel free to start your new journey!