Demo

Research Engineer - Environments, Data and Post-Training

Mercor
San Francisco, CA Full Time
POSTED ON 5/16/2026
AVAILABLE BEFORE 6/22/2026
About Mercor

Mercor is defining the future of work. We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development. Our vast talent network trains frontier AI models in the same way teachers teach students: by sharing knowledge, experience, and context that can't be captured in code alone. Today, more than 30,000 experts in our network collectively earn over $2 million a day.

Mercor is creating a new category of work where expertise powers AI advancement. Achieving this requires an ambitious, fast-paced and deeply committed team. You’ll work alongside researchers, operators, and AI companies at the forefront of shaping the systems that are redefining society. Mercor is a profitable Series C company valued at $10 billion. We work in-person five days a week in our San Francisco, NYC, or London offices.

About The Role

As a Research Engineer at Mercor, you’ll work at the intersection of engineering and applied AI research. You’ll contribute directly to post-training and RLVR, synthetic data generation, and large-scale evaluation workflows that meaningfully impact frontier language models.

Your work will be used to train large language models to master tool use, agentic behavior, and real-world reasoning in real-world production environments. You’ll shape rewards, run post-training experiments, and build scalable systems that improve model performance. You’ll help design and evaluate datasets, create scalable data augmentation pipelines, and build rubrics and evaluators that push the boundaries of what LLMs can learn.

What You’ll Do

  • Work on post-training and RLVR pipelines to understand how datasets, rewards, and training strategies impact model performance.
  • Design and run reward-shaping experiments and algorithmic improvements (e.g., GRPO, DAPO) to improve LLM tool-use, agentic behavior, and real-world reasoning.
  • Quantify data usability, quality, and performance uplift on key benchmarks.
  • Build and maintain data generation and augmentation pipelines that scale with training needs.
  • Create and refine rubrics, evaluators, and scoring frameworks that guide training and evaluation decisions.
  • Build and operate LLM evaluation systems, benchmarks, and metrics at scale.
  • Collaborate closely with AI researchers, applied AI teams, and experts producing training data.
  • Operate in a fast-paced, experimental research environment with rapid iteration cycles and high ownership.

What We’re Looking For

  • Strong applied research background, with a focus on post-training and/or model evaluation.
  • Strong coding proficiency and hands-on experience working with machine learning models.
  • Strong understanding of data structures, algorithms, backend systems, and core engineering fundamentals.
  • Familiarity with APIs, SQL/NoSQL databases, and cloud platforms.
  • Ability to reason deeply about model behavior, experimental results, and data quality.
  • Excitement to work in person in San Francisco, five days a week (with optional remote Saturdays), and thrive in a high-intensity, high-ownership environment.

Nice To Have

  • Real-world post-training team experience in industry (highest priority).
  • Publications at top-tier conferences (NeurIPS, ICML, ACL).
  • Experience training models or evaluating model performance.
  • Experience in synthetic data generation, LLM evaluations, or RL-style workflows.
  • Work samples, artifacts, or code repositories demonstrating relevant skills.

Benefits

  • Generous equity grant vested over 4 years
  • A $10K housing bonus (if you live within 0.5 miles of our office)
  • A $1.5K monthly stipend for meals
  • Free Equinox membership
  • Health insurance

Compensation Range: $130K - $500K

Salary : $130,000 - $500,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Research Engineer - Environments, Data and Post-Training?

Sign up to receive alerts about other jobs on the Research Engineer - Environments, Data and Post-Training career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$60,672 - $83,972
Income Estimation: 
$71,239 - $96,587
Income Estimation: 
$110,446 - $140,217
Income Estimation: 
$134,206 - $155,125
Income Estimation: 
$92,981 - $138,294
Income Estimation: 
$136,683 - $171,343
Income Estimation: 
$178,466 - $212,939
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Mercor

  • Mercor Washington, DC
  • About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benc... more
  • Just Posted

  • Mercor Washington, DC
  • About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benc... more
  • Just Posted

  • Mercor Washington, DC
  • About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benc... more
  • Just Posted

  • Mercor Peru, IN
  • About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benc... more
  • Just Posted


Not the job you're looking for? Here are some other Research Engineer - Environments, Data and Post-Training jobs in the San Francisco, CA area that may be a better fit.

  • thinkingmachines San Francisco, CA
  • Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has acc... more
  • 4 Days Ago

  • skildai-careers San Mateo, CA
  • Company Overview At Skild AI, we are building the world's first general purpose robotic intelligence that is robust and adapts to unseen scenarios without ... more
  • 2 Months Ago

AI Assistant is available now!

Feel free to start your new journey!