What are the responsibilities and job description for the Research Scientist, RL Training position at ChatGPT Jobs?

Job Description

Research Scientist - Reinforcement Learning for LLMs

Location: Redwood City, CA

Work Model: On-site, Remote

About The Role

Snorkel is seeking a Research Scientist to focus on reinforcement learning for training and aligning large language models (LLMs). This foundational research role aims to solve the open data problem of generating reliable training data, reward signals, and procedures to steer LLM behavior.

Main Responsibilities

Research and implement reinforcement learning techniques (GRPO, RLHF, RLAIF, DPO, reward modeling) to create data products for LLM training and fine-tuning.
Design and build data pipelines for generating high-quality training signals and improving model generalization.
Prototype and iterate on end-to-end RL training recipes.
Collaborate with research, engineering, and delivery teams to translate RL research into customer-ready data products.
Stay current with advancements in LLM training, alignment research, and scalable RL methods.
Contribute to research publications and internal knowledge base.

Preferred Qualifications

Deep expertise in reinforcement learning from human or AI feedback, reward modeling, and credit attribution.
Experience training or fine-tuning 30B LLMs at scale, including distributed training infrastructure.
Strong proficiency in Python and ML frameworks (PyTorch, HuggingFace) and RL frameworks (Verl, SkyRL).
Solid software engineering fundamentals for building reproducible research prototypes.
Familiarity with ML infrastructure and cloud platforms (AWS, GCP, Kubernetes, Slurm); experience with large-scale RL training pipelines is a plus.
Comfort with high-iteration, open-ended research environments and shifting constraints.
Ph.D. in machine learning, reinforcement learning, or related field, or equivalent industry experience.

Salary Range

$200,000 - $275,000 USD

Salary : $200,000 - $275,000

Apply for this job

Receive alerts for other Research Scientist, RL Training job openings

Research Scientist, RL Training

What are the responsibilities and job description for the Research Scientist, RL Training position at ChatGPT Jobs?

What is the career path for a Research Scientist, RL Training?

Job openings at ChatGPT Jobs

Not the job you're looking for? Here are some other Research Scientist, RL Training jobs in the Redwood, CA area that may be a better fit.

We don't have any other Research Scientist, RL Training jobs in the Redwood, CA area right now.

AI Assistant is available now!