What are the responsibilities and job description for the Research Fellowship - Automated Environment Design position at Vmax?
About Vmax
Vmax is an applied research lab working at the frontier of reinforcement learning (RL). We are building new techniques for leveraging RL with Large Language Models (LLMs). Our research contributes directly to our RL platform, which automates the engineering involved in converting data and evals into RL environments.
About The Role
This position is for a 6-month research project with the Vmax team to make progress on our environment generation techniques.
Our meta-agent designs tasks and environments for other agents to help them learn domain specific skills. In this role you will develop approaches to optimize the construction of tasks to maximize resulting agent performance.
Responsibilities
Vmax is an applied research lab working at the frontier of reinforcement learning (RL). We are building new techniques for leveraging RL with Large Language Models (LLMs). Our research contributes directly to our RL platform, which automates the engineering involved in converting data and evals into RL environments.
About The Role
This position is for a 6-month research project with the Vmax team to make progress on our environment generation techniques.
Our meta-agent designs tasks and environments for other agents to help them learn domain specific skills. In this role you will develop approaches to optimize the construction of tasks to maximize resulting agent performance.
Responsibilities
- develop new approaches to task construction - building on literature in open endedness and unsupervised environment design
- develop new reward functions for environment design
- benchmark the agents that learn in generated environments
- validate your research on industry specific problems
- Currently enrolled in AI PhD or equivalent experience
- track record of research excellence, as demonstrated by publications, open source work or publicly deployed AI systems
- deep understanding of RL and ML
- significant engineering experience - our research feeds directly into environments and agents that need to be deployed for customers
- expertise with Python and a ML framework (PyTorch, JAX) is required for this role as well as experience with post-training frameworks
- experience in post-training LLMs
- experience researching evolutionary algorithms
- experience researching unsupervised environment design
- this role is based in our San Francisco office; for exceptional candidates we are willing to consider a hybrid arrangement