docs(rl): update tutorial with AgenticGRPOLearner for async RL training by AntonyMei · Pull Request #4181 · AI-Hypercomputer/maxtext

AntonyMei · 2026-06-16T21:22:11Z

Description

Updates the rl.md tutorial documentation to include instructions and an example command for using AgenticGRPOLearner for asynchronous RL training.

Tests

Command tested on Qwen3 4B using a v6e-8 machine.

# [Test Agentic Learner (default)]
export MODEL="qwen3-4b"
export BASE_OUTPUT_DIRECTORY="gs://yixuanm-maxtext-logs/AgenticGRPOTesting"
export RUN_NAME="agentic-0610-default"
export CHIPS_PER_VM=8
export MAXTEXT_CKPT_PATH="gs://yixuanm-maxtext-logs/AgenticGRPOTesting/qwen3-4b-ckpt/0/items/"

# GRPO command
python3 -m maxtext.trainers.post_train.rl.train_rl \
  model_name=${MODEL?} \
  load_parameters_path=${MAXTEXT_CKPT_PATH?} \
  run_name=${RUN_NAME?} \
  base_output_directory=${BASE_OUTPUT_DIRECTORY?} \
  chips_per_vm=${CHIPS_PER_VM?} \
  enable_tunix_perf_metrics=True \
  rl.use_agentic_rollout=True

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

AntonyMei requested review from A9isha, RissyRan, SurbhiJainUSC, bvandermoon, darisoy, gagika, gobbleturk, jacoguzo, jiangjy1982, richjames0, shralex and vipannalla as code owners June 16, 2026 21:22

AntonyMei force-pushed the yixuanm-doc-fix branch 2 times, most recently from f51a65d to fbd63d2 Compare June 16, 2026 21:50

docs(rl): update tutorial with AgenticGRPOLearner for async RL training

9a04d58

AntonyMei force-pushed the yixuanm-doc-fix branch from fbd63d2 to 9a04d58 Compare June 16, 2026 21:51

igorts-git approved these changes Jun 16, 2026

View reviewed changes

SurbhiJainUSC approved these changes Jun 16, 2026

View reviewed changes

github-actions Bot added the pull ready label Jun 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(rl): update tutorial with AgenticGRPOLearner for async RL training#4181

docs(rl): update tutorial with AgenticGRPOLearner for async RL training#4181
AntonyMei wants to merge 1 commit into
mainfrom
yixuanm-doc-fix

AntonyMei commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AntonyMei commented Jun 16, 2026

Description

Tests

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants