Skip to content

Pull requests: allenai/open-instruct

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix GRPO OLMo-core bookkeeping PG deadlock + Qwen3 parity tweaks
#1708 opened May 28, 2026 by finbarrtimbers Collaborator Loading…
2 tasks done
Wire max_checkpoints through SFT, DPO, and GRPO paths
#1701 opened May 27, 2026 by TimDettmers Loading…
4 tasks
Add olmo-eval Beaker launch integration for GRPO
#1698 opened May 22, 2026 by mnoukhov Contributor Draft
2 of 3 tasks
Add Trackio rollout trace logging
#1697 opened May 21, 2026 by abidlabs Loading…
Add grpo difficulty curriculum
#1694 opened May 12, 2026 by undfined Loading…
Add difficulty map builder
#1693 opened May 12, 2026 by undfined Loading…
Add difficulty curriculum sampler
#1692 opened May 12, 2026 by undfined Loading…
Replace submit_eval_jobs.py with thin wrapper around submit_eval_jobs.sh
#1658 opened May 6, 2026 by finbarrtimbers Collaborator Loading…
2 tasks
Add time/per_group_wall_time metric
#1656 opened May 5, 2026 by finbarrtimbers Collaborator Loading…
2 tasks
Make checkpointing better
#1647 opened Apr 29, 2026 by finbarrtimbers Collaborator Draft
3 tasks
Fix submit_eval_jobs.py for olmo-eval-internal runs
#1644 opened Apr 28, 2026 by finbarrtimbers Collaborator Loading…
4 tasks done
Add Delightful Policy Gradient loss and Kondo Gate to GRPO
#1628 opened Apr 20, 2026 by finbarrtimbers Collaborator Loading…
3 tasks done
Warn about checkpoint disk space only on the first checkpoint
#1608 opened Apr 13, 2026 by mnoukhov Contributor Loading…
Fix: deterministic downsampling
#1603 opened Apr 11, 2026 by mnoukhov Contributor Loading…
WIP
#1555 opened Mar 24, 2026 by mnoukhov Contributor Draft
Priority local eval queue for grpo_fast
#1553 opened Mar 23, 2026 by mnoukhov Contributor Loading…
2GPU Olmo-core GRPO
#1551 opened Mar 23, 2026 by mnoukhov Contributor Draft
DELTA benchmark
#1541 opened Mar 19, 2026 by mnoukhov Contributor Draft
ProTip! Updated in the last three days: updated:>2026-05-26.