Skip to content

Pull requests: allenai/open-instruct

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Simplify pending query map increments codex
#1207 opened Nov 18, 2025 by finbarrtimbers Loading…
Make CLAUDE instructions symlink codex
#1206 opened Nov 18, 2025 by finbarrtimbers Loading…
added staleness metrics
#1204 opened Nov 18, 2025 by mnoukhov Loading…
Pad out 32b
#1177 opened Nov 12, 2025 by hamishivi Draft
Changes to make DPO run faster
#1175 opened Nov 11, 2025 by finbarrtimbers Draft
Added NCCL flags from DPO
#1165 opened Nov 10, 2025 by finbarrtimbers Loading…
RL Zero Scripts
#1162 opened Nov 10, 2025 by mnoukhov Draft
Correct loss accumulation for grpo_fast
#1161 opened Nov 10, 2025 by hamishivi Loading…
set to augusta when specified
#1138 opened Nov 3, 2025 by saumyamalik Loading…
Rename grpo_fast.py to grpo.py.
#1133 opened Nov 3, 2025 by finbarrtimbers Loading…
Removes dead code
#1122 opened Oct 30, 2025 by finbarrtimbers Draft
active sampling or whatever we call it
#1111 opened Oct 27, 2025 by mnoukhov Loading…
fix resumption when ref policy loading fails
#1110 opened Oct 24, 2025 by saurabh111233212 Loading…
ProTip! Exclude everything labeled bug with -label:bug.