generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.2k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Replace unittest skipTest from transformers with pytest.skip
#4297
opened Oct 16, 2025 by
albertvillanova
Loading…
switch to sleep level=2 and split wake-ups in GRPO and RLOO trainers
#4296
opened Oct 16, 2025 by
xxrjun
Loading…
1 of 5 tasks
feat: Add Multi-Token Prediction (MTP) support to SFTTrainer
#4290
opened Oct 15, 2025 by
KLGR123
Loading…
[SFT] add support for unified conversion logic for both images and videos
#4264
opened Oct 13, 2025 by
kashif
Loading…
Remove FSDP1 support: use FSDP2 exclusively
#4260
opened Oct 11, 2025 by
behroozazarkhalili
Loading…
Fix DPO Trainer Bug For Qwen2-VL (Issue 2660)
#4257
opened Oct 11, 2025 by
FabianSchuetze
Loading…
1 of 3 tasks
[Activation-checkpointing] add tensor dedup and param offloading
#4247
opened Oct 10, 2025 by
kashif
Loading…
Update
max_length
explanation for VLM trainers
#4220
opened Oct 7, 2025 by
sergiopaniego
Loading…
5 tasks
🧺 [5/N] Refactor
_generate
in GRPO/RLOO: Insert images in the prompt
#4155
opened Sep 26, 2025 by
qgallouedec
Loading…
🧺 [4/N] Refactor
_generate
in GRPO/RLOO: Move forward_kwargs
outside generation method
#4154
opened Sep 26, 2025 by
qgallouedec
Loading…
update guided decoding param to structured outputs
#4117
opened Sep 22, 2025 by
jiqing-feng
Loading…
feat:add support for 'image_grid_thw'(QwenVL) in DPOTrainer
#4091
opened Sep 15, 2025 by
ycma8
Loading…
2 of 5 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.