Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add node_rank argument for example scripts
#1604 opened May 30, 2025 by xylllllllll Loading…
CLIPViTModel support SP and CP
#1600 opened May 28, 2025 by Thaurun Loading…
Support Multiple Input Formats for checkpoint
#1599 opened May 28, 2025 by Thaurun Loading…
support for qwen2.5vl window attention
#1591 opened May 22, 2025 by Agoniii Loading…
[Draft ]FP8 param support for MXFP8
#1581 opened May 14, 2025 by WanZzzzzz Draft
fix: count_zeros protection in chained optimizer
#1569 opened May 8, 2025 by clumsy Loading…
Fix incorrect softmax_factor calculation in MLA
#1562 opened May 2, 2025 by HowardZorn Loading…
[bugfix] fix the bug that loss: 0 will not be printed
#1555 opened Apr 28, 2025 by leisuzz Loading…
Fix: training arguments print format
#1552 opened Apr 24, 2025 by vicoooo26 Loading…
Fp8 LM-head
#1551 opened Apr 22, 2025 by dhia680 Loading…
lora offload
#1540 opened Apr 15, 2025 by sanandaraj5597 Loading…
Lora offload
#1539 opened Apr 15, 2025 by sanandaraj5597 Loading…
Swiglu fusion
#1538 opened Apr 15, 2025 by rachitgarg91 Loading…
Add fused swiglu for MLP
#1536 opened Apr 15, 2025 by michal2409 Loading…
Fix AttributeError in MultiTokenPredictionLayer
#1529 opened Apr 12, 2025 by shenyunhang Loading…
ProTip! Filter pull requests by the default branch with base:main.