-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add DistTrain, Allow Encoder to Have Different DP Size
#1605
opened May 30, 2025 by
zidanehuang001
Loading…
bugfix: cross_entropy inplace operations may cause backward error
#1594
opened May 24, 2025 by
ChangWeiming
Loading…
fix bug: the loss of aux_loss and mtp will be tracked twice
#1585
opened May 18, 2025 by
hyleepp
Loading…
use multiple yaml files to avoid passing annoying model configs from cmd lines
#1579
opened May 14, 2025 by
nrailg
Loading…
The phrase "need to want to" is grammatically incorrect
#1574
opened May 13, 2025 by
A-transformer
Loading…
param_copy_back_gpu_hook should sync to h2d stream
#1543
opened Apr 16, 2025 by
ariverhorse
Loading…
Fix parameter error in text_generation_server.py file
#1542
opened Apr 16, 2025 by
xichengpro
Loading…
[BUGFIX] Save dist_checkpointing metadata on all nodes for multi-node training
#1531
opened Apr 13, 2025 by
Pranaykarvi
Loading…
added fix to avoid overflow with new numpy casting behaviour (Issue: #1519)
#1520
opened Apr 4, 2025 by
Apsod
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.