-
Notifications
You must be signed in to change notification settings - Fork 3.5k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add a logprobs test with real gpt model.
Expert Review
Apply this label to indicate that your PR is ready for expert review.
#2870
opened Jan 8, 2026 by
yobibyte
Loading…
6 tasks
Remove cross-rank synchronization during checkpoint load & deprecate torch.distributed.checkpoint.state_dict_loader.load_state_dict
#2864
opened Jan 8, 2026 by
asolergi-nv
Loading…
Use global user buffer when the bucket size does not fit FixedPoolAllocator
#2857
opened Jan 7, 2026 by
shengf-nv
Loading…
6 tasks
Refactor spec modification/introspection to make references to Submodules typed
community-request
#2834
opened Jan 6, 2026 by
nschank
Loading…
6 tasks
fsdp: avoid double sharding of MoE experts when EP is enabled
community-request
Expert Review
Apply this label to indicate that your PR is ready for expert review.
module: megatron-fsdp
#2833
opened Jan 6, 2026 by
CodersAcademy006
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.