Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Allow GPU work exclusive ownership of asyncio loop complexity: medium
#4295 opened Apr 14, 2026 by tdene Contributor Loading…
5 tasks
Core 0.16
[fix] fix optimizer community-request Final Review PR is in the "final review" stage
#4294 opened Apr 14, 2026 by pavelgein Loading…
1 of 5 tasks
Cuda graph fix
#4285 opened Apr 13, 2026 by i-riyad Contributor Draft
5 tasks
Fix typo in PR4133. Approved All necessary approvals have been made bug Something isn't working complexity: low
#4277 opened Apr 13, 2026 by cspades Member Loading…
5 tasks
Core 0.16
ci: add workflow_dispatch support to cicd-main.yml
#4275 opened Apr 13, 2026 by ko3n1g Contributor Draft
3 tasks
fix mfsdp unwrap stuck at MegatronFSDP complexity: low Final Review PR is in the "final review" stage module: megatron-fsdp
#4274 opened Apr 13, 2026 by wplf Member Loading… Core 0.16
[Dev] Support delayed wgrad compute overlap with P2P backward
#4268 opened Apr 13, 2026 by Wohox Contributor Draft
5 tasks
[Main] Fix TE version check for retain_pinned_cpu_buffers in cpu offload complexity: low Final Review PR is in the "final review" stage
#4267 opened Apr 13, 2026 by BestJuly Contributor Loading… Core 0.16
Get device correctly when module returns a dict instead of individual tensor Approved All necessary approvals have been made complexity: low
#4265 opened Apr 13, 2026 by shifangx Contributor Loading…
5 tasks
Factor RL-specific code out of training.py complexity: high
#4264 opened Apr 12, 2026 by tdene Contributor Loading…
5 tasks
Core 0.16
Add training-side support for Rollout Routing Replay (R3) community-request needs-follow-up Issue needs follow-up
#4256 opened Apr 10, 2026 by meinie0826 Loading…
4 of 5 tasks
get rid of pickle
#4254 opened Apr 10, 2026 by dimapihtar Contributor Draft
5 tasks
Allow fine-grained offloading with MC impl of full-CG.
#4253 opened Apr 10, 2026 by rapatel Contributor Loading…
5 tasks
Core 0.16
[fix] Use MSC for checking checkpoint existence community-request
#4251 opened Apr 10, 2026 by pavelgein Loading…
1 of 5 tasks
ProTip! Add no:assignee to see everything that’s not assigned.