Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix bare except clause in timers.py bug Something isn't working
#1857 opened Oct 12, 2025 by vediyappanm Loading…
7 tasks done
ci: Auto-update copy-pr-bot vetters
#1850 opened Oct 9, 2025 by ko3n1g Loading…
chore: rename cicd nemo to cicd megatron-lm
#1848 opened Oct 8, 2025 by ko3n1g Loading…
checkpointing debug loop
#1846 opened Oct 2, 2025 by francistotle Loading…
COmmit we forked from
#1845 opened Oct 2, 2025 by francistotle Loading…
ci(fix): Install check
#1843 opened Oct 1, 2025 by ko3n1g Loading…
ci: Add install-test
#1840 opened Oct 1, 2025 by ko3n1g Loading…
[bugfix] fix typo
#1833 opened Sep 29, 2025 by 1195343015 Loading…
Update model_parallel_config.py bug Something isn't working
#1832 opened Sep 28, 2025 by skirdey-inflection Loading…
25.09 alpha rope split concat fusion
#1826 opened Sep 24, 2025 by vasunvidia Loading…
Fix _set_wandb_writer serialization issues bug Something isn't working module: debugging
#1806 opened Sep 11, 2025 by gakkiri Loading…
5 of 8 tasks
Add files via upload
#1801 opened Sep 10, 2025 by wenchenqian Loading…
Quant
#1794 opened Sep 6, 2025 by Charles2530 Loading…
Update README.md module: documentation
#1792 opened Sep 4, 2025 by yuyu5333 Loading…
Add falcon h1 2 enhancement New feature or request
#1785 opened Sep 2, 2025 by dhiaEddineRhaiem Loading…
bugfix: raise error if eos_token is not set in tokenizer bug Something isn't working module: data pipeline
#1774 opened Aug 27, 2025 by imomayiz Loading…
Fix Context Parallel NaN Loss bug Something isn't working
#1765 opened Aug 21, 2025 by leoleoasd Loading…
Fix runaway Etpt in straggler detector by resetting FLOPs accumulator bug Something isn't working
#1755 opened Aug 19, 2025 by cms42 Loading… Core 0.15
[main][feature][under updating]zero-overhead activation offload enhancement New feature or request
#1752 opened Aug 18, 2025 by GeYuhong Loading…
fix: Initialize master_weight with params_dtype directly bug Something isn't working
#1748 opened Aug 15, 2025 by Mirza-Samad-Ahmed-Baig Loading…
fix loading dcp OOM bug Something isn't working
#1747 opened Aug 14, 2025 by zjjott Loading…
ProTip! Filter pull requests by the default branch with base:main.