Skip to content

Pull requests: pytorch/torchtitan

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add support for saving HF format tensors with DCP CLA Signed This label is managed by the Meta Open Source bot.
#1351 opened Jun 27, 2025 by ankitageorge Draft
[WIP] Document MX FP8 recipe CLA Signed This label is managed by the Meta Open Source bot.
#1350 opened Jun 27, 2025 by lessw2020 Loading…
Autoparallel support for DP-only, DP+TP, or TP-only CLA Signed This label is managed by the Meta Open Source bot.
#1349 opened Jun 27, 2025 by wconstab Loading…
[WIP] Enable causal block mask for sdpa CLA Signed This label is managed by the Meta Open Source bot.
#1348 opened Jun 26, 2025 by mreso Draft
[WIP][RFC] Always flatten model state_dict CLA Signed This label is managed by the Meta Open Source bot.
#1347 opened Jun 26, 2025 by fegin Loading…
[DSV3] Add PP support for DSV3 CLA Signed This label is managed by the Meta Open Source bot.
#1345 opened Jun 26, 2025 by H-Huang Draft
[SimpleFSDP] Add support for hsdp+tp CLA Signed This label is managed by the Meta Open Source bot.
#1343 opened Jun 26, 2025 by ruisizhang123 Loading…
[DSV3] Apply TP on DSV3 CLA Signed This label is managed by the Meta Open Source bot.
#1341 opened Jun 26, 2025 by wwwjn Loading…
[WIP] Refactor Tokenizer -> BaseTokenizer CLA Signed This label is managed by the Meta Open Source bot.
#1333 opened Jun 24, 2025 by H-Huang Draft
[kernels][blackwell] add cutlass/cute group gemm forward for blackwell CLA Signed This label is managed by the Meta Open Source bot.
#1327 opened Jun 22, 2025 by lessw2020 Loading…
[WIP] expert parallel dp2ep CLA Signed This label is managed by the Meta Open Source bot.
#1324 opened Jun 21, 2025 by tianyu-l Draft
Support finetuning from a pretrained model CLA Signed This label is managed by the Meta Open Source bot.
#1321 opened Jun 20, 2025 by vwxyzjn Loading…
[float8] add _auto_filter_for_recipe for float8 training CLA Signed This label is managed by the Meta Open Source bot.
#1319 opened Jun 18, 2025 by danielvegamyhre Loading…
Support different tokenizers CLA Signed This label is managed by the Meta Open Source bot.
#1318 opened Jun 18, 2025 by H-Huang Loading…
[not for land] testing out float8 128_1_128_128 blockwise scaling CLA Signed This label is managed by the Meta Open Source bot.
#1317 opened Jun 18, 2025 by vkuzo Loading…
Do not submit: Multinode training seems to be working CLA Signed This label is managed by the Meta Open Source bot.
#1314 opened Jun 17, 2025 by ahmadsharif1 Draft
Do not submit: Multinode is working with multiple controllers CLA Signed This label is managed by the Meta Open Source bot.
#1313 opened Jun 17, 2025 by ahmadsharif1 Draft
[llama4][auxiliary-loss-free load balancing] update expert_bias without backward hooks CLA Signed This label is managed by the Meta Open Source bot.
#1304 opened Jun 16, 2025 by hann-wang Loading…
Finetune from pre-trained models CLA Signed This label is managed by the Meta Open Source bot.
#1300 opened Jun 15, 2025 by vwxyzjn Loading…
[not for land] Use new AC CLA Signed This label is managed by the Meta Open Source bot.
#1294 opened Jun 13, 2025 by soulitzer Loading…
WIP: Try to use monarch to run torchtitan. CLA Signed This label is managed by the Meta Open Source bot.
#1288 opened Jun 12, 2025 by ahmadsharif1 Draft
Titan changes to use DCP ZOC instead of titan default Async + Pinned Memory CLA Signed This label is managed by the Meta Open Source bot.
#1287 opened Jun 12, 2025 by Saiteja64 Loading…
DO NOT SUBMIT: WIP: Try to use monarch to run torchtitan. CLA Signed This label is managed by the Meta Open Source bot.
#1286 opened Jun 12, 2025 by ahmadsharif1 Draft
[deepseek][kernels][blackwell] Cutlass blackwell grouped gemm using cute dsl (forward,backward) CLA Signed This label is managed by the Meta Open Source bot.
#1276 opened Jun 8, 2025 by lessw2020 Loading…
[deepseek][blackwell] add Cutlass cute dsl blackwell dense based looping group gemm CLA Signed This label is managed by the Meta Open Source bot.
#1274 opened Jun 8, 2025 by lessw2020 Loading…
ProTip! Filter pull requests by the default branch with base:main.