-
Notifications
You must be signed in to change notification settings - Fork 411
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add support for saving HF format tensors with DCP
CLA Signed
This label is managed by the Meta Open Source bot.
#1351
opened Jun 27, 2025 by
ankitageorge
•
Draft
[WIP] Document MX FP8 recipe
CLA Signed
This label is managed by the Meta Open Source bot.
#1350
opened Jun 27, 2025 by
lessw2020
Loading…
Autoparallel support for DP-only, DP+TP, or TP-only
CLA Signed
This label is managed by the Meta Open Source bot.
#1349
opened Jun 27, 2025 by
wconstab
Loading…
[WIP] Enable causal block mask for sdpa
CLA Signed
This label is managed by the Meta Open Source bot.
[WIP][RFC] Always flatten model state_dict
CLA Signed
This label is managed by the Meta Open Source bot.
#1347
opened Jun 26, 2025 by
fegin
Loading…
[SimpleFSDP] Add support for hsdp+tp
CLA Signed
This label is managed by the Meta Open Source bot.
#1343
opened Jun 26, 2025 by
ruisizhang123
Loading…
[DSV3] Apply TP on DSV3
CLA Signed
This label is managed by the Meta Open Source bot.
#1341
opened Jun 26, 2025 by
wwwjn
Loading…
[WIP] Refactor Tokenizer -> BaseTokenizer
CLA Signed
This label is managed by the Meta Open Source bot.
[kernels][blackwell] add cutlass/cute group gemm forward for blackwell
CLA Signed
This label is managed by the Meta Open Source bot.
#1327
opened Jun 22, 2025 by
lessw2020
Loading…
Support finetuning from a pretrained model
CLA Signed
This label is managed by the Meta Open Source bot.
#1321
opened Jun 20, 2025 by
vwxyzjn
Loading…
[float8] add _auto_filter_for_recipe for float8 training
CLA Signed
This label is managed by the Meta Open Source bot.
#1319
opened Jun 18, 2025 by
danielvegamyhre
Loading…
Support different tokenizers
CLA Signed
This label is managed by the Meta Open Source bot.
#1318
opened Jun 18, 2025 by
H-Huang
Loading…
[not for land] testing out float8 128_1_128_128 blockwise scaling
CLA Signed
This label is managed by the Meta Open Source bot.
#1317
opened Jun 18, 2025 by
vkuzo
Loading…
Do not submit: Multinode training seems to be working
CLA Signed
This label is managed by the Meta Open Source bot.
#1314
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
Do not submit: Multinode is working with multiple controllers
CLA Signed
This label is managed by the Meta Open Source bot.
#1313
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
[llama4][auxiliary-loss-free load balancing] update expert_bias without backward hooks
CLA Signed
This label is managed by the Meta Open Source bot.
#1304
opened Jun 16, 2025 by
hann-wang
Loading…
Finetune from pre-trained models
CLA Signed
This label is managed by the Meta Open Source bot.
#1300
opened Jun 15, 2025 by
vwxyzjn
Loading…
[not for land] Use new AC
CLA Signed
This label is managed by the Meta Open Source bot.
#1294
opened Jun 13, 2025 by
soulitzer
Loading…
WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1288
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
Titan changes to use DCP ZOC instead of titan default Async + Pinned Memory
CLA Signed
This label is managed by the Meta Open Source bot.
#1287
opened Jun 12, 2025 by
Saiteja64
Loading…
DO NOT SUBMIT: WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1286
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
[deepseek][kernels][blackwell] Cutlass blackwell grouped gemm using cute dsl (forward,backward)
CLA Signed
This label is managed by the Meta Open Source bot.
#1276
opened Jun 8, 2025 by
lessw2020
Loading…
[deepseek][blackwell] add Cutlass cute dsl blackwell dense based looping group gemm
CLA Signed
This label is managed by the Meta Open Source bot.
#1274
opened Jun 8, 2025 by
lessw2020
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.