-
Notifications
You must be signed in to change notification settings - Fork 427
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Validator integration with current metrics processor for logging
CLA Signed
This label is managed by the Meta Open Source bot.
#1395
opened Jul 14, 2025 by
wesleytruong
Loading…
[DSV3] Add CI Integration tests for deepseek-v3
CLA Signed
This label is managed by the Meta Open Source bot.
#1394
opened Jul 14, 2025 by
wwwjn
Loading…
Add Github workflow to build and publish wheel to PyTorch Index nightly
CLA Signed
This label is managed by the Meta Open Source bot.
#1392
opened Jul 14, 2025 by
joecummings
Loading…
add the forge folder
CLA Signed
This label is managed by the Meta Open Source bot.
#1387
opened Jul 13, 2025 by
tianyu-l
Loading…
Add option for selective op AC to filter mm shapes based on fqn
CLA Signed
This label is managed by the Meta Open Source bot.
#1380
opened Jul 11, 2025 by
soulitzer
Loading…
add float8 support
CLA Signed
This label is managed by the Meta Open Source bot.
#1378
opened Jul 10, 2025 by
bdhirsh
Loading…
[llama3] add configurations for Llama 3 1B and 3B models
CLA Signed
This label is managed by the Meta Open Source bot.
#1376
opened Jul 9, 2025 by
idoh
Loading…
[WIP] Document MX FP8 recipe
CLA Signed
This label is managed by the Meta Open Source bot.
#1350
opened Jun 27, 2025 by
lessw2020
Loading…
Autoparallel support for DP-only, DP+TP, or TP-only
CLA Signed
This label is managed by the Meta Open Source bot.
#1349
opened Jun 27, 2025 by
wconstab
Loading…
[WIP] Enable causal block mask for sdpa
CLA Signed
This label is managed by the Meta Open Source bot.
[DSV3] Add PP support for DSV3
CLA Signed
This label is managed by the Meta Open Source bot.
#1345
opened Jun 26, 2025 by
H-Huang
Loading…
[kernels][blackwell] add cutlass/cute group gemm forward for blackwell
CLA Signed
This label is managed by the Meta Open Source bot.
#1327
opened Jun 22, 2025 by
lessw2020
Loading…
Support finetuning from a pretrained model
CLA Signed
This label is managed by the Meta Open Source bot.
#1321
opened Jun 20, 2025 by
vwxyzjn
Loading…
[not for land] testing out float8 128_1_128_128 blockwise scaling
CLA Signed
This label is managed by the Meta Open Source bot.
#1317
opened Jun 18, 2025 by
vkuzo
Loading…
Do not submit: Multinode training seems to be working
CLA Signed
This label is managed by the Meta Open Source bot.
#1314
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
Do not submit: Multinode is working with multiple controllers
CLA Signed
This label is managed by the Meta Open Source bot.
#1313
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
[llama4][auxiliary-loss-free load balancing] update expert_bias without backward hooks
CLA Signed
This label is managed by the Meta Open Source bot.
#1304
opened Jun 16, 2025 by
hann-wang
Loading…
Finetune from pre-trained models
CLA Signed
This label is managed by the Meta Open Source bot.
#1300
opened Jun 15, 2025 by
vwxyzjn
Loading…
[not for land] Use new AC
CLA Signed
This label is managed by the Meta Open Source bot.
#1294
opened Jun 13, 2025 by
soulitzer
Loading…
WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1288
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
DO NOT SUBMIT: WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1286
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
[deepseek][kernels][blackwell] Cutlass blackwell grouped gemm using cute dsl (forward,backward)
CLA Signed
This label is managed by the Meta Open Source bot.
#1276
opened Jun 8, 2025 by
lessw2020
Loading…
[deepseek][blackwell] add Cutlass cute dsl blackwell dense based looping group gemm
CLA Signed
This label is managed by the Meta Open Source bot.
#1274
opened Jun 8, 2025 by
lessw2020
Loading…
[llama4] enable expert parallel on the same device mesh as tp (tp2ep)
CLA Signed
This label is managed by the Meta Open Source bot.
#1269
opened Jun 6, 2025 by
hann-wang
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.