-
Notifications
You must be signed in to change notification settings - Fork 723
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Distributed Scion/Muon
CLA Signed
This label is managed by the Meta Open Source bot.
#1630
opened Aug 25, 2025 by
rakkit
Loading…
[llama4] enable expert parallel on the same device mesh as tp (tp2ep)
CLA Signed
This label is managed by the Meta Open Source bot.
#1269
opened Jun 6, 2025 by
hann-wang
Loading…
[WIP][Blackwell Kernels] Blackwell group gemm and dense gemms with Python Cutlass
CLA Signed
This label is managed by the Meta Open Source bot.
#1256
opened Jun 3, 2025 by
lessw2020
Loading…
Remove unnecessary token padding for MoE in BF16 mode
CLA Signed
This label is managed by the Meta Open Source bot.
high priority
Make CheckpointManager friendlier to custom StorageWriter/StorageReader
CLA Signed
This label is managed by the Meta Open Source bot.
#789
opened Jan 12, 2025 by
dimdi-y
Loading…
[Evaluation] Adding evaluation feature to TorchTitan
CLA Signed
This label is managed by the Meta Open Source bot.
#1470
opened Jul 28, 2025 by
raymin0223
Loading…
workarounds for all2all autograd issues that Ruisi ran into
CLA Signed
This label is managed by the Meta Open Source bot.
#1604
opened Aug 20, 2025 by
bdhirsh
Loading…
Add Transformer-Engine Fused_Adam Optimizer Support
CLA Signed
This label is managed by the Meta Open Source bot.
Add PP tracer + DP test
CLA Signed
This label is managed by the Meta Open Source bot.
#379
opened Jun 1, 2024 by
kwen2501
Loading…
[DeepSeek] Move seqlen from model config to This label is managed by the Meta Open Source bot.
setup_symm_mem
CLA Signed
#1017
opened Mar 24, 2025 by
kwen2501
Loading…
Add other ops
CLA Signed
This label is managed by the Meta Open Source bot.
#2033
opened Nov 13, 2025 by
wconstab
Loading…
debugging mm backwards shape error
CLA Signed
This label is managed by the Meta Open Source bot.
#2035
opened Nov 13, 2025 by
wconstab
Loading…
WIP
CLA Signed
This label is managed by the Meta Open Source bot.
#2032
opened Nov 13, 2025 by
wconstab
Loading…
Enable PP and EP overlap for MoE
CLA Signed
This label is managed by the Meta Open Source bot.
#2031
opened Nov 13, 2025 by
wconstab
Loading…
fix grad_out passing
CLA Signed
This label is managed by the Meta Open Source bot.
#2036
opened Nov 13, 2025 by
wconstab
Loading…
claude fix errors
CLA Signed
This label is managed by the Meta Open Source bot.
#2034
opened Nov 13, 2025 by
wconstab
Loading…
[RFC][DONT LAND] Support different state_dict for save and load
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
[torchtitan][debug] integrated CommDebugMode into TorchTitan
CLA Signed
This label is managed by the Meta Open Source bot.
#480
opened Jul 24, 2024 by
sinhaanshul
Loading…
WIP change to run a zero-bubble like schedule
CLA Signed
This label is managed by the Meta Open Source bot.
#416
opened Jun 21, 2024 by
wconstab
Loading…
[Not for landing] piggy back on titan for scale init test
CLA Signed
This label is managed by the Meta Open Source bot.
[mxfp8 moe training] temp workaround: don't compile GroupedExperts
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
#2268
opened Jan 21, 2026 by
danielvegamyhre
Loading…
[mxfp8 training] add new configurable params now exposed by torchao
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
#2251
opened Jan 18, 2026 by
danielvegamyhre
Loading…
[WIP] Allow benchmark between multiple configs
CLA Signed
This label is managed by the Meta Open Source bot.
#703
opened Nov 26, 2024 by
H-Huang
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.