-
Notifications
You must be signed in to change notification settings - Fork 725
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Experimental Feature] Huggingface model training
CLA Signed
This label is managed by the Meta Open Source bot.
#919
opened Mar 3, 2025 by
junjzhang
Loading…
IBM experimental dataloaders
CLA Signed
This label is managed by the Meta Open Source bot.
#376
opened May 31, 2024 by
daviswer
Loading…
Execute moe.gate in float32
CLA Signed
This label is managed by the Meta Open Source bot.
high priority
[torchtitan][debug] integrated CommDebugMode into TorchTitan
CLA Signed
This label is managed by the Meta Open Source bot.
#480
opened Jul 24, 2024 by
sinhaanshul
Loading…
[DO NOT REVIEW] debug fsdp2 checkpoint for uneven sharding
CLA Signed
This label is managed by the Meta Open Source bot.
Distributed Scion/Muon
CLA Signed
This label is managed by the Meta Open Source bot.
#1630
opened Aug 25, 2025 by
rakkit
Loading…
Staging SFT training
CLA Signed
This label is managed by the Meta Open Source bot.
#2148
opened Dec 12, 2025 by
rakkit
Loading…
Remove device to host synchronizations from repeat_interleave and tail_slack
CLA Signed
This label is managed by the Meta Open Source bot.
#2440
opened Feb 25, 2026 by
rthekini-aws
Loading…
Add shims for PyTorch stable
CLA Signed
This label is managed by the Meta Open Source bot.
#1862
opened Oct 13, 2025 by
bwasti
Loading…
[llama4][auxiliary-loss-free load balancing] update expert_bias without backward hooks
CLA Signed
This label is managed by the Meta Open Source bot.
#1304
opened Jun 16, 2025 by
hann-wang
Loading…
[PoC] Enable flexible different layout for same mesh via a util function
CLA Signed
This label is managed by the Meta Open Source bot.
#1550
opened Aug 11, 2025 by
fduwjj
Loading…
[WIP] Memeory Sharded Tensor (MST)
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
Implement the num_flops_per_token calculation in get_nparams_and_flops() function for Flux
CLA Signed
This label is managed by the Meta Open Source bot.
#2452
opened Feb 27, 2026 by
vivekgoe
Loading…
Support finetuning from a pretrained model
CLA Signed
This label is managed by the Meta Open Source bot.
#1321
opened Jun 20, 2025 by
vwxyzjn
Loading…
experiments: add nemotron3 model to experiments folder
CLA Signed
This label is managed by the Meta Open Source bot.
#2187
opened Dec 30, 2025 by
aghilann
Loading…
4 tasks
feat: add support for DeepEP in Qwen3 parallelization logic
CLA Signed
This label is managed by the Meta Open Source bot.
#2392
opened Feb 18, 2026 by
jordisassoon
Loading…
[WIP][full DTensor] Use all DTensor for Qwen3 and llama4 at TP region
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
high priority
Add PP tracer + DP test
CLA Signed
This label is managed by the Meta Open Source bot.
#379
opened Jun 1, 2024 by
kwen2501
Loading…
[DeepSeek] Move seqlen from model config to This label is managed by the Meta Open Source bot.
setup_symm_mem
CLA Signed
#1017
opened Mar 24, 2025 by
kwen2501
Loading…
[mxfp8 moe training] mxfp8 all to all
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
high priority
#2250
opened Jan 17, 2026 by
danielvegamyhre
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.