-
Notifications
You must be signed in to change notification settings - Fork 725
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] M*G Triton group gemm for MoE training
CLA Signed
This label is managed by the Meta Open Source bot.
#967
by lessw2020
was merged Apr 3, 2025
Loading…
[WIP] Integrate DeepGEMM, add supporting utils and unit testing, to enable blockwise fp8 inference
CLA Signed
This label is managed by the Meta Open Source bot.
#1124
by lessw2020
was merged Apr 24, 2025
Loading…
[WIP] Experimental implementation of gpt-oss (grouped GEMM MoE + FlexAttention sink/sliding)
CLA Signed
This label is managed by the Meta Open Source bot.
#1559
by KhoomeiK
was closed Oct 23, 2025
Loading…
Add deterministic RL training experiment with vLLM
CLA Signed
This label is managed by the Meta Open Source bot.
#1975
by bwasti
was merged Nov 7, 2025
Loading…
[DeepSeek] Let MoEs share Symmetric Memory across layers
CLA Signed
This label is managed by the Meta Open Source bot.
#958
by kwen2501
was closed Mar 20, 2025
Loading…
gpt-oss model enablement
CLA Signed
This label is managed by the Meta Open Source bot.
#1754
by wwwjn
was merged Oct 22, 2025
Loading…
add benchmarks folder and submission guidelines
CLA Signed
This label is managed by the Meta Open Source bot.
#1296
by tianyu-l
was merged Jun 13, 2025
Loading…
Add the option to turn on async-TP
CLA Signed
This label is managed by the Meta Open Source bot.
#429
by yifuwang
was merged Jun 27, 2024
Loading…
[Qwen3] Qwen3 MoE initial support
CLA Signed
This label is managed by the Meta Open Source bot.
#1685
by wwwjn
was merged Sep 11, 2025
Loading…
[405B] Add performance data for 405B model
CLA Signed
This label is managed by the Meta Open Source bot.
#554
by fduwjj
was merged Aug 23, 2024
Loading…
[HF] Model Definition Conversion Support for FLUX
CLA Signed
This label is managed by the Meta Open Source bot.
#1582
by wesleytruong
was merged Aug 20, 2025
Loading…
[Kernels] add triton contiguous groupgemm
CLA Signed
This label is managed by the Meta Open Source bot.
#1154
by lessw2020
was merged May 1, 2025
Loading…
CUDAGraph support for SimpleFSDP and TP
CLA Signed
This label is managed by the Meta Open Source bot.
#2050
by BoyuanFeng
was merged Nov 20, 2025
Loading…
4 tasks done
add llama4 as an experiment
CLA Signed
This label is managed by the Meta Open Source bot.
#1064
by tianyu-l
was merged Apr 7, 2025
Loading…
upgrade generate_permute_indices with faster kernel (parallelized)
CLA Signed
This label is managed by the Meta Open Source bot.
#1098
by lessw2020
was merged Apr 15, 2025
Loading…
[llama4] fall back to for-loop based MoE if not on SM90 or later
CLA Signed
This label is managed by the Meta Open Source bot.
#1096
by tianyu-l
was merged Apr 12, 2025
Loading…
[DeepSeek] Integrate M*G Group Gemm
CLA Signed
This label is managed by the Meta Open Source bot.
#1046
by kwen2501
was merged Apr 3, 2025
Loading…
[DeepSeek] all-to-all-v kernel writes out This label is managed by the Meta Open Source bot.
output_splits
CLA Signed
#941
by kwen2501
was merged Mar 20, 2025
Loading…
Implement async_checkpoint
CLA Signed
This label is managed by the Meta Open Source bot.
#313
by fegin
was merged May 7, 2024
Loading…
enable gc control scheduling to help avoid stragglers
CLA Signed
This label is managed by the Meta Open Source bot.
#148
by lessw2020
was merged Mar 20, 2024
Loading…
[MoE] DeepEP refactor and fix memory leak during training and inference
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
#2296
by shuhuayu
was merged Jan 29, 2026
Loading…
Use cudnn backend on B200
CLA Signed
This label is managed by the Meta Open Source bot.
#1196
by drisspg
was merged May 15, 2025
Loading…
add L40s gpu type for MFU calcs
CLA Signed
This label is managed by the Meta Open Source bot.
#1204
by lessw2020
was merged May 19, 2025
Loading…
[BC Breaking] Config System Refactor: TOML to Python Dataclass Registry
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.