Skip to content

Pull requests: pytorch/torchtitan

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] M*G Triton group gemm for MoE training CLA Signed This label is managed by the Meta Open Source bot.
#967 by lessw2020 was merged Apr 3, 2025 Loading…
[WIP] Integrate DeepGEMM, add supporting utils and unit testing, to enable blockwise fp8 inference CLA Signed This label is managed by the Meta Open Source bot.
#1124 by lessw2020 was merged Apr 24, 2025 Loading…
[WIP] Experimental implementation of gpt-oss (grouped GEMM MoE + FlexAttention sink/sliding) CLA Signed This label is managed by the Meta Open Source bot.
#1559 by KhoomeiK was closed Oct 23, 2025 Loading…
Add deterministic RL training experiment with vLLM CLA Signed This label is managed by the Meta Open Source bot.
#1975 by bwasti was merged Nov 7, 2025 Loading…
[DeepSeek] Let MoEs share Symmetric Memory across layers CLA Signed This label is managed by the Meta Open Source bot.
#958 by kwen2501 was closed Mar 20, 2025 Loading…
gpt-oss model enablement CLA Signed This label is managed by the Meta Open Source bot.
#1754 by wwwjn was merged Oct 22, 2025 Loading…
add benchmarks folder and submission guidelines CLA Signed This label is managed by the Meta Open Source bot.
#1296 by tianyu-l was merged Jun 13, 2025 Loading…
Add the option to turn on async-TP CLA Signed This label is managed by the Meta Open Source bot.
#429 by yifuwang was merged Jun 27, 2024 Loading…
[Qwen3] Qwen3 MoE initial support CLA Signed This label is managed by the Meta Open Source bot.
#1685 by wwwjn was merged Sep 11, 2025 Loading…
[405B] Add performance data for 405B model CLA Signed This label is managed by the Meta Open Source bot.
#554 by fduwjj was merged Aug 23, 2024 Loading…
[HF] Model Definition Conversion Support for FLUX CLA Signed This label is managed by the Meta Open Source bot.
#1582 by wesleytruong was merged Aug 20, 2025 Loading…
[Kernels] add triton contiguous groupgemm CLA Signed This label is managed by the Meta Open Source bot.
#1154 by lessw2020 was merged May 1, 2025 Loading…
CUDAGraph support for SimpleFSDP and TP CLA Signed This label is managed by the Meta Open Source bot.
#2050 by BoyuanFeng was merged Nov 20, 2025 Loading…
4 tasks done
add llama4 as an experiment CLA Signed This label is managed by the Meta Open Source bot.
#1064 by tianyu-l was merged Apr 7, 2025 Loading…
upgrade generate_permute_indices with faster kernel (parallelized) CLA Signed This label is managed by the Meta Open Source bot.
#1098 by lessw2020 was merged Apr 15, 2025 Loading…
[WIP] Used per-parameter FSDP CLA Signed This label is managed by the Meta Open Source bot.
#70 by awgu was closed Mar 13, 2024 Draft
2 of 5 tasks
[llama4] fall back to for-loop based MoE if not on SM90 or later CLA Signed This label is managed by the Meta Open Source bot.
#1096 by tianyu-l was merged Apr 12, 2025 Loading…
[DeepSeek] Integrate M*G Group Gemm CLA Signed This label is managed by the Meta Open Source bot.
#1046 by kwen2501 was merged Apr 3, 2025 Loading…
[DeepSeek] all-to-all-v kernel writes out output_splits CLA Signed This label is managed by the Meta Open Source bot.
#941 by kwen2501 was merged Mar 20, 2025 Loading…
Implement async_checkpoint CLA Signed This label is managed by the Meta Open Source bot.
#313 by fegin was merged May 7, 2024 Loading…
enable gc control scheduling to help avoid stragglers CLA Signed This label is managed by the Meta Open Source bot.
#148 by lessw2020 was merged Mar 20, 2024 Loading…
[MoE] DeepEP refactor and fix memory leak during training and inference ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2296 by shuhuayu was merged Jan 29, 2026 Loading…
Use cudnn backend on B200 CLA Signed This label is managed by the Meta Open Source bot.
#1196 by drisspg was merged May 15, 2025 Loading…
add L40s gpu type for MFU calcs CLA Signed This label is managed by the Meta Open Source bot.
#1204 by lessw2020 was merged May 19, 2025 Loading…
ProTip! Exclude everything labeled bug with -label:bug.