Skip to content

Pull requests: pytorch/torchtitan

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Typing] Fix pyrefly ignores in deepseek model.py ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2305 opened Jan 30, 2026 by fegin Loading…
[rl] Add numerics test against vllm native inference ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2300 opened Jan 29, 2026 by wwwjn Loading…
[rl] GQA attention enablement in torchtitan vllm wrapper ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2299 opened Jan 29, 2026 by wwwjn Loading…
Add Transformer-Engine Fused_Adam Optimizer Support CLA Signed This label is managed by the Meta Open Source bot.
#2293 opened Jan 29, 2026 by vivekgoe Draft
[draft][lora] Apply LoraLinear as a wrapper of Linear ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2288 opened Jan 28, 2026 by mori360 Draft
[WIP][FSDP2] enable per-param mesh FSDP2 for MoE and per-layer compile ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2281 opened Jan 28, 2026 by weifengpy Draft
[DeepEP Integration] Free cache after combine for forward only path. CLA Signed This label is managed by the Meta Open Source bot.
#2274 opened Jan 25, 2026 by elfiegg Draft
Enable graph_pp for autoparallel in torchtitan ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2271 opened Jan 23, 2026 by sanketpurandare Draft
[mxfp8 moe training] temp workaround: don't compile GroupedExperts ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2268 opened Jan 21, 2026 by danielvegamyhre Loading…
[NOT_FOR_LAND] inductor auto_bucketing passes benchmarking for DSv3 ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2266 opened Jan 21, 2026 by IvanKobzarev Loading…
[LoRA] Add LoRA converter for LoRA finetuning ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2263 opened Jan 20, 2026 by mori360 Loading…
Remove unnecessary token padding for MoE in BF16 mode CLA Signed This label is managed by the Meta Open Source bot.
#2255 opened Jan 20, 2026 by rakkit Loading…
[mxfp8 training] add new configurable params now exposed by torchao ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2251 opened Jan 18, 2026 by danielvegamyhre Loading…
[mxfp8 moe training] mxfp8 all to all ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2250 opened Jan 17, 2026 by danielvegamyhre Loading…
[mxfp8 moe training] support wgrad_with_hp recipe ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2249 opened Jan 17, 2026 by danielvegamyhre Loading…
[WIP][rl] refactor grader and trainer generator actor ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2244 opened Jan 16, 2026 by wwwjn Loading…
[lint] ignore all existing pyrefly errors (v0.45.1) ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2241 opened Jan 16, 2026 by xmfan Loading…
[DONT LAND] Implement PrefetchedDataloader for overlapped data loading ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2232 opened Jan 14, 2026 by fegin Loading…
[WIP][rl] refactor save and load model weights using DCP ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2221 opened Jan 13, 2026 by wwwjn Loading…
feat(gpt-oss): add YaRN RoPE extensions with mscale for extended context CLA Signed This label is managed by the Meta Open Source bot.
#2216 opened Jan 8, 2026 by eous Loading…
feat(training): add freeze_router_bias and freeze_expert_bias configs… CLA Signed This label is managed by the Meta Open Source bot.
#2215 opened Jan 8, 2026 by eous Loading…
fix: enable torch.autocast for TP parallelism without FSDP CLA Signed This label is managed by the Meta Open Source bot.
#2213 opened Jan 8, 2026 by eous Loading…
feat: add differential learning rate and weight decay support CLA Signed This label is managed by the Meta Open Source bot.
#2210 opened Jan 8, 2026 by eous Loading…
feat(gpt-oss): Add CPU offload optimizer, differential LR/WD, and more CLA Signed This label is managed by the Meta Open Source bot.
#2205 opened Jan 7, 2026 by eous Loading…
[rl] Use vllm.Attention for trainer. ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#2198 opened Jan 5, 2026 by zhxchen17 Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.