-
Notifications
You must be signed in to change notification settings - Fork 685
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: enable torch.autocast for TP parallelism without FSDP
CLA Signed
This label is managed by the Meta Open Source bot.
#2213
opened Jan 8, 2026 by
eous
Loading…
feat(moe): add topk_before_score routing and use_router_bias support
CLA Signed
This label is managed by the Meta Open Source bot.
#2212
opened Jan 8, 2026 by
eous
Loading…
fix(gpt-oss): correct attention sink from sigmoid to LSE renormalization
CLA Signed
This label is managed by the Meta Open Source bot.
#2211
opened Jan 8, 2026 by
eous
Loading…
feat: add differential learning rate and weight decay support
CLA Signed
This label is managed by the Meta Open Source bot.
#2210
opened Jan 8, 2026 by
eous
Loading…
[HybridEP] Support hybridEP for GB200 with NVL72
CLA Signed
This label is managed by the Meta Open Source bot.
#2207
opened Jan 8, 2026 by
elfiegg
Loading…
feat(gpt-oss): Add CPU offload optimizer, differential LR/WD, and more
CLA Signed
This label is managed by the Meta Open Source bot.
#2205
opened Jan 7, 2026 by
eous
Loading…
[rl] Use vllm.Attention for trainer.
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
[rl] refactor model registery
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
#2194
opened Jan 2, 2026 by
wwwjn
Loading…
[rl] Using JobConfig as the centralized config system for inference and simple GRPO
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
#2191
opened Jan 2, 2026 by
wwwjn
Loading…
use comms in compiler toolkit
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
experiments: add nemotron3 model to experiments folder
CLA Signed
This label is managed by the Meta Open Source bot.
#2187
opened Dec 30, 2025 by
aghilann
Loading…
4 tasks
auto-chunk unembed & loss
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
#2186
opened Dec 29, 2025 by
shunting314
Loading…
[rl] Update callsite to init_batch_invariance to pass attention backend.
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
#2176
opened Dec 24, 2025 by
zhxchen17
Loading…
compiler_toolkit: inputs are not DTensor if TP is not enabled
CLA Signed
This label is managed by the Meta Open Source bot.
#2175
opened Dec 24, 2025 by
yanboliang
Loading…
Add Flex flash backend to flex attention module
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
[do not land] trying invoke_subgraph on torchtitan
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
[transformers_modeling_backend] Upgrade transformers from 4.57.1 to 5.0.0rc0
CLA Signed
This label is managed by the Meta Open Source bot.
#2154
opened Dec 15, 2025 by
3outeille
Loading…
[WIP] Use all DTensor for Qwen3 and llama4 at TP region
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
#2149
opened Dec 12, 2025 by
wwwjn
Loading…
Staging SFT training
CLA Signed
This label is managed by the Meta Open Source bot.
#2148
opened Dec 12, 2025 by
rakkit
Loading…
Add repeated_subgraphs option in AutoParallel example
CLA Signed
This label is managed by the Meta Open Source bot.
#2138
opened Dec 10, 2025 by
fmassa
Loading…
[Not Ready] Enable Async TP CI
ciflow/8gpu
CLA Signed
This label is managed by the Meta Open Source bot.
improve throughput of HF dense model (no need actually)
CLA Signed
This label is managed by the Meta Open Source bot.
perf(pipeline): implement auto-partition algorithm
CLA Signed
This label is managed by the Meta Open Source bot.
enhancement
New feature or request
#2113
opened Dec 5, 2025 by
TXacs
Loading…
[simple_fsdp] Turn on bucketing by default
CLA Signed
This label is managed by the Meta Open Source bot.
#2103
opened Dec 3, 2025 by
IvanKobzarev
Loading…
ProTip!
What’s not been updated in a month: updated:<2025-12-30.