[CP] Enable FlexCP for llama3 #2145

fegin · 2025-12-11T20:43:15Z

Stack from ghstack (oldest at bottom):

Summary:

Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer.

Note that this PR requires pytorch/pytorch#170201

[ghstack-poisoned]

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 ghstack-source-id: c161349 Pull-Request: #2145

[ghstack-poisoned]

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 ghstack-source-id: 4bced25 Pull-Request: #2145

[ghstack-poisoned]

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 ghstack-source-id: cf1e8d6 Pull-Request: #2145

[ghstack-poisoned]

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 ghstack-source-id: 1143a55 Pull-Request: #2145

torchtitan/distributed/utils.py

Stack from [ghstack](https://github.com/ezyang/ghstack/tree/0.12.0) (oldest at bottom): * #2145 * #2144 * __->__ #2143 1. Accept one "." (meaning the current commit) case to simplify the command line. 2. Ignore the untracked files.

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 [ghstack-poisoned]

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 ghstack-source-id: 85c9bff Pull-Request: #2145

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 [ghstack-poisoned]

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 ghstack-source-id: e3cfb0c Pull-Request: #2145

[ghstack-poisoned]

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 ghstack-source-id: fd039d4 Pull-Request: #2145

[ghstack-poisoned]

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 ghstack-source-id: eb4903d Pull-Request: #2145

tianyu-l · 2025-12-16T23:23:50Z

torchtitan/models/attention.py

        v: torch.Tensor,
        *,
-        block_mask: BlockMask,
+        score_mod: _score_mod_signature | None = None,


arg not used anywhere

Update

4c06750

[ghstack-poisoned]

fegin requested review from tianyu-l, wconstab and wwwjn as code owners December 11, 2025 20:43

This was referenced Dec 11, 2025

Improve the loss_compare.sh logic #2143

Merged

[CP] Refactor Context Parallel to use new PyTorch CP APIs #2144

Open

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 11, 2025

Update

44a2d8c

[ghstack-poisoned]

Update

4c950e8

[ghstack-poisoned]

Update

3bd90f8

[ghstack-poisoned]

tianyu-l reviewed Dec 14, 2025

View reviewed changes

torchtitan/distributed/utils.py Outdated Show resolved Hide resolved

torchtitan/distributed/utils.py Outdated Show resolved Hide resolved

torchtitan/distributed/utils.py Show resolved Hide resolved

Update on "[CP] Enable FlexCP for llama3"

0803909

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 [ghstack-poisoned]

pytorch-bot bot added the ciflow/8gpu label Dec 15, 2025

Update on "[CP] Enable FlexCP for llama3"

c4f3886

Summary: Continue the previous PR, this PR enable FlexAttention + CP for llama3. FlexCP will use PTRRLoadBalancer. Note that this PR requires pytorch/pytorch#170201 [ghstack-poisoned]

Update

e18f9ad

[ghstack-poisoned]

Update

21e2a5d

[ghstack-poisoned]

fegin requested a review from tianyu-l December 16, 2025 07:33

tianyu-l approved these changes Dec 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CP] Enable FlexCP for llama3 #2145

[CP] Enable FlexCP for llama3 #2145

Uh oh!

fegin commented Dec 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianyu-l Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[CP] Enable FlexCP for llama3 #2145

Are you sure you want to change the base?

[CP] Enable FlexCP for llama3 #2145

Uh oh!

Conversation

fegin commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tianyu-l Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fegin commented Dec 11, 2025 •

edited

Loading