[Compiler Toolkit] Enable nested_compile_region on TransformerBlock #1973

SherlockNoMad · 2025-10-31T06:52:07Z

Need to run with fix in pytorch/pytorch#166702

NGPU=8 CONFIG_FILE=./torchtitan/models/llama3/train_configs/debug_model.toml ./run_train.sh --model.name compiler_toolkit.llama3 --parallelism.data_parallel_shard_degree=2 --parallelism.tensor_parallel_degree=4

Current output: P2016557983

Observations

I see each TransformerBlock becomes one subgraph, look for subgraph_0, subgraph_2... This is not what we want. we should see 1 instance of subgraph_0, and multiple invoke_subgraph nodes on the same subgraph_0, with different layer weights.
Due to AC, we also have hop.tag_activation_checkpoint(subgraph_1), where subgraph_1 internally calls invoke_subgraph for he transformerblock. We are getting into nested HOP/subgraph region.
dynamo_graph_capture passing. currently failing on aot_export_joint. Looks like DTensor x Dynaomo softness.

miladm · 2025-11-05T18:04:31Z

cc @williamwen42

[Compiler Toolkit] Enable nested_compile_region on TransformerBlock

9104efe

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 31, 2025

SherlockNoMad requested review from anijain2305 and ezyang October 31, 2025 07:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Compiler Toolkit] Enable nested_compile_region on TransformerBlock #1973

[Compiler Toolkit] Enable nested_compile_region on TransformerBlock #1973

SherlockNoMad commented Oct 31, 2025 •

edited

Loading

Uh oh!

miladm commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Compiler Toolkit] Enable nested_compile_region on TransformerBlock #1973

Are you sure you want to change the base?

[Compiler Toolkit] Enable nested_compile_region on TransformerBlock #1973

Conversation

SherlockNoMad commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

miladm commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SherlockNoMad commented Oct 31, 2025 •

edited

Loading