[transformers_modeling_backend] Upgrade transformers from 4.57.1 to 5.0.0rc0 #2154

3outeille · 2025-12-15T16:25:52Z

This fixes: huggingface#6

Thanks to do that we can do torch.compile + 4D-//ism on HF model (cf huggingface#5)

…coder)

…compile flag.

torchtitan/experiments/transformers_modeling_backend/model/model.py

3outeille · 2025-12-15T18:28:46Z

Upgrading to transformers v5 fixes it as it no longer uses kwargs for self.attn #2154

wwwjn

LGTM!

3outeille · 2025-12-16T10:13:32Z

fixed linting

tianyu-l · 2025-12-16T22:20:04Z

torchtitan/experiments/transformers_modeling_backend/model/model.py

                if module.padding_idx is not None:
-                    module.weight.data[module.padding_idx].zero_()
+                    if isinstance(module.weight.data, DTensor):
+                        module.weight.data._local_tensor[module.padding_idx].zero_()


sorry I probably didn't what you are doing here.
If the padding is on the "global tensor", we should just do the same thing module.weight.data[module.padding_idx].zero_()

The code here is doing local modification, which may or may not be correct depending on if the padding_idx is meant to be local or global.

3outeille added 11 commits November 24, 2025 10:03

add tooling

4f36924

add check checkpoint correctness

5a63932

add compare throughput to tooling

819ea64

Enhance debug_local.sh to support compile flag

a54016b

Enhance debug_local.sh to support dynamic flavor selection

409483a

modify template to use reservation

b7e88ce

this doesnt exist in v5 anymore (maybe for encoder tho but not for de…

33f7e57

…coder)

Fix padding index handling for DTensor in HFTransformerModel class

f544b93

Enhance grid_search_slurm.sh to support dynamic flavor selection and …

15e44c3

…compile flag.

Merge branch 'tooling' into move-to-v5

f0eb064

upgrade to transformers v5

769a61a

3outeille requested review from fegin, tianyu-l, wconstab and wwwjn as code owners December 15, 2025 16:25

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 15, 2025

3outeille changed the title ~~Upgrade transformers from 4.57.1 to 5.0.0rc0~~ [transformers_modeling_backend] Upgrade transformers from 4.57.1 to 5.0.0rc0 Dec 15, 2025

tianyu-l reviewed Dec 15, 2025

View reviewed changes

torchtitan/experiments/transformers_modeling_backend/model/model.py Outdated Show resolved Hide resolved

avoid copy using _local_tensor()

b6bad9a

3outeille force-pushed the move-to-v5 branch from 34285a5 to b6bad9a Compare December 15, 2025 17:40

3outeille closed this Dec 15, 2025

3outeille reopened this Dec 15, 2025

remove tooling

da21ac8

3outeille requested a review from tianyu-l December 15, 2025 18:31

wwwjn approved these changes Dec 16, 2025

View reviewed changes

linting

ff0dbb9

3outeille requested a review from wwwjn December 16, 2025 22:05

tianyu-l reviewed Dec 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[transformers_modeling_backend] Upgrade transformers from 4.57.1 to 5.0.0rc0 #2154

[transformers_modeling_backend] Upgrade transformers from 4.57.1 to 5.0.0rc0 #2154

3outeille commented Dec 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

3outeille commented Dec 15, 2025

Uh oh!

wwwjn left a comment

Uh oh!

3outeille commented Dec 16, 2025

Uh oh!

tianyu-l Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[transformers_modeling_backend] Upgrade transformers from 4.57.1 to 5.0.0rc0 #2154

Are you sure you want to change the base?

[transformers_modeling_backend] Upgrade transformers from 4.57.1 to 5.0.0rc0 #2154

Conversation

3outeille commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

3outeille commented Dec 15, 2025

Uh oh!

wwwjn left a comment

Choose a reason for hiding this comment

Uh oh!

3outeille commented Dec 16, 2025

Uh oh!

tianyu-l Dec 16, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

3outeille commented Dec 15, 2025 •

edited

Loading