[HF] Model Definition Conversion Support for FLUX by wesleytruong · Pull Request #1582 · pytorch/torchtitan

wesleytruong · 2025-08-15T23:39:48Z

This PR adds the FluxStateDictAdapter, allowing us to convert checkpoints to and from HF.

Additional changes:

Modifies download_hf_assets script to support downloading diffusion-type safetensor files
Registers Flux's TrainSpec in convert_from_hf and convert_to_hf so that conversion script can be reused
- e.g. python ./scripts/checkpoint_conversion/convert_from_hf.py ./assets/hf/FLUX.1-dev/transformer ./outputs/temp --model_name flux --model_flavor flux-dev

Tests:
Performing KL divergence test on the forward pass of converted weights loaded in torchtitan and HF weights loaded with HF FluxTransformer2DModel, we get:

Average loss for test from_hf is 7.233546986222528e-13

Addiitonally, we can now run inference with HF weights to verify changes made in #1548

Batched Inference on TorchTitan:

	prompt0	prompt1	prompt2
no CFG
CFG

tianyu-l

Picture looks great!

Had some comments.

tianyu-l · 2025-08-15T23:49:45Z

torchtitan/protocols/state_dict_adapter.py

-            except FileNotFoundError:
+            index_files = [
+                "model.safetensors.index.json",
+                "diffusion_pytorch_model.safetensors.index.json",


I think we shouldn't hardcode diffusion model specific file name here. If this is special treatment, let's do it in Flux's own state_dict_adapter (maybe without inheriting this __init__).

tianyu-l · 2025-08-15T23:50:23Z

scripts/checkpoint_conversion/convert_from_hf.py


 import torch
 import torch.distributed.checkpoint as dcp
+import torchtitan.experiments.flux  # noqa: F401


why do we need this but didn't need to import llama?

wwwjn · 2025-08-16T21:03:30Z

torchtitan/experiments/flux/model/state_dict_adapter.py

+        return state_dict
+
+
+def build_flux_state_dict_adapter(


Do we need this function? If we want to make field in TrainSpec like ``build_xxxx" , and not plug in the class directly, we could use this function, but I didn't see it used anywhere now

…lux sd_adapter builder

tianyu-l · 2025-08-18T22:57:38Z

torchtitan/experiments/flux/dataset/flux_dataset.py

            self.timestep_cycle = itertools.cycle(val_timesteps)

+        # Disable classifier free guidance for validation
+        self.job_config.training.classifier_free_guidance_prob = 0.0


I see only appearance of this field at

torchtitan/torchtitan/experiments/flux/dataset/flux_dataset.py

Line 291 in f9e8897

dropout_prob = self.job_config.training.classifier_free_guidance_prob

You already called super().__init__() above, will this still take effect?

Also it sounds to me that you might have to pass this in the constructor, unless you have other better ideas?

You're right in the above implementation this line unintentionally disabled using dropout during training as well, since the job_config is globally changed.

I switched this override to happen inside flux validator.validate now and switch back at the end

fegin · 2025-08-19T21:05:56Z

torchtitan/experiments/flux/model/state_dict_adapter.py

+
+import torch
+
+logger = logging.getLogger()


nit, it's better to put this line after the last import (line 21 in this case).

Removed the redundant from_hf_map_combine mapping in favor of reversed_combination_plan. The reasoning is from_hf_map_combine forces us to only be able to combine from one direction tt->hf or hf->tt. Now if we need to do both combine and splitting in a conversion then we can just check for the key in combination_plan to know we need to split or check for key in reversed_combination_plan to know we need to combine.

tianyu-l

didn't read into conversion map details, but the organization looks good to me.

This PR adds the `FluxStateDictAdapter`, allowing us to convert checkpoints to and from HF. Additional changes: - Modifies `download_hf_assets` script to support downloading diffusion-type safetensor files - Registers Flux's `TrainSpec` in `convert_from_hf` and `convert_to_hf` so that conversion script can be reused - e.g. `python ./scripts/checkpoint_conversion/convert_from_hf.py ./assets/hf/FLUX.1-dev/transformer ./outputs/temp --model_name flux --model_flavor flux-dev` Tests: Performing KL divergence test on the forward pass of converted weights loaded in `torchtitan` and HF weights loaded with HF `FluxTransformer2DModel`, we get: ``` Average loss for test from_hf is 7.233546986222528e-13 ``` Addiitonally, we can now run inference with HF weights to verify changes made in pytorch#1548 ### Batched Inference on TorchTitan: | | prompt0 | prompt1 | prompt2 | | --- | --- | --- | --- | | no CFG | <img width="1024" height="1024" alt="prompt0_nocfg" src="https://github.com/user-attachments/assets/421fab49-239a-4ca2-b51a-16823d89acfd" /> | <img width="1024" height="1024" alt="prompt1_nocfg" src="https://github.com/user-attachments/assets/534b557e-7b93-4f2e-b3b3-3a0c7cf57c40" /> | <img width="1024" height="1024" alt="prompt2_nocfg" src="https://github.com/user-attachments/assets/d0f33526-f95d-47db-b5a6-6200bfa151f9" /> | | CFG | <img width="1024" height="1024" alt="prompt0_cfg" src="https://github.com/user-attachments/assets/83234675-eb47-4785-abe1-0f07dd854f1c" /> | <img width="1024" height="1024" alt="prompt1_cfg" src="https://github.com/user-attachments/assets/5e76f3e7-0ca3-47a4-a0ef-3c7e983e8c2c" /> | <img width="1024" height="1024" alt="prompt2_cfg" src="https://github.com/user-attachments/assets/c8cbe367-d96e-4559-a201-48e8dc3d18ee" /> |

adds huggingface support for flux

73eb077

wesleytruong requested review from fegin, tianyu-l, wconstab and wwwjn as code owners August 15, 2025 23:39

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 15, 2025

tianyu-l reviewed Aug 15, 2025

View reviewed changes

wwwjn reviewed Aug 16, 2025

View reviewed changes

switched conversion script to lazy import flux, removed unnecessary f…

6efe1b4

…lux sd_adapter builder

tianyu-l reviewed Aug 18, 2025

View reviewed changes

wesleytruong mentioned this pull request Aug 19, 2025

added better guidance for if deprecated tokenizer path fails #1568

Merged

fegin reviewed Aug 19, 2025

View reviewed changes

wesleytruong force-pushed the flux_hf_conversion branch from 4c568b2 to f8b41e9 Compare August 19, 2025 21:16

fixed dropout_probability override

52440c1

wesleytruong force-pushed the flux_hf_conversion branch from f8b41e9 to 52440c1 Compare August 19, 2025 21:19

wesleytruong force-pushed the flux_hf_conversion branch from 02392ed to 7789949 Compare August 20, 2025 00:54

tianyu-l approved these changes Aug 20, 2025

View reviewed changes

wesleytruong merged commit c0b2e5a into main Aug 20, 2025
9 checks passed

tianyu-l deleted the flux_hf_conversion branch August 20, 2025 20:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HF] Model Definition Conversion Support for FLUX#1582

[HF] Model Definition Conversion Support for FLUX#1582
wesleytruong merged 4 commits intomainfrom
flux_hf_conversion

wesleytruong commented Aug 15, 2025 •

edited

Loading

Uh oh!

tianyu-l left a comment

Uh oh!

tianyu-l Aug 15, 2025

Uh oh!

tianyu-l Aug 15, 2025

Uh oh!

wwwjn Aug 16, 2025

Uh oh!

tianyu-l Aug 18, 2025

Uh oh!

wesleytruong Aug 19, 2025

Uh oh!

fegin Aug 19, 2025

Uh oh!

tianyu-l left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

wesleytruong commented Aug 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Batched Inference on TorchTitan:

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

tianyu-l Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

tianyu-l Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

wwwjn Aug 16, 2025

Choose a reason for hiding this comment

Uh oh!

tianyu-l Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

wesleytruong Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

fegin Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

tianyu-l left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wesleytruong commented Aug 15, 2025 •

edited

Loading