Extend usage for olora finetune script by jiqing-feng · Pull Request #2308 · huggingface/peft

jiqing-feng · 2025-01-07T05:19:55Z

I compared a lot of lora finetune script and found this script is the best, it's so clear and easy to understand. So I want to extend the usage for this script to support more platforms and also DDP with minimum changes.

I currently test the script on cuda and cpu with 4 DDP on opt and llama-3. It shows reasonable results.

It would be great if this PR can be merged so we can apply it on more models and devices.

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

BenjaminBossan

Thanks for making these additions to the OLoRA fine-tuning script. Overall I think these are good additions, I just have a few small comments.

Also pinging @tokenizer-decode in case you also want to review but it's not necessary for this change, as OLoRA is not directly involved.

examples/olora_finetuning/README.md

examples/olora_finetuning/olora_finetuning.py

BenjaminBossan · 2025-01-07T10:48:30Z

examples/olora_finetuning/README.md

+```
+please add `--device_map cpu` if you want to run finetune on CPU.
+
+If you want to train a quantized model like AWQ and GPTQ which do not support olora init method, please pass `--init_lora_weights gaussian`.


Would it be possible to use AWQ or GPTQ right now? If the user passes --quantize, the usage of bitsandbytes is hard-coded.

Yes, we can just pass a quantized model. I have updated it in the readme.

examples/olora_finetuning/olora_finetuning.py

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

buyukakyuz · 2025-01-07T14:06:09Z

Seems like good addition to me. I haven't run the script, but it looks good at first glance. You can ping me if anything is needed. Thanks btw @jiqing-feng

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

jiqing-feng · 2025-01-08T09:26:54Z

Hi @BenjaminBossan . I have fixed all your comments, please review the new changes. Thanks!

BenjaminBossan · 2025-01-08T09:51:42Z

examples/olora_finetuning/olora_finetuning.py

            output_dir=output_dir,
            save_total_limit=3,
            load_best_model_at_end=True,
+            ddp_find_unused_parameters=False if world_size > 1 else True,


Why should this be set to True if world_size is not greater than 1? AFAICT, the default is None.

Without this parameter, it will print the following warning when runnnig DDP:

[rank1]:[W108 19:06:54.522318540 reducer.cpp:1400] Warning: find_unused_parameters=True was specified in DDP constructor, but did not find any unused parameters in the forward pass. This flag results in an extra traversal of the autograd graph every iteration, which can adversely affect performance. If your model indeed never has any unused parameters in the forward pass, consider turning this flag off. Note that this warning may be a false positive if your model has flow control causing later iterations to have unused parameters. (function operator()) [rank2]:[W108 19:06:54.522318531 reducer.cpp:1400] Warning: find_unused_parameters=True was specified in DDP constructor, but did not find any unused parameters in the forward pass. This flag results in an extra traversal of the autograd graph every iteration, which can adversely affect performance. If your model indeed never has any unused parameters in the forward pass, consider turning this flag off. Note that this warning may be a false positive if your model has flow control causing later iterations to have unused parameters. (function operator()) [rank0]:[W108 19:06:54.523105147 reducer.cpp:1400] Warning: find_unused_parameters=True was specified in DDP constructor, but did not find any unused parameters in the forward pass. This flag results in an extra traversal of the autograd graph every iteration, which can adversely affect performance. If your model indeed never has any unused parameters in the forward pass, consider turning this flag off. Note that this warning may be a false positive if your model has flow control causing later iterations to have unused parameters. (function operator()) [rank3]:[W108 19:06:54.523123410 reducer.cpp:1400] Warning: find_unused_parameters=True was specified in DDP constructor, but did not find any unused parameters in the forward pass. This flag results in an extra traversal of the autograd graph every iteration, which can adversely affect performance. If your model indeed never has any unused parameters in the forward pass, consider turning this flag off. Note that this warning may be a false positive if your model has flow control causing later iterations to have unused parameters. (function operator())

I mean changing the line to: ddp_find_unused_parameters=False if world_size > 1 else None. Is that what you tested?

The parameter should be setted to True here

Right, it should be None if no DDP.

HuggingFaceDocBuilderDev · 2025-01-08T10:14:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jiqing-feng · 2025-01-08T10:19:38Z

Please trigger the test again. Thanks!

jiqing-feng · 2025-01-08T10:55:03Z

Format fixed.

jiqing-feng · 2025-01-08T14:16:20Z

The failed tests do not seem to be related to my changes.

BenjaminBossan

The changes LGTM, thanks.

Failing CI is indeed unrelated, I'll merge.

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

- allow DDP - make it work on CPU - set seed and dtype Related: dequantize_bnb_weight is updated not to move to cuda if not available. --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng added 7 commits January 6, 2025 09:54

add ddp support

d9f125b

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix device_map

a57832e

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

add pad token

c3e5f79

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix pad token

f690263

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix device map and quantization config

5b36d04

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

update readme

0916d27

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

enable low-bit cpu traning

78ef8af

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

BenjaminBossan requested changes Jan 7, 2025

View reviewed changes

update readme

3e2afda

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng added 2 commits January 7, 2025 09:18

fix typo

28de4f5

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

add comment

0edcb41

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng marked this pull request as ready for review January 8, 2025 09:02

jiqing-feng and others added 3 commits January 8, 2025 17:12

Update examples/olora_finetuning/olora_finetuning.py

795f3f1

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

Update examples/olora_finetuning/olora_finetuning.py

6e8d734

Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

Merge branch 'huggingface:main' into olora

5322d6b

BenjaminBossan reviewed Jan 8, 2025

View reviewed changes

BenjaminBossan approved these changes Jan 8, 2025

View reviewed changes

BenjaminBossan merged commit c207885 into huggingface:main Jan 8, 2025
10 of 14 checks passed

jiqing-feng added 4 commits January 8, 2025 17:07

upfate

b55d8d3

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix param

f38af4e

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix format

4c6299c

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix format

b2329c6

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng deleted the olora branch October 9, 2025 01:42

Conversation

jiqing-feng commented Jan 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

buyukakyuz commented Jan 7, 2025

Uh oh!

jiqing-feng commented Jan 8, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jan 8, 2025

Uh oh!

jiqing-feng commented Jan 8, 2025

Uh oh!

jiqing-feng commented Jan 8, 2025

Uh oh!

jiqing-feng commented Jan 8, 2025

Uh oh!

BenjaminBossan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jiqing-feng commented Jan 7, 2025 •

edited

Loading