Add custom chat template option #1261

patrik-bartak · 2025-04-25T12:46:31Z

Right now the RL and SFT datasets use the chat template of the tokenizer to format the prompt. Some models do not have a chat template (for example deepseek-ai/deepseek-coder-1.3b-base), and sometimes it can be useful to override this template. This adds the option to override the template when loading the tokenizer.

For example, setting to "{{ bos_token }}{{ messages }}" and assigning "string" to the dataset prompt key, will make the prompt_with_chat_template <bos>string

By default the config value is null, so that the default model template is used.

I have added the required code to make this work with the RL dataset, SFT dataset, and main_ppo / main_ppo_split. If there are other parts of the code that should be updated to make this option work, let me know.

More info here https://huggingface.co/docs/transformers/main/en/chat_templating

vermouth1992 · 2025-04-27T14:06:09Z

Could you help modify one test to use custom chat template? Maybe passing the chat template from the tokenizer just for testing?

patrik-bartak · 2025-04-30T12:19:17Z

@vermouth1992
I don't really see a test that I could modify. This PR just lets you override the chat_template using a key in the config. It is then automatically used when the code runs apply_chat_template.

Maybe the function hf_tokenizer can include a warning/info print that says if the default template is being used, or if it is being overwritten? When I first used verl, it was not clear to me why the prompt was different from what I set up in the data_preprocess. Then I found out the chat template was being applied.

patrik-bartak added 4 commits April 25, 2025 14:43

add custom chat template option

29506ea

update main_ppo_split

9a680e4

remove num_examine

b82e5f1

fix

42bf275

ZihengJiang added the status: need review label Apr 29, 2025

Merge branch 'main' into custom-chat-template

a82eb06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add custom chat template option #1261

Add custom chat template option #1261

Uh oh!

patrik-bartak commented Apr 25, 2025 •

edited

Loading

Uh oh!

vermouth1992 commented Apr 27, 2025

Uh oh!

patrik-bartak commented Apr 30, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add custom chat template option #1261

Are you sure you want to change the base?

Add custom chat template option #1261

Uh oh!

Conversation

patrik-bartak commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vermouth1992 commented Apr 27, 2025

Uh oh!

patrik-bartak commented Apr 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

patrik-bartak commented Apr 25, 2025 •

edited

Loading

patrik-bartak commented Apr 30, 2025 •

edited

Loading