T5 conversion script and presets #900

mattdangerw · 2023-03-22T03:55:20Z

Draft of weight conversion script for T5 1.1 weights.

Still need to figure out...

A better approach for approx gelu.
How to store the weights for the language model output layer.

dathudeptrai · 2023-05-10T05:41:35Z

mattdangerw · 2023-05-10T21:23:38Z

@dathudeptrai thanks for the ping. Should be updates on this shortly!

Basically, we were focused on a 0.5 release (just out the a few days ago), with generation utils and simple decoder models.

But that's done, so now we will be full speed ahead on landing T5 and BART, to seed a seq2seq offering.

abuelnasr0 · 2023-05-13T13:31:12Z

Can I contribute in shipping this model?
There is changes I want to make to the T5Tokenizer. there is no changes in this PR for the tokenizer so there will be no conflicts.

the tokenizer only checks for the pad_token exists. I will make it check pad_token, end_token, unk_token also.
this is written in the code:

# T5 uses the same start token as end token, i.e., "<\s>".

but actually T5 uses pad_token as start token for decoder input. documatation from huggingFace read the decoder_input_ids argument.
3. add extra_ids argument that will add extra ids at the end of the vocabularies for use as sentinels for T5 training.

This was noticed on keras-team#900, but we should probably get the fix into the forward pass without waiting on checkpoints.

This was noticed on #900, but we should probably get the fix into the forward pass without waiting on checkpoints.

mattdangerw · 2023-10-19T22:59:13Z

This is coming in finally on #1277

T5 conversion script and presets

4f710be

mattdangerw force-pushed the t5-conversion branch from ca55a25 to 4f710be Compare March 22, 2023 04:04

mattdangerw added a commit to mattdangerw/keras-hub that referenced this pull request Jun 16, 2023

Fix t5 forward pass

d3896ba

This was noticed on keras-team#900, but we should probably get the fix into the forward pass without waiting on checkpoints.

mattdangerw mentioned this pull request Jun 16, 2023

Fix t5 forward pass #1082

Merged

mattdangerw added a commit that referenced this pull request Jun 21, 2023

Fix t5 forward pass (#1082)

d27df4d

This was noticed on #900, but we should probably get the fix into the forward pass without waiting on checkpoints.

mattdangerw mentioned this pull request Oct 16, 2023

Finish t5 release #1271

Open

mattdangerw closed this Oct 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

T5 conversion script and presets #900

T5 conversion script and presets #900

Uh oh!

mattdangerw commented Mar 22, 2023

Uh oh!

dathudeptrai commented May 10, 2023

Uh oh!

mattdangerw commented May 10, 2023

Uh oh!

abuelnasr0 commented May 13, 2023 •

edited

Loading

Uh oh!

mattdangerw commented Oct 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

T5 conversion script and presets #900

T5 conversion script and presets #900

Uh oh!

Conversation

mattdangerw commented Mar 22, 2023

Uh oh!

dathudeptrai commented May 10, 2023

Uh oh!

mattdangerw commented May 10, 2023

Uh oh!

abuelnasr0 commented May 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdangerw commented Oct 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

abuelnasr0 commented May 13, 2023 •

edited

Loading