Skip to content

JetRunner/Sequence_Span_Rewriting

 
 

Repository files navigation

Sequence_Span_Rewriting

Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting

Usage

data_generation.py contains key functions of generating training data for the sequence span rewriting objective.

data_gen.py contains an example of data generation.

run_summarization.py is from huggingface transformers. We use this function to continually per-train with SSR and fine-tune on downstream tasks.

run_generation.py is used for inference (i.e., generation)

Pre-trained models

you can load our pre-trained SSR-base from huggingface's model hub:

from transformers import AutoTokenizer, AutoModelForSeq2SeqLM

tokenizer = AutoTokenizer.from_pretrained("microsoft/ssr-base")

model = AutoModelForSeq2SeqLM.from_pretrained("microsoft/ssr-base")

About

Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%