Skip to content

Latest commit

 

History

History
14 lines (9 loc) · 999 Bytes

README.md

File metadata and controls

14 lines (9 loc) · 999 Bytes

Data description

WMT2017~WMT2020

We provide the original sentence pairs, their corresponding Pre-Ins and Post-Ins format at this folder.

Alpaca

We treat the official Alpaca as the Pre-Ins formatted, and convert it into Post-Ins format at here.

CNN/DailyMail

We follow the pre-processing scrips of ProphetNet, and provide the processed data at here

MQM

We treat the processed MQM data provided by Parrot as the Pre-Ins formatted, and convert it into Post-Ins format at here.