The repository for ACL 2023 paper: Disentangled Phonetic Representation for Chinese Spelling Correction.
Data is downloaded from ACL 2021 paper ReaLiSe
The data processing code is data_process/data_processor.py
.
python data_process/data_processor.py
The code of Dorm is dorm.py
, and the code of training is dorm_finetune.py
.
The scripts of training and evaluation are train.sh
and test.sh
, respectively.
Our code was modified and developed based on ReaLiSe, and we would like to express our gratitude to their team.