CLMAndWWM

This repository contains the data and models from the paper '“Is Whole Word Masking Always Better for Chinese BERT?”: Probing on Chinese Grammatical Error Correction', which is accepted in Findings of ACL 2022, short. The paper link is here.

The data was obtained from CGED's benchmark for Chinese Grammatical Error Diagnosis and subsequently processed by us. The checkpoit link of RoBERTa is here, you can load the model using hugginface interfaces.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
README.md		README.md
ged_test.tsv		ged_test.tsv
train_16.tsv		train_16.tsv
train_17.tsv		train_17.tsv
train_18.tsv		train_18.tsv
train_20.tsv		train_20.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

ged_test.tsv

ged_test.tsv

train_16.tsv

train_16.tsv

train_17.tsv

train_17.tsv

train_18.tsv

train_18.tsv

train_20.tsv

train_20.tsv

Repository files navigation

CLMAndWWM

About

Releases

Packages

daiyongya/CLMAndWWM

Folders and files

Latest commit

History

Repository files navigation

CLMAndWWM

About

Resources

Stars

Watchers

Forks