Skip to content

nymwa/arterarejo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

arterarejo

Workspace for arteraro

  • ニューラル文法誤り訂正のための多様な規則を用いる人工誤り生成 (言語処理学会第27回年次大会)
    • Please use v1.0.0 to reproduce results of this paper.

how to use

1. download datasets

Read and follow corpora/README.md

2. prepare tokenized corpora in arterarejo/tokenized

Read and follow tokenized/README.md

3. prepare SpaCy labeled corpora in arterarejo/labeled

Read and follow labeled/README.md

4. prepare afiksilo model

Read and follow afiksilo/README.md

5. prepare BPE model under bpe

Read and follow bpe/README.md

6. prepare falsliter model

Read and follow falsliter/README.md

7. prepare ortobruilo model

Read and follow ortobruilo/README.md

8. prepare errant environment under arterarejo/errant

Read and follow errant/README.md

9. prepare m2scorer in arterarejo/m2scorer

Run sh download in /path/to/arterarejo/m2scorer.

10. generate artificial erronous data under arterarejo/noised

Read and follow noised/README.md

11. prepare fairseq preprocessed data under arterarejo/data

Read and follow data/README.md

12. run fairseq training, generation, and evaluation under arterarejo/expt

Read and follow expt/README.md

About

workspace for arteraro

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages