Skip to content

Constraint Grammar based pseudonymization method for IKDP Spoken Komi corpus.

License

Notifications You must be signed in to change notification settings

langdoc/langdoc-pseudonymization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Pseudonymization method for language documentation corpora

This repository accompanies the paper of Niko Partanen, Rogier Blokland and Michael Rießler "A pseudonymisation method for language documentation corpora: An experiment with spoken Komi" PDF.

@inproceedings{partanenEtAl2020a,
  title={A pseudonymisation method for language documentation corpora: An experiment with spoken Komi},
  author={Partanen, Niko and Blokland, Rogier and Rie{\ss}ler, Michael},
  booktitle={Proceedings of the Sixth International Workshop on Computational Linguistics of Uralic Languages},
  pages={1--8},
  year={2020}
}

Notes

  • As the newest version of uralicNLP allows using both Komi and Russian FST, we need to adjust the rules to take into account the Russian readings correctly.

About

Constraint Grammar based pseudonymization method for IKDP Spoken Komi corpus.

Resources

License

Stars

Watchers

Forks

Packages

No packages published