@cltk

Classical Language Toolkit

Natural language processing for Classical languages

  • Latin treebank from the Perseus Digital Library

    Python 2 1 Updated Jun 22, 2017
  • Collected Greek files from the Perseus Digital Library

    Python 4 2 Updated Jun 21, 2017
  • Collected Latin files from the Perseus Digital Library

    Python 5 4 Updated Jun 21, 2017
  • Collected texts from wikisource.org

    1 Updated Jun 19, 2017
  • Old French lexicon from wikisource.org

    Updated Jun 19, 2017
  • Training sets and tokenizer for the Latin language, for use with CLTK

    Python 3 3 MIT Updated May 26, 2017
  • corpus for Classical arabic

    1 Updated Mar 21, 2017
  • contains malayalam_text

    HTML Updated Mar 16, 2017
  • Classical Telugu texts from Wikisource

    Python 3 Updated Mar 16, 2017
  • HTML Updated Mar 15, 2017
  • This Repository contains parallel Sanskrit and English Documents.

    Python 2 1 Updated Mar 15, 2017
  • Python 1 3 Updated Mar 7, 2017
  • Python 2 Updated Feb 27, 2017
  • Structured Jewish texts and metadata exported from Sefaria's database.

    Python 1 29 Updated Feb 19, 2017
  • Chinese Buddhist scriptures from CBETA

    Python Updated Feb 19, 2017
  • Chinese Buddhist scriptures from CBETA

    Python 1 Updated Feb 19, 2017
  • extracted the old javanese text.

    HTML Updated Feb 19, 2017
  • Corpus for Italian Poetry in Latin

    HTML 2 Updated Feb 19, 2017
  • HTML 2 Unlicense Updated Feb 19, 2017
  • Pali Tipitaka packaged with the Digital Pali Reader

    JavaScript 3 GPL-2.0 Updated Feb 19, 2017
  • Punjabi Files of Gurbani

    Python 1 Unlicense Updated Feb 19, 2017
  • Python 1 3 CC0-1.0 Updated Feb 19, 2017
  • sanskrit monolingual corpus

    Python 3 1 Updated Feb 19, 2017
  • Texts from Corpus of Middle English Prose and Verse

    Perl 1 3 Updated Feb 18, 2017
  • Official releases of the TOROT treebank

    2 Updated Feb 14, 2017
  • Lexica and lemmata for the Ancient Greek language, from various sources

    Python 7 5 Updated Feb 8, 2017
  • Python Updated Nov 1, 2016
  • Python MIT Updated Oct 11, 2016
  • Sanskrit Corpus

    5 6 Updated Sep 29, 2016
  • Trained taggers, tokenizers, etc. for the CLTK

    1 MIT Updated Sep 26, 2016