Skip to content

Larz60p/NLTK-Corpora-Catalog

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NLTK-Corpora-Catalog

Creates a catalog of currently available NLTK corpora, and it's attributes It also generates a dictionary which is saved to a JSON file named corpora.json

Output sample

RecId: RecId57
    webpage: http://www.pascal-network.org/Challenges/RTE/
    unzipped_size: 1279930
    url: https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/corpora/rte.zip
    name: PASCAL RTE Challenges 1, 2, and 3
    id: rte
    size: 386303
    unzip: 1
    checksum: ca21663daa326a3bb53001c3d82e62d6
    subdir: corpora

About

Creates a catalog of currently available NLTK corpora, and it's attributes

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages