Creates a catalog of currently available NLTK corpora, and it's attributes It also generates a dictionary which is saved to a JSON file named corpora.json
Output sample
RecId: RecId57
webpage: http://www.pascal-network.org/Challenges/RTE/
unzipped_size: 1279930
url: https://raw.githubusercontent.com/nltk/nltk_data/gh-pages/packages/corpora/rte.zip
name: PASCAL RTE Challenges 1, 2, and 3
id: rte
size: 386303
unzip: 1
checksum: ca21663daa326a3bb53001c3d82e62d6
subdir: corpora