Skip to content

mediagestalt/Counting-Word-Frequencies

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Counting-Word-Frequencies

iPython notebook that counts word frequencies for documents in a multi-file directory. The complete version is found here: http://mediagestalt.com/thesis/CountingWordFrequencies.html

The data is from the Canadian House of Commons Parliamentary debates, published as Hansard. It can be downloaded as a zip file here: https://dataverse.library.ualberta.ca/dvn/dv/hansard

The data includes transcripts for the years 2006 to 2015 (Parliaments 39-41) inclusive.

This repo can also be viewed in iPython notebook format at: http://nbviewer.ipython.org/github/mediagestalt/Counting-Word-Frequencies. Download the directory and explore the data in your own way.

About

iPython notebook that counts word frequencies for documents in a multi-file directory.

Resources

License

Stars

Watchers

Forks

Packages

No packages published