iPython notebook that counts word frequencies, comparing the ratio of the word privacy
to the total number of words in a corpus.
The data is from the Canadian House of Commons Parliamentary debates, published as Hansard. It can be downloaded as a zip file here: https://dataverse.library.ualberta.ca/dvn/dv/hansard
The data includes transcripts for the years 2006 to 2015 (Parliaments 39-41) inclusive.
This repo can also be viewed in iPython notebook format at: http://nbviewer.ipython.org/github/mediagestalt/Adding-Context. Download the directory and explore the data in your own way.