Text analysis project
Letters: plot and compare the frequencies of each alphabet letters in novels written in different languages. Use of Pandas and Matplotlib.
Words: compare the use of certain words in novels over time. Use of Matplotlib and nltk.
Python class to perform the cleaning of Gutenberg books.
- Script to get counts and frequencies.
- Using Pandas (with Matplotlib and ggplot) to visualise letters frequencies in three books (English language): ![img_bar_plot] (https://raw.github.com/Eleonore9/text_analysis/master/imgs/letters_freqs.png)
- Words: basic script for counts and frequencies. On hold until the part on "Letters" is over.