'Analyse' of words/letters in books of the Gutenberg project
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.ipynb_checkpoints
books
imgs
.gitignore
README.md
Test_textanalysis.py.ipynb
clean_text_class.py
count_words_letters_class.py
requirements.txt
text_letters_analysis.py

README.md

Text analysis project

Idea:

  • Letters: plot and compare the frequencies of each alphabet letters in novels written in different languages. Use of Pandas and Matplotlib.

  • Words: compare the use of certain words in novels over time. Use of Matplotlib and nltk.

Progress:

Python class to perform the cleaning of Gutenberg books.

My iPython notebook

  • Words: basic script for counts and frequencies. On hold until the part on "Letters" is over.