Time At Last

Purpose of the project.

The purpose of this project is evaluate the closeness of the writing composition between one book and a group of other books.

To do this I have broken down the steps into two parts in the code above.

Works with the content of each book. Every book that comes into the parser:

Has it's parts of speech and sentiment calculated.
Has it's stop words and proper nouns removed.
Remaining POS (Part of Speech) Tags, Words, and Sentiment is saved as raw json to be read by the analyzer.

Works with the content of each book in respect to every other book:

Look at the sentiment over time and mark period of book where it crossed certain thresholds.
Use TFIDF vectorization to compare common and unique words in all books and output most similar books.
Use cosine similarity to characterize defining words in each book that appear the least in the other books.

I have added an output of the text as well to this repo as results.txt.

A demo of this technique is below. Final product demo

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
archived		archived
parsed		parsed
screens		screens
.DS_Store		.DS_Store
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
analyzer.py		analyzer.py
analyzer_single.py		analyzer_single.py
changer.py		changer.py
parser-test.py		parser-test.py
parser.py		parser.py
results.txt		results.txt
tron.py		tron.py
viewer.py		viewer.py