NLP-Project Table of Contents Term Statistics Sentiment analysis Topic extraction Term Statistics Data available at: http://snap.stanford.edu/data/amazon/productGraph/categoryFiles/reviews_Office_Products_5.json.gz Text processing pipeline JSON formatting Tokenization Removal of stop words Lemmatization Stemming Frequency vs Rank POS collection 2-gram, 3-gram, 4-gram generation Sentiment analysis TF-IDF computation Naive Bayes classification Topic extraction