GitHub - PalashMatey/BigDataProject_Python2: All NLP modules for Analysis

#Advanced Big Data Analytics

Building basic NLTK modules

Basic Chunking implemented
Chunking could be used to figure out how many other proper nouns are spoken about in Donald Trumps supports.
Chinking may also be a good resource, but i think i will stick to using Chunks
Named Entity recongnition completed. Has a very valid use case. Will still have to figure out its best use-case application
Wordnet is a powerful tool to find synonyms and antonymns. Possible use case could be to translate words which are in a foreign language. I think that the use case viability is limited by the fact that it cant recognise the language on its own. (Google translate api could be a better fit)
The file better_text_classification.py has code to save pickle commented out
NLTK modules now has the completed Sentiment Analysis Module
The module is in the file senti_final.py
Add text and run the file : run_sentiment_analysis.py to check the results of sentiment analysis
The file final_chunk.py contains the working code for chunking

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
AccessInternet		AccessInternet
Marshall		Marshall
NLTK_modules		NLTK_modules
Twitter_Sentiment		Twitter_Sentiment
home/swn/www/admin/dump		home/swn/www/admin/dump
opencv		opencv
opencv_contrib		opencv_contrib
scipy		scipy
venv		venv
Donald_trump.txt		Donald_trump.txt
README.md		README.md
SentiWordNet_3.0.0.tgz		SentiWordNet_3.0.0.tgz
graph_live_sentiment.py		graph_live_sentiment.py
parsing_site.py		parsing_site.py
requirements.txt		requirements.txt
stemming_words.py		stemming_words.py
token.pyc		token.pyc
tweet.py		tweet.py

Provide feedback