💬 Sentiment Analysis & NLP Pipeline

Applied to 15,000 Disneyland Reviews

Overview

Comprehensive NLP pipeline on 15,000 Disneyland park reviews covering sentiment classification, text summarization, named entity recognition, and n-gram analysis using BERT and traditional ML.

Dataset

Source: Disneyland Reviews (Kaggle)
Size: 15,000 reviews
Task: Sentiment classification (1-5 star ratings)

Results

BERT: 89% sentiment classification accuracy
4-7% accuracy improvement via T5 summarization preprocessing pipeline
NER identifies locations, organizations, entities
Word cloud and n-gram analysis of key themes

NLP Tasks

Text Preprocessing — contractions, tokenization, stopwords, lemmatization (NLTK + spaCy)
Exploratory Analysis — rating distribution, word clouds, n-gram analysis, NER
Sentiment Classification — BERT, Logistic Regression, SVM, Naive Bayes, TextBlob (TF-IDF)
Text Summarization — extractive (Summa) and abstractive (T5 transformer), ROUGE evaluation

Tech Stack

How to Run

git clone https://github.com/Phoenixking-04/Sentiment-Analysis-NLP.git
pip install pandas scikit-learn nltk spacy transformers torch wordcloud textblob sumy rouge-score matplotlib seaborn
python -m spacy download en_core_web_sm
jupyter notebook NLP.ipynb

🔗 Developer: Kalyankumar Sandireddy

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Final_NLP.ipynb		Final_NLP.ipynb
Processed_DisneylandReviews.csv		Processed_DisneylandReviews.csv
README.md		README.md
after_preprocessing.csv		after_preprocessing.csv
dataset_15k.csv		dataset_15k.csv
model_evaluation_results.csv		model_evaluation_results.csv
updated_sentiment_analysis_results.csv		updated_sentiment_analysis_results.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

💬 Sentiment Analysis & NLP Pipeline

Overview

Dataset

Results

NLP Tasks

Tech Stack

How to Run

About

Uh oh!

Releases

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

💬 Sentiment Analysis & NLP Pipeline

Overview

Dataset

Results

NLP Tasks

Tech Stack

How to Run

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages