Sentiment Analysis on Movie Reviews

created by : I Gusti Ngurah Ervan Juli Ardana

Description

I made this project for the ITS Internal Satria Data (Big Data Challenge) competition. The project uses machine learning to guess how people feel based on movie reviews. let's look deeper

Workflow

Import Libraries
Load the Data
Pre-process the Data

In the pre-processing stage, various approaches were employed to enhance accuracy:
- Lowercasing
- Removing punctuation
- Eliminating white spaces
- Removing numbers
- Eliminating stop words
- Tokenizing
- Stemming
Feature Extraction

Two feature extraction methods were explored: TF*IDF and Ngrams. The accuracy analysis indicated that TF*IDF yielded superior results.
Model Development

Several models were tested, including logistic regression, Naive Bayes, Random Forest, and SVM. Based on accuracy results, logistic regression was chosen as it demonstrated the highest accuracy.
Hyperparameter Tuning

Hyperparameter tuning was performed using gridsearchcv to optimize the logistic regression model's parameters.
Test Prediction

After completing all these steps, the model achieved an accuracy of 0.881.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
result		result
README.md		README.md
Sentiment Analysis.ipynb		Sentiment Analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis on Movie Reviews

Description

Workflow

About

Releases

Packages

Languages

NgurahErvan/Sentiment-Analysis-on-Movie-Reviews

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis on Movie Reviews

Description

Workflow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages