Skip to content

It offers an in-depth exploration into classifying IMDB movie reviews using machine learning and NLP techniques. It details steps from data preprocessing and feature extraction to model training with both classical and neural network approaches, aimed at predicting review sentiments.

Notifications You must be signed in to change notification settings

AICrafter08/IMDB-Sentiment-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

IMDB Sentiment Analysis

This repository contains code and resources for performing sentiment analysis on the IMDB dataset of 50K movie reviews. It explores different methodologies including Bag of Words (BoW), TF-IDF, and Neural Networks to classify reviews into positive or negative sentiments.

Dataset Overview

The IMDB dataset features 50,000 movie reviews, evenly split between 25,000 positive and 25,000 negative reviews. This balance makes it an excellent resource for binary sentiment classification, providing a standard benchmark for evaluating model performance in natural language processing tasks.

How to Use

  1. Clone the Repository:
    • git clone https://github.com/AICrafter08/IMDB-Sentiment-Analysis.git
  2. Navigate to the Repository Folder:
    • cd IMDB-Sentiment-Analysis.git

Folder Structure

  • src/ - Contains Python scripts:
    • BOW_classification.py: Contains code specific to Bag of Words classification.
    • TFIDF_classification.py: Dedicated to TF-IDF based classification.
    • Neuralnet_classification.py: Implements Neural Network for sentiment analysis.
  • IMDB_dataset_classifier.ipynb: Jupyter notebook with a comprehensive guide covering all methods.

Installation

Ensure Python 3.x is installed. Install the required libraries using:

pip install -r requirements.txt

Running the Codes

  • BoW Classification:
    • python BOW_classification.py
  • TF-IDF Classification:
    • python TFIDF_classification.py
  • Neural Network Classification:
    • python Neuralnet_classification.py
  • Comprehensive Analysis (includes all methods):
    • Open and run IMDB_dataset_classifier.ipynb in a Jupyter notebook environment or Google Colab.

Author

Tushar Chauhan

Give a ⭐ To This Repository!

About

It offers an in-depth exploration into classifying IMDB movie reviews using machine learning and NLP techniques. It details steps from data preprocessing and feature extraction to model training with both classical and neural network approaches, aimed at predicting review sentiments.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published