Sentiment Analysis using Support Vector Machines (SVM)

Overview

Sentiment analysis using Support Vector Machines (SVM) on textual data. Sentiment analysis is a natural language processing (NLP) technique used to determine the sentiment (positive, negative, or neutral) expressed in a piece of text.

The script utilizes the popular scikit-learn library for machine learning and NLP tasks. It demonstrates how to preprocess textual data, vectorize it using Term Frequency-Inverse Document Frequency (TF-IDF) representation, train a Linear Support Vector Machine classifier, and make predictions on new user input.

Dependencies

Ensure you have the following Python libraries installed:

scikit-learn
numpy
pandas
nltk

You can install the required dependencies using pip:

pip install scikit-learn numpy pandas nltk

Prepare your data:

Make sure you have a CSV file named Sentimental Analysis Data.csv with two columns: text and sentiment. The text column should contain the textual data, and the sentiment column should contain the corresponding sentiment labels (positive, negative, neutral).

Data Preprocessing

The text data undergoes thorough preprocessing before training and prediction. The preprocessing steps include:

Removing punctuation, numbers and special characters.
Converting text to lowercase.
Tokenizing the text.
Applying stemming using Porter Stemmer. These steps help to ensure that the text data is in a suitable format for the SVM classifier.

Model Training and Prediction

The script uses the TfidfVectorizer to convert the text data into a numerical representation based on the TF-IDF algorithm. Subsequently, a Linear Support Vector Machine (LinearSVC) classifier is trained on the vectorized data to predict the sentiment labels.

Evaluation

To assess the model's performance, the script evaluates its predictions using precision, recall, accuracy, and F1 measure. The evaluation is conducted by predicting the sentiment label for user input and comparing it with the actual label.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
Sentimental Analysis Data.csv		Sentimental Analysis Data.csv
Sentimental Analysis.ipynb		Sentimental Analysis.ipynb
history.csv		history.csv
trained_model.pkl		trained_model.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

Sentimental Analysis Data.csv

Sentimental Analysis Data.csv

Sentimental Analysis.ipynb

Sentimental Analysis.ipynb

history.csv

history.csv

trained_model.pkl

trained_model.pkl

Repository files navigation

Sentiment Analysis using Support Vector Machines (SVM)

Overview

Dependencies

Prepare your data:

Data Preprocessing

Model Training and Prediction

Evaluation

License

About

Languages

License

AnamolZ/Sentiment_Analysis

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis using Support Vector Machines (SVM)

Overview

Dependencies

Prepare your data:

Data Preprocessing

Model Training and Prediction

Evaluation

License

About

Resources

License

Stars

Watchers

Forks

Languages