GitHub - kaizenbit/Sentiment_Classification_Using_Embeddings: Embedding-based sentiment classification system for Twitter data using Gemini embeddings and machine learning, built and documented in VS Code.

🧠 Sentiment Classification Using Sentence Transformers

~ This project implements an embedding-based sentiment classification system that classifies Twitter tweets into Positive, Negative, or Neutral sentiments using Sentence Transformer embeddings and a machine learning classifier.

~ Unlike API-based solutions, this system works fully offline, making it efficient, scalable, and reproducible.

📌 Problem Statement

~ Social media platforms generate millions of posts daily, making manual sentiment analysis impractical. Understanding public sentiment helps brands, governments, and organizations make informed decisions.

🎯 Objective

~ The goal of this project is to build a sentiment classifier using:

Text preprocessing and cleaning
Transformer-based semantic embeddings
A machine learning classification model

📊 Dataset

~ Dataset: Twitter Tweets Sentiment Dataset ~ Size: ~27,000 tweets ~ Columns:

- textID

- text

- selected_text

- sentiment

~ Sentiment Labels:

 - Positive
  
 - Negative
  
 - Neutral

🔗 Dataset Link:

https://www.kaggle.com/datasets/yasserh/twitter-tweets-sentiment-dataset

🛠️ Technologies Used

Python
Pandas, NumPy
NLTK (text preprocessing)
Sentence Transformers (all-MiniLM-L6-v2)
Scikit-learn (Logistic Regression)
Matplotlib, Seaborn
WordCloud
VS Code (Jupyter Notebook)

🧠 Embedding Model Used

~ We use the lightweight transformer model:

all-MiniLM-L6-v2

~ Key Features:

384-dimensional semantic embeddings

~ Captures contextual meaning of sentences

Fast and lightweight

~ Works completely offline

~ No API dependency

🔄 Project Workflow

Exploratory Data Analysis (EDA)
Text preprocessing and cleaning
Word cloud visualization
Embedding generation using Sentence Transformers
Train-test split
Model training using Logistic Regression
Model evaluation using classification metrics
Custom tweet sentiment prediction

📈 Results

~ The model successfully classifies tweets into positive, negative, and neutral categories.

~ Transformer-based embeddings capture contextual meaning effectively.

~ The classifier performs strongly on short social media texts.

~ Custom user-defined tweets were accurately classified.

🧪 Sample Predictions

"I absolutely love this new phone!" → Positive

"This service is horrible and frustrating" → Negative

"The event happened yesterday" → Neutral

🚀 How to Run the Project

1️⃣ Clone the repository git clone https://github.com/coderShreyIn/Sentiment_Classification_Using_Embeddings.git cd Sentiment_Classification_Using_Embeddings 2️⃣ Install dependencies pip install -r requirements.txt 3️⃣ Run the Jupyter Notebook

Open in VS Code or Jupyter:

jupyter notebook

Run all cells sequentially.

📦 No API Key Required

This project uses offline Sentence Transformers, so:

❌ No Gemini API key needed

❌ No rate limits

❌ No internet dependency after first model download

📊 Model Performance

Logistic Regression trained on transformer embeddings
High accuracy on multi-class sentiment classification
Good generalization on unseen tweets

⚠️ Limitations

Sarcasm detection is challenging
Very short texts may reduce classification confidence
Mixed sentiment sentences can cause ambiguity

🔮 Future Improvements

Fine-tune a transformer model directly for sentiment classification
Add confidence score display
Implement hyperparameter tuning
Deploy as a web app (Streamlit/Flask)
Add real-time tweet streaming analysis

🌍 Real-World Applications

Social media sentiment monitoring
Brand reputation analysis
Customer feedback analytics
Political opinion mining
Product review classification
Chatbot emotion detection

👨‍💻 Author

~ Shrey Dak

AI & Machine Learning Enthusiast GitHub: https://github.com/coderShreyIn

⭐ If You Found This Useful

~ Please consider giving this repository a ⭐ on GitHub!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
project_Sentiment_Classification_Using_Embeddings.ipynb		project_Sentiment_Classification_Using_Embeddings.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Sentiment Classification Using Sentence Transformers

📌 Problem Statement

🎯 Objective

📊 Dataset

🔗 Dataset Link:

🛠️ Technologies Used

🧠 Embedding Model Used

🔄 Project Workflow

📈 Results

🧪 Sample Predictions

🚀 How to Run the Project

📦 No API Key Required

📊 Model Performance

⚠️ Limitations

🔮 Future Improvements

🌍 Real-World Applications

👨‍💻 Author

⭐ If You Found This Useful

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 Sentiment Classification Using Sentence Transformers

📌 Problem Statement

🎯 Objective

📊 Dataset

🔗 Dataset Link:

🛠️ Technologies Used

🧠 Embedding Model Used

🔄 Project Workflow

📈 Results

🧪 Sample Predictions

🚀 How to Run the Project

📦 No API Key Required

📊 Model Performance

⚠️ Limitations

🔮 Future Improvements

🌍 Real-World Applications

👨‍💻 Author

⭐ If You Found This Useful

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages