Skip to content

csesoumita/Reddit-Data-Classification

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 

Repository files navigation

Reddit Data Classification

-Predicted the correct class of subreddit to which a single thread of discussion belongs to. Also performed comment disclosure classification.
-Performed tokenization and normalization on fields like title and body. Also implemented One Hot Encoding and Tfidf Vectorizer.
Implemented classifications algorithms like Logistic Regression,SVC and MLP.
-Evaluated the algorithm through Confusion Matrix ,F1- Score,Accuracy,Precision and Recall.

Python libraries implemented: Spacy, Scikit-learn. Was implemented in Google Colab.

File Names File Description
TAD_Assignment.ipynb

Contains the overall implementation.

About

Analysis of Reddit Data. One Hot Encoding.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published