Skip to content

Fake news detection using logistic regression

Notifications You must be signed in to change notification settings

5l1v3r1/fake-news-detection

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

fake-news-detection

📈 Description Of The Model

Data Preprocessing


One of the major forms of preprocessing is to filter out useless data. In natural language processing, uninformative words are referred to as stop words. We would not want these words to take up space in our database, or expend our valuable CPU time. NLTK is employed for removal of the stop words in this project. Null values are replaced with empty string. Author name and news title is merged to improve performance. The Porter Stemmer algorithm is used for transformation of words into root words. TfidfVectorizer is used for transformation of text data into numeric data.

Training the Model

The dataset is splitted as 80% for training set and 20% for test set. Logistic regression is used in the model.

📰 Dataset

https://www.kaggle.com/c/fake-news/data?select=train.csv


🔎 For more...

About

Fake news detection using logistic regression

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%