fake-news-detection

📈 Description Of The Model

Data Preprocessing

One of the major forms of preprocessing is to filter out useless data. In natural language processing, uninformative words are referred to as stop words. We would not want these words to take up space in our database, or expend our valuable CPU time. NLTK is employed for removal of the stop words in this project. Null values are replaced with empty string. Author name and news title is merged to improve performance. The Porter Stemmer algorithm is used for transformation of words into root words. TfidfVectorizer is used for transformation of text data into numeric data.

Training the Model

The dataset is splitted as 80% for training set and 20% for test set. Logistic regression is used in the model.

📰 Dataset

https://www.kaggle.com/c/fake-news/data?select=train.csv

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
FakeNewsDetection.ipynb		FakeNewsDetection.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly