Fake news classifier using Machine Learning

Detection of fake news online is important in today's society as fresh news content is rapidly being produced as a result of the abundance of technology that is present. Fake news is nothing new. But, what is new is how easy it's become to share information – both true and false – on a massive scale.

Social media platforms allow almost anyone to publish their thoughts or share stories to the world. The trouble is, most people don't check the source of the material that they view online before they share it, which can lead to fake news spreading quickly or even "going viral."

Our work will consist of building a fake news classifier using machine learning.

Authors:

Team #Sg_Spanish:

Armando Medina(@ketcx)
Jacqueline Susan Mejía (@susyjam)
Fernando Terrazas(@Fernando T)
Stanley Salvatierra(@Stanley Salvatierra)
Cesar Schiraldi(@Cesar S.)
Íñigo Lejarza(@Íñigo Lejarza)

Introduction: Business Problem

Typically spread over social media and traditional news outlets, misinformation remains rampant through the use of clickbait headlines and polarizing content. With recent world events, we’ve seen how much impact the news has on our lives. From understanding what is happening surrounding the pandemic to the movement of the stock market, I know that I rely heavily on the news, and I’m sure everyone else does too. However, it is often difficult to distinguish between articles with false information and those providing real, fact-checked news. Given that companies such as Facebook and Twitter deploy algorithms to ensure that people are receiving the right, correct information on their feeds, I wanted to explore utilizing Natural Language Processing and text analysis to build a fake news classifier.

Data

This dataset contains a list of articles considered as "fake" news For this example we are based on the Kaggle data https://www.kaggle.com/clmentbisaillon/fake-and-real-news-dataset#True.csv

Methodology

Sequential Model and Bi-Directional RNN and LSTM

Exploratory Data Analysis

In the data you can see the words "Fake"

In the data you can see the words "Real"

Applying Designer

We used the Designer inspired by the laboratory of the course on text classification to evaluate how the result would be obtained with the Two-Class Logistic Regression algorithm

Results and Discussion

If the predicted value is > 0.5 it is real else it is fake But in our example the Model Accuracy is 0.9985152190051967

Confusion matrix

Link FakeNewsClassifierPS.ipynb

https://github.com/ketcx/fake-news-classifier/blob/master/FakeNewsClassifierPS.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Data		Data
Images		Images
.gitattributes		.gitattributes
.gitignore		.gitignore
FakeNewsClassifierPS.ipynb		FakeNewsClassifierPS.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fake news classifier using Machine Learning

Authors:

Table of Contents

Introduction: Business Problem

Data

Methodology

Exploratory Data Analysis

Applying Designer

Results and Discussion

Confusion matrix

Link FakeNewsClassifierPS.ipynb

About

Releases

Packages

Contributors 2

Languages

ketcx/fake-news-classifier

Folders and files

Latest commit

History

Repository files navigation

Fake news classifier using Machine Learning

Authors:

Table of Contents

Introduction: Business Problem

Data

Methodology

Exploratory Data Analysis

Applying Designer

Results and Discussion

Confusion matrix

Link FakeNewsClassifierPS.ipynb

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages