Skip to content

Fake news classifier for Machine Learning Scholarship Program for Microsoft Azure

Notifications You must be signed in to change notification settings

ketcx/fake-news-classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fake news classifier using Machine Learning

Detection of fake news online is important in today's society as fresh news content is rapidly being produced as a result of the abundance of technology that is present. Fake news is nothing new. But, what is new is how easy it's become to share information – both true and false – on a massive scale.

Social media platforms allow almost anyone to publish their thoughts or share stories to the world. The trouble is, most people don't check the source of the material that they view online before they share it, which can lead to fake news spreading quickly or even "going viral."

Our work will consist of building a fake news classifier using machine learning.

Authors:

Team #Sg_Spanish:

  1. Armando Medina(@ketcx)
  2. Jacqueline Susan Mejía (@susyjam)
  3. Fernando Terrazas(@Fernando T)
  4. Stanley Salvatierra(@Stanley Salvatierra)
  5. Cesar Schiraldi(@Cesar S.)
  6. Íñigo Lejarza(@Íñigo Lejarza)

Table of Contents

Show/Hide
  1. Introduction: Business Problem
  2. Data
  3. Methodology
  4. Analysis
  5. Applying Designer
  6. Results and Discussion

Introduction: Business Problem

Typically spread over social media and traditional news outlets, misinformation remains rampant through the use of clickbait headlines and polarizing content. With recent world events, we’ve seen how much impact the news has on our lives. From understanding what is happening surrounding the pandemic to the movement of the stock market, I know that I rely heavily on the news, and I’m sure everyone else does too. However, it is often difficult to distinguish between articles with false information and those providing real, fact-checked news. Given that companies such as Facebook and Twitter deploy algorithms to ensure that people are receiving the right, correct information on their feeds, I wanted to explore utilizing Natural Language Processing and text analysis to build a fake news classifier.

Data

This dataset contains a list of articles considered as "fake" news For this example we are based on the Kaggle data https://www.kaggle.com/clmentbisaillon/fake-and-real-news-dataset#True.csv


Methodology

Sequential Model and Bi-Directional RNN and LSTM

Exploratory Data Analysis

In the data you can see the words "Fake"

In the data you can see the words "Real"


Applying Designer

We used the Designer inspired by the laboratory of the course on text classification to evaluate how the result would be obtained with the Two-Class Logistic Regression algorithm


Results and Discussion

If the predicted value is > 0.5 it is real else it is fake But in our example the Model Accuracy is 0.9985152190051967

Confusion matrix


Link FakeNewsClassifierPS.ipynb

https://github.com/ketcx/fake-news-classifier/blob/master/FakeNewsClassifierPS.ipynb

About

Fake news classifier for Machine Learning Scholarship Program for Microsoft Azure

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published