Detection of fake news online is important in today's society as fresh news content is rapidly being produced as a result of the abundance of technology that is present. Fake news is nothing new. But, what is new is how easy it's become to share information – both true and false – on a massive scale.
Social media platforms allow almost anyone to publish their thoughts or share stories to the world. The trouble is, most people don't check the source of the material that they view online before they share it, which can lead to fake news spreading quickly or even "going viral."
Our work will consist of building a fake news classifier using machine learning.
Team #Sg_Spanish:
- Armando Medina(@ketcx)
- Jacqueline Susan Mejía (@susyjam)
- Fernando Terrazas(@Fernando T)
- Stanley Salvatierra(@Stanley Salvatierra)
- Cesar Schiraldi(@Cesar S.)
- Íñigo Lejarza(@Íñigo Lejarza)
Show/Hide
Typically spread over social media and traditional news outlets, misinformation remains rampant through the use of clickbait headlines and polarizing content.
With recent world events, we’ve seen how much impact the news has on our lives. From understanding what is happening surrounding the pandemic to the movement of the stock market, I know that I rely heavily on the news, and I’m sure everyone else does too. However, it is often difficult to distinguish between articles with false information and those providing real, fact-checked news. Given that companies such as Facebook and Twitter deploy algorithms to ensure that people are receiving the right, correct information on their feeds, I wanted to explore utilizing Natural Language Processing and text analysis to build a fake news classifier.
This dataset contains a list of articles considered as "fake" news For this example we are based on the Kaggle data https://www.kaggle.com/clmentbisaillon/fake-and-real-news-dataset#True.csv
Sequential Model and Bi-Directional RNN and LSTM
In the data you can see the words "Real"
We used the Designer inspired by the laboratory of the course on text classification to evaluate how the result would be obtained with the Two-Class Logistic Regression algorithm
If the predicted value is > 0.5 it is real else it is fake But in our example the Model Accuracy is 0.9985152190051967
https://github.com/ketcx/fake-news-classifier/blob/master/FakeNewsClassifierPS.ipynb