A two-step fact-checking procedure for medium-sized text documents in English language on the topic of Covid-19

Particularly during a time of a global pandemic, it is crucial to find efficient ways of determining the credibility of information. From fake news to conspiracy theories, it is hard to fight the “infodemic” which eventually makes the pandemic even more dangerous. Currently, fact-checking websites such as Snopes, FactCheck.org, etc. perform manual claim validation from articles, speeches, or even social media posts from known figures, etc. Of course, they cannot cover all the dubious claims that can be found on the internet, as they focus mainly on the ones that "go viral". For the general user, however, it is impossible to fact check every single statement or sentence on a specific topic he finds on the internet. A lot of research has been invested in both claim verification and fact-check-worthiness, but there is no work done yet on the detection and extraction of dubious claims, combined with their fact checking using external information sources, such as knowledge graphs and knowledge bases, especially on the COVID-19 domain. The solution to this problem is a two-step claim verification procedure, consisting of sequence classification using Transformer models and fact checking using the Google Fact Check Tools. The goal of this work is to develop a high-performance component for fact checking of small- to medium-sized documents in English language on the topic of COVID-19.
We curate a dataset from existing COVID-19 related datasets and perform multiple preprocessing steps to ensure all the data is uniform. The data pre-processing procedure and the final datasets can be found here.
The training code for 4 Transformer models and a simple LSTM (as a baseline) is available here.
For a quick tutorial on how to start and use the fact-checking app, go to the Streamlit app directory.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
dataset_preparation		dataset_preparation
fact_checking_app		fact_checking_app
training		training
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A two-step fact-checking procedure for medium-sized text documents in English language on the topic of Covid-19

About

Releases

Packages

Languages

elip06/covid19-fact-checking

Folders and files

Latest commit

History

Repository files navigation

A two-step fact-checking procedure for medium-sized text documents in English language on the topic of Covid-19

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages