Harnessing Large Language Models in Fake News Detection

by Anamaria Todor and Marcel Castro

Introduction

Fake news, defined as news that convey or incorporate false, fabricated or deliberately misleading information, have been around as early as the emergence of the printing press. The rapid spread of fake news and disinformation online is not only deceiving to the public, but can also have profound impact on society, politics, economy and culture. Examples include:

Cultivating distrust in the media
Undermining the democratic process
Spread of false or discredited science – for example the anti-vax movement

Advances in Artificial Intelligence and Machine Learning has made developing tools for creating and sharing fake news even easier. On the one hand, early examples include advanced social bots and automated accounts that supercharge the initial stage of spreading fake news. In general, it is not trivial for the public to determine whether such accounts are people or bots. In addition, social bots are not illegal tools and many companies legally purchase them as a part of marketing, thus it is not easy to curb the use of social bots systematically. Recent discoveries in the field of Generative AI (in particular text to text and text to image models) makes it possible to produce textual and rich content at an unprecedented pace with the help of Large Language Models (LLMs). LLMs are Generative AI text models with over one billion parameters and they are facilitated in the synthesis of high-quality text.

In this blog post we explore how Large Language Models (LLMs) can be utilized to tackle the prevalent issue of detecting fake news, or in other words to “fight fire with fire”. We suggest that LLMs are sufficiently advanced for this task, especially if improved prompt techniques such as Chain-of-Thought and ReAct are used in conjunction with tools for information retrieval.

We illustrate this by creating a LangChain application which, given a piece of news, highlights to the user whether the article is true or fake using natural language. The solution makes also use of Amazon Bedrock, a fully managed service that makes foundation models (FMs) from Amazon and third-party model providers easily accessible through the AWS console and API.

For a set-by-step description of the solution, please check our AWS Machine Learning Blog entitled Harness large language models in fake news detection.

How to use this repository

This repository contains a ChainOfThought and ReAct implementation.

To run the scripts you can do:

python react/ReAct.py

or

python fact-checker/fact_checker.py

Both scripts make use of a shorter (4 statements) or longer (50 statements) from the FEVER dataset. Ref. knowledge_qa_test.json and knowledge_qa.json.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
fact-checker		fact-checker
react		react
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
knowledge_qa.jsonl		knowledge_qa.jsonl
knowledge_qa_test.jsonl		knowledge_qa_test.jsonl
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Harnessing Large Language Models in Fake News Detection

Introduction

How to use this repository

About

Releases

Packages

Contributors 2

Languages

License

aws-samples/amazon-bedrock-llm-fakenews-detector

Folders and files

Latest commit

History

Repository files navigation

Harnessing Large Language Models in Fake News Detection

Introduction

How to use this repository

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages