Text Data Augmentation Techniques for Word Embeddings in Fake News Classification

Data Augmentation (DA) can be defined as any method for increasing the diversity of training examples without explicitly collecting new data. This repository contains the code for a research project focusing on coreference resolution in the field of natural language processing (NLP). The goal of the project was to investigate the impact of coreference resolution on classification of fake news. The research aimed to determine whether coreferencing the input data before classification is more effective than classifying them without coreference.

Dataset

We used two availabe datasets:

Getting real about fake news - training dataset.
WELFake dataset - used for classification tasks.

Repository Contents

The repository contains the following files:

📁 notebook: This directory contains the source code for the implementation of the proposed procedures.
📁 results: This directory stores the evaluation results and performance metrics of the implemented classifiers as .png and in .csv.

Authors

List of all authors who contributed to the project:

Jozef Kapusta (supervisor)
Dávid Držík (PhD. student)
Kirsten Šteflovič (PhD. student)
Kitti Szabó Nagy

Paper

You can read the article here.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
notebooks/data-augmentation		notebooks/data-augmentation
results		results
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Data Augmentation Techniques for Word Embeddings in Fake News Classification

Dataset

Repository Contents

Authors

Paper

About

Releases

Packages

Languages

ksteflovic/data-augmentation_word-vectors

Folders and files

Latest commit

History

Repository files navigation

Text Data Augmentation Techniques for Word Embeddings in Fake News Classification

Dataset

Repository Contents

Authors

Paper

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages