Skip to content

This repository explores the application of data augmentation techniques to expand existing corpora. Data augmentation encompasses several methods, including Synonym Replacement, Back Translation, and Word removal function.

Notifications You must be signed in to change notification settings

ksteflovic/data-augmentation_word-vectors

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Text Data Augmentation Techniques for Word Embeddings in Fake News Classification

Data Augmentation (DA) can be defined as any method for increasing the diversity of training examples without explicitly collecting new data. This repository contains the code for a research project focusing on coreference resolution in the field of natural language processing (NLP). The goal of the project was to investigate the impact of coreference resolution on classification of fake news. The research aimed to determine whether coreferencing the input data before classification is more effective than classifying them without coreference.

Dataset

We used two availabe datasets:

Repository Contents

The repository contains the following files:

  • 📁 notebook: This directory contains the source code for the implementation of the proposed procedures.
  • 📁 results: This directory stores the evaluation results and performance metrics of the implemented classifiers as .png and in .csv.

Authors

List of all authors who contributed to the project:

Paper

You can read the article here.

About

This repository explores the application of data augmentation techniques to expand existing corpora. Data augmentation encompasses several methods, including Synonym Replacement, Back Translation, and Word removal function.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published