Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
-
Updated
Sep 27, 2022 - Python
Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.
Augmenty is an augmentation library based on spaCy for augmenting texts.
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
Chinese Characters Visualization & Chinese Text Augmentation.
This library helps you to create random words i.e noise in text data. Helpful in many tasks like the generation of random authorization token generation of constant or variable length, text data augmentation, etc.
Use online translation tool to effectively generates new datasets in other language from original datasets, especially from those popular standard baseline datasets for specific tasks.
This repository contains the data and code for the paper "Self-training with Two-phase Self-augmentation for Few-shot Dialogue Generation" (EMNLP2022-Findings).
This repo offers a Python script using NLPAug library & RTT to augment text datasets. It processes TXT files in "data/" folder, translating text and creating augmented versions. Augmented data enhances NLP tasks like chatbot training & text classification. Includes overview of techniques, applications & implementation.
[WIP] Fast text augmentation for small text corvus
A PyPI package for augmenting text data using NLP techniques directly in your pandas dataframe.
Feature space Augmentation
Dritributed Text Augmentation Techniques (Appeared AAAI 2023)
Add a description, image, and links to the text-augmentation topic page so that developers can more easily learn about it.
To associate your repository with the text-augmentation topic, visit your repo's landing page and select "manage topics."