Data augmentation for NLP, presented at EMNLP 2019
-
Updated
Mar 19, 2023 - Python
Data augmentation for NLP, presented at EMNLP 2019
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
📃Language Model based sentences scoring library
ICLR 2018 Quick-Thought vectors
Extract Information from web corpus using Open Information Extraction.
🙊 Stop repeating yourself
Python API & command-line tool to easily transcribe speech-based video files into clean text
10,000 sentences: an Android app to help you learn new words in foreign languages
Tensorflow Implementation of Variational Attention for Sequence to Sequence Models (COLING 2018)
Apache OpenNLP wrapper for Nodejs
Russian language support for NLTK's PunktSentenceTokenizer
A sentence segmentation library with wide language support optimized for speed and utility.
Join all elements of an array and create a human-readable string
A web application that interfaces two GEC systems. [web instance is down]
Add a description, image, and links to the sentence topic page so that developers can more easily learn about it.
To associate your repository with the sentence topic, visit your repo's landing page and select "manage topics."