Utility that automates name anonymizing over batches of text files
-
Updated
Sep 29, 2017 - Python
Utility that automates name anonymizing over batches of text files
A Simple Easy To Use Text Cleaning Package For NLP Built In Python. It Can Clean and Analyze Your Text Data In One Line of Code.
Preprocess Package for https://bit.ly/intro_nlp (Text cleaning and preprocessing example)
👀 Everything Everyway All At Once Text Preprocessing for Natural Language Processing.
The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.
Repo with basic start on Recurrent Neural Networks, Word2Vec, Doc2Vec, TFIDF vectors and NLP basics
ValX is an open-source Python package for text cleaning tasks, including profanity detection and removal. Now also includes sensitive information detection, and removal.
Utility that automates text cleaning over batches of text files
Python Text Cleaning ToolKit library (pyTCTK)
Code for introduction to text processing blog post.
Utility that automates spelling correction over batches of text files
Common Text Pre-Processing for Portuguese
Corpora and scripts for cleaning political science texts. Scripts are translated into transformations that support SAGE Texti.
Korean text data preprocess toolkit for NLP
A Python package to get useful information from documents using TopicRank Algorithm.
Text preprocessing package for use in NLP tasks https://pypi.org/project/textcl/
A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。
NLP预/后处理工具。
Add a description, image, and links to the text-cleaning topic page so that developers can more easily learn about it.
To associate your repository with the text-cleaning topic, visit your repo's landing page and select "manage topics."