Shell
Updated Apr 28, 2019
#
text-processing
Repositories 514
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Python
Updated Apr 28, 2019
Intuitive find & replace CLI (sed alternative)
Rust
Updated Apr 15, 2019
Simple SQL-like syntax on top of Perl text processing.
Python
Updated Apr 2, 2019
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
natural-language-processing
deep-learning
nlp-library
nlp-parsing
chinese-nlp
text-classification
text-processing
Python
Updated May 3, 2019
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules u…
nlp
python
computational-linguistics
linguistics
library
folia
machine-learning
language-modelling
search-algorithms
evaluation-metrics
text-processing
nlp-library
natural-language-processing
Python
Updated Mar 13, 2019
Open Korean Text Processor - An Open-source Korean Text Processor
korean
korean-text-processing
natural-language-processing
text-processing
tokenizer
korean-tokenizer
Scala
Updated Aug 7, 2018
Natural language detection library for Go
Go
Updated Mar 6, 2019
A simple Python module for parsing human names into their individual components
Python
Updated Apr 20, 2019
Text Classification Algorithms: A Survey
text-classification
nlp-machine-learning
document-classification
text-processing
dimensionality-reduction
rocchio-algorithm
boosting-algorithms
logistic-regression
naive-bayes-classifier
k-nearest-neighbours
support-vector-machines
decision-trees
random-forest
conditional-random-fields
deep-learning
deep-neural-network
recurrent-neural-networks
convolutional-neural-networks
deep-belief-network
hierarchical-attention-networks
Python
Updated May 3, 2019
python
python2
python3
python-2
python-3
parser-combinators
parsing-expression-grammar
parsing
parsing-library
text-processing
Python
Updated Apr 17, 2019
machine-learning
classification
python
python3
python2
text
text-mining
adversarial-examples
spam
spam-filtering
spam-detection
spam-classification
text-classification
text-analysis
data-science
data-mining
text-processing
black-box-benchmarking
black-box-attacks
metrics
Python
Updated Oct 14, 2018
A fast implementation of Aho-Corasick in Rust.
Rust
Updated May 1, 2019
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis…
nlp
text-processing
nlp-library
spelling-correction
tokenizer
tokenization
word-segmentation
word-normalization
spell-corrector
text-segmentation
semeval
Python
Updated Nov 22, 2018
UNIC: Unicode and Internationalization Crates for Rust
unicode
internationalization
text-processing
crates
rust
cldr
locale-data
unic
unicode-characters
unicode-algorithms
Good first issues
Rust
Updated Mar 5, 2019
Util collection for Japanese text processing. Hiraganize, Katakanize, and Romanize.
JavaScript
Updated Apr 18, 2019
Stanford NLP group's shared Python tools.
Python
Updated Mar 14, 2018
Textpipe: clean and extract metadata from text
Good first issues
Add installation instructions in README file
good first issue enhancement#17 opened 10 months ago by dungchu
1
Python
Updated May 3, 2019
A low level regular expression library that uses deterministic finite automata.
Rust
Updated Feb 25, 2019
CogComp's light-weight Python NLP annotators
Python
Updated Feb 18, 2019
Text vectorization tool to outperform TFIDF for classification tasks
python
nlp
machine-learning
text-analysis
text-classification
text-processing
tf-idf
natural-language-processing
Python
Updated Sep 10, 2018
pyarabic
Python
Updated Mar 27, 2019
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such …
c-plus-plus
python
nlp
ngrams
skipgram
ngram
corpus
linguistics
library
text-processing
computational-linguistics
pattern-recognition
C++
Updated Jan 15, 2019
Multi-lingual Text Processing
Updated Jan 22, 2019
Extract indicators of compromise from text, including "escaped" ones.
ioc
iocs
extract
extraction
text-mining
text-processing
indicators-of-compromise
command-line-tool
command-line
defang
escaping
regex
regexp
data-mining
Go
Updated May 1, 2019
Vision Framework IOS WWDC 2017
image-analysis
face-detection
rectangle-detection
char-detection
ios-vision
visionframework
wwdc2017
ios
machine
learning
text-detection
text-processing
Swift
Updated Jun 26, 2017
A flexible Java text processor. BB, BBCode, BB-code, HTML, Textile, Markdown, parser, translator, converter.
Java
Updated Oct 19, 2016
Unix Text Processing Command Reference
Updated Sep 12, 2016
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku and Zenkaku
Python
Updated Feb 3, 2019
vims - use vim commands for pipeline filtering in terminal
Shell
Updated Aug 12, 2018