Skip to content
A simple package designed to be used for demonstrating basic Natural Language Processing (NLP) feature engineering in Python.
Python Jupyter Notebook Shell
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
docker
notebooks
tests
text2math
.gitignore
LICENSE
MANIFEST.in
README.md
get_data
gpl-3.0-Preamble.txt added license copy Feb 15, 2016
setup.cfg
setup.py

README.md

A simple package designed to be used for demonstrating basic Natural Language Processing (NLP) feature engineering in Python.

More Info:

Practice Dataset

Stack Exchange Data Dump

Text Encoding

The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) by Joel Spolsky

Packages

  • chardet - Universal encoding detector for Python 2 and 3
  • cchardet - Universal encoding detector. This library is faster than chardet
  • ftfy - fixes text for you
  • unidecode - ASCII transliterations of Unicode text

Natural Language Processing

Care and Feeding of Topic Models: Problems, Diagnostics, and Improvementes

Functional Programing in Python

Functional programming in Python Examine the functional aspects of Python: which options work well and which ones you should avoid By David Mertz

Packages

  • toolz - Toolz provides a set of utility functions for iterators, functions, and dictionaries.
  • functools - Higher-order functions and operations on callable objects.
  • itertools - Functions creating iterators for efficient looping.
You can’t perform that action at this time.