Skip to content
Permalink
Branch: master
Find file Copy path
Find file Copy path
Fetching contributors…
Cannot retrieve contributors at this time
58 lines (41 sloc) 1.99 KB

Text Transformations Tools

Please put in pull requests to add resources and links

RegEx

A syntax for search queries that match string patterns. Essentially a sophisticated find method for a document or text set.

Python

A high-level, multipurpose programming language useful for working with plain text and data. Used not only by DH practitioners, but by Google, NASA, and the scientific community.

Natural Language Toolkit (NLTK)

A Python library for working with natural language. NLTK can tokenize strings (create a list of words from a set of characters), idenfity parts of sppech, and perform operations based on a word's context.



MALLET
Unix/Linux
R
XLST
TEI
Git
Gephi
D3
TextWrangler
BBEdit
Excel
OpenRefine
Voyant Tools
R Studio
DH Box

Resources
The Programming Historian http://programminghistorian.org/

Workshops
http://gcdi.commons.gc.cuny.edu/events/

You can’t perform that action at this time.