____
___ .-~. /_"-._
`-._~-. / /_ "~o\ :Y
\ \ / : \~x. ` ')
] Y / | Y< ~-.__j
/ ! _.--~T : l l< /.-~
/ / ____.--~ . ` l /~\ \<|Y
/ / .-~~" /| . ',-~\ \L|
/ / / .^ \ Y~Y \.^>/l_ "--'
/ Y .-"( . l__ j_j l_/ /~_.-~ .
Y l / \ ) ~~~." / `/"~ / \.__/l_
| \ _.-" ~-{__ l : l._Z~-.___.--~
| ~---~ / ~~"---\_ ' __[>
l . _.^ ___ _>-y~
\ \ . .-~ .-~ ~>--" /||
\ ~---" / ./ _.-' || _____ ____ _____ _ _
"-.,_____.,_ _.--~\ _.-~ |||_ _| | _ \| ____| \ | | ___ _ __ __ _
~~ ( _} || | |_____| |_) | _| | \| | / _ \| '__/ _` |
`. ~( || | |_____| __/| |___| |\ || (_) | | | (_| |
) \ || |_| |_| |_____|_| \_(_)___/|_| \__, |
/,`--'~\--'~\ || |___/
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
->T-PEN<- Transcription for Paleographical and Editorial Notation
This fork is by Thom Hastings to integrate
- Tesseract-OCR for Optical Character Recognition
via git svn and - Moses-SMT for Statistical Machine Translation
via github into the existing T-PEN framework. - Tesseract is trained automatically with TesseractTrainer.
- A sample Tesseract data set is provided by Dr. Kevin Scannell, Tesseract-GLE-Unical
svn checkout http://tesseract-gle-uncial.googlecode.com/svn/trunk/ tesseract-gle-uncial
- Tie in Tesseract's SVN codebase as a git submodule like this.
Furthermore, MATLAB hooks might be used for handwriting analysis.
A tool for the Digital Humanities
T-PEN is a web-based tool for working with images of manuscripts. Users attach transcription data (new or uploaded) to the actual lines of the original manuscript in a simple, flexible interface.
The Transcription for Paleographical and Editorial Notation (T-PEN) project is coordinated by the Center for Digital Theology at Saint Louis University and funded by the Andrew W. Mellon Foundation and the National Endowment for the Humanities. The Electronic Norman Anonymous Project developed several abilities at the core of this project's functionality.
T‑PEN is released under the Educational Community License v.2.0 as free and open-source software, the primary instance of which is maintained at T‑PEN.org.