FromThePage is a wiki-like application for crowdsourcing transcription of handwritten documents.
saracarl Update production.rb
Turned off Ahoy geocoding.
Latest commit 520ac81 Aug 30, 2018
Failed to load latest commit information.
.autocode Initialize Autocode Feb 3, 2016
.settings Fixed encoding problems in OCR ingestion Jan 7, 2015
app Don't blow up if we can't find the subject title in the text Aug 5, 2018
bin Update to rails 4.0.4 May 8, 2014
config Update production.rb Aug 30, 2018
db Add `dictation_langauge` to user model Jul 27, 2018
doc remove old files Apr 1, 2014
gemfiles fix travisci for mysql57 Jul 26, 2018
javascripts Create gh-pages branch via GitHub Nov 15, 2014
lib Merge branch '874-contentdm-import' into ui-design Mar 8, 2018
log Got rid of extra log files with a different extension Nov 25, 2013
public deleted file Oct 24, 2017
spec Merge branch 'development' into 1096-user-dictation-langauge Jul 29, 2018
stylesheets Create gh-pages branch via GitHub Nov 15, 2014
test_data added new test, issue #460 Nov 27, 2016
travis Changed keyserver Jun 24, 2018
vendor/assets Remove plugins dir since Rails 4.1 no longer supports it May 27, 2014
.gitignore Removed schema.rb Jul 27, 2018
.rspec Good place again Nov 25, 2013
.ruby-gemset Replacing simple_captcha with recaptcha Nov 25, 2013
.ruby-version Bump to Ruby 2.3.7 Jul 17, 2018
.travis.yml Bump to Ruby 2.3.7 Jul 17, 2018
CNAME CNAME file for pointing to github pages Nov 15, 2014
Capfile Revert "Merge branch '' into development" May 14, 2018
Gemfile Add pry back in for dependency Jul 29, 2018
Gemfile.lock new spec test to replace textgrid, which went offline Jul 29, 2018
LICENSE Added AGPL license file Feb 11, 2010 Update Dec 4, 2017
README.rdoc Update README.rdoc May 8, 2014
Rakefile I renamed some files from .rhtml to .erb Feb 6, 2013 I renamed some files from .rhtml to .erb Feb 6, 2013
iiifDocumentEditing.md5 documentation for iiif support Sep 28, 2016
index.html Create gh-pages branch via GitHub Nov 15, 2014
params.json Create gh-pages branch via GitHub Nov 15, 2014

FromThePage is an open-source tool that allows volunteers to collaborate to transcribe handwritten documents.


  • Wiki-style Editing: Users add or edit transcriptions using simple, wiki-style syntax on one side of the screen while viewing a scanned image of the manuscript page on the other side.
  • Version Control: Changes to each page transcription are recorded and may be viewed to follow the edit history of a page.
  • Wikilinks: Subjects mentioned within the document may are indexed via simple wikilinks within the transcription. Users can annotate subjects with full subject articles.
  • Presentation: Readers can view transcriptions in a multi-page format or alongside page images. They can also read all the pages that mention a subject
  • Automatic Markup: FromThePage can suggest wikilinks to editors by mining previously edited transcriptions. This helps insure editorial consistency and vastly reduces the amount of effort involved in markup.
  • Internet Archive integration: FromThePage can be pointed at manuscripts hosted on It will import the page structure and any printed page titles into its native format for transcription, while serving page images from the Internet Archive.


FromThePage is currently issued under the Affero GPL. This license remains controversial, however, so we are trying to preserve the option to dual-license the code.


FromThePage has been run successfully under both Linux and Windows. It currently requires Ruby on Rails version 4.1.1 and the RMagick, hpricot, will_paginate, and OAI gems.


Detailed Installation Instructions are available in the wiki, inclusing a link to a Docker file.

If you install FromThePage, please join the low volume FromThePage Google Group so we can keep you informed of bug fixes and new releases.

Install Ruby, RubyGems, Bundler, ImageMagick, MySQL and Git

Clone the repository

git clone git://

Install required gems

bundle install

Install Graphviz

apt-get install graphviz (or see the graphviz documentation at

Configure MySQL

Create a database and user account for FromThePage to use.

Then update the config/database.yml file to point to the MySQL user account and database you created above.

Run rake db:migrate to load the schema definition into the database account.

Modify the configuration parameters in config/initializers/01fromthepage.rb.

Modify the config/environments/production.rb (or development.rb) file to configure your mailer. (Search for "action_mailer".)

If you wish to use latex formulas in your transcriptions, you'll need to install "pdflatex" and "pdfcrop". You can usually install them by typing: sudo apt-get install texlive-latex-base texlive-extra-utils

Finally, start the application

rails server