GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Tools for text tokenization and encoding
A full text Bookworm on Public Domain Hathitrust works
Web app for browsing HathiTrust BW.
GUI for a Bookworm web app
Parsing MARC records for Bookworm ingest
Script for installing prerequisites for bookworm
Documentation for Bookworm: particularly focusing on creation aspects -
An API implementing a grammar for text analysis
Sample scripts to compile jsoncatalog.txt file for Bookworm
Retrieve Ngram Data
creates a catalog file files for use in generating a Bookworm based on HTRC texts
Bookworm extension for Solr