Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
Extact all URLs from anchor and image tags within a html/xhtml page and its children.
Archive a URL in Internet Archive's Wayback Machine.
Statistics reporting system for authors.
Common setup for OBP's MySQL databases.
Generates a stylised HTML site from an ePub book
Remove suspicious requests from awstats CSV reports
Read book metadata from a spreadsheet an produce JSON-LD scripts in http://schema.org/Book format
Generates a stylised HTML site (instantiating the Internet Archive's Book Reader) from a PDF of a book
This Python library examines a PDF and extracts text blocks, videos, internal and external links, and their positions
Python PDF Parser
TEI to MediaWiki wikitext converter