GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Named Entity Recognition data for Europeana Newspapers
Named Entity Recognition tool for Europeana Newspapers
Europeana Newspapers Final Report
Named Entity Disambiguation for Europeana Newspapers
File Rename Tool
Binarization and Conversion Tool
File Analyzer Tool
Core libraries by the PRImA Research Lab
PAGE Metadata Scanner is a command line tool that scans a single PAGE XML file (document layout and text content) and outputs its properties in CSV format.
Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.