Grow your team on GitHub
GitHub is home to over 28 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.Sign up
Data which can be used for evaluation of matcher algorithm
This repository contains manually inspected datasets for evaluating the different steps during the citation information extraction process.
This repository contains all codes of annotator tools for generating gold standard.
Project for extracting reference strings from PDF publications.
This repository contains the evaluation results of our AMSD 2017 submission.
Contains code that we use to evaluate the quality of our PDF corpus. For example, we look at the amount of scanned PDFs as well as PDFs that contain OCR text.
Curated list of publicly available datasets and APIs