@KBNLresearch

National Library of the Netherlands / Research

National Library of the Netherlands / Research

Pinned repositories

  1. europeananp-ner

    Named Entities Recognition Annotator Tool for Europeana Newspapers

    Java 49 5

  2. KB-python-API

    Python API for KB data-services

    Python 10 6

  3. isolyzer

    Verify size of ISO 9660 image against Volume Descriptor fields

    Python 11 3

  4. ochre

    Toolbox for OCR post-correction

    Common Workflow Language 22 7

  5. keyword-generator

    Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf scores.

    Python 25 15

  • Apache-2.0 Updated Oct 19, 2018
  • Meresco Components are components to build searchengines, repositories and archives, based on Meresco Core

    Python 1 GPL-2.0 Updated Oct 16, 2018
  • Toolbox for OCR post-correction

    Common Workflow Language 22 7 Apache-2.0 Updated Oct 10, 2018
  • Meresco Lucene is a set of components and tools to integrate Lucene (based on PyLucene 4.3) into Meresco

    Java 4 GPL-2.0 Updated Oct 5, 2018
  • Loader software for automated imaging of optical media with Nimbie disc robot

    HTML 9 1 Apache-2.0 Updated Sep 27, 2018
  • Various resources and documentation related to the nl-menu recovery efforts

    Shell Apache-2.0 Updated Sep 5, 2018
  • Narralyzer is a narrative analyzer

    Python 4 1 GPL-3.0 Updated Sep 5, 2018
  • Tool for extracting topics, keywords and their collocates from a Dutch corpus. Includes and extends the functionality of the Keyword Generator.

    Python 7 1 GPL-3.0 Updated Sep 3, 2018
  • Create ingest-ready SIPs from batches of optical media images

    Python 8 Apache-2.0 Updated Aug 29, 2018
  • Saving URLs of Leesplein.nl to Wayback Machine of The Internet Archive

    Python 1 Updated Jul 18, 2018
  • Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia descriptions using either a binary SVM classifier or a neural net.

    Python 9 GPL-3.0 Updated Jul 17, 2018
  • Predict news article topics and DBpedia description topics and type.

    Jupyter Notebook 1 1 Apache-2.0 Updated Jul 13, 2018
  • Meresco Html is a template engine based on generators, and a sequel to Slowfoot. It is also known as DynamicHtml or Seecr Html.

    Python 1 GPL-2.0 Updated Jun 6, 2018
  • DHBenelux 2018

    Jupyter Notebook Updated Jun 6, 2018
  • Web interface to manually annotate named entity mentions in newspaper articles with the correct DBpedia link(s), if any. Produces labeled data sets for training and evaluating the DAC Entity Linker.

    Python 5 GPL-3.0 Updated Jun 1, 2018
  • Scripts for quality assessment of e-books

    XSLT 1 Updated May 31, 2018
  • Java MIT Updated May 17, 2018
  • Collection of Python scripts to build a Solr index from selected Dutch and English DBpedia dumps.

    Python GPL-3.0 Updated May 8, 2018
  • Bash script that performs file format identification on all files in a directory tree using Apache Tika

    Shell 1 Updated Apr 17, 2018
  • Shell Updated Mar 23, 2018
  • Classified Historical Newspaper Images

    HTML 2 Apache-2.0 Updated Mar 21, 2018
  • Book back recognition

    1 Updated Feb 14, 2018
  • Python 1 MIT Updated Feb 12, 2018
  • Verify size of ISO 9660 image against Volume Descriptor fields

    Python 11 3 Updated Jan 26, 2018
  • Python API for KB data-services

    Python 10 6 Updated Jan 12, 2018
  • Named Entities Recognition Annotator Tool for Europeana Newspapers

    Java 49 5 Updated Jan 12, 2018
  • Automated JP2 profiling for digitisation batches

    Python 1 1 Updated Oct 30, 2017
  • 😀😄😂😭 A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤

    112 CC-BY-SA-4.0 Updated Oct 20, 2017
  • Bulk downloader of web resources via OAI/PMH

    Java 2 1 MIT Updated Oct 6, 2017
  • Advertisement search interface based on image similarity.

    Python 3 1 Apache-2.0 Updated Sep 29, 2017
  • Top languages

    Loading…

    Most used topics

    Loading…