@ContentMine

The ContentMine

The ContentMine is extracting 100 million facts from the academic literature

  • Document processing including support libraries and PDFBox2

    Updated Oct 16, 2018
  • 🐳 Docker images and compose file for Wikibase and the query service

    Shell 16 Updated Oct 15, 2018
  • Get metadata, fulltexts or fulltext URLs of papers matching a search query

    JavaScript 134 27 MIT Updated Aug 31, 2018
  • A tool to convert a variety of inputs into normalized, tagged, XHTML (with embedded/linked SVG and PNG where appropriate).

    Updated Aug 23, 2018
  • Journal scraper definitions for the ContentMine framework

    Ruby 46 31 Updated Jul 12, 2018
  • Web services layer for ContentMine text and data mining tools and utilities

    JavaScript Apache-2.0 Updated May 30, 2018
  • Canary is a UI to the contentmine tools getpapers, quickscrape, norma, and ami.

    HTML 4 5 MIT Updated May 30, 2018
  • ES Academic paper fact extraction - backend for canary

    JavaScript 1 1 Apache-2.0 Updated May 30, 2018
  • Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML

    HTML 23 21 Apache-2.0 Updated Apr 10, 2018
  • ContentMine Fork of the WWMM imageanalysis Package

    HTML 4 Updated Apr 10, 2018
  • ContentMine Fork of the WWMM pdf2svg Package

    Java 1 5 Updated Apr 10, 2018
  • ContentMine Fork of the WWMM svg2xml Package

    HTML 4 Apache-2.0 Updated Apr 10, 2018
  • Combined SVG and HTML repos and building functionality

    Java 2 Apache-2.0 Updated Apr 10, 2018
  • ArgProcessor and files for basic CMDirectories. Often subclassed. Needs to be separate from euclid and norma

    HTML 7 Apache-2.0 Updated Apr 10, 2018
  • ContentMine Fork of the WWMM Euclid Package

    Java 3 1 issue needs help Updated Apr 10, 2018
  • Parent POM for ContentMine Java/MVN stack

    Shell 2 Apache-2.0 Updated Apr 6, 2018
  • Data and progress tracking for table extraction and semantically guided content enhancement

    HTML 2 Apache-2.0 Updated Apr 5, 2018
  • Release packages for ContentMine projects

    Shell 2 Apache-2.0 Updated Mar 8, 2018
  • repo for executables (so as not to bloat projects)

    Updated Jan 24, 2018
  • tools and codes to run grobid

    Apache-2.0 Updated Jan 21, 2018
  • ContentMine Fork of the WWMM html Package

    HTML 3 Updated Dec 13, 2017
  • Dictionaries for use with `ami` , including some management software

    JavaScript 2 3 Apache-2.0 Updated Nov 28, 2017
  • The WikifactMine API Endpoint

    JavaScript 5 1 Apache-2.0 Updated Oct 28, 2017
  • Repository for tracking bugs in the new CM website

    Updated Oct 18, 2017
  • HTML 1 Updated Sep 29, 2017
  • CLI tool for canary

    JavaScript Apache-2.0 Updated Aug 30, 2017
  • World Library and Information Congress 2017 ContentMine Workshop

    Updated Aug 23, 2017
  • Materials of FutureTDM project

    Jupyter Notebook 7 4 Updated Aug 22, 2017
  • Extraction of data from Vector-based Funnel Plots in the scholarly literature

    Shell 1 2 Updated Jul 10, 2017
  • HTML 11 14 Apache-2.0 Updated Jul 2, 2017