Skip to content

The ContentMine

The ContentMine is extracting 100 million facts from the academic literature

Popular repositories

  1. A scraping command line tool for the modern web

    JavaScript 237 42

  2. Get metadata, fulltexts or fulltext URLs of papers matching a search query

    JavaScript 181 36

  3. Journal scraper definitions for the ContentMine framework

    Ruby 62 34

  4. This repository contains material helping you to set up a ContentMine workshop. It also includes tutorials for learning the ContentMine tools on your own.

    33 11

  5. The scraperJSON standard for defining web scrapers as JSON objects

    31 2

  6. norma Public

    Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML

    HTML 31 21


Top languages


Most used topics