Skip to content

Overview Docs

The open source document mining platform

Popular repositories

  1. Open source large document set visualization platform

    Scala 268 36

  2. Run Overview on your own system

    Shell 111 24

  3. Java library to detect files' MIME types

    Java 40 11

  4. Proof-of-concept document set visual exploration system

    Java 37 10

  5. docs2csv Public

    Scan a folder of document files of all types and extract the text into a CSV suitable for Overview

    Ruby 26 6

  6. pdfocr Public

    Scala library that shells to Tesseract to make PDFs searchable

    Scala 15 4


Top languages


Most used topics