Skip to content

Popular repositories Loading

  1. behemoth behemoth Public archive

    Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.

    Java 282 59

  2. TextClassification TextClassification Public

    A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and can be used as a front end to various ML algorithms. libSVM …

    Java 48 22

  3. textclassification-examples textclassification-examples Public

    Use cases for DigitalPebble's TextClassification API

    Java 10 3

  4. stormcrawlerfight stormcrawlerfight Public

    Crawl configurations for benchmarking / testing StormCrawler

    Shell 10 5

  5. stormcrawler-docker stormcrawler-docker Public

    Resources for running StormCrawler with Docker services

    Dockerfile 10 3

  6. ansible-storm ansible-storm Public

    Ansible playbook for deploying a Storm cluster

    7 1

Repositories

Showing 10 of 27 repositories

Top languages

Loading…

Most used topics

Loading…