This repository has been archived by the owner. It is now read-only.
Permalink
Commits on Sep 14, 2017
Commits on Feb 28, 2017
Commits on Feb 24, 2017
Commits on Feb 22, 2017
  1. Merge pull request #262 from hellowsummer/vis-fileMenu

    ianmilligan1 committed Feb 22, 2017
    visualization: add file menu
Commits on Feb 19, 2017
Commits on Feb 11, 2017
  1. better io (#261)

    zackwang authored and lintool committed Feb 11, 2017
    OpenJDK's implementation of io utilities sucks. It will raise a java.lang.OutOfMemoryError: Requested array size exceeds VM limit exception when copying array larger than 2G. This will affect large warc files. org.apache.commons.io provides more robust io utilities.
Commits on Oct 1, 2016
  1. Minor tweak to README.

    lintool committed Oct 1, 2016
Commits on Sep 30, 2016
  1. Merge pull request #253 from ukwa/master

    lintool committed Sep 30, 2016
    issue-252: Also allow XHTML through in keepValidPages
Commits on Sep 29, 2016
Commits on Sep 23, 2016
Commits on Aug 5, 2016
Commits on Aug 2, 2016
  1. Merge pull request #242 from yb1/checksum

    ianmilligan1 committed Aug 2, 2016
    Multiple partitions
  2. Multiple partitions

    youngbink committed Aug 2, 2016
Commits on Jul 31, 2016
  1. Merge pull request #241 from yb1/checksum

    ianmilligan1 committed Jul 31, 2016
    Changed output type to rdd
Commits on Jul 28, 2016
Commits on Jul 27, 2016
  1. checksum

    youngbink committed Jul 27, 2016
Commits on Jun 29, 2016
Commits on Jun 28, 2016
  1. Refactored Warcbase into multiple modules, upgraded to CDH 5.7.1 (w/ …

    lintool committed Jun 28, 2016
    …Spark 1.6.0).
    
    Closed following issues:
    Issue #236 Trantor upgraded to CDH 5.7.1
    Issue #235 Break Warcbase up into sub-artifacts
    Issue #231 Upgrade to Spark 1.6.1?
  2. Updated documentation.

    lintool committed Jun 28, 2016
Commits on Jun 24, 2016
  1. Upgraded to CDH 5.7.1.

    lintool committed Jun 24, 2016
Commits on Jun 22, 2016
Commits on Jun 17, 2016
Commits on Jun 16, 2016
Commits on Jun 15, 2016
  1. Fixed issue #234 Error handling for broken ARC/WARC files: empty cont…

    lintool committed Jun 15, 2016
    …ent in ARC records.
  2. fixed broken link

    ianmilligan1 committed Jun 15, 2016