Permalink
Commits on Feb 24, 2017
Commits on Feb 22, 2017
  1. Merge pull request #262 from hellowsummer/vis-fileMenu

    visualization: add file menu
    ianmilligan1 committed on GitHub Feb 22, 2017
Commits on Feb 19, 2017
Commits on Feb 11, 2017
  1. better io (#261)

    OpenJDK's implementation of io utilities sucks. It will raise a java.lang.OutOfMemoryError: Requested array size exceeds VM limit exception when copying array larger than 2G. This will affect large warc files. org.apache.commons.io provides more robust io utilities.
    zackwang committed with Feb 11, 2017
Commits on Oct 1, 2016
  1. Minor tweak to README.

    committed Oct 1, 2016
Commits on Sep 30, 2016
  1. Merge pull request #253 from ukwa/master

    issue-252: Also allow XHTML through in keepValidPages
    committed on GitHub Sep 30, 2016
  2. Better error trapping for issue #244: java.util.zip.ZipException: inv…

    …alid distance code
    committed Sep 30, 2016
Commits on Sep 29, 2016
Commits on Sep 23, 2016
Commits on Aug 5, 2016
Commits on Aug 2, 2016
  1. Merge pull request #242 from yb1/checksum

    Multiple partitions
    ianmilligan1 committed on GitHub Aug 2, 2016
  2. Multiple partitions

    ybsimon committed Aug 2, 2016
Commits on Jul 31, 2016
  1. Merge pull request #241 from yb1/checksum

    Changed output type to rdd
    ianmilligan1 committed on GitHub Jul 31, 2016
  2. Changed output type to rdd

    ybsimon committed Jul 31, 2016
Commits on Jul 28, 2016
Commits on Jul 27, 2016
  1. checksum

    ybsimon committed Jul 27, 2016
Commits on Jun 29, 2016
Commits on Jun 28, 2016
  1. Refactored Warcbase into multiple modules, upgraded to CDH 5.7.1 (w/ …

    …Spark 1.6.0).
    
    Closed following issues:
    Issue #236 Trantor upgraded to CDH 5.7.1
    Issue #235 Break Warcbase up into sub-artifacts
    Issue #231 Upgrade to Spark 1.6.1?
    committed Jun 28, 2016
  2. Updated documentation.

    committed Jun 28, 2016
Commits on Jun 24, 2016
  1. Upgraded to CDH 5.7.1.

    committed Jun 24, 2016
Commits on Jun 22, 2016
Commits on Jun 17, 2016
Commits on Jun 16, 2016
  1. Created warcbase-core module.

    committed Jun 16, 2016
Commits on Jun 15, 2016
  1. Fixed issue #234 Error handling for broken ARC/WARC files: empty cont…

    …ent in ARC records.
    committed Jun 15, 2016
  2. fixed broken link

    ianmilligan1 committed Jun 15, 2016
Commits on May 21, 2016
Commits on May 20, 2016
  1. default values

    ybsimon committed May 20, 2016