Block or report user

Popular repositories

  1. ArchiveSpark

    An Apache Spark framework for easy data processing, extraction as well as derivation for Web archives and archival collections, developed by the Internet Archive and L3S Research Center.

    Jupyter Notebook 66 8

  2. Web2Warc

    An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)

    Scala 10 4

  3. HadoopConcatGz

    A Splitable Hadoop InputFormat for Concatenated GZIP Files and *.(w)arc.gz

    Java 6 2

  4. Exspec

    Don't write specs anymore, just save 'em while testing your code interactively. Specs will become a byproduct.

    Ruby 5

  5. internetarchive-transfer-scripts

    Scripts to transfer collections, using

    Python 5 3

  6. IABooksOnArchiveSpark

    Analyze digitized books from the Internet Archive remotely with ArchiveSpark

    Scala 4

92 contributions in the last year

Apr May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Mon Wed Fri

Contribution activity First pull request First issue First repository Joined GitHub

March 2018

Created a pull request in archivesunleashed/aut that received 14 comments

make ArchiveRecord a trait

The title of this pull-request should be a brief description of what the pull-request fixes/improves/changes. Ideally 50 characters or less. GitHu…

+29 −8 14 comments

Seeing something unexpected? Take a look at the GitHub profile guide.