Skip to content
Branch: master
Find file History

Latest commit

Fetching latest commit…
Cannot retrieve the latest commit at this time.

Files

Permalink
Type Name Latest commit message Commit time
..
Failed to load latest commit information.
src/main/java/com/code402
.gitignore
README.md
go
pom.xml

README.md

Java

There are three implementations. Each uses the IIPC's jwarc library for parsing WARC files. They differ only in the regular expression engine that is used - the default one that ships with the JDK, Google's re2j, and Anders Møller's dk.brics.automaton library.

Running

./go JDK   # Use the JDK's engine

./go Brics # Use the Brics engine

./go Re2j  # Use the Re2j engine

You can optionally provide your own WARC file to be searched as a second argument.

You can’t perform that action at this time.