DocId set compression and set operation library
Java Shell
Pull request Compare This branch is 3 commits ahead, 1 commit behind hyan:master.
Latest commit 0a04ee6 Mar 8, 2014 @hyan hyan Update README.md
Permalink
Failed to load latest commit information.
.settings andnot Nov 1, 2011
bin clean up deploy script Feb 5, 2011
contrib/luceneCodec improved the hardcode of pfordelta and simple16, update the contrib t… Feb 19, 2011
src This change comments out the logging using standard output. The logging Sep 16, 2013
.classpath
.gitignore added output to git ignore Sep 8, 2012
.project andnot Nov 1, 2011
LICENSE refactored for maven Jan 21, 2011
NOTES.txt removed old build files Jan 21, 2011
NOTICE.txt first commit Feb 27, 2010
README.md
pom.xml change version Sep 25, 2013

README.md

What is Kamikaze

Kamikaze is a utility package wrapping set implementations on document lists.

It also implements the PForDelta compression algorithm for sorted integer segments to enable Inverted List compression for search engines like Lucene (http://lucene.apache.org/core/4_5_1/core/org/apache/lucene/util/PForDeltaDocIdSet.html).

Kamikaze is based on the PForDelta algorithm proposed in the following paper: Inverted Index Compression and Query Processing with Optimized Document Ordering Hao Yan, S.Ding and T.Suel. The 18th International World Wide Web Conference (WWW'09), Madrid, Spain, April 2009

Kamikaze is open sourced by LinkedIn Corp : http://data.linkedin.com/opensource/kamikaze.

The principal committer of Kamikaze is Hao Yan. If you have any questions regarding Kamikaze, please email him at hyan@linkedin.com.


Wiki

Wiki is available HERE

Issues

Issues are tracked HERE