Skip to content
A compressed alternative to the Java BitSet class
Fetching latest commit...
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.


(c) 2009-2012
Daniel Lemire (, 
Cliff Moon (, 
David McIntosh (,
Robert Becho (,
and Colby Ranger (

This code is licensed under Apache License, Version 2.0 (ASL2.0).

This is a word-aligned compressed variant of
the Java Bitset class. We provide both a 64-bit 
and a 32-bit RLE-like compression scheme. It can
be used to implement bitmap indexes.

The goal of word-aligned compression is not to 
achieve the best compression, but rather to 
improve query processing time. Hence, we try
to save CPU cycles, maybe at the expense of
storage. However, the EWAH scheme we implemented
is always more efficient storage-wise than an
uncompressed bitmap (as implemented in the java
BitSet class by Sun).

For better performance, use a 64-bit JVM over
64-bit CPUs when using the 64-bit scheme (javaewah.EWAHCompressedBitmap).

The 32-bit version (javaewah32.EWAHCompressedBitmap32) should
compress better but be comparatively slower.

For more details regarding the compression format, please
see Section 3 of the following paper:

Daniel Lemire, Owen Kaser, Kamel Aouiche, Sorting improves word-aligned bitmap indexes. Data & Knowledge Engineering 69 (1), pages 3-28, 2010.
 (The PDF file is freely available on the arXiv site.)
== Unit testing ==

As of October 2011, this packages relies on Maven. To
test it:

mvn test

for details.

=== Usage ==


== Maven central repository ==

You can download JavaEWAH from the Maven central repository:

You can also specify the dependency in the Maven "pom.xml" file:


Naturally, you should replace "version" by the version
you desire.
Something went wrong with that request. Please try again.