LZF Compressor

Overview

LZF-compress is a Java library for encoding and decoding data in LZF format, written by Tatu Saloranta (tatu.saloranta@iki.fi)

Data format and algorithm based on original LZF library by Marc A Lehmann. See LZF Format Specification for full description.

Format differs slightly from some other adaptations, such as the one used by H2 database project (by Thomas Mueller); although internal block compression structure is the same, block identifiers differ. This package uses the original LZF identifiers to be 100% compatible with existing command-line lzf tool(s).

LZF algorithm itself is optimized for speed, with somewhat more modest compression. Compared to the standard Deflate (algorithm gzip uses) LZF can be 5-6 times as fast to compress, and twice as fast to decompress. Compression rate is lower since no Huffman-encoding is used after lempel-ziv substring elimination.

License

Apache License 2.0

Requirements

Versions up to 1.0.4 require JDK 6; versions from 1.1 on require JDK 8.

Library has no external dependencies.

Usage

See Wiki for more details; here's a "TL;DNR" version.

Both compression and decompression can be done either by streaming approach:

InputStream in = new LZFInputStream(new FileInputStream("data.lzf"));
OutputStream out = new LZFOutputStream(new FileOutputStream("results.lzf"));
InputStream compIn = new LZFCompressingInputStream(new FileInputStream("stuff.txt"));

or by block operation:

byte[] compressed = LZFEncoder.encode(uncompressedData);
byte[] uncompressed = LZFDecoder.decode(compressedData);

and you can even use the LZF jar as a command-line tool (it has manifest that points to 'com.ning.compress.lzf.LZF' as the class having main() method to call), like so:

java -jar compress-lzf-1.1.2.jar

(which will display necessary usage arguments for -c(ompressing) or -d(ecompressing) files.

Adding as Dependency

Maven

<dependency>
  <groupId>com.ning</groupId>
  <artifactId>compress-lzf</artifactId>
  <version>1.1.2</version>
</dependency>

Module info (JPMS)

Starting with version 1.1, module-info.class is included; module name is com.ning.compress.lzf so you will need to use:

requires com.ning.compress.lzf

Parallel processing

Since the compression is more CPU-heavy than decompression, it could benefit from concurrent operation. This works well with LZF because of its block-oriented nature, so that although there is need for sequential processing within block (of up to 64kB), encoding of separate blocks can be done completely independently: there are no dependencies to earlier blocks.

The main abstraction to use is PLZFOutputStream which a FilterOutputStream and implements java.nio.channels.WritableByteChannel as well. It use is like that of any OutputStream:

PLZFOutputStream output = new PLZFOutputStream(new FileOutputStream("stuff.lzf"));
// then write contents:
output.write(buffer);
// ...
output.close();

Interoperability

Besides Java support, LZF codecs / bindings exist for non-JVM languages as well:

C: liblzf (the original LZF package!)
C#: C# LZF
Go: Golly
Javascript(!): freecode LZF (or via SourceForge)
Perl: Compress::LZF
Python: Python-LZF
Ruby: glebtv/lzf, LZF/Ruby

More

Project Wiki.

Alternative High-Speed Lempel-Ziv Compressors

LZF belongs to a family of compression codecs called "simple Lempel-Ziv" codecs. Since LZ compression is also the first part of deflate compression (which is used, along with simple framing, for gzip), it can be viewed as "first-part of gzip" (second part being Huffman-encoding of compressed content).

There are many other codecs in this category, most notable (and competitive being)

Snappy
LZ4

all of which have very similar compression ratios (due to same underlying algorithm, differences coming from slight encoding variations, and efficiency differences in back-reference matching), and similar performance profiles regarding ratio of compression vs uncompression speeds.

Name		Name	Last commit message	Last commit date
Latest commit History 327 Commits
.github/workflows		.github/workflows
.mvn/wrapper		.mvn/wrapper
src		src
testdata		testdata
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
VERSION.txt		VERSION.txt
mvnw		mvnw
pom.xml		pom.xml
profile-comp-perf		profile-comp-perf
profile-skip		profile-skip
profile-uncomp-perf		profile-uncomp-perf
run-comp-perf		run-comp-perf
run-skip		run-skip
run-uncomp-perf		run-uncomp-perf

License

ning/compress

Folders and files

Latest commit

History

Repository files navigation

LZF Compressor

Overview

License

Requirements

Usage

Adding as Dependency

Maven

Module info (JPMS)

Parallel processing

Interoperability

Related

More

Alternative High-Speed Lempel-Ziv Compressors

About

Resources

License

Stars

Watchers

Forks

Languages