Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

C implementation of SpeedyFx algorithm

branch: master

Fetching latest commit…

Octocat-spinner-32-eaf2f5

Cannot retrieve the latest commit at this time

Octocat-spinner-32 .gitignore
Octocat-spinner-32 README.md
Octocat-spinner-32 speedyfx.c
README.md

SpeedyFx algorithm

Tokenize/hash large amount of strings efficiently.

Original Java implementation ported to C by Stanislaw Pusep.

Compile with:

clang -lm -O3 -o speedyfx speedyfx.c

or:

gcc -lm -O3 -o speedyfx speedyfx.c

Then use as:

./speedyfx enwik9 > fv.bin

To generate 128KB feature vector for enwik9 text file.

Benchmark

Test data: https://cs.fit.edu/~mmahoney/compression/enwik9.bz2

Hardware: Intel(R) Xeon(R) CPU E5620 @ 2.40GHz

Average feature vector build speed: 213.83 MB/s

Something went wrong with that request. Please try again.