Optimized implementation of BlaBla for SSE2/SSSE3/AVX2

This project is an optimized implementation of BlaBla for CPUs supporting SSE2, SSSE3 or AVX2 instructions. A reference C implementation is also provided for comparison. Another reference C implementation was written by Frank Denis.

The optimization strategy is inspired by the AVX2 ChaCha implementation by Samuel Neves.

Benchmarks

The project still lacks extensive benchmarks on multiple architectures, but current tests suggest ~15% performance improvement over AVX2 ChaCha implementation for the same number of rounds.

Testing

You can check that the code compiles and benchmark the various implementations as follows.

make
./bench-ref
./bench-opt-sse2
./bench-opt-ssse3
./bench-opt-avx2

Authors

Guillaume Endignoux, while intern at Kudelski Security

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
bench.c		bench.c
blabla-opt.c		blabla-opt.c
blabla-ref.c		blabla-ref.c
blabla.h		blabla.h
config.h		config.h
test.c		test.c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Optimized implementation of BlaBla for SSE2/SSSE3/AVX2

Benchmarks

Testing

Authors

About

Releases

Packages

Languages

License

kudelskisecurity/blabla-avx2

Folders and files

Latest commit

History

Repository files navigation

Optimized implementation of BlaBla for SSE2/SSSE3/AVX2

Benchmarks

Testing

Authors

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages