Bulk text diff using NVIDIA CUDA

This is an ongoing effort to accelerate diff-ing of text files in a data parallel fashion. In other words, this project tries to improve on diff-ing many files at once and not increasing the speed of a single diff operation.

The approach chosen is as follows:

Extract line endings.
Hash each line.
Apply Myers algorithm to hashes on the GPU.

Results

This project appeared to be 2x faster than a single-threaded libxdiff run. The profiling results showed that 25% of time is spent on line endings and hashing, which is done in OpenMP multithreaded mode.

License

MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
xdiffbench		xdiffbench
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
benchmark.cpp		benchmark.cpp
diff_myers.py		diff_myers.py
diffcuda.cpp		diffcuda.cpp
diffcuda.h		diffcuda.h
genbench.py		genbench.py
kernel.cu		kernel.cu
private.h		private.h
python.cpp		python.cpp
xxhash.c		xxhash.c
xxhash.h		xxhash.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Bulk text diff using NVIDIA CUDA

Results

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

src-d/diffcuda

Folders and files

Latest commit

History

Repository files navigation

Bulk text diff using NVIDIA CUDA

Results

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages