README

What is this repository for?

This repository contains implementation of hash tables that are aimed to better support kmer indexing. Specifically, the hash tables are based on open addressing and uses either linear probing or Robin Hood hashing.

The implemention is c++ 11 compliant and header only.

Repository Structure

The repository is organized as follows:

"include/kmerhash" contains the implementations, in particular hashmap_linearprobe_doubling.hpp and hashmap_robinhood_doubling.hpp.
"include/kmerhash/experimental" contains some experimental implementations.
"test/unit" contains the unit tests for the clases.
"benchmark" contains benchmarks, in particular BenchmarkHashTables.cpp is the primary benchmark tool.
"ext" contains dependencies as git submodules.

Dependencies

For the moment, this project depends on "kmerind", and indirectly on Kmerind's dependencies including google sparse hash map, google farm hash, murmur hash, mxx, google test, and tclap. These dependencies are likely to be refactored later.

The project requires a c++11 compliant compiler (4.8.4 or later) and cmake ver 2.8 or later.

To initialize the git submodules, invoke the following:

#!sh
cd {src}
git submodule init
git submodule update

cd ext/kmerind
git submodule init
git submodule update

Configuration

To compile, first create a build directory, preferably outside of the source directory.

#!sh
mkdir {build}
cd {build}
cmake {src} -DENABLE_TESTING=ON -DENABLE_BENCHMARKING=ON -DBUILD_EXAMPLE_APPLICATIONS=ON
make

Alternatively, you can use ccmake.

#!sh
mkdir {build}
cd {build}
ccmake {src}
make

How to run tests

The unit test are located in the "test" subdirectory inside the build directory. To run individual tests, the executable can be invoked directly. To run all tests, use

#!sh
cd {build}
make test

How to run benchmarks

There is currently only 1 benchmark and it is hard coded to insert 100M elements with average 5x repeats, and query with 10M elements. The following tests are run in sequence: insert, find, count, erase, count.

To run the benchmark, invoke the executable below from the build directory root.

#!sh
cd {build}
bin/benchmark_hashtables

Contribution guidelines

Writing tests
Code review
Other guidelines

Name		Name	Last commit message	Last commit date
Latest commit History 414 Commits
.vscode		.vscode
benchmark		benchmark
cmake		cmake
conf		conf
docs		docs
ext		ext
include/kmerhash		include/kmerhash
scripts		scripts
test		test
.cproject		.cproject
.gitignore		.gitignore
.gitmodules		.gitmodules
.project		.project
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
multithread.debug.18225d4.xlsx		multithread.debug.18225d4.xlsx
multithread.debug.19ca57f.xlsx		multithread.debug.19ca57f.xlsx
multithread.debug.574ddd1.xlsx		multithread.debug.574ddd1.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

What is this repository for?

Repository Structure

Dependencies

Configuration

How to run tests

How to run benchmarks

Contribution guidelines

Who do I talk to?

About

Releases

Packages

Contributors 2

Languages

License

ParBLiSS/kmerhash

Folders and files

Latest commit

History

Repository files navigation

README

What is this repository for?

Repository Structure

Dependencies

Configuration

How to run tests

How to run benchmarks

Contribution guidelines

Who do I talk to?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages