Skip to content

Cache-friendly, Parallel, and Samplesort-based Constructor for Suffix Arrays and LCP Arrays

License

Notifications You must be signed in to change notification settings

jamshed/CaPS-SA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

90 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CaPS-SA

CaPS-SA is a simple, parallel, and cache-friendly Suffix Array and LCP array construction algorithm.

Installation

From source:

git clone https://github.com/jamshed/CaPS-SA.git
cd CaPS-SA/
mkdir build && cd build/
cmake -DCMAKE_INSTALL_PREFIX=../ ..
make install
cd ..

This installs caps_sa in a sub-directory named bin, inside the project root directory.

Usage

export PARLAY_NUM_THREADS=<thread-count>
caps_sa <input_file> <output_file>

Note that by default the subproblem count is set to 8000. If caps_sa is run on small datasets it may produce a segmentation fault if a given subproblem is of size 0. Future releases will dynamically set subproblem count, but as of the version 1 release, please use a small subproblem count for datasets significantly smaller than the human genome.