CMS-BWT

Tool for computing the BWT of a set of highly similar strings using compressed matching statistics.

Installation

git clone https://github.com/fmasillo/CMS-BWT.git
git submodule update --init --recursive
cd CMS-BWT
make

Usage

Usage:

./cms_bwt [options] <input filename>
<input filename> is the name of the file containing paths to the reference sequence (in the first line) and to the collection file (in the second line).
  Options: 
        -p      read only a prefix of the file expressed in number of characters, def. whole file
        -b      size for the additional memory buffer in GB, def. 2 
        -r      outputs the run-length encoded BWT, def. false 
        -m      memory saving implementation, def. false 
        -o      basename for the output files, def. <input filename>

Command example for running the memory-saving implementation and outputting to my_output.rl_bwt the run-length encoded BWT of the first 100000000 characters using 1GB of extra space:

./cms_bwt -p 100000000 -b 1 -r -m -o my_output file_example.txt

file_example.txt content should look like this:

/data/reference.fa
/data/collection.fa

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
libsais		libsais
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
CMS-BWT-functions.cpp		CMS-BWT-functions.cpp
CMS-BWT.h		CMS-BWT.h
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
main.cpp		main.cpp
match.h		match.h
predecessor.h		predecessor.h
rmq_tree.h		rmq_tree.h
utils.h		utils.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CMS-BWT

Installation

Usage

About

Releases

Packages

Languages

License

fmasillo/CMS-BWT

Folders and files

Latest commit

History

Repository files navigation

CMS-BWT

Installation

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages