The aim of the project is to create a dictionary of all k-mers within an input file, count their multiplicities, calculate their godel numbers and use them as features to create data matrices.
The input of the project should be a single .fasta file.
cd localmachine/path/to/kmerGodel
python run.py kmin kmax /localmachine/path/to/input_file.fasta
- localmachine/path/to/kmerGodel : the full path of the local repository
- kmin and kmax are integers
- localmachine/path/to/input_file.fasta: the full path of the input .fasta file within your local machine
Inside the folder there is a .def file that creates a singularity image file for the project. There is also a README file with instructions on how to create and run it.