A feature ranking tool based on binomial distribution
Binomial distribution is one of the wonderful feature selection techniques that have been successfully applied in many works. For the convenience of researchers, a feature ranking tool based on binomial distribution was developed.
Download: github environment(recommend python3.8):
pip3 install -r requirements.txt
|parameters|description|
|-k |the number of kmer|
|-t |type of sequence (require:DNA ,RNA or protein)|
|-f |the path of sequencefile|
python3 BinomialDistribution.py -k 2 -t DNA -f C:\Users\DNASeqfile
or
BinomialDistribution.exe -k 2 -t DNA -f C:\Users\DNASeqfile
|file name|description| |feature.csv | the csv feature file before sorting by BD| |BDSortedfeature.csv | the csv feature file after sorting by BD| |BinomialDistributionCL.txt | rank file| |clresult.txt | the detail of CL values|
Fu-Ying Dao, Hao Lv, Zhao-Yue Zhang, Hao Lin*. (2021) BDselect: a package for k-mer selection based on binomial distribution. Current Bioinformatics, 17(3): 238-244(7).