BPPart/BPMax

This repository includes the implementation of BPPart and BPMax, as well as the last stable version of piRNA tool from the Algorithmic Biology Laboratory (with the permission of Dr. Hamidreza Chitsaz)

To see the details about these tools and what they compute, please refer to below manuscripts. Please cite them if you use part of this repository:

Ebrahimpour-Boroojeny, A., Rajopadhye, S., & Chitsaz, H. (2019). BPPart and BPMax: RNA-RNA interaction partition function and structure prediction for the base pair counting model. arXiv preprint arXiv:1904.01235.
Chitsaz, H., Salari, R., Sahinalp, S. C., & Backofen, R. (2009). A partition function algorithm for interacting nucleic acid strands. Bioinformatics, 25(12), i365-i373.

Installation

To install the tools after cloning or downloading the repository, simply use the command:

  make

Input format

The input format to these tools have beem made the same to make it easier to replace them with one another in a pipeline. The input has to be a fasta file in which each entry contains the the pair of sequence we want to run these tools on. Those sequence should be separated with an &. test_input.fa is a sample input with two pair of sequences. To make the format of the input clear, here we show the content of the file:

  >example1
  ACCGCCGTCTTCGAGGAAAG&CCCGGCTGCTAGCTAGGAGAAATCGCGCATTT
  >example2
  CGCGCTGGATAAATATAGGACCAGGAAT&GCTCGGATAGAGCTAGGAGAAATCGCGCCGCTAGA

Running the tools

To run the tools with the default hyper-parameters use these commands (replace the test_input.fa with your desired input file):

  ./bppart test_input.fa

  ./bpmax test_input.fa

  ./pirna test_input.fa

Tunning the hyper-parameters

To get a list of the hyper-parameters of the each of the tools, simple use -h. As an example for bppart:

  ./bppart -h

As an example, to change the weights of AU and GU pairs to 1.0 and 2.5, respectively, use this command (keep in mind that the weight of CG pairs are considered to be 3):

  ./bppart -A=1 -G=2.5 test_input.fa

As another example, to run piRNA in 25 degrees Celsius, run this:

  ./pirna -t=25 -T=25 test_input.fa

Accumulating the results

The results of bppart and bpmax will be automatically accumulated in a tab-separated file, in which ear row contains the results for one entry (one pair of sequences) of the input file.

piRNA generates separate files for each of the entries in the input file. To make it easier to process the data, we have prepared a script to accumulate these results in single tab-separated file. To do so, use this command (replace the test_input.fa with your desired input file):

   python src_ext_res.py test_input.fa

Precomputed scores

The precomputed scores on the data from RISE database are available in the pre_computed folder. table_x.csv files have the scores of piRNA at temperature x. BPPart and BPMax scores are also available (they are not temperature-dependent).

To compute the correlations that are presented in the paper and generate the correlation plot of the paper, you can run the command below. Note that len_data_human_1_101.fa is the file that has the length information of the RNA pairs in order to normalize the scores.

   python correlation.py pre_computed/len_data_human_1_101.fa

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
data		data
pre_computed		pre_computed
.gitignore		.gitignore
.hgignore		.hgignore
.nfs00000000008c01f800000001		.nfs00000000008c01f800000001
Makefile		Makefile
README.md		README.md
alloc.C		alloc.C
alloc.h		alloc.h
appirna		appirna
appirna.C		appirna.C
appirna.h		appirna.h
birna		birna
birna.C		birna.C
birna.h		birna.h
birna2		birna2
birna2.C		birna2.C
birna2.h		birna2.h
bpmax		bpmax
bpmax.C		bpmax.C
bpmax.h		bpmax.h
bppart		bppart
bppart.C		bppart.C
bppart.h		bppart.h
bpscore.C		bpscore.C
bpscore.h		bpscore.h
collection.C		collection.C
collection.h		collection.h
config.h		config.h
correlation.py		correlation.py
energy.C		energy.C
energy.h		energy.h
extract_ranks.py		extract_ranks.py
extract_ranks_2col.py		extract_ranks_2col.py
generate_random_seqs.py		generate_random_seqs.py
getopt.C		getopt.C
getopt.h		getopt.h
jointprob.C		jointprob.C
jointprob.h		jointprob.h
mtree.C		mtree.C
mtree.h		mtree.h
partitionfunction.C		partitionfunction.C
partitionfunction.h		partitionfunction.h
pirna		pirna
pirna.C		pirna.C
pirna.h		pirna.h
probability.C		probability.C
probability.h		probability.h
sequence.C		sequence.C
sequence.h		sequence.h
src_ext_res.py		src_ext_res.py
table.C		table.C
table.h		table.h
test_input.fa		test_input.fa
ubpartitionfunction.C		ubpartitionfunction.C
ubpartitionfunction.h		ubpartitionfunction.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BPPart/BPMax

Installation

Input format

Running the tools

Tunning the hyper-parameters

Accumulating the results

Precomputed scores

About

Releases

Packages

Languages

Ali-E/bipart

Folders and files

Latest commit

History

Repository files navigation

BPPart/BPMax

Installation

Input format

Running the tools

Tunning the hyper-parameters

Accumulating the results

Precomputed scores

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages