EDLMFC

An ensemble deep learning with multi-scale feature combination for ncRNA-protein interaction prediction.

The untils, data and result directories contain model codes, tested data sets and generated results, respectively. The depended python packages are listed in requirements.txt. The package versions should be followed by users in their environments to achieve the supposed performance.

How to run

The program is in Python 3.6 using Keras and Tensorflow backends. Use the below bash command to run EDLMFC.

    python main.py -d dataset

The parameter of dataset could be RPI488, RPI1807 and NPInter v2.0. Then, EDLMFC will perform 5-fold cross validation on the specific dataset.

Two RPI datasets

The widely used RPI benchmark datasets are organized in the data directory.

Due to the limitation of the hardware conditions of the selected RNA secondary structure method, it can only predict the secondary structure of RNA with a length of no more than 500 nucleotides, so we preprocessed the data.

               Dataset    |  #Positive pairs  |  #Negative pairs  |  RNAs  |  Proteins  |Reference

Original set

             RPI1807             1807               1436            1078       3131        [1]
             
             NPInter v2.0        10412             10412            4636       449         [2] 
             
             RPI488              243                245             25         247         [1]

Optimal set

             RPI1807             652                 221            646        868         [1]
           
             NPInter v2.0        1943               1943            513        448         [2]  
            
             RPI488               43                 233            13         155         [1]

Help

For any questions, feel free to contact me by WangJingJing@emails.bjut.edu.cn or start an issue instead.

[1] Pan, X.Y.; Fan, Y.X.; Yan, J.C.; Shen, H.B. IPMiner: hidden ncRNA-protein interaction sequential pattern mining with stacked autoencoder for accurate computational prediction. Bmc Genomics 2016, 17. doi:ARTN 582 10.1186/s12864-016-2931-8.

[2] Yuan, J.;Wu,W.; Xie, C.Y.; Zhao, G.G.; Zhao, Y.; Chen, R.S. NPInter v2.0: an updated database of ncRNA interactions. Nucleic Acids Research 2014, 42, D104–D108. doi:10.1093/nar/gkt1057.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
data		data
result		result
script		script
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EDLMFC

How to run

Two RPI datasets

Help

About

Releases

Packages

Contributors 2

Languages

JingjingWang-87/EDLMFC

Folders and files

Latest commit

History

Repository files navigation

EDLMFC

How to run

Two RPI datasets

Help

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages