I-DNAN6mA: Accurate identification of DNA N6-methyladenine sites using base-pairing map and deep learning.

Abstract

Motivation: The recent discovery of numerous DNA N6-methyladenine (6mA) has transformed our perception about the roles of 6mA in living organisms. However, our ability to understand them is hampered by our inability to identify 6mA sites rapidly and cost-efficiently by existing experimental methods. Developing a novel method to fast and accurately identify 6mA sites is critical for speeding up the progress of its function detection and understanding. Results: We propose a novel computational method, I-DNAN6mA, to identify 6mA sites and well complement experimental methods, by leveraging the base-pairing rules and a well-designed three-stage deep learning model with pairwise inputs. The performance of our proposed method is benchmarked and evaluated on four species, i.e., Arabidopsis thaliana, Drosophila melanogaster, Rice, and Rosaceae. The experimental results demonstrate that the I-DNAN6mA achieves accuracies of 91.5%, 92.7%, 88.2%, and 96.2%, Mathew’s correlation coefficient values of 0.855, 0.831, 0.763, and 0.924, and area under the receiver operating characteristic curve values of 0.967, 0.963, 0.947, and 0.990 on four benchmark datasets, respectively, and outperforms several existing state-of-the-art methods. To our knowledge, I-DNAN6mA is the first approach to identify 6mA sites using a novel image-like representation of DNA sequences and a deep learning model with pairwise inputs. I-DNAN6mA is expected to be useful for locating functional regions of DNA.

System Requirments

Hardware Requirments: I-DNAN6mA requires only a standard computer with around 32 GB RAM to support the in-memory operations.

Software Requirments:

Python3.7
Pytorch
Anaconda
CUDA 10.0 (Optional If using GPU)
cuDNN (>= 7.4.1) (Optional If using GPU)

I-DNAN6mA has been tested on Ubuntu 18.04 and Window10 operating systems

Installation

To install I-DNAN6mA and it's dependencies following commands can be used in terminal:

git clone https://github.com/XueQiangFan/I-DNAN6mA.git
cd I-DNAN6mA

Either follow virtualenv column steps or conda column steps to create virtual environment and to install I-DNAN6mA dependencies given in table below:

	conda
3.	`conda create -n venv python=3.7`
4.	`conda activate venv`
5.	To run I-DNAN6mA on CPU: `conda install pytorch torchvision torchaudio cpuonly -c pytorch` or To run I-DNAN6mA on GPU: `conda install pytorch torchvision torchaudio cudatoolkit=10.2 -c pytorch`
6.	`while read p; do conda install --yes $p; done < requirements.txt`

Usage

To run the I-DNAN6mA

run: python main.py -test_path test path -result_path result path

    For example:
    python main_GPU(_CPU).py -test_path /DNAN6mAsites/dataset/A.thaliana_test.xlsx -result_path ./result.csv

Datasets

The following benchmark datasets ware used for I-DNAN6mA: Datasets

Citation guide

If you use I-DNAN6mA for your research please cite the following papers:

[1] Xue-Qiang Fan, Bing Lin, Jun Hu*, and Zhong-Yi Guo*. I-DNAN6mA: Accurate identification of DNA N6-methyladenine sites using base-pairing map and deep learning. Journal of Chemical Information and Modeling.

Licence

Mozilla Public License 2.0

Contact

Thanks for your attention. If you have any questions, please contact my email: xstrongf@163.com

Name	Name	Last commit message	Last commit date
Latest commit XueqiangF Update README.md Feb 2, 2023 bd72065 · Feb 2, 2023 History 39 Commits
Benchmark datasets	Benchmark datasets	Add files via upload	May 12, 2022
Util	Util	Add files via upload	May 12, 2022
saved_models	saved_models	Add files via upload	May 13, 2022
LICENSE	LICENSE	Initial commit	May 12, 2022
README.md	README.md	Update README.md	Feb 2, 2023
__init__.py	__init__.py	Add files via upload	May 12, 2022
calculate_16xLxL_matrix_map.py	calculate_16xLxL_matrix_map.py	Add files via upload	May 13, 2022
data_loading.py	data_loading.py	Add files via upload	May 13, 2022
main_CPU.py	main_CPU.py	Add files via upload	May 13, 2022
main_GPU.py	main_GPU.py	Add files via upload	May 13, 2022
network_model.py	network_model.py	Update network_model.py	Sep 7, 2022
requirements.txt	requirements.txt	Add files via upload	May 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

I-DNAN6mA: Accurate identification of DNA N6-methyladenine sites using base-pairing map and deep learning.

Contents

Abstract

System Requirments

Installation

Usage

run: python main.py -test_path test path -result_path result path

Datasets

Citation guide

Licence

Contact

About

Releases

Packages

Languages

License

XueqiangF/I-DNAN6mA

Folders and files

Latest commit

History

Repository files navigation

I-DNAN6mA: Accurate identification of DNA N6-methyladenine sites using base-pairing map and deep learning.

Contents

Abstract

System Requirments

Installation

Usage

run: python main.py -test_path test path -result_path result path

Datasets

Citation guide

Licence

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages