ACP-DRL

About

ACP-DRL: An Anticancer Peptides Recognition Method Based on Deep Representation Learning.

Hardware Support

We ran ACP-DRL on a single node of the GPU cluster in the National Center for Protein Sciences (Beijing). This node is equipped with two 2.6GHz Intel Xeon processors, eight Tesla V100 GPUs, 256 GB RAM and runs under CentOS 7.6.

Based on our practical experience, we recommend running on a GPU with at least 12GB of memory.

Installation

Use the git command or download the zip file locally.

git clone  https://github.com/shallFun4Learning/ACP-DRL.git

Dependency Base Dependency:

Python 3.7.11/3.8.3

PyTorch 1.7.1/1.12.0

cudnn 7.6.5

transformers 4.32.1

tokenizers 0.13.2

datasets 2.16.1

scikit-learn 1.3.0

We recommend using conda for environment management.

a. To create a new environment.
```
conda create -n YOUR_ENV_NAME python=3.8.3
```
b. Switch to the created environment.
```
conda activate YOUR_ENV_NAME
```
c. Install PyTorch
```
conda install pytorch==1.7.1 torchvision==0.8.2 torchaudio==0.7.2 cudatoolkit=10.2 -c pytorch
```
or see the PyTorch website.

d. About other dependencies

Now, let's install transformers
```
pip install transformers
```
and other dependencies.

Usage

Weights

The available weights file will be updated our OneDrive.

and

Datasets

The available datasets file will be updated our OneDrive.

Main

Alternate

IFPT

Fasta to CSV Conversion Script

This script converts .fasta files to .csv format. It treats the portion of each line before the "|" as the title, the portion after the "|" as the label, and the next line as the sequence, forming each column of the .csv file.

python convert_script.py /path/to/input1.fasta /path/to/input2.fasta ...

Arguments: --infiles: a list of one or more paths to .fasta files that you want to convert. Replace "/path/to/input1.fasta" and "/path/to/input2.fasta" in the sample code above with the actual paths to your files. You can add as many .fasta file paths as you need.

Output:This script generates a .csv file in the same directory as each input file. The new file has the same name as the original .fasta file, but with a different file extension.

Quick start

python run.py \
    --model_path YOUR_MODEL_PATH \
    --test_dataset_path YOUR_TEST_SET_PATH\
    --tokenizer_path YOUR_TOKENIZER_PATH\
    --outPutDir YOUR_OUTPUT_PATH

Explanation for each parameter:

--model_path YOUR_MODEL_PATH : This is where your model file is stored. Replace 'YOUR_MODEL_PATH' with the full path to your model file. For example, if your model is stored in a directory named "models" with the model file named "model.pth", your model path would be "users/models".
--test_dataset_path YOUR_TEST_SET_PATH : This is where your test dataset file resides. Replace 'YOUR_TEST_SET_PATH' with the full path to your test dataset. For instance, if your dataset is stored in a directory named "data" with the file named "test_dataset.csv", your testset path would be "data/test_dataset.csv".In this project, the CSV file is required to have at least two columns: sequence and label.
--tokenizer_path YOUR_TOKENIZER_PATH : This is where your tokenizer configuration file is located. Replace 'YOUR_TOKENIZER_PATH' with the full path to your tokenizer configuration file. For instance, if your tokenizer configuration is stored in a directory named "tokenizer" with the file named "tokenizer_config.json", your tokenizer path would be "users/tokenizer".
--outPutDir YOUR_OUTPUT_PATH : This is where the results of model execution will be stored. Replace 'YOUR_OUTPUT_PATH' with the full path to your desired output location. For instance, it might look like this: "output/directory".

Please make sure to replace 'YOUR_MODEL_PATH', 'YOUR_TEST_SET_PATH', 'YOUR_TOKENIZER_PATH', and 'YOUR_OUTPUT_PATH' with real paths in your environment.

LICENSE

ACP-DRL is for non-commercial use only.

Supports

Feel free to submit an issue or contact the author(sfun@foxmail.com) if you encounter any problems during use.

Happy New Year 2024

:-)

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Getmetrics.py		Getmetrics.py
LICENCES		LICENCES
fasta2csv.py		fasta2csv.py
getDataset.py		getDataset.py
readme.md		readme.md
run.py		run.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getmetrics.py

Getmetrics.py

LICENCES

LICENCES

fasta2csv.py

fasta2csv.py

getDataset.py

getDataset.py

readme.md

readme.md

run.py

run.py

train.py

train.py

Repository files navigation

ACP-DRL

About

Hardware Support

Installation

Usage

Weights

Datasets

Fasta to CSV Conversion Script

Quick start

LICENSE

Supports

About

Releases

Packages

Languages

shallFun4Learning/ACP-DRL

Folders and files

Latest commit

History

Repository files navigation

ACP-DRL

About

Hardware Support

Installation

Usage

Weights

Datasets

Fasta to CSV Conversion Script

Quick start

LICENSE

Supports

About

Resources

Stars

Watchers

Forks

Languages