A multichannel neural network to predict cellular response of drugs by integrating multidimensional data
- drug_response_data: drug response data from the PRISM database
- 388-cell-line-list: list of cell lines we use
- 470-734dim-miRNA: feature of miRNA
- 461-23316dim-copynumber: feature of copy number
- 1448-269dim-physicochemical: list of drugs and physicochemical properties
- 1448-881dim-fingerprint: feature of molecular fingerprint
- drug_smiles:SMILES (Simplified molecular input line entry system) of drugs
- preprocess.py: load data and convert to pytorch format
- train.py: train the model and make predictions
- functions.py: some custom functions
- simles2graph.py: convert SMILES sequence to graph
- AE.py: dimensionality reduction for ultra-high dimensional features
- NeRD_Net.py: multi-channel neural network model
- Unzip the data. Due to the large amount of data, part of the data is compressed when uploading.
- Install dependencies, including torch1.4, torch_geometric (you need to install torch_cluster, torch_scatter, torch_sparse, and torch_spline_conv before installation), matplotlib, scipy, sklearn, rdkit, and networkx.
- Run AE.py to reduce the dimensionality of the copynumber feature.
- Run preprocess.py to convert label data and feature data into pytorch format.
- Run train.py for training and prediction.
- Process the drug response data you want to use into csv format, with each entry containing the drug id, cell line id, and response value.
- Organize all cell line IDs and drug IDs into two lists and store them in two csv files respectively.
- Download the features of the drugs in the list, including SMILES and molecular fingerprints, from PubChem compound database. Then process them into the format of the feature data we uploaded.
- Download the features of the cell lines in the list, including miRNA and copy number, from CCLE database. Then process them into the format of the feature data we uploaded.
- Run the program as “Step-by-step instructions” (2)-(5).
torch1.4
torch_geometric (install torch_cluster, torch_scatter, torch_sparse, torch_spline_conv before installation) https://pytorch-geometric.readthedocs.io/en/latest/notes/installation.html
matplotlib
scipy
sklearn
rdkit
networkx