Deep learning approach for cancer subtype classification using high-dimensional gene expression data It is built with Tensorflow and Python 3.
- python 3.7, numpy, scipy, TensorFlow 2.4, pysam
- DCGN-new.ipynb
- Comparative methods.ipynb
- BLCAMDAsmote.mat
- BLCALundsmote.mat
- BLCAcitsmote.mat
- The BRCA dataset is too large, the original dataset can be downloaded from www.acsu.buffalo.edu/~yijunsun/lab/DeepType.html
- Read the data (.mat), check the data and label dimensions,
- Feature normalization before training
- Set a random seed and shuffle the dataset
- Change the shape according to different datasets in the model.build function
- The last layer of data set BLCALund data set must be greater than 10 nodes.
- Set the loss function and evaluation indicators
- Define the training and validation steps
- Defining and dividing the dataset
- Perform training and validation steps in a loop
- After the training and validation steps, the model is finally evaluated on the test set.