## Machine learning using convolutional neural networks for SERS analysis of biomarkers in medical diagnostics

This paper used machine learning for dye-free SERS spectra but has not been applied to material dye-labeled SERS with known spectral shape.

Author compare the performances of spectral decomposition, support vector regression, random forest regression, partial least squares regression, and convolutional neural network (CNN) for SERS “spectral unmixing” from a multiplexed mixture of 7 SERS-active “nanorattles” loaded with different dyes for mRNA biomarker detection

#### Partial least squares regression ####
PLSR is espe-cially useful when X values are not independent and have multicollinearity, as is the case with Raman spectra.
PLSR analysis was conducted in Python scikit-learn packpage using 29 components and tolerance of 5.14 x 10^-66

####  Support vector regression #### 
SVR is a regression technique related to SVM, a classifi- cation technique.
Unlike linear regression, the best fit is not calculated by squared error; decision boundaries on either side of the hyperplane allow an
error tolerance. SVR analysis was done in Python using scikit-learn with a radial basis function kernel,
epsilon = 1.03 x 10 ^ -3 , C = 1.07, gamma = 2.54 x 10 ^ -2

#### Random forest regression
RFR is an ensemble learning regression technique using decision trees and modified boostrap aggregation (bag-ging). Ensemble learning combines the output of multiple weak learners to produce a strong learner. RFR uses a modified bagging method which randomly samples the dataset with replacement to produce multiple subsets of
the data. Each tree is grown using a different random sampled dataset. RFR analysis was done in Python using scikit-learn with max depth of 18, 36 max features, min sample leaf of 1, min sample split of 4, and 483 estimators.


#### Convolutional neural network
The 1D-CNN was built using TensorFlow in Python and is comprised of an input layer, four 1D convolutional layers, separated by three max pooling layers, followed by a fully connected dense layer and output layer (Figure 3). The preprocessed SERS spectra is fed to the input layer, which passes to the first convolutional layer comprised of
16 kernels of size 12. A convolutional layer moves a kernel over the spectrum with a stride of 1 and outputs a feature map that is fed to a ReLU nonlinear activation function. max pooling layer with stride 2 reduces dimensionality of the previous layer, decreasing risk of overfitting, and computational burden. After four convolutional and four max pooling layers, the data are flattened before being fed into the dense layer with a dropout rate of 10%.

![image.png](attachment:1c4b29d8-be30-4b8e-9d81-2e890770b88e.png)

    CNN layer architecture. CNN, convolutional neural networks

## Result

![image.png](attachment:5118d92b-7628-4261-ae1e-e3f92c8616ab.png)

![image.png](attachment:08c865ab-b8bc-4a60-89df-c802aebaff45.png)

RMSEspectrum performance of n = 5 CNN, n = 5 RFRs, SVR, and PLSR on 100-mW test set (a) and 80-mW test set (b) areplotted as circular markers for training sets of size n = 100, n = 500, n = 1000, n = 2000, n = 4000, n = 8000, and n = 16000 (70% training, 30% validation) on a semilog plot. Smoothed training sets are plotted with solid markers, connected by solid lines, and unsmoothed training sets are plotted as hollow markers, connected by dashed lines. Training time (c) and time per prediction (d) for PLSR, SVR, RF, and CNN were averaged between models trained on smoothed and unsmoothed data with above-mentioned training set sizes and plotted on a log-log and semi-log plot, respectively. CNN, convolutional neural networks; SVR, support vector regression; PLSR, partial least squares regression; RFR, random forest regression.Prediction performance of CNN, SVR, and RFR models depend greatly on hyperparameter tuning. SVR greatly relies on tuning of gamma and epsilon to achieve optimal model performance. Important hyperparameters to be tuned in RFR include number of decision trees in forest, maximum depth of trees, minimum samples to split on at an internal node, etc. CNN has many more tunable hyperparameters than the other employed algorithms because of their many possible architectures.

Prediction performance of CNN, SVR, and RFR models depend greatly on hyperparameter tuning. SVR greatly relies on tuning of gamma and epsilon to achieve optimal model performance. Important hyperparameters to be tuned in RFR include number of decision trees in forest, maximum depth of trees, minimum samples to split on at an internal node, etc. CNN has many more tunable hyperparameters than the other employed algorithms because of their many possible architectures.

The author has shown that CNN most accurately determines the extracted relative contributions of each individual dye-loaded nanoparticle. CNN and comparative models were then used to analyze the SERS spectrum from a single complex point-of-care assay that detected mRNA biomarkers for head and neck cancers in 20 samples. CNN, trained on simulated multiplex data, identified precise dye contributions from a single-sided assay with RMSElabel = 6.42 x 10^-So CNN is very good at predicting outcomes in the medical field.2 These results demonstrate the potential of CNN-based ML to promote SERS-based diagnosis.

### So CNN is very good at predicting outcomes in the medical field.