Deep learning for regression of cod otoliths

Highlights:

Deep learning models can accurately estimate age from images of broken otoliths
All modern architectures perform similarly, so we recommend simple architectures
Under-exposed images are better, but combining multiple exposures may be beneficial
Models are highly correlated, which has implications for selection into ensembles
Deep learning models could be added to routine otolith-based age reading procedures

Middle Exposure	Min Exposure	Max Exposure

The data set consists of 6 images of each otolith. First 3 images with 3 different exposures. Then a rotation of 180 degrees and another 3 images. The project investigates both which architectures in the EfficientNet family is best to age the otoliths as a regression task using Mean Squared Error (MSE), and which images in the protocol produces the best images. The images to investigate is min, middle and max exposure aswell as a 9 channel image containing all the 3 images. The data set consists of 5150 images from age 1 to 13.


Figure 1.:An example of a 9 channel image at the bottom right, where the image is represented as the expectation of the 3 images across channels.:

Findings so far: B5, and B6 is better than V2 large, training on more data is better. Testing on same size as training set gives higher accuracy than on test set size, as described in the paper. Training on 3 images (9 channels) with different lighting is better than any one lighting.

For full-precision results: readme details

Exporatory Data Analysis (EDA) on the models: readme eda

Final results of loss MSE, metric Accuracy, and Percentage Agreement

Acc:light/CNN	B4	B5	B6	Medium	Large	Mean
min	72.8	74.4	73.4	74.0	72.0	73.3
middle	71.5	73.4	74.4	72.4	72.8	72.9
max	70.9	73.2	71.5	71.3	72.4	71.9
9 channels	-	-	-	74.0	72.2	73.1
Mean	71.7	73.7	73.1	72.9	72.4	72.7
MSE:light/CNN
min	.277	.277	.272	.273	.280	.276
middle	.285	.273	.262	.278	.275	.275
max	.291	.359	.305	.289	.286	.306
9 channels	-	-	-	.273	.271	.272
Mean	.284	.303	.280	.278	.278	.284
PA:light/CNN
min	89.5	89.3	88.2	89.7	89.9	89.3
middle	88.2	89.5	90.9	91.1	87.8	89.5
max	87.6	90.5	88.0	89.5	90.3	89.2
9 channels	-	-	-	91.3	91.1	91.2
Mean	88.1	89.8	89.0	90.4	89.8	89.6

Accuracy on each network - ensemble mean is higher than mean of the 10-folds

2-way anova with interactive plot on 10-fold ensemble average (higher than simple average)

#Df Sum Sq Mean Sq F value Pr(>F)
#dataf$network   4  6.409  1.6023   2.369 0.1391
#dataf$exposure  2  5.649  2.8247   4.176 0.0573 .
#Residuals       8  5.411  0.6763
#---
#  Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Summary of best results on training on cod otoliths compared to other projects:

Species	Predict	validLOSS	MSE	MAPE	ACC	MCC	#trained	activ. f
Greenland Halibut(1)	age	x	2.65	0.124	0.262	x	8875	linear
Salmon	sea age	-"-	0.239	0.141	0.822	x	9073	linear
Salmon B4	river age	0.359	0.359	19.58	0.618	x	6246	linear
Cod B5	age	0.277	0.8796	-	0.744	x	5150	linear

5-fold training - training set: 68%, validation set: 17%, test set 15%

NN-config	fold-1 (mse, acc)	fold-2	fold-3	fold-4	fold-5	mean MSE	mean ACC	datset size
B4	0.486, 0.649	0.469, 0.670	0.482, 0.663	0.488, 0.649	0.473, 0.658	0.422	0.697	5150
B4,standardScalar on target, StratifiedKFold	0.490, 0.644	0.535, 0.623	0.497, 0.661	0.457, 0.698	0.513, 0.651	0.426	0.701	5150
B4,standardScalar on target, StratifiedKFold, pretraining on salmon-scales 20 epochs	0.469, 0.663	0.469, 0.685	0.513, 0.651	0.474, 0.681	0.479, 0.650	0.433	0.689	5150
B5,standardScalar on target, StratifiedKFold	0.435, 0.667	0.447, 0.683	0.451, 0.677	0.431, 0.675	0.441, 0.692	0.401	0.707	5150

10-fold training, training set: 81% validation set: 9%, test set 10% - 515 images - no augmentaion on EfficientNetV2

NN-config (mse, acc)	1	2	3	4	5	6	7	8	9	10	mean MSE	mean ACC	datset size
B4 with B5 img size,standardScalar on target, StratifiedKFold	0.320, 0.699	0.318, 0.689	0.306, 0.687	0.313, 0.683	0.322, 0.689	0.314, 0.701	0.315, 0.697	0.316, 0.668	0.306, 0.689	0.302, 0.724	0.277	0.728	5150
B4 MLP(256,32,1) middle	0.344, 0.685	0.328, 0.693	0.316, 0.730	0.334, 0.685	0.326, 0.678	0.320, 0.682	0.355, 0.672	0.326, 0.672	0.313, 0.683	0.325, 0.695	0.285	0.715	5150
B4 MLP(256,32,1) max	0.340, 0.641	0.317, 0.682	0.318, 0.672	0.347, 0.662	0.336, 0.678	0.336, 0.695	0.336, 0.672	0.320, 0.693	0.354, 0.662	0.336, 0.652	0.291	0.709	5150
B5,standardScalar on target, StratifiedKFold	0.324, 0.718	0.322, 0.691	0.325, 0.693	0.336, 0.668	0.291, 0.736	0.314, 0.707	0.320, 0.662	0.331, 0.683	0.3298, 0.695	0.317, 0.687	0.277	0.744	5150
B5,standardScalar on target, StratifiedKFold, middle	0.308, 0.703	0.286, 0.720	0.315, 0.678	0.349, 0.666	0.332, 0.674	0.310, 0.699	0.280, 0.718	0.275, 0.715	0.331, 0.682	0.288, 0.722	0.273	0.734	5150
B6,standardScalar on target, StratifiedKFold, min	0.325, 0.683	0.329, 0.685	0.334, 0.664	0.293, 0.724	0.312, 0.707	0.290, 0.709	0.320, 0.693	0.306, 0.693	0.276, 0.720	0.300, 0.689	0.272	0.734	5150
B6,standardScalar on target, StratifiedKFold, middle	0.323, 0.685	0.301, 0.699	0.312, 0.676	0.268, 0.736	0.294, 0.728	0.266, 0.720	0.309, 0.680	0.311, 0.693	0.278, 0.720	0.289, 0.711	0.262	0.744	5150
B6,standardScalar on target, StratifiedKFold, max	0.435, 0.705	0.306, 0.682	0.306, 0.652	0.270, 0.732	0.390, 0.691	0.321, 0.678	0.411, 0.680	0.321, 0.680	0.294, 0.728	0.448, 0.685	0.305	0.715	5150
EfficientNetV2-m baseline	0.436, 0.586	0.329, 0.676	0.336, 0.678	0.374, 0.637	0.392, 0.625	0.361, 0.654	0.344, 0.660	0.375, 0.639	0.322, 0.658	0.328, 0.666	0.331	0.670	5150
EfficientNetV2-l baseline	0.363, 0.65	0.360, 0.652	0.435, 0.641	0.344, 0.670	0.381, 0.631	0.352, 0.664	0.377, 0.648	0.355, 0.658	0.339, 0.656	0.350, 0.658	0.348	0.676	5150
EfficientNetV2-m exposure="middle"	0.397, 0.608	0.374, 0.652	0.356, 0.660	0.384, 0.627	0.350, 0.654	0.337, 0.668	0.326, 0.658	0.365, 0.621	0.353, 0.664	0.335, 0.664	0.336	0.643	5150
EfficientNetV2-m exposure="max"	0.455, 0.588	0.369, 0.652	0.412, 0.610	0.351, 0.645	0.343, 0.680	0.413, 0.604	0.358, 0.658	0.365, 0.649	0.441, 0.581	0.354, 0.654	0.360	0.652	5150
EfficientNetV2-m exposure="max" without mixed precision (amp.GradScaler())	0.456, 0.579	0.396, 0.639	0.387, 0.631	0.372, 0.643	0.395, 0.635	0.381, 0.631	0.369, 0.635	0.447, 0.579	0.433, 0.610	0.3631, 0.633	0.383	0.627	5150
EfficientNetV2-l MLP(256,32,1)	0.363, 0.664	0.378, 0.654	0.405, 0.662	0.342, 0.660	0.393, 0.654	0.370, 0.668	0.446, 0.639	0.344, 0.668	0.333, 0.666	0.363, 0.656	0.358	0.662	5150

10-fold training - testset 10% on EffNetV2 with albumenation (-90,90) rotation

NN-config (val_mse,val_acc),(mse, acc)	1	2	3	4	5	6	7	8	9	10	mean MSE	mean ACC	datset size
EfficientNetV2-m exposure="max"	0.371, 0.662	0.456, 0.623	0.355, 0.645	0.405, 0.614	0.886, 0.441	0.481, 0.623	0.370, 0.654	0.459, 0.633	0.803, 0.521	0.595, 0.6	0.381	0.658	5150
EfficientNetV2-m exposure="max" MLE savepoints	0.390, 0.635	0.398, 0.619	0.346, 0.650	0.392, 0.647	0.394, 0.619	0.365, 0.662	0.329, 0.672	0.459, 0.581	0.448, 0.614	0.381, 0.645	0.402	0.650	5150
EfficientNetV2-m MLP(256,32,1) exposure="middle" MLE savepoints**	0.321, 0.687	0.377, 0.676	0.332, 0.683	0.285, 0.711	0.285, 0.701	0.325, 0.705	0.311, 0.699	0.348, 0.683	0.295, 0.699	0.373, 0.660	0.292	0.724	5150
EfficientNetV2-l MLP(256,32,1) MLE savepoints, same test image size, middle_old	0.301, 0.697	0.281, 0.734	0.299, 0.691	0.318, 0.670	0.282, 0.718	0.305, 0.699	0.280, 0.726	0.334, 0.682	0.300, 0.705	0.310, 0.703	0.280	0.718	5150
EfficientNetV2-l MLP(256,32,1) max, mse savepoints, max	0.322, 0.711	0.295, 0.701	0.324, 0.699	0.353, 0.742	0.295, 0.728	0.306, 0.711	0.271, 0.722	0.292, 0.711	0.380, 0.711	0.299, 0.701	0.286	0.724	5150
EfficientNetV2-l MLP(256,32,1) middle, mse savepoints, middle	0.300, 0.687	0.332, 0.680	0.320, 0.697	0.300, 0.718	0.272, 0.711	0.302, 0.711	0.294, 0.697	0.285, 0.705	0.307, 0.711	0.285, 0.720	0.275	0.728	5150
EfficientNetV2-l MLP(256,32,1) Reload weights, middle	0.322, 0.666	0.3455, 0.636	0.428, 0.596
EfficientNetV2-l MLP(256,32,1) Reload weights test_img size 480x480, middle	0.336, 0.656	0.331, 0.645	0.324, 0.648
EfficientNetV2-l MLP(256,32,1) 9 channels, mse savepoints,test_img=384	0.292, 0.709	0.289, 0.707	0.289, 0.705	0.326, 0.707	0.307, 0.715	0.327, 0.693	0.283, 0.707	0.29997, 0.718	0.335, 0.697	0.295, 0.709	0.281	0.717	5150
EfficientNetV2-m MLP(256,32,1) 9 channels, mse savepoints,test_img=384	0.289, 0.717	0.299, 0.707	0.303, 0.693	0.284, 0.713	0.292, 0.718	0.287, 0.718	0.303, 0.713	0.288, 0.717	0.289, 0.711	0.294, 0.707	0.273	0.740	5150
EfficientNetV2-m MLP(256,32,1) max, mse savepoints,test_img=384,150 epochs	0.305, 0.689	0.413, 0.625	0.319, 0.668	0.327, 0.705	0.310, 0.689	0.284, 0.709	0.309, 0.693	0.315, 0.707	0.302, 0.697	0.287, 0.726	0.290	0.711	5150
EfficientNetV2-m MLP(256,32,1) max, mse savepoints,test_img=384,450 epochs	0.337, 0.689	0.297, 0.701	0.302, ,0.703	0.291, 0.713	0.315, 0.707	0.347, 0.685	0.338, 0.697	0.321, 0.680	0.313, 0.691	0.283, 0.718	0.289	0.713	5150
EfficientNetV2-m MLP(256,32,1) min, mse savepoints,test_img=384	0.292, 0.7107	0.292, 0.711	0.294, 0.695	0.275, 0.734	0.298, 0.718	0.304, 0.709	0.304, 0.709	0.331, 0.697	0.3068, 0.701	0.295, 0.715	0.273	0.740	5150
RexNet	0.388, 0.616	0.446, 0.561	0.379, 0.61										5150

Age distribution of data set of 5150 images

{1: 382, 2: 522, 3: 509, 4: 624, 5: 805, 6: 540, 7: 544, 8: 477, 9: 327, 10: 217, 11: 122, 12: 55, 13: 26}


Figure 2.:Age distribution of cod otoliths:

Test-set age distribution of data set of 515 images

{1: 41, 2: 59, 3: 52, 4: 60, 5: 90, 6: 52, 7: 55, 8: 47, 9: 23, 10: 19, 11: 13, 12: 2, 13: 2}


Figure 3.:Test set age distribution of cod otoliths:

Name		Name	Last commit message	Last commit date
Latest commit History 507 Commits
Deep-learning-for-regression-of-cod-otoliths		Deep-learning-for-regression-of-cod-otoliths
ICES_2022_presentation		ICES_2022_presentation
docker		docker
img		img
jupyter		jupyter
manuscript		manuscript
referat		referat
regression		regression
results		results
tmp		tmp
.gitignore		.gitignore
README.md		README.md
README_detailed.md		README_detailed.md
README_eda.md		README_eda.md
Scratch.md		Scratch.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep learning for regression of cod otoliths

Highlights:

Final results of loss MSE, metric Accuracy, and Percentage Agreement

2-way anova with interactive plot on 10-fold ensemble average (higher than simple average)

Summary of best results on training on cod otoliths compared to other projects:

5-fold training - training set: 68%, validation set: 17%, test set 15%

10-fold training, training set: 81% validation set: 9%, test set 10% - 515 images - no augmentaion on EfficientNetV2

10-fold training - testset 10% on EffNetV2 with albumenation (-90,90) rotation

Age distribution of data set of 5150 images

Test-set age distribution of data set of 515 images

About

Releases

Packages

Languages

emoen/Deep-learning-for-regression-of-cod-otoliths

Folders and files

Latest commit

History

Repository files navigation

Deep learning for regression of cod otoliths

Highlights:

Final results of loss MSE, metric Accuracy, and Percentage Agreement

2-way anova with interactive plot on 10-fold ensemble average (higher than simple average)

Summary of best results on training on cod otoliths compared to other projects:

5-fold training - training set: 68%, validation set: 17%, test set 15%

10-fold training, training set: 81% validation set: 9%, test set 10% - 515 images - no augmentaion on EfficientNetV2

10-fold training - testset 10% on EffNetV2 with albumenation (-90,90) rotation

Age distribution of data set of 5150 images

Test-set age distribution of data set of 515 images

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages