Skip to content

thim0o/CoinDetector

Repository files navigation

A VGG16-based coins detector

Authors: Khanh Vu, Thimo Blom, Hannah Stone.
Institute: Vrije Universiteit Amsterdam.

Description

1. Images acquisition (tools: OpenCV, Python)

  1. Capture raw images from the webcam.
  2. Convert RGB to HSV color space.
    HSV
  3. Apply median blur in order to cancel noises.
  4. Apply image thresholding & background subtraction to capture existing coins.
    paramsthresholding
  5. Make bounding boxes and crop out coins.
    extracted_coins

2. Convolutional neural network - The VGG16 (Pre-trained model)

From https://neurohive.io/en/popular-networks/vgg16/

VGG16 is a convolutional neural network model proposed by K. Simonyan and A. Zisserman from the University of Oxford in the paper “Very Deep Convolutional Networks for Large-Scale Image Recognition”. The model achieves 92.7% top-5 test accuracy in ImageNet, which is a dataset of over 14 million images belonging to 1000 classes. It was one of the famous model submitted to ILSVRC-2014. It makes the improvement over AlexNet by replacing large kernel-sized filters (11 and 5 in the first and second convolutional layer, respectively) with multiple 3×3 kernel-sized filters one after another.
VGG16

We made few adjustions to the existing VGG16 model from Keras:

  1. We kept only the convolutional layers so we removed the last 4 layers (flatten + 2 x 4096 fully-connected + 1000 classes prediction layers).
  2. Added some fully-connected layers together with a batch_normalization layer and a dropout layer. model

3. Model training (tools: Google Colab)

Open In Colab

  • Dataset:

    • Total images: 2923 (~500 images per class).
    • 6 classes: '10cent', '1euro', '20cent', '2euro', '50cent', '5cent'.
    • Train/Validation ratio: 7:3.
    • Data augmentation: Rotation and Flip.
  • Model:

    • Optimizer: Adam - Learning rate = 0.0001.
    • Loss function: 'categorical_crossentropy'.
    • Transfer learning: We froze all the layers except for the last 7 hidden layers to accelerate the training process.
    • Metrics tracker: Tensorboard.

4. Deploy the model and classify coins (locally, tools: Jupyter notebook)

We downloaded the model params (model.h5) and ran it in our machine - source code.
demo

Evaluation

See our project report for more details.