Kervolutional Neural Networks

A Tensorflow implementation of the Kervolutional Neural Networks (KNN).

Introduction

The paper introduces an alternative operator to the usual convolution operator in CNNs, called kernel convolution. The key idea is to use non-linear kernels to extract more complexe features without adding any additional parameters.

Using kernels as a source of non-linearity is more effective than using activation functions and max pooling operations (see figure below).

Implementation

This code was tested using TF2.0 and python 3.6.

pip install -r requirements.txt

To launch training using LeNet5 and MNIST dataset as described in section 4 in the paper :

python train_evaluate.py --lr 0.003 --batch_size 50 --epochs 20 --model_name lenetknn --kernel polynomial

The figures below represent the test accuracy for the first epoch.

For the learnable parameter cp of the polynomial kernal, the initialization of this parameter is important for faster convergence. The curve in the figure below used 0.5 as initial value.

To test the non-linearity impact on the performance, the activations are removed and the max pooling is replaced by an average pooling. These experiments are done using a lower leraning rate (0.0001)

Licence

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
images		images
models		models
README.md		README.md
datasets.py		datasets.py
layers.py		layers.py
requirements.txt		requirements.txt
train_evaluate.py		train_evaluate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kervolutional Neural Networks

Introduction

Implementation

Licence

About

Releases

Packages

Languages

amalF/Kervolution

Folders and files

Latest commit

History

Repository files navigation

Kervolutional Neural Networks

Introduction

Implementation

Licence

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages