AlexNet in TensorFlow

This repository comes with AlexNet's implementation in TensorFlow. AlexNet is the winner of the ILSVRC-2012 Competition.

The original model introduced in the paper used two separate GPUs for architecturing. That was due to the lack of GPU resources (memory) at the time. Because the limitation is no longer applicable for the current GPU technology for the AlexNet, this repository's implementation merged two separate models allocated into two separate GPUs into one.

Required Packages

scikit-images
pickle
tqdm
numpy
tensorflow-gpu (>1.7)

Usage

From command line
- Will download CIFAR-10 dataset and pre-processing of it, and run the training on AlexNet. It will produce the checkpoint file for performing inference later.

python alexnet.py

From source code
- Same behaviour as from the command line.

import cifar10_utils
from alexnet import AlexNet

...
valid_set = (valid_features, valid_labels)
...

alexNet = AlexNet('cifar10', learning_rate=0.0001)
alexNet.train(epochs=20, 
              batch_size=128, 
              valid_set=valid_set, 
              save_model_path='./model')

Experiment on CIFAR-10 dataset

Environment
- Floydhub GPU2 instance (1 x Tesla V100)
Approximate running time
- 1 hour 45 mins
Hyperparameters
- Learning rate: 0.00005
- Epochs: 18
- Batch size: 64
Test Accuracy: 0.6548566878980892

Resources

alexnet.py : Providing AlexNet class implementation
cifar10_utils.py : Providing handy functions to download and preprocess CIFAR-10 dataset
AlexNet.pdf : My own summary focused on implementation detail
AlexNet.ipynb : Experimental workflow code on CIFAR-10 dataset
External Checkpoint files
- providing pre-trained checkpoint file on CIFAR-10 dataset
- Download Link

Overall Architecture

1. Input Layer of Image Size (224 x 224 x 3)

2. Convolutional Layer (96 x (11 x 11 x 3)) + stride size of 4

Bias with constant value of 1
ReLU Activation
Local Response Normalization
Max Pooling (Overlapping Pooling)

3. Convolutional Layer (256 x (5 x 5 x 48))

ReLU Activation
Local Response Noramlization
Max Pooling (Overlapping Pooling)

4. Convolutional Layer (384 x (3 x 3 x 128))

Bias with constant value of 1

5. Convolutional Layer (384 x (3 x 3 x 192))

Bias with constant value of 1

6. Convolutional Layer (256 x (3 x 3 x 192))

Max Pooling (Overlapping Pooling)

7. Fully Connected Layer (4096)

Bias with constant value of 1
Dropout

8. Fully Connected Layer (4096)

Bias with constant value of 1
Dropout

9. Fully Connected Layer (1000)

Training

Optimizer (Implementation) : AdamOptimizer

References

ImageNet Classification with Deep Convolutional Neural Networks (Original Paper)
Summary Slide
Summary Slide 2
Local Response Normalization
- https://stats.stackexchange.com/questions/145768/importance-of-local-response-normalization-in-cnn

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
AlexNet.pdf		AlexNet.pdf
AlexNet.pptx		AlexNet.pptx
README.md		README.md
alexnet.ipynb		alexnet.ipynb
alexnet.py		alexnet.py
cifar100_utils.py		cifar100_utils.py
cifar10_utils.py		cifar10_utils.py
experiment.png		experiment.png
figure1.png		figure1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AlexNet in TensorFlow

Required Packages

Usage

Experiment on CIFAR-10 dataset

Resources

Overall Architecture

Training

References

About

Releases

Packages

Languages

deep-diver/AlexNet

Folders and files

Latest commit

History

Repository files navigation

AlexNet in TensorFlow

Required Packages

Usage

Experiment on CIFAR-10 dataset

Resources

Overall Architecture

Training

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages