# <font color='red'>Warning: fastai - v1 is used to train the model</font>

## Image classification with Convolutional Neural Networks

Welcome to the first week of the second deep learning certificate! We're going to use convolutional neural networks (CNNs) to allow our computer to see - something that is only possible thanks to deep learning.

## Introduction to our first task: 'Dogs vs Cats'

We're going to try to create a model to enter the Dogs vs Cats competition at Kaggle. There are 25,000 labelled dog and cat photos available for training, and 12,500 in the test set that we have to try to label for this competition. According to the Kaggle web-site, when this competition was launched (end of 2013): "State of the art: The current literature suggests machine classifiers can score above 80% accuracy on this task". So if we can beat 80%, then we will be at the cutting edge as of 2013!

In [1]:
# Put these at the top of every notebook, to get automatic reloading and inline plotting
%reload_ext autoreload
%autoreload 2
%matplotlib inline

Here we import the libraries we need. We'll learn about what each does during the course.

In [2]:
# This file contains all the main external libs we'll use
from fastai import *
from fastai.vision import *

`PATH` is the path to your data - if you use the recommended setup approaches from the lesson, you won't need to change this. `sz` is the size that the images will be resized to in order to ensure that the training runs quickly. We'll be talking about this parameter a lot during the course. Leave it at `224` for now.

In [3]:
PATH = "data/"
size=224
bs=64

It's important that you have a working NVidia GPU set up. The programming framework used to behind the scenes to work with NVidia GPUs is called CUDA. Therefore, you need to ensure the following line returns `True` before you proceed. If you have problems with this, please check the FAQ and ask for help on [the forums](http://forums.fast.ai).

In [4]:
torch.cuda.is_available()

True

In [5]:
torch.cuda.set_device(1)

In [6]:
torch.cuda.current_device()

1

In addition, NVidia provides special accelerated functions for deep learning in a package called CuDNN. Although not strictly necessary, it will improve training performance significantly, and is included by default in all supported fastai configurations. Therefore, if the following does not return `True`, you may want to look into why.

In [7]:
torch.backends.cudnn.enabled

True

## Our first model: quick start

We're going to use a <b>pre-trained</b> model, that is, a model created by some one else to solve a different problem. Instead of building a model from scratch to solve a similar problem, we'll use a model trained on ImageNet (1.2 million images and 1000 classes) as a starting point. The model is a Convolutional Neural Network (CNN), a type of Neural Network that builds state-of-the-art models for computer vision. We'll be learning all about CNNs during this course.

We will be using the <b>resnet34</b> model. resnet34 is a version of the model that won the 2015 ImageNet competition. Here is more info on [resnet models](https://github.com/KaimingHe/deep-residual-networks). We'll be studying them in depth later, but for now we'll focus on using them effectively.

Here's how to train and evalulate a *dogs vs cats* model in 3 lines of code, and under 20 seconds:

In [8]:
path = untar_data(PATH);
final_path = path/'dogscats/'
final_path.ls()

[PosixPath('/home/zhossain/myProjects/deeplearning/fastai-unofficial/data/dogscats/tmp'),
 PosixPath('/home/zhossain/myProjects/deeplearning/fastai-unofficial/data/dogscats/valid'),
 PosixPath('/home/zhossain/myProjects/deeplearning/fastai-unofficial/data/dogscats/train'),
 PosixPath('/home/zhossain/myProjects/deeplearning/fastai-unofficial/data/dogscats/sample'),
 PosixPath('/home/zhossain/myProjects/deeplearning/fastai-unofficial/data/dogscats/test1'),
 PosixPath('/home/zhossain/myProjects/deeplearning/fastai-unofficial/data/dogscats/models')]

In [9]:
(final_path/'train').ls()

[PosixPath('/home/zhossain/myProjects/deeplearning/fastai-unofficial/data/dogscats/train/cats'),
 PosixPath('/home/zhossain/myProjects/deeplearning/fastai-unofficial/data/dogscats/train/dogs')]

In [10]:
tfms = get_transforms(do_flip=False)

In [17]:
data = ImageDataBunch.from_folder(final_path, ds_tfms=tfms, size=size, bs=bs)

How good is this model? Well, as we mentioned, prior to this competition, the state of the art was 80% accuracy. But the competition resulted in a huge jump to 98.9% accuracy, with the author of a popular deep learning library winning the competition. Extraordinarily, less than 4 years later, we can now beat that result in seconds! Even last year in this same course, our initial model had 98.3% accuracy, which is nearly double the error we're getting just a year later, and that took around 10 minutes to compute.

In [14]:
learn = create_cnn(data, models.resnet34, metrics=accuracy)

In [15]:
learn.fit_one_cycle(1)

epoch,train_loss,valid_loss,accuracy
1,0.056411,0.026544,0.991000


## Analyzing results: looking at pictures

As well as looking at the overall metrics, it's also a good idea to look at examples of each of:
1. A few correct labels at random
2. A few incorrect labels at random
3. The most correct labels of each class (i.e. those with highest probability that are correct)
4. The most incorrect labels of each class (i.e. those with highest probability that are incorrect)
5. The most uncertain labels (i.e. those with probability closest to 0.5).