`scikit-learn`

Handwritten digit recognition with scikit-learn

`Installing scikit-learn`

Use pip virtualenv

pip install -U scikit-learn

Use conda

conda create -n sklearn-env -c conda-forge scikit-learn

conda activate sklearn-env

see the documentation for more information https://scikit-learn.org/stable/install.html

We trained a simple neural network to recognize the numbers in these images. This network will take 1D arrays of 8x8=64 values as input. We then converted these 2D images into 1D arrays

We start by loading the sample

from sklearn import datasets
digits = datasets.load_digits()

Then we print the first image

print(digits.images[0])

[[ 0.  0.  5. 13.  9.  1.  0.  0.]
 [ 0.  0. 13. 15. 10. 15.  5.  0.]
 [ 0.  3. 15.  2.  0. 11.  8.  0.]
 [ 0.  4. 12.  0.  0.  8.  8.  0.]
 [ 0.  5.  8.  0.  0.  9.  8.  0.]
 [ 0.  4. 11.  0.  1. 12.  7.  0.]
 [ 0.  2. 14.  5. 10. 12.  0.  0.]
 [ 0.  0.  6. 13. 10.  0.  0.  0.]]

Like all the images in the sample, this one is an 8x8 pixel image, black and white (a single color level per pixel). It can be displayed in the following way, also indicating the corresponding label (the number to which the image corresponds)

import matplotlib.pyplot as plt
plt.imshow(digits.images[0],cmap='binary')
plt.title(digits.target[0])
plt.axis('off')
plt.show()

We will train a simple neural network to recognize numbers in these images. This network will take 1D arrays of 8x8=64 values as input. So we need to convert our 2D images into 1D arrays

x = digits.images.reshape((len(digits.images), -1))

x contient toutes les images en version 1D

print(x[0])

[ 0.  0.  5. 13.  9.  1.  0.  0.  0.  0. 13. 15. 10. 15.  5.  0.  0.  3.
 15.  2.  0. 11.  8.  0.  0.  4. 12.  0.  0.  8.  8.  0.  0.  5.  8.  0.
  0.  9.  8.  0.  0.  4. 11.  0.  1. 12.  7.  0.  0.  2. 14.  5. 10. 12.
  0.  0.  0.  0.  6. 13. 10.  0.  0.  0.]

The network will act as a function allowing you to go from an array of 64 input values to an output value, its estimate of the figure. Here are the output values

y = digits.target
print(len(digits.images))

We will create a relatively simple neural network, with a single layer of 15 neurons.

from sklearn.neural_network import MLPClassifier

mlp = MLPClassifier(hidden_layer_sizes=(15,))

We will train this network on the first 1000 images of our sample, and reserve the following images to test the performance of the network

x_train = x[:1000]
y_train = y[:1000]
x_test = x[1000:]
y_test = y[1000:]

mlp.fit(x_train, y_train)

/home/samglish/.local/lib/python3.9/site-packages/sklearn/neural_network/_multilayer_perceptron.py:691: ConvergenceWarning: Stochastic Optimizer: Maximum iterations (200) reached and the optimization hasn't converged yet.

And There you go ! we can now look at what the network gives for the following images, which were not seen by the network during training

mlp.predict(x_test[:10])

array([1, 4, 0, 5, 3, 6, 9, 6, 1, 7])

y_test[:10]

array([1, 4, 0, 5, 3, 6, 9, 6, 1, 7])

For the first 10 test images, the estimates are excellent!

y_pred = mlp.predict(x_test)

Then search for the images for which the network made a mistake

error = (y_pred != y_test)

Here is the calculation of the error rate

import numpy as np
np.sum(error) / len(y_test)

0.09535759096612297

We can finally select the bad predictions to display them

x_error = x_test[error].reshape((-1, 8,8))
y_error = y_test[error]
y_pred_error = y_pred[error]
i = 1
plt.imshow(x_error[i],cmap='binary')
plt.title(f'cible: {y_error[i]}, prediction: {y_pred_error[i]}')
plt.axis('off')
plt.show()

As we can see, it is difficult to classify these images, even for a human For better performance, higher resolution images and a more complex neural network, such as a convolutional network, should be used.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
DigitRecognitionIA.ipynb		DigitRecognitionIA.ipynb
README.md		README.md
output.png		output.png
output1.png		output1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`scikit-learn`

`Installing scikit-learn`

We trained a simple neural network to recognize the numbers in these images. This network will take 1D arrays of 8x8=64 values as input. We then converted these 2D images into 1D arrays

We will create a relatively simple neural network, with a single layer of 15 neurons.

About

Releases

Packages

Languages

samglish/scikit-learn

Folders and files

Latest commit

History

Repository files navigation

scikit-learn

Installing scikit-learn

We trained a simple neural network to recognize the numbers in these images. This network will take 1D arrays of 8x8=64 values as input. We then converted these 2D images into 1D arrays

We will create a relatively simple neural network, with a single layer of 15 neurons.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`scikit-learn`

`Installing scikit-learn`

Packages