Skip to content

Implementation of Visual transformer (ViT) on the MNIST dataset.

Notifications You must be signed in to change notification settings

PanithanS/Visual-Transformer-MNIST

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

Visual transformer for MNIST handwriting digit classification

  • Here is a short explanation of how we can use a neural network based on the transformer architecture trained on a handwritten digit dataset to classify handwritten digits:
  1. We first break down the handwritten digit image into patches.
  2. We then pass the embedded patches to the transformer architecture.
  3. The transformer architecture learns long-range dependencies between the patches.
  4. The output of the transformer architecture is a vector that represents the handwritten digit image.
  5. We then use a classifier to predict the digit that the handwritten digit image represents.

image

Prediction output

image

About

Implementation of Visual transformer (ViT) on the MNIST dataset.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published