Skip to content

A beginner's attempt to understand and implement the Vision Transformer paper.

Notifications You must be signed in to change notification settings

kohlivrinda/Art-ViT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Art-ViT

A beginner's attempt to understand and implement the Vision Transformer paper from scratch.

As of now the implemented ViT classifies Roccoco and Expressionalism art styles. The model has not been trained yet to its fullest capability due to limited computational resources avaialble at the moment.

Further plan of action is as follows:

  • to use a pre-trained ViT model from pytorch models and observe how well it performs.
  • run the same classification task on pretrained CNN models and compare results.

kaggle notebook: https://www.kaggle.com/code/vrindakohli/art-vit

dataset: https://www.kaggle.com/datasets/sivarazadi/wikiart-art-movementsstyles

Screenshot (226)

About

A beginner's attempt to understand and implement the Vision Transformer paper.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published