-
🏫 I'm currently pursuing B-Tech in Information Technology from VJTI Mumbai.
-
📫 How to reach me sneha.singh.31415@gmail.com
Highlights
- Pro
Pinned Loading
-
vision_transformers_from_scratch
vision_transformers_from_scratch PublicThis project aims to develop an image captioning model by leveraging the power of Vision Transformers (ViTs) as described in the 2020 paper "An Image is worth 16 x 16 words".
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.