Project_MLops_DTU

Fruit image classification using Vision Transformer

This repository contains the project work of our group for the DTU special course Machine Learning Operations for the Autumn semester 2023.

Group members:

Elena Muniz s213579
Theodoros Loukis s223526
Ioannis Louvis s222556
Ioannis Karampinis s222559

Overall goal

The goal of the project is to fine tune a deep learning model based on Vision Transformer (ViT) that classifies the quality of fruits by their image.

Framework

We plan to use the tranformer framework from Huggingface. Specifically, use the Vision Transformer based on the paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale.

How to include the framework

We want to use the Transformers framework that includes many pretrained models, which wil intend to use in order to transfer, learn and train our classification model in the dataset bellow.

Dataset

We plan to use the FRUIT CLASSIFICATION dataset from Kaggle. This is a dataset that contains a total of more than 14700 high quality fruit images of 6 different classes of fruits i.e. apple, banana, guava, lime, orange, and pomegranate. Our goal is to classify them to different classes based on their quality:

Good.
Bad.
Mixed.

Deep learning models

We expect to use the Vision Transformer (ViT) model, which is a deep learning model and is a transformer that is targeted at vision processing tasks such as image recognition. We might as well also try the BERT Pre-Training of Image Transformers (BEiT) and/or Data-efficient Image Transformers (DeiT) models, which are follow-up works on the original ViT model.

Name		Name	Last commit message	Last commit date
Latest commit History 185 Commits
.dvc		.dvc
.github/workflows		.github/workflows
data		data
docker		docker
docs		docs
models		models
notebooks		notebooks
reports		reports
src		src
.dvcignore		.dvcignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
cloudbuild_predict_gpu.yaml		cloudbuild_predict_gpu.yaml
cloudbuild_train_gpu.yml		cloudbuild_train_gpu.yml
config_cpu.yml		config_cpu.yml
config_gpu_cloud.yml		config_gpu_cloud.yml
dtumlops-406109-3703b69ca83d.json		dtumlops-406109-3703b69ca83d.json
dtumlops-406109-ac8fa5c6b516.json		dtumlops-406109-ac8fa5c6b516.json
fastapi_model.py		fastapi_model.py
latest		latest
main_copy.py		main_copy.py
profile_results.prof		profile_results.prof
requirements.txt		requirements.txt
requirements_gpu.txt		requirements_gpu.txt
requirements_tests.txt		requirements_tests.txt
setup.py		setup.py
test_environment.py		test_environment.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project_MLops_DTU

Fruit image classification using Vision Transformer

Overall goal

Framework

How to include the framework

Dataset

Deep learning models

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project_MLops_DTU

Fruit image classification using Vision Transformer

Overall goal

Framework

How to include the framework

Dataset

Deep learning models

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages