Find Your Shoes with CLIP (On-going project)

This repository is about Find Your Shoes using CLIP(Contrastive Language-Image PreTraining) model which is from OpenAI.

We thought it would be a meaningful service if we could search for the shoes by image when we don't know the name of the shoes, and search again by changing some features in the shoes. So we developed the service by limiting the dataset to shoes.

For example, when text for different color info from original color and a user's shoes image were given as an input, the model finds the same kind of shoes in the given text color.

We developed this service inspired by Google's image search. We found that our service is very similar to NAVER OmniSearch but we developed this service because it could be challenging and fun to implement.

This Project is ongoing which is completed by 2023.1. This is to-do-list about our development. Our final presentation slide is completed!. This is link : slide link

(I need to refactorize codes. Current codes are messy.)

To do list for our project

Demo

Demo with Jordan 1 Retro High og Chicago (image : Jordan 1 Retro High og Chicago, text : I want same brand but color is different)
Demo with New Balance 574 (image : New Balance 574, text : I want similar shoes but brand is different)

Dataset

There were not existing shoes labeled dataset which include various features (e.g, brand, color, hightop, sole). And also, there were not stable crawler for crawling full size image from google. So we made our own Crawler using python, selenium. For filtering crawled dataset, we made crawling Rule for our dataset. We followed this rule for crawling and filtering images.

Crawling Rule

The number of shoes (one or two) doesn't matter.
Exclude photos which do not show the entire shoe appearance.
Exclude photos that cannot be identified whether they are high-top or low.
Exclude photos of shoes with heels only.
Crawling only for human-recognizable photos about the brand, color, high top, and sole features.
It doesn't matter if a person is wearing it.
The background doesn't matter.
It's good to have as many angles as possible.

We will not use datasets for commercial purposes and we are going to share dataset when it is collected.

Method

1. Prompt Ensemble

We basically experimented with the model over five steps. Let's assume that Shoes image is given.

Put the given shoe image and promptenseble together in the clip model to obtain each simularity score for brand (nike, adidas,...), color (red, blue, yellow,...), and heightop (low or high). For each type, a feature with a top 1 score is extracted and classified.
Filter the shoes that does not match the selected feature in the shoes table.
Calculate the similarity score by applying the prompt ensemble to the classified features (brand, color, hightop) and filtered shoe names.
Calculate the simiarity score by applying prompt ensemble to the entire shoe name.
Among the similarity scores calculated in 3 and 4, select the shoe type with the highest score.

Experiment with Large Dataset.
Inference without prompt learner

	Brand	Color	Hightop	Sole	Name
Top 1	89.75	59.23	94.69	14.71	44.48
Top 5	99.87	93.65	100	99.76	78.77

Inference with prompt learner

	Brand	Color	Hightop	Sole	Name
Top 1	97.47	95	98.43	99.25	93.19
Top 5	99.96	99.65	100	100	99.52

Quick Start

Download Dataset

(not implemented yet. it will be added.)

sh dataset.sh

Before run the model, You should generate table for shoes information.

(This require dataset. )

python3 csv_generator.py

Experiment for evaluating model

(This require dataset. )

using only prompt ensemble version(not CoOp)

python3 main.py

using CoOp

cd CoOp_trainer
python3 exp_CoOp.py

Inference

For inference, you should locate your image in img folder and run below command.

(This require dataset. )

using only prompt ensemble version(not CoOp)

python3 infer.py

using CoOp

cd CoOp_trainer
python3 CoOp_infer.py

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
CoOp_trainer		CoOp_trainer
config		config
legacy		legacy
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
best_log_large_exp.log		best_log_large_exp.log
best_log_small_exp.log		best_log_small_exp.log
csv_generator.py		csv_generator.py
data.py		data.py
demo_1.gif		demo_1.gif
demo_2.gif		demo_2.gif
inference.log		inference.log
inference.py		inference.py
main.py		main.py
meta_info.csv		meta_info.csv
meta_info_final.csv		meta_info_final.csv
meta_info_small.csv		meta_info_small.csv
metric.py		metric.py
prompt_compute.py		prompt_compute.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Find Your Shoes with CLIP (On-going project)

To do list for our project

Demo

Dataset

Crawling Rule

Method

1. Prompt Ensemble

Quick Start

Download Dataset

Before run the model, You should generate table for shoes information.

Experiment for evaluating model

Inference

Members

Reference

About

Releases

Packages

Languages

changhyeonnam/FindYourShoes_with_CLIP

Folders and files

Latest commit

History

Repository files navigation

Find Your Shoes with CLIP (On-going project)

To do list for our project

Demo

Dataset

Crawling Rule

Method

1. Prompt Ensemble

Quick Start

Download Dataset

Before run the model, You should generate table for shoes information.

Experiment for evaluating model

Inference

Members

Reference

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages