Cleaned Classes for Visual Genome

This repository contains the code used to generate the results reported in the paper: Cleaner Categories Improve Object Detection and Visual-Textual Grounding.
Most of our code is based on bottom-up-attention.pytorch. Thanks!
Follow the list of classes:

Cleaned classes: ./evaluation/objects_vocab.txt
Old classes: ./evaluation/old_objects_vocab.txt
Random classes: ./evaluation/objects_vocab_random.txt

Dependencies, Structure, Usage

For more details on repository structure, dependencies, datasets and usage, refer to the original README

Datasets

To download the Visual Genome dataset, follow the instruction presented here: README
To generate the new datasets:

python make_dataset_cleanv3.py --labels ./evaluation/objects_vocab.txt
python make_dataset_random.py --labels ./evaluation/objects_vocab_random.txt

or, download them from here.

Use this command to generate the list of random labels:

python make_categories_random.py    --labels ./evaluation/objects_vocab.txt --output_folder ./

Repository Branches

This repository has the following branches:

master: this branch includes all the experiments regarding the new 878 classes;
develop: this branch includes all the experiments performed with the old 1600 classes;
develop_postmapping: this branch includes all the experiments performed with the old 1600 classes which are then post-processed to map to the new 878 classes;

Checkpoints

Checkpoint of the model trained on the new 878 cleaned classes: weight

Example of commands

Follow some examples:

Tran and Test

Use the following command to train the model with either cleaned or random labels:

# for random labels
# CONFIG_FILE=./configs/d2/train-d2-r101_random.yaml
# OUTPUT_FOLDER=./output/output_random/
# for cleaned labels
CONFIG_FILE=./configs/d2/train-d2-r101_cleaned.yaml
OUTPUT_FOLDER=./output/output_new_classes_v3/
python train_net.py \
                    --mode d2 \
                    --config  ${CONFIG_FILE} \
                    --num-gpus 1 \
                    OUTPUT_DIR ${OUTPUT_FOLDER}

Use the following command to test the model with either cleaned or random labels:

# for random labels
# CONFIG_FILE=./configs/d2/test-d2-r101_random.yaml
# OUTPUT_FOLDER=./output/output_random/
# for cleaned labels
CONFIG_FILE=./configs/d2/test-d2-r101_cleaned.yaml
OUTPUT_FOLDER=./output/output_new_classes_v3/
python train_net.py --mode d2 \
                        --config-file ${CONFIG_FILE} \
                        --num-gpus 4 \
                        --eval-only \
                        MODEL.WEIGHTS ${OUTPUT_FOLDER}model_final.pth \
                        OUTPUT_DIR ${OUTPUT_FOLDER}

Extract Features

Use the following command to extract features the features:

# for random labels
# CONFIG_FILE=./configs/d2/test-d2-r101_random.yaml
# OUTPUT_FOLDER=./output/output_random/
# OUTPUT_FOLDER_FEATURES=./extracted_features/extracted_features_clean_VG_th02/
# for cleaned labels
CONFIG_FILE=./configs/d2/test-d2-r101_cleaned.yaml
OUTPUT_FOLDER=./output/output_new_classes_v3/
OUTPUT_FOLDER_FEATURES=./extracted_features/extracted_features_random_VG_th02/

python extract_features.py  --mode d2 \
                            --num-cpus 32 \
                            --extract-mode roi_feats \
                            --min-max-boxes 10,100 \
                            --config-file ${CONFIG_FILE} \
                            --image-dir ./datasets/visual_genome/images/ \
                            --out-dir ${OUTPUT_FOLDER_FEATURES} \
                            MODEL.ROI_HEADS.SCORE_THRESH_TEST 0.2  \
                            OUTPUT_DIR ${OUTPUT_FOLDER}  \
                            MODEL.WEIGHTS .${OUTPUT_FOLDER}model_final.pth

Plot Dataset Frequencies

Use the following command to plot the frequencies per datasets:

python plot_freq_class_dataset.py   --file ./analysis/noisy_classes_frequency.json \
                                    --compare ./analysis/cleaned_classes_frequency.json \
                                    --loglog

NOTE: See the folder ./cluster for more commands.

Information

For any questions and comments about the new classes, contact Davide Rigoni.

License

Apache License

Name		Name	Last commit message	Last commit date
Latest commit History 228 Commits
analysis		analysis
bua		bua
cluster		cluster
configs		configs
datasets		datasets
detectron2 @ be792b9		detectron2 @ be792b9
evaluation		evaluation
utils		utils
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
README_bu.md		README_bu.md
calc_features_comparisons.py		calc_features_comparisons.py
calc_features_comparisons_testing.py		calc_features_comparisons_testing.py
calc_features_knn_comparisons.py		calc_features_knn_comparisons.py
calc_freq_class_proposals.py		calc_freq_class_proposals.py
compare_intra_distances_files.py		compare_intra_distances_files.py
compare_knn_old_files.py		compare_knn_old_files.py
container.bash		container.bash
env.txt		env.txt
env.yml		env.yml
extract_features.py		extract_features.py
make_categories_random.py		make_categories_random.py
make_dataset_cleanv3.py		make_dataset_cleanv3.py
make_dataset_random.py		make_dataset_random.py
opts.py		opts.py
plot_features_comparisons.py		plot_features_comparisons.py
plot_freq_class_dataset.py		plot_freq_class_dataset.py
plot_pred_bestclass_scores.py		plot_pred_bestclass_scores.py
plot_pred_bestclass_scores_kdeplot.py		plot_pred_bestclass_scores_kdeplot.py
plot_pred_class_AP.py		plot_pred_class_AP.py
plot_pred_class_AP_by_bucket.py		plot_pred_class_AP_by_bucket.py
plot_tsne_features.py		plot_tsne_features.py
setup.py		setup.py
train_net.py		train_net.py

License

drigoni/bottom-up-attention.pytorch

Folders and files

Latest commit

History

Repository files navigation

Cleaned Classes for Visual Genome

Dependencies, Structure, Usage

Datasets

Repository Branches

Checkpoints

Example of commands

Tran and Test

Extract Features

Plot Dataset Frequencies

Information

License

About

Resources

License

Stars

Watchers

Forks

Languages