CLEVR Diff Dataset

Dataset Generation

Code to generate pairs of CLEVR style images with an obvious difference.

You can use this code to render pairs of synthetic images and difference for those images, like this:

For the above example, generated difference,

{
   "image_index": 0,
   "difference_type": "deleted",
   "image_filename_deleted_diff": "CLEVR_new_000000_del.png",
   "location": {
      "pixel_coords": [
         251,
         95,
         10.763978
      ],
      "rotation": 325.82462,
      "3d_coords": [
         1.627432,
         2.2852376,
         0.7
      ]
   },
   "attributes": {
      "color": "gray",
      "shape": "sphere",
      "material": "rubber",
      "size": "large"
   },
   "difference": "The large gray sphere made of rubber is missing",
   "image_filename": "CLEVR_new_000000.png"
}

To generate such pair of images, use the following command,

blender --background --python render_images.py -- --num_images 300 --del_images 1 --use_gpu 1

To generate the difference between the images,

python generate_questions.py --del_images 1

Refer to CLEVR Dataset Generation repo for setting up blender and learning more about the code.

Feature Extraction

Extract ResNet-101 features.

To extract the features for the generated image,

python extract_features.py --input_image_dir output_500/images/ --output_h5_file output_500/train_image_feats.h5 --batch_size 32
python extract_features.py --input_image_dir output_100/images/ --output_h5_file output_100/dev_image_feats.h5 --batch_size 32
python extract_features.py --input_image_dir output_200/images/ --output_h5_file output_200/test_image_feats.h5 --batch_size 32

To extract the features for the generated image with a difference,

python extract_features.py --input_image_dir output_500/images/ --output_h5_file output_500/train_image_feats.h5 --batch_size 32 --type del
python extract_features.py --input_image_dir output_100/images/ --output_h5_file output_100/dev_image_feats.h5 --batch_size 32 --type del
python extract_features.py --input_image_dir output_200/images/ --output_h5_file output_200/test_image_feats.h5 --batch_size 32 --type del

Baseline Model

Currently, a very simple model is coded to identify the shape of the object which is different in the two images.

To train and evaluate the model,

python main.py --mode 0                  # Train
python main.py --mode 1                  # Test

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
image_generation		image_generation
images		images
question_generation		question_generation
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PATENTS		PATENTS
README.md		README.md
data_loader.py		data_loader.py
evaluator.py		evaluator.py
extract_features.py		extract_features.py
main.py		main.py
model.py		model.py
trainer.py		trainer.py
util.py		util.py

License

MysteryVaibhav/clevr-dataset-gen

Folders and files

Latest commit

History

Repository files navigation

CLEVR Diff Dataset

Dataset Generation

Feature Extraction

Baseline Model

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Languages