Skip to content


##RefCOCOg-Adv: A Referring Expresion Dataset

This folder contains adversarial annotations for part of the test images from RefCOCOg dataset.

##Prepare Images: Download "mscoco" into the Images folder, which can be from mscoco Bounding box annotations are also from mscoco dataset

##Dataset Summary:

  • In RefCOCOg-adv dataset we have have 976 unique images from RefCOCOg test dataset. In total 3704 referring expressions were annotated with an average length of 11.3.

#Example Annotation of RefCOCOG-adv: refcocog_adv_annotations.json contains a dict of advesarial referential expressions, where each annotation is

    'shuffle': 'to giraffes in hat next red white fence over shirt a a a two and woman leaning', 
    'hit_id': '3K3G488TR2FROXACUQOSDKLYTK65Q8', 
    'hit_type_id': '3QH037L2CMDD4RRYVNJWX0CT0B0AWC', 
    'adj_noun': 'giraffes white hat shirt woman next fence red', 
    'GT_bbox_number': '3', 
    'most_confused_bbox_GT_desc': 'A woman in a red shirt, feeding giraffes through a fence.', 
    'raw_str': 'two giraffes leaning over a fence next to a woman in a red shirt and white hat.', 
    'noun_adj': ' giraffes fence woman red shirt white', 
    'batch_id': 'batch18', 
    'bbox_candidates_map': '{"1": [373.0, 78.04, 266.56, 348.96], "2": [0.96, 0.24, 294.58, 398.21], "3": [62.37, 0.14, 329.79, 292.73], "4": [455.08, 177.19, 50.35, 235.29]}', 
    'adj': 'next red white', 
    'most_confused_bbox_number': '1', 
    'assigns': [{'dropdownselect': '2'}, {'dropdownselect': '1'}, {'dropdownselect': '1'}], 
    'must_words': 'giraffes,fence,woman,red,shirt,white', 
    'bbox_html_dropdown': "<option value='0'>Select the Box here</option> <option value='1'>1</option> <option value='2'>2</option> <option value='3'>3</option> <option value='4'>4</option> ", 
    'shuffle_pos': 'two shirt leaning over a fence white to a giraffes in a next woman red hat', 
    'refID': '64809', 
    'stage': 'stage3', 
    'noun': 'giraffes fence woman shirt hat', 
    'imgID': '254291', 
    'annID': '595839', 
    'img_file_name': 'COCO_train2014_000000254291.jpg'

#From the above example

shuffle, adj_noun, noun_adj, noun, must_words, shuffle_pos are paramters used to perturb data for our adversarial data creation
raw_str and GT_bbox_number are from RefCOCOg dataset
bbox_candidates: from mscoco
most_confused_bbox_GT_desc: from our annotation


This library is licensed under the CC-BY-4.0 License.


No description, website, or topics provided.



Code of conduct

Security policy





No releases published


No packages published