This is the official repository for Towards Visual Taxonomy Expansion.
The constructed Chinese taxonomy dataset is in ./Chinese taxonomy dataset
.
Due to commercial license, we cannot upload the images for each term.
However, it is possible to search each term on search engines to get access to its corresponding image.
In the training file, every hypernymy pair is stored as a list, where the first item denotes the hyponym, while the second item denotes the hypernym.
In the test file, the first item denotes the hyponym, while the second item denotes a potential parent derived from user click logs.
The original dataset can be view here. The images used for this dataset are uploaded here.
To reproduce our results reported in our paper, run:
python train.py \
--train_datapath $TRAIN_DATAPATH \
--dev_datapath $DEV_DATAPATH \
--test_datapath $TEST_DATAPATH \
--save_dir $YOUR_SAVE_DIR \
--batch_size $BATCH_SIZE \
--modal_integration add \
--auto_add \
--integration dot \
--train_epochs $EPOCHS