Skip to content

oierldl/vsts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

vSTS: Visual Semantic Textual Similarity

We present present Visual Semantic Textual Similarity (vSTS) that extends the Semantic Textual Similarity to the visual modality, a task and dataset which allows to study whether better sentence representations can be built when having access to the corresponding images, in contrast with having access to the text alone.

The vSTS dataset aims to become a standard benchmark to test the contribution of visual information when evaluating the similarity of sentences and the quality of multimodal representations, allowing to test the complementarity of visual and textual information for improved language understanding.

For more details check out our ECAI paper Evaluating Multimodal Representations on Visual Semantic Textual Similarity, and website.

The code and models presented in our paper, see our repository at: TBD

Full dataset can be downloaded here.

Data description and format

Each instance contains a pair of images and their description and a similarity value that ranges from 0 to 5. In total, we obtained 2677 pairs.

Each row contains one instance with tab separated values. Each row contains the following information:

  • id: instances ID.
  • source: Source of the sentence pairs: STS2014, STS2015, MS-COCO and Flickr30k.
  • sent1: First sentence of the pair.
  • image1: Path to the image file associated with the first sentence.
  • sent2: Second sentence of the pair.
  • image2: Path to the image file associated with the secpmd sentence.
  • sim: Similarity value that ranges from 0 to 5.

The file visual_sts.v2.0.all.tsv (download) contains the whole dataset without the actual images. Train, development, and test partition used in the paper can be found in train-dev-test folder in the GitHub repository. Partitions are divied in the following files:

  • Training set: dataset/train-dev-test/visual_sts.v2.0.train.tsv
  • Development set: dataset/train-dev-test/visual_sts.v2.0.dev.tsv
  • Test set: dataset/train-dev-test/visual_sts.v2.0.test.tsv

In order to obtain the images you can download the whole dataset from our servers: vSTS dataset.

About

vSTS: A dataset for evaluating multimodal representations on Visual Semantic Textual Similarity

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages