Skip to content

sandropezzelle/malevic

Repository files navigation

MALeViC

MALeViC: Modeling Adjectives Leveraging Visual Contexts

Code and data presented in:

  1. Pezzelle & Fernández (2019). Is the Red Square Big? MALeViC: Modeling Adjectives Leveraging Visual Contexts. Proceedings of EMNLP-IJCNLP 2019

Abstract This work aims at modeling how the meaning of gradable adjectives of size ('big', 'small') can be learned from visually-grounded contexts. Inspired by cognitive and linguistic evidence showing that the use of these expressions relies on setting a threshold that is dependent on a specific context, we investigate the ability of multi-modal models in assessing whether an object is 'big' or 'small' in a given visual scene. In contrast with the standard computational approach that simplistically treats gradable adjectives as fixed attributes, we pose the problem as relational: to be successful, a model has to consider the full visual context. By means of four main tasks, we show that state-of-the-art models (but not a relatively strong baseline) can learn the function subtending the meaning of size adjectives, though their performance is found to decrease while moving from simple to more complex tasks. Crucially, models fail in developing abstract representations of gradable adjectives that can be used compositionally.

diagram

@inproceedings{pezzelle-fernandez-2019-red,
    title = "Is the Red Square Big? {MAL}e{V}i{C}: Modeling Adjectives Leveraging Visual Contexts",
    author = "Pezzelle, Sandro  and
      Fern{\'a}ndez, Raquel",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)",
    month = nov,
    year = "2019",
    address = "Hong Kong, China",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/D19-1285",
    doi = "10.18653/v1/D19-1285",
    pages = "2865--2876"
}

  1. Pezzelle & Fernández (2019). Big Generalizations with Small Data: Exploring the Role of Training Samples in Learning Adjectives of Size. Proceedings of LANTERN 2019 co-located with EMNLP-IJCNLP 2019

Abstract In this paper, we experiment with a recently proposed visual reasoning task dealing with quantities – modeling the multimodal, contextually-dependent meaning of size adjectives ('big', 'small') – and explore the impact of varying the training data on the learning behavior of a state-of-art system. In previous work, models have been shown to fail in generalizing to unseen adjective-noun combinations. Here, we investigate whether, and to what extent, seeing some of these cases during training helps a model understand the rule subtending the task, i.e., that being big implies being not small, and vice versa. We show that relatively few examples are enough to understand this relationship, and that developing a specific, mutually exclusive representation of size adjectives is beneficial to the task.

@inproceedings{pezzelle-fernandez-2019-big,
    title = "Big Generalizations with Small Data: Exploring the Role of Training Samples in Learning Adjectives of Size",
    author = "Pezzelle, Sandro  and
      Fern{\'a}ndez, Raquel",
    booktitle = "Proceedings of the Beyond Vision and LANguage: inTEgrating Real-world kNowledge (LANTERN)",
    month = nov,
    year = "2019",
    address = "Hong Kong, China",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/D19-6403",
    doi = "10.18653/v1/D19-6403",
    pages = "18--23"
}

Best models and pretrained ResNet101 visual features for all the tasks can be downloaded from: MALeViC

For questions, info, feedback please contact: s.pezzelle@uva.nl

About

MALeViC: Modeling Adjectives Leveraging Visual Contexts

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published