Pragmatic-Image-Captioning

This codebase implements Bayesian pragmatics (i.e. the Rational Speech Acts model - RSA) over the top of a deep neural image captioning model. These are desirable to combine, since RSA gives rise to linguistically realistic effects, while deep models can capture (at least some) of the flexibility and expressivity of natural language.

Summary:

Suppose we have a space of possible sentences U
Choosing the sentence which is the most informative caption for identifying image w out of a set of images W is a useful task (moreover, it represents a key instance of natural language pragmatics)
Viewed as an inference problem (of a speaker agent P(U|W=w) ), this task is intractable when U is large.
But if the space of possible sentences U is recursively generated, there's a solution: at each stage of the recursive generation of a sentence u, we perform a local inference as to the most informative next step
Category theoretic perspective (very roughly): this amounts to mapping the inference onto the coalgebra of the anamorphism used to generate the distribution over U
Linguistic perspective (very broadly): we're pushing pragmatics into the lower levels of language, rather than adding it on top
Computational perspective: this provides us a way to get the power of Bayesian models of pragmatics (see Rational Speech Acts) with deep machine learning models powerful enough to model natural language

Setup:

To run the model, you'll need python3.6, and to have cloned the repo with git lfs. Run main.py - you can supply urls for your own images, but on the current settings, the captions probably won't be great (needs beam search, and the example uses greedy)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
bayesian_agents		bayesian_agents
data		data
paper		paper
recursion_schemes		recursion_schemes
train		train
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pragmatic-Image-Captioning

About

Releases

Packages

Languages

reubenharry/Recurrent-RSA

Folders and files

Latest commit

History

Repository files navigation

Pragmatic-Image-Captioning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages