1.Composing Text and Image for Image Retrieval

[1]. Vo N, Jiang L, Sun C, et al. Composing text and image for image retrieval-an empirical odyssey[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019: 6439-6448.[pdf][code]
[2]. Vo N, Jiang L, Hays J. Let's Transfer Transformations of Shared Semantic Representations[J]. arXiv preprint arXiv:1903.00793, 2019.[pdf]
[3]. Guo X, Wu H, Cheng Y, et al. Dialog-based interactive image retrieval[C]//Advances in Neural Information Processing Systems. 2018: 678-688.[pdf][code]
[4]. Perez E, Strub F, De Vries H, et al. Film: Visual reasoning with a general conditioning layer[C]//Thirty-Second AAAI Conference on Artificial Intelligence. 2018.[pdf][code]
[5]. Santoro A, Raposo D, Barrett D G, et al. A simple neural network module for relational reasoning[C]//Advances in neural information processing systems. 2017: 4967-4976.[pdf][code]
[6]. Noh H, Hongsuck Seo P, Han B. Image question answering using convolutional neural network with dynamic parameter prediction[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 30-38.[pdf]
[7]. Zhao B, Feng J, Wu X, et al. Memory-augmented attribute manipulation networks for interactive fashion search[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 1520-1528.[pdf]

2.Image-Text Match

[1].Socher R, Karpathy A, Le Q V, et al. Grounded compositional semantics for finding and describing images with sentences[J]. Transactions of the Association for Computational Linguistics, 2014, 2: 207-218.[pdf]
[2].Hu Z, Luo Y, Lin J, et al. Multi-level visual-semantic alignments with relation-wise dual attention network for image and text matching[C]//Proceedings of the 28th International Joint Conference on Artificial Intelligence. AAAI Press, 2019: 789-795.[pdf]
[3].Nam H, Ha J W, Kim J. Dual attention networks for multimodal reasoning and matching[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 299-307.[pdf]
[4].Qi J, Peng Y, Yuan Y. Cross-media multi-level alignment with relation attention network[J]. arXiv preprint arXiv:1804.09539, 2018.[pdf]

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

1.Composing Text and Image for Image Retrieval

2.Image-Text Match

About

Releases

Packages

gchb2012/VQA

Folders and files

Latest commit

History

Repository files navigation

1.Composing Text and Image for Image Retrieval

2.Image-Text Match

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages