Skip to content

gchb2012/VQA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 

Repository files navigation

1.Composing Text and Image for Image Retrieval

[1]. Vo N, Jiang L, Sun C, et al. Composing text and image for image retrieval-an empirical odyssey[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019: 6439-6448.[pdf][code]
[2]. Vo N, Jiang L, Hays J. Let's Transfer Transformations of Shared Semantic Representations[J]. arXiv preprint arXiv:1903.00793, 2019.[pdf]
[3]. Guo X, Wu H, Cheng Y, et al. Dialog-based interactive image retrieval[C]//Advances in Neural Information Processing Systems. 2018: 678-688.[pdf][code]
[4]. Perez E, Strub F, De Vries H, et al. Film: Visual reasoning with a general conditioning layer[C]//Thirty-Second AAAI Conference on Artificial Intelligence. 2018.[pdf][code]
[5]. Santoro A, Raposo D, Barrett D G, et al. A simple neural network module for relational reasoning[C]//Advances in neural information processing systems. 2017: 4967-4976.[pdf][code]
[6]. Noh H, Hongsuck Seo P, Han B. Image question answering using convolutional neural network with dynamic parameter prediction[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 30-38.[pdf]
[7]. Zhao B, Feng J, Wu X, et al. Memory-augmented attribute manipulation networks for interactive fashion search[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 1520-1528.[pdf]

2.Image-Text Match

[1].Socher R, Karpathy A, Le Q V, et al. Grounded compositional semantics for finding and describing images with sentences[J]. Transactions of the Association for Computational Linguistics, 2014, 2: 207-218.[pdf]
[2].Hu Z, Luo Y, Lin J, et al. Multi-level visual-semantic alignments with relation-wise dual attention network for image and text matching[C]//Proceedings of the 28th International Joint Conference on Artificial Intelligence. AAAI Press, 2019: 789-795.[pdf]
[3].Nam H, Ha J W, Kim J. Dual attention networks for multimodal reasoning and matching[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017: 299-307.[pdf]
[4].Qi J, Peng Y, Yuan Y. Cross-media multi-level alignment with relation attention network[J]. arXiv preprint arXiv:1804.09539, 2018.[pdf]

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published