MAN

Multimodal Adversarial Network for Cross-modal Retrieval (PyTorch Code)

Abstract

Cross-modal retrieval aims to retrieve the pertinent samples across different modalities, which is important in numerous multimodal applications. It is challenging to correlate the multimodal data due to large heterogeneous gap between distinct modalities. In this paper, we propose a Multimodal Adversarial Network (MAN) to project the multimodal data into a common space wherein the similarities between different modalities can be directly computed by the same distance measurement. The proposed MAN consists of multiple modality-specific generators, a discriminator and a multimodal discriminant analysis (MDA) loss. With the adversarial learning, the generators are pitted against the discriminator to eliminate the cross-modal discrepancy. Furthermore, a novel MDA loss is proposed to preserve as much discrimination as possible into all available dimensions of the generated common representations. However, there are some problems in directly optimizing the MDA trace criteria. To be specific, the discriminant function will overemphasize 1) the large distances between already separated classes, 2) and the dominant eigenvalues. These problems may cause poor discrimination of the common representations. To solve these problems, we propose a between-class strategy and an eigenvalue strategy to weaken the largest between-class differences and the dominant eigenvalues, respectively. To the best of our knowledge, the proposed MAN could be one of the first works to specifically design for the multimodal representation learning (more than two modalities) with the adversarial learning. To verify the effectiveness of the proposed method, extensive experiments are carried out on four widely-used multimodal databases comparing with 16 state-of-the-art approaches.

Framework

Result

Citing MAN

If you find MAN useful in your research, please consider citing:

@article{hu2019multimodal,
  title={Multimodal adversarial network for cross-modal retrieval},
  author={Hu, Peng and Peng, Dezhong and Wang, Xu and Xiang, Yong},
  journal={Knowledge-Based Systems},
  volume={180},
  pages={38--50},
  year={2019},
  publisher={Elsevier}
}

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
datasets		datasets
feature_results		feature_results
MAN.py		MAN.py
README.md		README.md
data_loader.py		data_loader.py
framework.png		framework.png
main.py		main.py
main_XMedia.py		main_XMedia.py
model.py		model.py
pascal_sentence_results.png		pascal_sentence_results.png
utils_PyTorch.py		utils_PyTorch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MAN

Abstract

Framework

Result

Citing MAN

About

Releases

Packages

Languages

penghu-cs/MAN

Folders and files

Latest commit

History

Repository files navigation

MAN

Abstract

Framework

Result

Citing MAN

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages