We propose the Incomplete Multimodality-Diffused emotion recognition (IMDer) method that maps input random noise to the distribution space of missing modalities and recovers missing data in accordance with their original distributions.
(1) IMDer maps input random noise to the distribution space of missing modalities and recovers missing data in accordance with their original distributions. (2) To minimize the semantic ambiguity between the missing and recovered modalities, IMDer utilize the available modalities as prior conditions to guide and refine the recovering process. Please refer to our paper for details.
- Python 3.8
- PyTorch 1.9.0
- CUDA 11.4
Data files can be downloaded from here, and you only need to download the aligned data.
You can put the downloaded datasets into dataset/
directory.
Before running missing cases, you should download the weights pretrained by complete multimodal data (i.e., MR=0.0).
You can put the downloaded weights into pt/
directory.
Running the following command:
python train.py
If you find the code helpful in your research or work, please cite the following paper.
@inproceedings{wang2023incomplete,
title={Incomplete Multimodality-Diffused Emotion Recognition},
author={Wang, Yuanzhi and Li, Yong and Cui, Zhen},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
year={2023}
}