The official implementation code for RMIB: Representation Matching Information Bottleneck for Matching Text Representations (ICML2024). This repo is build based on RE2. To ensure the reproducibility of the experiment, we set the seed of all experiments to 32. We also upload training logs for BERT and SBERT to the path logs/
.
-
Experimental env
- GPU: GeForce RTX 2080Ti
- CUDA Version: 11.0
- Python Version: 3.7.13
-
Running script
-
Git clone repo
git clone https://github.com/chenxingphh/rmib
-
Install related packages
pip install -r requirement.txt
-
Download GloVe word vectors (glove.840B.300d) to
resources/
-
Runing script (Using different configs to run different models and datasets)
python train.py configs/sbert_sick.json5
-
Asymmetrical text matching refers to two input texts from different domains. For example, the question and the candidate answers can be viewed as being sampled from two different distributions. For addressing the challenge of matching texts across domains, we introduce the Representation Matching Information Bottleneck (RMIB) framework.
Recent studies have shown that domain matching of text representation will help improve the generalization ability of text matching. We narrow the distribution between text representations by explicitly aligning text representations with a prior distribution in text matching.
We prove that domain matching in text matching is equivalent to optimizing the information bottleneck in text matching, which indicates that domain alignment of input texts can make the learned text representation forget the input redundant information as much as possible.
Since the interaction between text representations plays an important role in asymmetrical domains text matching, IB does not restrict the interaction between text representations. Therefore, we propose the adequacy of interaction and the incompleteness of a single text representation on the basis of IB and obtain the representation matching information bottleneck (RMIB).
Sufficient: The representations
Interaction: The interaction between text representations should be sufficient, which means there should be enough mutual information between the two text representations.
Inadequacy: The final correct result cannot be obtained only by using a single text representation in text matching.
The optimization objective of RMIB is:
We then prove the optimization objective of RMIB can also be expressed as:
⭐If you are interest in RMIB, please consider to cite this paper:
@inproceedings{
pan2024rmib,
title={RMIB: Representation Matching Information Bottleneck for Matching Text Representations},
author={Haihui Pan, Zhifang Liao, Wenrui Xie, Kun Han},
booktitle={International Conference on Machine Learning},
year={2024},
url={https://icml.cc/virtual/2024/poster/33374}
}