InfoOT: Information Maximizing Optimal Transport

Optimal transport aligns samples across distributions by minimizing the transportation cost between them, e.g., the geometric distances. Yet, it ignores coherence structure in the data such as clusters, does not handle outliers well, and cannot integrate new data points. To address these drawbacks, we propose InfoOT, an information-theoretic extension of optimal transport that maximizes the mutual information between domains while minimizing geometric distances. The resulting objective can still be formulated as a (generalized) optimal transport problem, and can be efficiently solved by projected gradient descent. This formulation yields a new projection method that is robust to outliers and generalizes to unseen samples.

InfoOT: Information Maximizing Optimal Transport ICML 2023 [paper]
Ching-Yao Chuang, Stefanie Jegelka, and David Alvarez-Melis

Prerequisites

Python 3.7
POT
tqdm
scikit-learn

Usage Examples

The code for InfoOT lie in infoot.py. For instance, the following code solves fused InfoOT given two data matrices:

# Xs: [n, d]
# Xt: [m, d]
from infoot import FusedInfoOT

ot = FusedInfoOT(Xs, Xt, h=0.5, reg=1.)
P = ot.solve()

If the source label is given, one can use it to refine the source pairwise distance as follows:

# Ys: [n]
ot = FusedInfoOT(Xs, Xt, Ys=Ys, h=0.5, reg=1.)
P = ot.solve()

Many applications of optimal transport involve mapping source points to a target domain. One can perform either barycentric or conditional projection with the following code. Note that the conditional projection can generalize to unseen samples.

# project the source onto target
ProjX1 = ot.project(Xs, method='barycentric')
ProjX2 = ot.project(Xs, method='conditional')

For aligning domains whose supports lie in different metric spaces, e.g., supports with different modalities or dimensionality, one can simply adopt the standar InfoOT:

# Xs: [n, d1]
# Xt: [m, d2]
# d1 != d2
from infoot import InfoOT

ot = InfoOT(Xs, Xt, h=0.5, reg=0.05)
P = ot.solve(numIter=100)

Other useful functions for computing kernels, the gradient w.r.t. mutual information, projection can also be found in infoot.py.

Domain Adaptation

Download the DeCAF feature for Office-Caltech dataset here and place the data in directory decaf6. The following script reproduces the result with barycentric and conditional projection.

python domain_adapt.py --src caltech --tgt dslr

Cross-Domain Retrieval

We will use the same data from the domain adaptation experiment. The following script reproduces the result with the conditional score.

python retrieval.py --src caltech --tgt dslr

Citation

@inproceedings{chuang2023info,
  title={InfoOT: Information Maximizing Optimal Transport},
  author={Chuang, Ching-Yao and Jegelka, Stefanie and Alvarez-Melis, David},
  booktitle={International Conference on Machine Learning},
  year={2023},
  organization={PMLR}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
LICENSE		LICENSE
README.md		README.md
domain_adapt.py		domain_adapt.py
infoot.py		infoot.py
retrieval.py		retrieval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InfoOT: Information Maximizing Optimal Transport

Prerequisites

Usage Examples

Domain Adaptation

Cross-Domain Retrieval

Citation

About

Releases

Packages

Languages

License

chingyaoc/InfoOT

Folders and files

Latest commit

History

Repository files navigation

InfoOT: Information Maximizing Optimal Transport

Prerequisites

Usage Examples

Domain Adaptation

Cross-Domain Retrieval

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages