[MRG] Jcpot : Multi source DA with target shift #137

ievred · 2020-03-31T15:17:19Z

Added jcpot class in the da.py, jcpot_barycenter with the optimization routine, an example and a test

…ce_da

rflamary

still a few things to handle the PR is looking good.

rflamary · 2020-04-07T11:23:27Z

README.md

@@ -29,6 +29,7 @@ It provides the following solvers:
 * Non regularized free support Wasserstein barycenters [20].
 * Unbalanced OT with KL relaxation distance and barycenter [10, 25].
 * Screening Sinkhorn Algorithm for OT [26].
+* JCPOT algorithm for multi-source target shift [27].


Multi source domain adaptation with target shift

rflamary · 2020-04-07T11:24:52Z

ot/bregman.py

+
+        # build the cost matrix and the Gibbs kernel
+        M = dist(Xs[d], Xt, metric=metric)
+        M = M / np.median(M)


normalization by median should be optional.

removed median and adjusted parameters in the tests accordingly

rflamary

This is looking good. just a few more comments and we are done.

Thanks again @ievred

rflamary · 2020-04-08T11:36:27Z

test/test_da.py

@@ -549,3 +547,57 @@ def test_linear_mapping_class():
    Cst = np.cov(Xst.T)

    np.testing.assert_allclose(Ct, Cst, rtol=1e-2, atol=1e-2)
+
+
+def test_jcpot_transport_class():


could you also write a test for the jcpot function in addition to the class?

It's nice to chgeck both since the interface is available for both. you can also use it on another test dataset with obvious solutions (repeating samples and known proportions for instance).

rflamary · 2020-04-08T11:39:12Z

ot/da.py

+class JCPOTTransport(BaseTransport):
+
+    """Domain Adapatation OT method for multi-source target shift based on Wasserstein barycenter algorithm.
+


Could you add to the documentation what kind of mzpping is used? barycentric it seems but you could also so label prop by keeping the target position and providong non binary one hot encoding no? This couls be a parameter to give to the method.

added label prop for base class and jcpot + tests for all otda methods

rflamary · 2020-04-08T11:40:48Z

ot/bregman.py

+    The problem consist in solving a Wasserstein barycenter problem to estimate the proportions :math:`\mathbf{h}` in the target domain.
+
+    The algorithm used for solving the problem is the Iterative Bregman projections algorithm
+    with two sets of marginal constraints related to the unknown vector :math:`\mathbf{h}` and uniform tarhet distribution.


target distribution

rflamary

This is very nice.

Just a few more comments related mostly to the new transfer_labels functions

rflamary · 2020-04-15T08:26:32Z

ot/bregman.py

+
+    bary = bary / np.sum(bary)
+
+    if log:


Hum, do we really need to return the gammas by default?

The name of the function is barycenter which is the weights h in this case.
I would put the gammas in the log or at least second position.

as far as I understand we have to return gamma as in documentation of the BaseTransport of da.py it is said that: "fit method should estimate a coupling matrix and store it in a coupling_ attribute"

rflamary · 2020-04-15T08:32:10Z

ot/da.py

+            # compute transported samples
+            transp_ys = np.dot(D1, transp)
+
+            return np.argmax(transp_ys, axis=0)


why return argmax? I woul return the smooth label estimations and let the user do the argmax if he really wants one label.

Done, changed documentation to "soft labels"

rflamary · 2020-04-15T08:32:52Z

ot/da.py

+            # compute transported samples
+            transp_ys = np.dot(D1, transp.T)
+
+            return np.argmax(transp_ys, axis=0)


same comment

Done, changed documentation to "soft labels"

rflamary · 2020-04-15T08:34:11Z

ot/da.py

+                transp[~ np.isfinite(transp)] = 0
+
+                # compute transported labels
+                transp_ys.append(np.argmax(np.dot(D1, transp.T), axis=0))


same argmax comment

Done, changed documentation to "soft labels"

ievred added 2 commits March 31, 2020 09:43

added jcpot

171b962

v1 jcpot example test

6aa0f1f

rflamary changed the title ~~Jcpot~~ [WIP] Jcpot Mar 31, 2020

ievred added 5 commits March 31, 2020 17:36

readme move to bregman

ba493aa

fix imports remove checks

4398606

fix test example add M to log

547a03e

add dataset clean plot

b1f8736

pep8

6b8477d

rflamary added the new feature label Apr 2, 2020

ievred added 8 commits April 2, 2020 15:29

laplace v1

9200af5

laplace emd+sinkhorn

90f5d5f

v2 laplace emd sinkhorn

fa99199

Delete da.py

0baef79

autopep+remove sinkhorn+add simtype

98b68f1

Merge branch 'laplace_da' of https://github.com/ievred/POT into lapla…

e48eebb

…ce_da

remove commented line

b32c815

Merge branch 'master' into laplace_da

bf9b170

rflamary reviewed Apr 7, 2020

View reviewed changes

ievred added 4 commits April 7, 2020 13:29

add sklearn to travis yml

ed34704

pep bregman

d52a78d

pep bregman

34e13d4

upd

2c9f992

ievred force-pushed the jcpot branch from a5e1bcc to 2c9f992 Compare April 8, 2020 07:20

ievred added 3 commits April 8, 2020 10:08

remove laplace from jcpot

c68b52d

test+utils+readme

5592651

remove laplacian

a5dbac1

rflamary reviewed Apr 8, 2020

View reviewed changes

Ievgen Redko added 3 commits April 8, 2020 14:10

Merge branch 'master' into jcpot

08d0bf9

added test barycenter + modif target

bc51793

add label prop + inverse

0b402fd

added label normalization to utils

1a4c264

rflamary changed the title ~~[WIP] Jcpot~~ [WIP] Jcpot : Multi source DA with target shift Apr 10, 2020

rflamary reviewed Apr 15, 2020

View reviewed changes

ievred added 3 commits April 15, 2020 11:12

fix soft labels, remove gammas from jcpot

749378a

fix jcpot_bary test

54a129f

fix indent test_da

7889484

rflamary changed the title ~~[WIP] Jcpot : Multi source DA with target shift~~ [MRG] Jcpot : Multi source DA with target shift Apr 15, 2020

rflamary merged commit adc5570 into PythonOT:master Apr 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] Jcpot : Multi source DA with target shift #137

[MRG] Jcpot : Multi source DA with target shift #137

ievred commented Mar 31, 2020

rflamary left a comment

rflamary Apr 7, 2020

ievred Apr 7, 2020

rflamary Apr 7, 2020

ievred Apr 7, 2020

rflamary left a comment

rflamary Apr 8, 2020

ievred Apr 8, 2020

rflamary Apr 8, 2020

ievred Apr 8, 2020

rflamary Apr 10, 2020

rflamary Apr 8, 2020

ievred Apr 8, 2020

rflamary left a comment

rflamary Apr 15, 2020

ievred Apr 15, 2020

ievred Apr 15, 2020

rflamary Apr 15, 2020

ievred Apr 15, 2020

rflamary Apr 15, 2020

ievred Apr 15, 2020

rflamary Apr 15, 2020

ievred Apr 15, 2020

		class JCPOTTransport(BaseTransport):

		"""Domain Adapatation OT method for multi-source target shift based on Wasserstein barycenter algorithm.

[MRG] Jcpot : Multi source DA with target shift #137

[MRG] Jcpot : Multi source DA with target shift #137

Conversation

ievred commented Mar 31, 2020

rflamary left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rflamary left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rflamary left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment