PR: inverse_transform is implemented for scikit-learn utility #12

sshojiro · 2020-05-01T07:14:01Z

Hello,

I have found that there is no inverse_transform, which of PCA is found in scikit-learn.
A new method inverse_transform in this PR maps responsibilities onto the original data space.

The short sample code is here:

from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from ugtm.ugtm_sklearn import eGTM
X,y = load_iris(return_X_y=True)
Xtrain, Xtest = train_test_split(X, test_size=0.20,
                random_state=42)
model = eGTM(model='responsibilities')
_ = model.fit_transform( Xtrain )
matR = model.transform(Xtest)
Xhat = model.inverse_transform(matR)
print("original data", Xtest.shape)
print("projected data", Xhat.shape)
#original data (30, 4)
#projected data (30, 4)

This may help analyses with GTM.
Thank you in advance for your consideration.

feat: inverse_transform maps responsibilities onto the original data space refactor: transform and fit_transform are available even in sklearn.pipeline.Pipeline, which ignores `model` input

….Pipeline` 1. transform(self, X, model) => transform(self, X), because transform params is not allowed in Pipeline.transform 2. eGTM.__init__(..., model="means"), because the transform parameter must be set when initialized or when set with `set_params` 3. fit(self, X) => fit(self, X, y=None), because fit method basically expects three inputs and the last one is None by default for unsupervised methods

sshojiro · 2020-05-02T03:40:32Z

The commit 3d1086e (formerly e8f6459 ) supports sklearn.pipeline.Pipeline integration.
Now the class eGTM switches the output format by eGTM.set_params and parameter eGTM.model.
eGTM.model takes either of 'means', 'modes', 'responsibilities' or 'complete'.

It is confirmed that the new feature works properly as follows:

from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from ugtm.ugtm_sklearn import eGTM
from sklearn.pipeline import make_pipeline
from sklearn.preprocessing import StandardScaler 
X,y = load_iris(return_X_y=True)
model = make_pipeline(StandardScaler(), eGTM())
Xtrain, Xtest = train_test_split(X, test_size=0.20,
                random_state=42)

model.set_params(**{"egtm__model": "responsibilities"})
model.fit_transform(Xtrain) # pass 
model.fit(Xtrain)           # pass 
for mtype in ['means', 'modes', 'responsibilities']:
    model.set_params(**{"egtm__model": mtype})
    Xttest = model.transform(Xtest)
    print(mtype, Xttest.shape)
# means (30, 2)
# modes (30, 2)
# responsibilities (30, 256)

sshojiro added 2 commits May 1, 2020 16:04

feat: inverse_transform is implemented

d5ec27b

feat: inverse_transform maps responsibilities onto the original data space refactor: transform and fit_transform are available even in sklearn.pipeline.Pipeline, which ignores `model` input

sshojiro force-pushed the master branch from e8f6459 to 3d1086e Compare May 2, 2020 02:39

hagax8 merged commit ff4d7d7 into hagax8:master Aug 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PR: inverse_transform is implemented for scikit-learn utility #12

PR: inverse_transform is implemented for scikit-learn utility #12

sshojiro commented May 1, 2020 •

edited

sshojiro commented May 2, 2020 •

edited

PR: inverse_transform is implemented for scikit-learn utility #12

PR: inverse_transform is implemented for scikit-learn utility #12

Conversation

sshojiro commented May 1, 2020 • edited

sshojiro commented May 2, 2020 • edited

sshojiro commented May 1, 2020 •

edited

sshojiro commented May 2, 2020 •

edited