Clarify semantics of Model.predict_real #21

lsc36 · 2015-12-17T13:22:04Z

Currently Model.predict_real is connected to predict_proba in scikit-learn, which returns an array of n_classes floats standing for probabilities of corresponding labels. But decision_function is another candidate whose returning shapes vary from model to model, for example (in our case n_samples = 1):

LogisticRegression: (n_samples,) if n_classes == 2 else (n_samples, n_classes)
C-SVC: (n_samples, n_classes * (n_classes-1) / 2)

We have to make sure what we want in order to well-define the interface. @hsuantien can you give us some advice on this?

The text was updated successfully, but these errors were encountered:

hsuantien · 2015-12-17T13:47:34Z

I suggest that we consider the logistic regression type only. Technically,
the C-SVC one can be converted to the LogisticRegression one in some way.
Thanks.

--HT

SC Lee notifications@github.com 於 2015年12月17日週四下午9:22寫道：

Currently Model.predict_real is connected to predict_proba in
scikit-learn, which returns an array of n_classes floats standing for
probabilities of corresponding labels. But decision_function is another
candidate whose returning shapes vary from model to model, for example (in
our case n_samples = 1):

LogisticRegression: (n_samples,) if n_classes == 2 else (n_samples,
n_classes)

C-SVC: (n_samples, n_classes * (n_classes-1) / 2)

We have to make sure what we want in order to well-define the interface.
@hsuantien https://github.com/hsuantien can you give us some advice on
this?

—
Reply to this email directly or view it on GitHub
#21.

yangarbiter · 2015-12-18T04:56:25Z

Actually the output of C-SVC differs with different multi class method (OVO, OVR).

yangarbiter · 2015-12-18T08:00:49Z

I tried to fix it in this branch https://github.com/ntucllab/libact/tree/predict_real_interface

Though I am not entirely sure the implementation of the largest margin method for now.

lsc36 · 2015-12-18T08:16:05Z

We should determine the interface before writing code. Is the "LogReg-style conversion" generally applicable?

yangarbiter · 2015-12-18T09:13:28Z

For binary classification case, svm and logReg-style are able to convert.

For multiclass case logReg-style supports only OVR method for SVM, but not OVO (it seems sklearn's logReg didn't support OVO).

As for other classifier, we might have to discuss case by case.

hsuantien · 2015-12-18T22:18:40Z

Let's use OVR-style for the interface now, I suggest. Thanks.

On Fri, Dec 18, 2015 at 5:13 PM, yangarbiter notifications@github.com
wrote:

For binary classification case, svm and logReg-style are able to convert.

For multiclass case logReg-style supports only OVR method for SVM, but not
OVO (it seems sklearn's logReg didn't support OVO).

As for other classifier, we might have to discuss case by case.

—
Reply to this email directly or view it on GitHub
#21 (comment).

Hsuan-Tien Lin htlin@csie.ntu.edu.tw

http://www.csie.ntu.edu.tw/~htlin

Associate Professor
Dept. of Computer Science and Information Engineering
& Graduate Institute of Networking and Multimedia

National Taiwan University

yangarbiter · 2015-12-22T14:06:58Z

I think for now we can make predict_real output ndarray with shape (n_sample, n_classes) (even n_classes=2)

but another thing might be defining the meaning of predict_real. For LogisticRegression and SVM like algorithm, their value may be more positive more towards label 1 and negative towards label -1.

How about other algorithms? Will they always be in this case?
@hsuantien

lsc36 · 2015-12-30T15:22:31Z

Consider as solved. Closing.

lsc36 added the bug label Dec 17, 2015

lsc36 added this to the v0.1 release milestone Dec 17, 2015

lsc36 mentioned this issue Dec 17, 2015

SVM: use scikit-learn instead of LIBSVM #15

Closed

lsc36 mentioned this issue Dec 21, 2015

scikit-learn model adapter #20

Closed

yangarbiter mentioned this issue Dec 22, 2015

Predict real interface #26

Merged

lsc36 closed this as completed Dec 30, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify semantics of Model.predict_real #21

Clarify semantics of Model.predict_real #21

lsc36 commented Dec 17, 2015

hsuantien commented Dec 17, 2015

yangarbiter commented Dec 18, 2015

yangarbiter commented Dec 18, 2015

lsc36 commented Dec 18, 2015

yangarbiter commented Dec 18, 2015

hsuantien commented Dec 18, 2015

yangarbiter commented Dec 22, 2015

lsc36 commented Dec 30, 2015

Clarify semantics of Model.predict_real #21

Clarify semantics of Model.predict_real #21

Comments

lsc36 commented Dec 17, 2015

hsuantien commented Dec 17, 2015

yangarbiter commented Dec 18, 2015

yangarbiter commented Dec 18, 2015

lsc36 commented Dec 18, 2015

yangarbiter commented Dec 18, 2015

hsuantien commented Dec 18, 2015

http://www.csie.ntu.edu.tw/~htlin

National Taiwan University

yangarbiter commented Dec 22, 2015

lsc36 commented Dec 30, 2015