-
Notifications
You must be signed in to change notification settings - Fork 174
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarify semantics of Model.predict_real #21
Comments
I suggest that we consider the logistic regression type only. Technically, --HT SC Lee notifications@github.com 於 2015年12月17日 週四 下午9:22寫道:
|
Actually the output of C-SVC differs with different multi class method (OVO, OVR). |
I tried to fix it in this branch https://github.com/ntucllab/libact/tree/predict_real_interface Though I am not entirely sure the implementation of the largest margin method for now. |
We should determine the interface before writing code. Is the "LogReg-style conversion" generally applicable? |
For binary classification case, svm and logReg-style are able to convert. For multiclass case logReg-style supports only OVR method for SVM, but not OVO (it seems sklearn's logReg didn't support OVO). As for other classifier, we might have to discuss case by case. |
Let's use OVR-style for the interface now, I suggest. Thanks. On Fri, Dec 18, 2015 at 5:13 PM, yangarbiter notifications@github.com
Hsuan-Tien Lin htlin@csie.ntu.edu.tw http://www.csie.ntu.edu.tw/~htlinAssociate Professor National Taiwan University |
I think for now we can make predict_real output ndarray with shape (n_sample, n_classes) (even n_classes=2) but another thing might be defining the meaning of predict_real. For LogisticRegression and SVM like algorithm, their value may be more positive more towards label 1 and negative towards label -1. How about other algorithms? Will they always be in this case? |
Consider as solved. Closing. |
Currently
Model.predict_real
is connected topredict_proba
in scikit-learn, which returns an array of n_classes floats standing for probabilities of corresponding labels. Butdecision_function
is another candidate whose returning shapes vary from model to model, for example (in our case n_samples = 1):We have to make sure what we want in order to well-define the interface. @hsuantien can you give us some advice on this?
The text was updated successfully, but these errors were encountered: