Update of the GLM models, OLS+Binary #31

tlienart · 2019-08-13T17:42:54Z

Prob OLS regression
Prob Binary classification (probit, logit, ...)

Future

(will be another pr)

add count regression (poisson)
add multinomial classif (will be for another PR, after Feature Request : Multiclass Classification MLJ.jl#175)

Notes

decode is now used properly, thanks for the feedback
MLJBase 0.4 seems to require different metadata, (scitype_union -> scitype and no input_multivariate etc) so I added a few fixes for those; given that 0.4 has not been merged but will soon, I've added a statement that checks the version and acts accordingly; it'll be easy to remove later on.
- this incurred a few missed commits where things were passing locally but not on Travis, I'll bring this up in the next call, it mostly seems to be a versioning issue.

* proper use of decode git commit -am

tlienart · 2019-08-19T20:52:06Z

I'll wait until you've pushed your changes corresponding to 0.4; then will merge master here, cleanup and re-submit :)

tlienart · 2019-08-22T20:43:06Z

Ok it passes on Julia 1 and 1.1 on Travis. It fails on linux-nightly apparently due to an issue with conda installation as far as I can tell but works on mac-nightly so I reckon it can be ignored.

The PR should be ok for reviewing @ablaom

src/GLM.jl

ablaom

This is really great. My annotations are just editorial.

I see that the old OLS, which was deterministic is gone. I guess this is because you can just use NormalRegressor with predict_mean?

Since we are changing names anyway, I should prefer to insert "Linear" in all the names. I mean, we have "DecisionTreeRegressor", and so forth. In the context of 50 odd models "NormalRegressor" is a bit mysterious. So LinearNormalRegressor, LinearBinaryClassfier, and so forth. ??

Can I suggest adding "Linear" to the names to distinguish Regressors and Classifiers from other Regressors and Classifiers (eg, DecisionTreeRegressor/Classifier). Also, I suggest the name reflect the scitype rather than the distribution being fit, as one model might fit to multiple distributions but we are generally having a separate model for different target scitpyes). Here are my suggestions:

LinearRegressor (for what is now NormalRegressor)
LinearBinaryClassifier as now
LinearCountRegressor (instead of PoissonRegressor)
LinearMulticlassRegressor (for targets with possibly more than one level, unordered)

Please don't use "Multivariate" for multi-level targets as I expect this would cause a lot of confusion. I expect "Multivariate" would normally refer to more than one feature (in case of inputs - the usual case) or more than one target column, in the case of targets. At, least that is how the term is used in documentation.

These are my suggestions. Go ahead and do what you feel best. This is ready to pull. Before, registering and tagging a new release, we might want to wait for the Count and Multiclass, yes?

tlienart · 2019-08-26T15:01:00Z

Ok, I've implemented your suggestions; will wait for tests to pass and merge.

Re Count, I'll do that in another PR;

Re Multiclass, is that supported now? Could you point me to the equivalent of MultivariateFinite? thanks!
Edit: oh ok there's coerce(Multiclass, y), I'll try using that

ablaom · 2019-08-26T19:56:56Z

Nevermind, about MutlivariateFinite; I had forgot our earlier discussion.

I'm puzzled why you have moved LIBSVM from optional dependency to actual dependency.

Update of the GLM models, OLS+Binary

ff06e96

tlienart mentioned this pull request Aug 17, 2019

Implement MLJ interface for linear models JuliaAI/MLJ.jl#35

Closed

7 tasks

tlienart added 2 commits August 17, 2019 17:04

adding RDatasets in the extras

de80f98

# Fixes to GLM PR + metadata fixes

c860522

* proper use of decode git commit -am

tlienart marked this pull request as ready for review August 19, 2019 09:44

tlienart requested a review from ablaom August 19, 2019 09:44

tlienart added 4 commits August 19, 2019 12:43

fixing mljbase 0.3-0.4 errors

be2559a

more 0.3-0.4 fix

e21b9f8

tests is not broken with 0.3

396d48c

reverting placement of LIBSVM in proj.toml

23b9141

tlienart removed the request for review from ablaom August 19, 2019 20:52

tlienart added 2 commits August 22, 2019 21:51

merge master here

3a425e7

GLM PR cleanup, ok locally

8f7df81

tlienart requested a review from ablaom August 22, 2019 20:43

ablaom reviewed Aug 26, 2019

View reviewed changes

src/GLM.jl Outdated Show resolved Hide resolved

ablaom reviewed Aug 26, 2019

View reviewed changes

src/GLM.jl Show resolved Hide resolved

ablaom approved these changes Aug 26, 2019

View reviewed changes

tlienart added 2 commits August 26, 2019 16:31

Merge branch 'master' into glm_2

9db7411

implementing requested changes

a241efc

tlienart merged commit a79056a into master Aug 26, 2019

tlienart deleted the glm_2 branch September 10, 2019 13:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update of the GLM models, OLS+Binary #31

Update of the GLM models, OLS+Binary #31

tlienart commented Aug 13, 2019 •

edited

tlienart commented Aug 19, 2019

tlienart commented Aug 22, 2019 •

edited

ablaom left a comment •

edited

tlienart commented Aug 26, 2019 •

edited

ablaom commented Aug 26, 2019

Update of the GLM models, OLS+Binary #31

Update of the GLM models, OLS+Binary #31

Conversation

tlienart commented Aug 13, 2019 • edited

Future

Notes

tlienart commented Aug 19, 2019

tlienart commented Aug 22, 2019 • edited

ablaom left a comment • edited

Choose a reason for hiding this comment

tlienart commented Aug 26, 2019 • edited

ablaom commented Aug 26, 2019

tlienart commented Aug 13, 2019 •

edited

tlienart commented Aug 22, 2019 •

edited

ablaom left a comment •

edited

tlienart commented Aug 26, 2019 •

edited