Requested changes made #17

chriswbartley · 2017-11-24T02:11:59Z

Christoph, I finally got to do all those changes. I have done everything as requested with the exception of adding the 'Cs' parameter into fit(). I put that in the constructor for RuleFit to match all the sklearn standard. Also, FYI LassoCV uses alphas and n_alphas so I had to convert Cs to alphas=1/Cs and n_alphas as needed.

Hopefully I haven't missed anything.

Cheers, Chris

- Fix: uses binomial (log) loss for classification - Added: use of Friedman standardisation on linear variables (Winsorised and scaled by 0.4/stdev) - Added: use of Friedman randomisation of number of terminal nodes using exponential distrbution - Fixed: use of set for rules sometimes caused wrong coeficients to be associiated with the wrong rules! Rules are now stored as a list (ie ordered) - Added ability for certain features to be constrained monotone (upcoming paper!) - Improved: sped up prediction by not evaluating rules with zero coefficients - Added: Max rules parameter like Friedman - Added: Invisible use of BoostingRegressor/Classifier (created according to constructor parameters)

Added a lot of features to make it more like the original paper, and the interface like Friedmans R implementation (http://statweb.stanford.edu/~jhf/r-rulefit/RuleFit_help.html) - Added: binomial (log) loss for classification (using glmnet_py) - Added: use of Friedman standardisation on linear variables (Winsorised and scaled by 0.4/stdev) - Added: use of Friedman randomisation of number of terminal nodes using exponential distrbution - Fixed: use of a set (i.e. unordered) for rules sometimes caused wrong coeficients to be associiated with the wrong rules! Rules are now stored as a list (ie ordered) - Improved: sped up prediction by not evaluating rules with zero coefficients - Added: Max rules parameter like Friedman - Added: Invisible use of BoostingRegressor/Classifier (created according to constructor parameters, like Friedman's R implementation)

Updated comments to describe rulefit constructor. Wherever possible it now matches Friedman's R library (http://statweb.stanford.edu/~jhf/r-rulefit/RuleFit_help.html).

Removed some testing guff

chriswbartley · 2017-11-24T02:21:33Z

Oops, I'm new to github I just realised I didn't need to close the last pull request for it to update...

christophM · 2017-11-24T11:38:28Z

Great! Thanks a lot for this contribution

chriswbartley and others added 8 commits September 1, 2017 14:00

Updated boston example to show current usage

fff44db

Basic Documentation included

166074d

Updated comments to describe rulefit constructor. Wherever possible it now matches Friedman's R library (http://statweb.stanford.edu/~jhf/r-rulefit/RuleFit_help.html).

Now uses scikit LogisticRegressionCV

2848a65

Tidied Up

aa0976c

Removed some testing guff

Most requested changes made

3462bcc

Final requested changes

84ea849

christophM merged commit 20350e9 into christophM:master Nov 24, 2017

christophM mentioned this pull request Nov 29, 2017

Implement trees with with random depth #4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Requested changes made #17

Requested changes made #17

chriswbartley commented Nov 24, 2017

chriswbartley commented Nov 24, 2017

christophM commented Nov 24, 2017

Requested changes made #17

Requested changes made #17

Conversation

chriswbartley commented Nov 24, 2017

chriswbartley commented Nov 24, 2017

christophM commented Nov 24, 2017