New classifiers #147

danthedaniel · 2016-05-16T22:56:44Z

What does this PR do?

Adds 7 new classifiers to TPOT

Where should the reviewer start?

At the _ada_boost() method in the TPOT class, going all the way down to the _p_aggr() method.

There is also relevant export code and docs that should be checked for an LGTM.

How should this PR be tested?

Travis should test the classifiers themselves, but a few pipelines could be made and exported to confirm that the code indeed works.

What are the relevant issues?

#128

Questions:

Do the docs need to be updated?

Yes. I have already updated them.

Does this PR add new (Python) dependencies?

No.

coveralls · 2016-05-16T23:00:21Z

Coverage increased (+0.6%) to 57.965% when pulling 879bd54 on teaearlgraycold:new_classifiers into 5aea926 on rhiever:master.

rhiever · 2016-05-17T14:08:05Z

tpot/tpot.py

+            Also adds the classifiers's predictions as a 'SyntheticFeature' column.
+
+        """
+        return self._train_model_and_predict(input_df, AdaBoostClassifier,


learning_rate should be capped at > 0.

Looking at the docs further, I also think that for the AdaBoostClassifier, we should allow n_estimators to be evolved as well, with a max of 500 estimators. The AdaBoostClassifier is one unique case where there is a tradeoff between n_estimators and learning_rate.

Should I just do

max(learning_rate, 0.001)

Not sure what the exact minimum should be here.

Check xgradient_boosting for an example: https://github.com/rhiever/tpot/blob/master/tpot/tpot.py#L582

learning_rate = max(0.0001, learning_rate)

And for the C param, check _logistic_regression for an example: https://github.com/rhiever/tpot/blob/master/tpot/tpot.py#L516

C = max(0.0001, C)

0.0001 seems like a fine minimum value for now until we finish the sklearn benchmark and figure out an ideal range.

rhiever · 2016-05-17T14:13:15Z

When writing test cases for classifiers, test with normal parameters as well as extreme parameters: negative values, out-of-bounds values, etc. That will help catch issues where we're allowing invalid parameters to be passed to the various models.

…ssifiers

…ifiers

coveralls · 2016-05-19T19:19:00Z

Coverage increased (+1.7%) to 59.055% when pulling c834e7a on teaearlgraycold:new_classifiers into a4e00b2 on rhiever:master.

coveralls · 2016-05-19T19:25:34Z

Coverage increased (+1.7%) to 59.055% when pulling 36e0acd on teaearlgraycold:new_classifiers into a4e00b2 on rhiever:master.

danthedaniel · 2016-05-20T00:43:57Z

Now addresses #151

coveralls · 2016-05-20T00:47:36Z

Coverage increased (+2.2%) to 59.588% when pulling ff3cb08 on teaearlgraycold:new_classifiers into a4e00b2 on rhiever:master.

coveralls · 2016-05-20T00:49:45Z

Coverage increased (+2.2%) to 59.588% when pulling ff3cb08 on teaearlgraycold:new_classifiers into a4e00b2 on rhiever:master.

Daniel Angell added 4 commits May 9, 2016 00:49

Started adding classifiers

e5dd2c7

Add classifier operators

0afb21e

Merge with rhiever/master

79542a6

Added docs and export code for new classifiers

879bd54

rhiever reviewed May 17, 2016
View reviewed changes

rhiever added the enhancement label May 19, 2016

Daniel Angell added 2 commits May 19, 2016 14:33

Merge branch 'master' of https://github.com/rhiever/tpot into new_cla…

c9f2db5

…ssifiers

Add limits to certain operator parameters and add tests for new class…

c834e7a

…ifiers

Fix some unused vars in export_utils' string.format statements

36e0acd

rhiever mentioned this pull request May 19, 2016

Replace XGBoost with GradientBoostingClassifier #151

Closed

Replace xgboost with sklearn's GradientBoost classifier

e47c356

Remove unnecessary INPUT_DF args from export_utils

ff3cb08

rhiever merged commit 5ebd6d1 into EpistasisLab:master May 25, 2016

danthedaniel deleted the new_classifiers branch August 19, 2016 19:18

AIAdventures mentioned this pull request Jun 6, 2017

Titanic example -problem with 2nd last cell. #492

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New classifiers #147

New classifiers #147

danthedaniel commented May 16, 2016 •

edited

Loading

coveralls commented May 16, 2016

rhiever May 17, 2016

danthedaniel May 17, 2016

rhiever May 17, 2016 •

edited

Loading

rhiever commented May 17, 2016

coveralls commented May 19, 2016

coveralls commented May 19, 2016

danthedaniel commented May 20, 2016

coveralls commented May 20, 2016

coveralls commented May 20, 2016

New classifiers #147

New classifiers #147

Conversation

danthedaniel commented May 16, 2016 • edited Loading

What does this PR do?

Where should the reviewer start?

How should this PR be tested?

What are the relevant issues?

Questions:

coveralls commented May 16, 2016

rhiever May 17, 2016

Choose a reason for hiding this comment

danthedaniel May 17, 2016

Choose a reason for hiding this comment

rhiever May 17, 2016 • edited Loading

Choose a reason for hiding this comment

rhiever commented May 17, 2016

coveralls commented May 19, 2016

coveralls commented May 19, 2016

danthedaniel commented May 20, 2016

coveralls commented May 20, 2016

coveralls commented May 20, 2016

danthedaniel commented May 16, 2016 •

edited

Loading

rhiever May 17, 2016 •

edited

Loading