Ensembles parallel #1116

zmjones · 2016-08-11T08:26:36Z

this is a clone of #615 in a branch in this repo so that @PhilippPro can help me fix the multilabel breakage

zmjones · 2016-08-11T11:16:57Z

thanks for the help @PhilippPro

berndbischl · 2016-08-11T14:31:06Z

please remove the stacking stuff. we have that completely refactored in another PR. this will create too many problems. store the diff, keep the branch. and doc for now that it is not parallel
make sure that users understand what the current par level now does. in the tutorial. list the ensemble learners there.
remove the partial dep stuff. but do a new PR for this. seems nearly finished.
remove the ens prediction stuff. we will do general parallel prediction.

- skipped stacking due to incoming refactoring - skipped prediction (left in branch) due to proposed parallelization of all prediction functions

zmjones · 2016-08-11T14:50:56Z

all that should be done now (minus tutorials and new pr)

jakob-r · 2016-08-11T17:12:48Z

there might still be some cleanup necessary. but i am out for the moment.

zmjones · 2016-08-12T10:48:52Z

@PhilippPro looks like the remaining errors are related to the multilabel stuff. help?

PhilippPro · 2016-08-12T10:49:49Z

ok. first have to look on my own stuff but will try it later.

Am 12.08.2016 um 12:48 schrieb Zachary M. Jones:

@PhilippPro https://github.com/PhilippPro looks like the remaining
errors are related to the multilabel stuff. help?

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#1116 (comment), or
mute the thread
https://github.com/notifications/unsubscribe-auth/ALCX-kaybZfoEphbOu1GY0r5LwcjPkBMks5qfE-XgaJpZM4Jh5Yc.

zmjones · 2016-08-12T10:50:09Z

thank you!

…t is solved. :)

zmjones · 2016-08-17T19:52:44Z

if someone else wants to finish this up i would be grateful. i don't know the multilabel stuff (which is what i think is broken).

PhilippPro · 2016-08-17T20:48:38Z

I think there are more things that do not work. As you can see in the current travis build, the actual error occurs also in generatePartialDependence and MulticlassWrapper (and MultilabelDBRWrapper). I just solved the problem with the multilabel example in the commit before.

zmjones · 2016-08-17T21:08:02Z

ah ok i will take a look at it again then. thanks

PhilippPro · 2016-08-17T21:11:42Z

I can look at the multilabel part if everything else is ok. It's just disgusting to look at these errors, because you have to step deeply into the infrastructure of predictLearner, predictLearner2, etc. to find the problems and it takes a long time.

# Conflicts: # R/generatePartialDependence.R

jakob-r · 2016-09-06T13:15:52Z

I fixed many variable missnamings and other stuff. But I can not get behind why the "MultilabelDBRWrapper" fails if it wasn't even touched!

PhilippPro · 2016-09-06T13:18:06Z

It depends on MultilabelBinaryRelevanceWrapper and this was touched.

jakob-r · 2016-09-06T13:32:44Z

Oh I indeed oversaw this. I guess I know the issue then

jakob-r · 2017-02-06T10:16:45Z

Will auto merge in 24 hours if no objections are raised.

berndbischl · 2017-02-06T19:15:53Z

Will auto merge in 24 hours if no objections are raised.

no, you can raise concerns that we are wasting your time. shout at me.
but this i really dont want.

i will try to review now.

berndbischl · 2017-02-06T19:18:29Z

the indentation is wrong in several places.

more importantly:
where are the docs what is parallelolized now?

jakob-r · 2017-02-07T10:51:48Z

True, documentation is totally missing. Any hint of where it would make sense to add it? Just add a note for each wrapper that it can be parallelized with the specific tag?

indentation... 🙄 I will take care of it

berndbischl · 2017-02-07T11:50:06Z

True, documentation is totally missing. Any hint of where it would make sense to add it? Just add a note for each wrapper that it can be parallelized with the specific tag?

well, i think it needs to go here:
https://mlr-org.github.io/mlr-tutorial/release/html/parallelization/index.html

BUT:

I think we should also have this in R. makes it much easier to review stuff like this, whether it is "complete". and complete documentation about behavior of options should always be available in R.

but we dont want to copy-paste docs to 2 places. I would suggest:
we create a mini doc page ?mlrParallel, copy the tutorial LEVEL definition info there, and you add your stuff.
EDIT: and we remove the parallel levels table from the tutorial and simply link to it

for completeness we can add a sentence and a link to the wrapper that is affected by your change here.

what do you think?

jakob-r · 2017-02-24T07:49:53Z

So. Documentations is done, tests are running. Should be merged soon.

jakob-r · 2017-02-27T10:44:00Z

@berndbischl ping

larskotthoff

It would be good to extend the tests to check the predictions as well.

jakob-r · 2017-03-10T10:37:11Z

We agreed that we do not need to check the result of the predictions as the parallelMap code is also run in the normal tests of the ensembles.
Actually it can be argued, that the parallel test can bee seen as superfluous as it just tests the parallelMap behavior.

berndbischl · 2017-03-13T11:00:46Z

Actually it can be argued, that the parallel test can bee seen as superfluous as it just tests the parallelMap behavior.

not really. i detected many bugs with this test, that were not pm bugs. but problems in mlr.
most problematic thing: exporting options to slave and so on.

zmjones added type-enhancement prio-high labels Aug 11, 2016

zmjones added the pr-ready for merge (?) label Aug 11, 2016

parallel ensemble training

4b1c274

- skipped stacking due to incoming refactoring - skipped prediction (left in branch) due to proposed parallelization of all prediction functions

zmjones force-pushed the ensembles_parallel branch from 365d047 to 4b1c274 Compare August 11, 2016 14:50

jakob-r added 4 commits August 11, 2016 18:07

add description to variables

40192d9

remove unnecessary variables

48c45ff

cleanup passed variables for MulticlassWrapper

b2cf9df

cleanup passed variables for MultilabelBinaryRelevanceWrapper

583e877

add missing bracket

48db77c

ja-thomas removed the pr-ready for merge (?) label Aug 12, 2016

this solved the problem; do not really now why it appeared now, but i…

10f3066

…t is solved. :)

jakob-r added the pr-work in progress - not done label Sep 2, 2016

jakob-r added 3 commits September 6, 2016 14:30

wrongly named variables

a98a789

Merge branch 'master' into ensembles_parallel

ce11575

# Conflicts: # R/generatePartialDependence.R

remove unused variables

c5c9a9a

jakob-r self-assigned this Feb 7, 2017

jakob-r mentioned this pull request Feb 15, 2017

Once we have parallelization documentation in mlr remove some redundancies here mlr-archive/mlr-tutorial#73

Closed

jakob-r added 9 commits February 15, 2017 16:38

add parallelization doc

4a1742d

fix indentations

845fb70

Merge branch 'master' into ensembles_parallel

600af90

skip on viper api down

1a23e8f

Merge branch 'master' into ensembles_parallel

9e87da2

fix brackets

7eba442

typo

205ea51

more typos

4b48c14

add rd file [ci-skip]

012c6a2

jakob-r removed their assignment Feb 24, 2017

jakob-r requested review from larskotthoff and berndbischl March 8, 2017 10:35

jakob-r unassigned berndbischl Mar 8, 2017

larskotthoff added 2 commits March 8, 2017 06:07

Update CostSensRegrWrapper.R

435d528

Update parallelization.R

78e03bd

larskotthoff approved these changes Mar 8, 2017

View reviewed changes

Coorsaa added the project - base label Mar 9, 2017

jakob-r merged commit a49bf1f into master Mar 10, 2017

mllg deleted the ensembles_parallel branch March 10, 2017 14:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensembles parallel #1116

Ensembles parallel #1116

zmjones commented Aug 11, 2016

zmjones commented Aug 11, 2016

berndbischl commented Aug 11, 2016

zmjones commented Aug 11, 2016 •

edited

Loading

jakob-r commented Aug 11, 2016

zmjones commented Aug 12, 2016

PhilippPro commented Aug 12, 2016

zmjones commented Aug 12, 2016

zmjones commented Aug 17, 2016

PhilippPro commented Aug 17, 2016 •

edited

Loading

zmjones commented Aug 17, 2016

PhilippPro commented Aug 17, 2016

jakob-r commented Sep 6, 2016

PhilippPro commented Sep 6, 2016

jakob-r commented Sep 6, 2016

jakob-r commented Feb 6, 2017 •

edited

Loading

berndbischl commented Feb 6, 2017

berndbischl commented Feb 6, 2017

jakob-r commented Feb 7, 2017 •

edited

Loading

berndbischl commented Feb 7, 2017 •

edited

Loading

jakob-r commented Feb 24, 2017

jakob-r commented Feb 27, 2017

larskotthoff left a comment

jakob-r commented Mar 10, 2017

berndbischl commented Mar 13, 2017

Ensembles parallel #1116

Ensembles parallel #1116

Conversation

zmjones commented Aug 11, 2016

zmjones commented Aug 11, 2016

berndbischl commented Aug 11, 2016

zmjones commented Aug 11, 2016 • edited Loading

jakob-r commented Aug 11, 2016

zmjones commented Aug 12, 2016

PhilippPro commented Aug 12, 2016

zmjones commented Aug 12, 2016

zmjones commented Aug 17, 2016

PhilippPro commented Aug 17, 2016 • edited Loading

zmjones commented Aug 17, 2016

PhilippPro commented Aug 17, 2016

jakob-r commented Sep 6, 2016

PhilippPro commented Sep 6, 2016

jakob-r commented Sep 6, 2016

jakob-r commented Feb 6, 2017 • edited Loading

berndbischl commented Feb 6, 2017

berndbischl commented Feb 6, 2017

jakob-r commented Feb 7, 2017 • edited Loading

berndbischl commented Feb 7, 2017 • edited Loading

jakob-r commented Feb 24, 2017

jakob-r commented Feb 27, 2017

larskotthoff left a comment

Choose a reason for hiding this comment

jakob-r commented Mar 10, 2017

berndbischl commented Mar 13, 2017

zmjones commented Aug 11, 2016 •

edited

Loading

PhilippPro commented Aug 17, 2016 •

edited

Loading

jakob-r commented Feb 6, 2017 •

edited

Loading

jakob-r commented Feb 7, 2017 •

edited

Loading

berndbischl commented Feb 7, 2017 •

edited

Loading