time to ensemble!! #7

ClimbsRocks · 2015-11-20T02:51:48Z

focus:
within the validations folder:

read in each validation file.
2a. create a list of lists (predictionsAllRows)
create a list for each row (predictionsForRow)
append each algo's prediction to predictionsForRow
once we have read in all the predictions
read in the validation dataset (with all the features)
append predictionsAllRows to our validationData using hstack

post-MVP:
break out the data at this point into validationTrain and validationTest.
test is just going to be our actual test data.

from there, run a RF over the data.

post-post-post MVP:
run a modified version of machineJS over the data. see how we can train the best classifiers possible over the validation data and the predictions from the first round of machineJS.

then ensembler will simply average the results together.

what this means:
leave the current ensembler flow untouched. we will use that again on the second round, once we have run things back through machineJS.
we need to create a new workflow in machineJS to accommodate this (no splitData or dataFormatting. that might be the only difference)

http://docs.scipy.org/doc/scipy-0.14.0/reference/generated/scipy.sparse.hstack.html

ClimbsRocks · 2015-12-07T04:41:44Z

whew, finished!

ClimbsRocks closed this as completed Dec 7, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

time to ensemble!! #7

time to ensemble!! #7

ClimbsRocks commented Nov 20, 2015

ClimbsRocks commented Dec 7, 2015

time to ensemble!! #7

time to ensemble!! #7

Comments

ClimbsRocks commented Nov 20, 2015

ClimbsRocks commented Dec 7, 2015