New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TPOT command line usage help #34
Comments
Hi @SimplyAhmazing, The first thing I should clarify is that TPOT will be quite slow on large data sets such as the full MNIST data set. Using the default TPOT settings, each iteration of the algorithm is evaluating 100 pipelines on the training set, many of which are training multiple classifiers on the data. This statement is true of most Evolutionary Computation-based methods, where it's not uncommon to allow the algorithm to run for several hours, days, or even weeks. Your best bet is to set TPOT to running on the data set and give it a couple days to crunch on the data. Regarding outputting the pipeline: Currently, TPOT on the command line only outputs the best pipeline in terms of TPOT functions at the end of the run. If you terminate a command line version early, you won't see the final pipeline. I've raised #36 as a suggestion to fix that. We're still working on outputting the pipelines as sklearn code. You can see the latest on this branch. It turned out to be quite tricky to convert these pipelines to useable Python code, so that feature is somewhat delayed. |
@rhiever Thanks! I figured that it would indeed be slow to run iterations on the training set. I think this is a really cool project. |
I downloaded a sample mnist data set into a CSV and installed TPOT and all the dependencies.
I tried running it through the command line and below is the command I ran and the results I got,
It took a couple of hours until TPOT returned the stats summary and then after an hour or so it was still running so I terminated it. I'm curious as to what a TPOT run looks like? And for some reason I was expecting code to be written to a directory,
Or maybe this python source is printed to the terminal? Going to start reading the tpot source more thoroughly.
The text was updated successfully, but these errors were encountered: