Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prevent TPOT from creating multiple copies of the data set (except at leaf nodes) #78

Closed
rhiever opened this issue Feb 8, 2016 · 1 comment

Comments

@rhiever
Copy link
Contributor

rhiever commented Feb 8, 2016

In many cases, TPOT creates multiple copies of the data set when it runs through multiple pipeline operators. The only time TPOT should have to store multiple copies of the data set is when multiple leaf nodes introduce multiple copies. Make sure TPOT cleans up data set copies as it processes them.

@rhiever
Copy link
Contributor Author

rhiever commented Aug 19, 2016

This will be in the 0.5 release because we work directly with sklearn Pipelines now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant