Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need for programmatic setup of datasets #14

Closed
lukehutch opened this issue May 5, 2016 · 2 comments
Closed

Need for programmatic setup of datasets #14

lukehutch opened this issue May 5, 2016 · 2 comments

Comments

@lukehutch
Copy link

Is there any way to build datasets in memory, using the API, or do I need to write out a file to disk, and read it back in?

I tried creating a dataset using the API, but the methods and constructors of Attribute are not visible, so I can't create a List, so I can't create an Instances object, so I can't create cross-validation folds.

@yinlou
Copy link
Owner

yinlou commented May 5, 2016

Yes. Note Attribute is an abstract class, you can't create an instance of it, but you can create a NumericalAttribute or NominalAttribute. Remember to set the class attribute. For cross validations, you can use InstancesSplitter under mltk.core.processor.

@lukehutch
Copy link
Author

OK, thanks for the explanation, I didn't see those concrete classes. Would be good to have documentation about this use case.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants