Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Separate datasets from MLChallenge #205

Closed
pvk-developer opened this issue Jul 27, 2020 · 1 comment · Fixed by #204
Closed

Separate datasets from MLChallenge #205

pvk-developer opened this issue Jul 27, 2020 · 1 comment · Fixed by #204
Assignees
Labels
internal improvement Improvements that do not change the public API
Milestone

Comments

@pvk-developer
Copy link
Collaborator

  • BTB version: 3.11
  • Python version: 3.6
  • Operating System: Ubuntu 20.04

Description

We want to separate the datasets list from MLChallenge in order to keep everyting more organized.
Additionally, ensure that we have the OpenML 100 datasets and create a separated list for them.

Proposal

Create a datasets.py module in benchmark/btb_benchmark/challenges/.
Create two lists inside:

  1. One that contains all the datasets sorted by execution time, slowest first.
  2. One that contains the OpenML 100 datasets

Then create a function that returns one of those collections.

@pvk-developer pvk-developer added the internal improvement Improvements that do not change the public API label Jul 27, 2020
@pvk-developer pvk-developer added this to the 0.3.12 milestone Jul 27, 2020
@pvk-developer pvk-developer self-assigned this Jul 27, 2020
@pvk-developer
Copy link
Collaborator Author

Solved at #204

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
internal improvement Improvements that do not change the public API
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant