Information on how these datasets were chosen #2

ledell · 2020-10-05T18:09:12Z

Hi there,

I realize this benchmark is a few years old now, but can you explain how these datasets from OpenML were selected for this benchmark? If they were not randomly selected (using a seed, sampling from OpenML ids), then it would be good to know how/why each dataset was chosen to be included in the benchmark. Thanks!

pplonski · 2020-10-06T06:29:06Z

Hey @ledell! Good question, I've probably taken them from one of the Frank Hutter articles (right now I don't remember which one, probably about auto-sklearn).

ledell · 2020-10-07T00:01:36Z

Ok, thanks for the info! I'll take a look at the paper and see if they match up.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Information on how these datasets were chosen #2

Information on how these datasets were chosen #2

ledell commented Oct 5, 2020

pplonski commented Oct 6, 2020

ledell commented Oct 7, 2020

Information on how these datasets were chosen #2

Information on how these datasets were chosen #2

Comments

ledell commented Oct 5, 2020

pplonski commented Oct 6, 2020

ledell commented Oct 7, 2020