We need to figure out how we handle character features and levels #275

mb706 · 2019-10-01T22:41:33Z

We should probably treat character features as something that may have an unlimited number of levels, leave it out from encoding, fixing factors etc. For factors and ordereds we should trust that levels(task$data()[[feature]]) is the same as task$levels()[[feature]]. In that case we can remove the "levels" argument from the data.table functions of PipeOpTaskPreproc.

The text was updated successfully, but these errors were encountered:

mllg · 2019-10-22T09:02:44Z

We can do this, if this helps. Currently blocked by mlr3db: Many databases do not provide a native type for factors, everything is a character. At least we need an option there to auto-convert character -> factor in the backend.

mb706 · 2020-02-10T17:24:36Z

Since mlr-org/mlr3#369 @mllg do you believe this is no longer blocked?

mllg · 2020-02-10T17:30:01Z

I guess so.

mb706 · 2020-06-21T23:35:10Z

We now consistently handle character features as features without levels.

mllg added Priority: Medium Status: Blocked Type: Enhancement Type: Maintenance labels Oct 22, 2019

mb706 removed the Status: Blocked label Feb 10, 2020

mb706 self-assigned this Feb 10, 2020

mb706 closed this as completed Jun 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

We need to figure out how we handle character features and levels #275

We need to figure out how we handle character features and levels #275

mb706 commented Oct 1, 2019

mllg commented Oct 22, 2019

mb706 commented Feb 10, 2020

mllg commented Feb 10, 2020 •

edited

mb706 commented Jun 21, 2020

We need to figure out how we handle character features and levels #275

We need to figure out how we handle character features and levels #275

Comments

mb706 commented Oct 1, 2019

mllg commented Oct 22, 2019

mb706 commented Feb 10, 2020

mllg commented Feb 10, 2020 • edited

mb706 commented Jun 21, 2020

mllg commented Feb 10, 2020 •

edited