Change default categorical transformer to categorical_fuzzy in HMA1 #214

csala · 2020-10-20T15:33:09Z

The HMA1 algorithm uses the GaussianCopula class, which, by default, uses the one_hot_encoding categorical transformer.

This may provoke high dimensionality issues when modeling tables with lots of columns with several depth levels, as reported in #209

To fix this, the default model_kwargs for the tabular model should set the categorical transformer to categorical_fuzzy, which does not create new columns.

The text was updated successfully, but these errors were encountered:

erenarkangil · 2022-05-31T01:20:00Z

Hi Carles, thank you for the feedback.

I have a high dimensional data with over 2000 categoric variables, but still have memory problems. I wonder the limit of fuzzy transformer. Here is my code:

from sdv.relational import HMA1

fuzzy = {'categorical_transformer': 'categorical_fuzzy'}
model = HMA1(metadata1,model_kwargs=fuzzy)

Many thanks

csala mentioned this issue Oct 20, 2020

Exploding Columns Problem #209

Closed

csala added the internal The issue doesn't change the API or functionality label Oct 21, 2020

csala mentioned this issue Nov 24, 2020

Issue 214 hma1 default to categorical fuzzy #259

Merged

csala self-assigned this Nov 24, 2020

csala added this to the 0.4.6 milestone Nov 24, 2020

csala closed this as completed in #259 Nov 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change default categorical transformer to categorical_fuzzy in HMA1 #214

Change default categorical transformer to categorical_fuzzy in HMA1 #214

csala commented Oct 20, 2020

erenarkangil commented May 31, 2022

Change default categorical transformer to categorical_fuzzy in HMA1 #214

Change default categorical transformer to categorical_fuzzy in HMA1 #214

Comments

csala commented Oct 20, 2020

erenarkangil commented May 31, 2022