You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If I am using the CTGANsynthesizer, then there is no need to apply any transformers to discrete data. Currently, the data preprocessor assigns None (no transformer) to categorical columns but it assigns LabelEncoder(add_noise=True) to boolean columns. This makes it inefficient to model boolean columns, as the CTGANSynthesizer then one hot encodes all those values (see #1450 as to why)
Workaround
Just update the sdtype from boolean to categorical and then the preprocessing will work as intended.
Environment Details
Error Description
If I am using the CTGANsynthesizer, then there is no need to apply any transformers to discrete data. Currently, the data preprocessor assigns
None
(no transformer) to categorical columns but it assignsLabelEncoder(add_noise=True)
to boolean columns. This makes it inefficient to model boolean columns, as the CTGANSynthesizer then one hot encodes all those values (see #1450 as to why)Workaround
Just update the sdtype from
boolean
tocategorical
and then the preprocessing will work as intended.Steps to reproduce
Output:
The text was updated successfully, but these errors were encountered: