Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to treat missing continuous data in data set #39

Closed
peter-jay-aut opened this issue Apr 1, 2020 · 3 comments
Closed

How to treat missing continuous data in data set #39

peter-jay-aut opened this issue Apr 1, 2020 · 3 comments
Labels
question General question about the software

Comments

@peter-jay-aut
Copy link

  • CTGAN version:0.2.2
  • Python version: 3.6.8
  • Operating System: Tensorflow Docker

Description

Hi, I'm curious how to treat missing continuous values in a training Dataset. Using a placeholder that was used in the demo file ('?') for missing discrete values won't work. Any suggestions, how to deal with this issue?
Kind Regards!

@leix28
Copy link
Collaborator

leix28 commented Apr 13, 2020

Hi @peter-jay-aut

Current CTGAN does not support missing values. "?" is not the best way to handle missing values for discrete columns. "?" is treated as a normal category. I would suggest you to run missing value imputation before training CTGAN.

We may consider adding missing value imputation in the future.

@leix28 leix28 added the internal The issue doesn't change the API or functionality label Apr 13, 2020
@peter-jay-aut
Copy link
Author

thank you very much for your reply!

@csala csala added question General question about the software and removed internal The issue doesn't change the API or functionality labels Jun 22, 2020
@csala
Copy link
Contributor

csala commented Jun 22, 2020

Closing this issue, as the question has already been responded.

Also adding that if you want to work with data that is not clean and completely ready for modeling, you might want to have a look at the sibling library SDV

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question General question about the software
Projects
None yet
Development

No branches or pull requests

3 participants