Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there an Ability to explicitly define Distributions? #119

Closed
clj2567 opened this issue Sep 23, 2019 · 2 comments
Closed

Is there an Ability to explicitly define Distributions? #119

clj2567 opened this issue Sep 23, 2019 · 2 comments
Assignees
Labels
question General question about the software

Comments

@clj2567
Copy link

clj2567 commented Sep 23, 2019

When I create synthetic rowsets using sampl_rows, one thing I noticed is that it assumes that my distribution is Normal and therefore it skews the values. For example, the secs elapsed field doesn’t have a negative value in my original dataset but in the synthetic ones it contains negative values ranging from -4000 to +6000. Is there any way I can specify the Distribution for each column?

@csala csala added the question General question about the software label Oct 17, 2019
@csala csala self-assigned this Oct 17, 2019
@csala
Copy link
Contributor

csala commented Oct 17, 2019

Hi @clj2567

There is a way to indicate which distribution to use, but currently this setting affects all the columns, so you cannot choose a custom distribution per column. This is a feature that will be added in the future.

Also, regarding sampling negative values, this would be better taken care of by the data constraints functionality that is also being developed. See #70

I hope this clarifies your doubts!

@csala
Copy link
Contributor

csala commented Oct 13, 2020

This topic has been fully covered with the recent introductions of the tabular models, the constraints and the new HMA1 relational algorithm.

@csala csala closed this as completed Oct 13, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question General question about the software
Projects
None yet
Development

No branches or pull requests

2 participants