Skip to content

How are synthetic datasets generated? #252

Discussion options

You must be logged in to vote

Oftentimes synthetic datasets are generated with GAN models, generative adversarial networks. GANs are independent from differential privacy. They consist of two sub-networks, a generator and a discriminator. The generator feeds into the discriminator.

The generator has a small number of input nodes and widens over multiple layers to the number of columns in your dataset. The generator learns the distribution of the underlying data such that, when you feed it a batch of noise, it emits a batch of synthetic data.

The discriminator is the opposite shape- it has just as many inputs as the generator has outputs, and narrows down into a single output node that discriminates if the input is fak…

Replies: 3 comments 8 replies

Comment options

You must be logged in to vote
2 replies
@Shoeboxam
Comment options

@Shoeboxam
Comment options

Answer selected by anniewu332
Comment options

anniewu332
Sep 1, 2021
Maintainer Author

You must be logged in to vote
1 reply
@Shoeboxam
Comment options

Comment options

You must be logged in to vote
5 replies
@Shoeboxam
Comment options

@sidify
Comment options

@Shoeboxam
Comment options

@sidify
Comment options

@Shoeboxam
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
3 participants