TAEGAN

Pre-requisites

Python>=3.10.
Python package torch, tqdm, sklearn, numpy.

Execution

Prepare Data

python run.py -o OUT_DIR prepare -s DATA_FILE

What the prepare step does is described in run.py under prepare function. Relevant code can be easily reconstructed from open-source CTAB-GAN+ implementation. We decided not to expose our processing code because we are using code source from enterprise, which is not supposed to be exposed, The part of code actually called from this private code essentially does the same thing as CTAB-GAN+ data preprocessing, with only code structure and efficiency improvements.

Train Model

python run.py -o OUT_DIR train -b BATCH_SIZE -e EPOCHS -w WARMUP_EPOCHS

The trained model weights are found in OUT_DIR/generator.pt and OUT_DIR/discriminator.pt.

Sample Tensor

python run.py -o OUT_DIR sample -n N_ROWS -b BATCH_SIZE

The sampled tensor is found in OUT_DIR/sampled.pt. Note that the output of this step is still a transformed tensor with one-hot and VGM data encoding.

Recover Sampled Data

python run.py -o OUT_DIR recover

This is the inverse step of prepare, with its description also found inside run.py, in function recover. Since this is the inverse step of prepare, which is dependent on the data processing module, this part of code is also not exposed, but would be very easy to reconstruct. The output should then be saved as a csv file, which is the actually generated tabular data.

Example

A running example is provided in diabetes-demo using the diabetes dataset from OpenML. Content inside is the outcome of prepare step.

One can directly run the following to get the sampled tensor:

python run.py -o diabetes-demo train
python run.py -o diabetes-demo sample -n 256

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
diabetes-demo		diabetes-demo
result_figures		result_figures
README.md		README.md
dataset.py		dataset.py
networks.py		networks.py
run.py		run.py
taegan.py		taegan.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TAEGAN

Pre-requisites

Execution

Prepare Data

Train Model

Sample Tensor

Recover Sampled Data

Example

About

Uh oh!

Releases

Packages

Languages

BetterdataLabs/taegan

Folders and files

Latest commit

History

Repository files navigation

TAEGAN

Pre-requisites

Execution

Prepare Data

Train Model

Sample Tensor

Recover Sampled Data

Example

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages