Skip to content

Commit

Permalink
update readme&bash files
Browse files Browse the repository at this point in the history
  • Loading branch information
ImKeTT committed May 19, 2022
1 parent 8b2e266 commit e85f453
Show file tree
Hide file tree
Showing 11 changed files with 833 additions and 480 deletions.
Binary file added .DS_Store
Binary file not shown.
2 changes: 1 addition & 1 deletion .idea/bert_adapter.iml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

9 changes: 8 additions & 1 deletion .idea/deployment.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

6 changes: 0 additions & 6 deletions .idea/inspectionProfiles/profiles_settings.xml

This file was deleted.

2 changes: 1 addition & 1 deletion .idea/misc.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

1,281 changes: 817 additions & 464 deletions .idea/workspace.xml

Large diffs are not rendered by default.

3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
## AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling

**[Repo In progress]** Official implementation for **AdaVAE**, check the paper on arxiv [https://arxiv.org/pdf/2205.05862.pdf](https://arxiv.org/pdf/2205.05862.pdf).
**[Repo In Progress]** Official implementation for **AdaVAE**, check the paper on arxiv [https://arxiv.org/pdf/2205.05862.pdf](https://arxiv.org/pdf/2205.05862.pdf).

*This repo takes some practices from [https://github.com/fangleai/TransformerCVAE](https://github.com/fangleai/TransformerCVAE)*. Many thanks !!
Binary file added low_nlu/.DS_Store
Binary file not shown.
Binary file added src/.DS_Store
Binary file not shown.
7 changes: 3 additions & 4 deletions src/run0.sh
Original file line number Diff line number Diff line change
@@ -1,4 +1,3 @@
python adaVAE.py --batch-sizes 100 --dataset yelp_data --max_length 32 --pre_enc_iter start --add_attn --beta_0 1 --fb 1 --adapter_size 128 --iterations 15000 --weighted_sample --latent_size 32 --encoder_n_layer 8 --decoder_n_layer 12 --adapter_init bert --attn_mode none --kl_rate 0.50 &&\
python adaVAE.py --batch-sizes 100 --dataset yahoo_data --max_length 32 --pre_enc_iter start --add_attn --beta_0 1 --fb 1 --adapter_size 128 --iterations 22000 --weighted_sample --latent_size 32 --encoder_n_layer 8 --decoder_n_layer 12 --adapter_init bert --attn_mode none --latent_gen linear --kl_rate 0.50 &&\
python adaVAE.py --batch-sizes 100 --dataset yahoo_data --max_length 32 --pre_enc_iter start --add_attn --beta_0 1 --fb 1 --adapter_size 128 --iterations 22000 --weighted_sample --latent_size 32 --encoder_n_layer 8 --decoder_n_layer 12 --adapter_init bert --attn_mode none --latent_gen linear --add_mem --kl_rate 0.50 &&\
python adaVAE.py --batch-sizes 100 --dataset yahoo_data --max_length 32 --pre_enc_iter start --add_mem --beta_0 1 --fb 1 --adapter_size 128 --iterations 22000 --weighted_sample --latent_size 32 --encoder_n_layer 8 --decoder_n_layer 12 --adapter_init bert --attn_mode none --latent_gen linear --kl_rate 0.50
python adaVAE.py --batch-sizes 90 --dataset yelp_data --max_length 32 --pre_enc_iter start --add_attn --add_z2adapters --beta_0 1 --fb 1 --adapter_size 128 --iterations 22000 --weighted_sample --latent_size 32 --encoder_n_layer 8 --decoder_n_layer 12 --adapter_init bert --attn_mode none --kl_rate 0.50 &&\
python adaVAE.py --batch-sizes 90 --dataset yelp_data --max_length 32 --pre_enc_iter start --add_attn --beta_0 1 --fb 1 --adapter_size 128 --iterations 22000 --latent_gen mean_max_linear --weighted_sample --latent_size 32 --encoder_n_layer 8 --decoder_n_layer 12 --adapter_init bert --attn_mode none --kl_rate 0.50 &&\
python adaVAE.py --batch-sizes 90 --dataset yelp_data --max_length 32 --pre_enc_iter start --add_attn --beta_0 1 --fb 1 --adapter_size 128 --iterations 22000 --latent_gen linear --weighted_sample --latent_size 32 --encoder_n_layer 8 --decoder_n_layer 12 --adapter_init bert --attn_mode none --kl_rate 0.50
3 changes: 1 addition & 2 deletions src/run1.sh
Original file line number Diff line number Diff line change
@@ -1,2 +1 @@
python adaVAE.py --batch-sizes 100 --dataset cola --max_length 32 --pre_enc_iter start --add_attn --beta_0 1 --fb 1 --adapter_size 128 --iterations 15000 --weighted_sample --latent_size 768 --encoder_n_layer 8 --decoder_n_layer 12 --adapter_init bert --attn_mode none --kl_rate 0.10 &&\
python adaVAE.py --batch-sizes 100 --dataset sst-2 --max_length 32 --pre_enc_iter start --add_attn --beta_0 1 --fb 1 --adapter_size 128 --iterations 15000 --weighted_sample --latent_size 768 --encoder_n_layer 8 --decoder_n_layer 12 --adapter_init bert --attn_mode none --kl_rate 0.10
python adaVAE.py --batch-sizes 90 --dataset yelp_data --max_length 32 --pre_enc_iter start --add_attn --add_z2adapters --beta_0 1 --fb 1 --adapter_size 128 --iterations 22000 --weighted_sample --latent_size 32 --encoder_n_layer 8 --decoder_n_layer 12 --adapter_init bert --attn_mode none --kl_rate 0.50

0 comments on commit e85f453

Please sign in to comment.