Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

about the train loss #13

Closed
Usernamezhx opened this issue Oct 14, 2022 · 2 comments
Closed

about the train loss #13

Usernamezhx opened this issue Oct 14, 2022 · 2 comments

Comments

@Usernamezhx
Copy link

Usernamezhx commented Oct 14, 2022

thank for you work. I train the model with command:

python train.py --dataset rico --batch_size 64 --iteration 200000 --latent_size 4 --lr 1e-05 --G_d_model 256 --G_nhead 4 --G_num_layers 8 --D_d_model 256 --D_nhead 4 --D_num_layers 8

and it will show me that. does it look normal ? the generator loss become large and Discriminator loss become small :
image
image

@ktrk115
Copy link
Owner

ktrk115 commented Dec 26, 2022

Sorry for the late response.

The generator seems relatively weak, but I think it is a natural behavior. You may update the generator two or more times for each update of the discriminator. The number of updates is a tunable hyperparameter.

@Usernamezhx
Copy link
Author

Sorry for the late response.

The generator seems relatively weak, but I think it is a natural behavior. You may update the generator two or more times for each update of the discriminator. The number of updates is a tunable hyperparameter.

thanks for your reply. I will try

@ktrk115 ktrk115 closed this as completed Dec 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants