Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Current status of model training #737

Closed
kuke opened this issue Mar 15, 2018 · 1 comment
Closed

Current status of model training #737

kuke opened this issue Mar 15, 2018 · 1 comment
Labels

Comments

@kuke
Copy link
Collaborator

kuke commented Mar 15, 2018

We are training the DeepASR model on the whole training dataset with duration 2000h. After 12 epoches' training, the model has well converged already.

Settings:

batch_size: 128
device: GPU
hidden_dim: 1024
learning_rate: 0.00016
minimum_batch_size: 1
proj_dim: 512
stacked_num: 5
optimizer: Adam

Env: 4 P40 GPUs, 15h per epoch.

training_acc_on_all_data

After the decoder is ready, we will continue to fine tune this model to catch up with the performance in accuracy of benchmark.

@kuke kuke added the DeepASR label Mar 15, 2018
@shanyi15
Copy link
Collaborator

您好,此issue在近一个月内暂无更新,我们将于今天内关闭。若在关闭后您仍需跟进提问,可重新开启此问题,我们将在24小时内回复您。因关闭带来的不便我们深表歉意,请您谅解~感谢您对PaddlePaddle的支持!
Hello, this issue has not been updated in the past month. We will close it today for the sake of other user‘s experience. If you still need to follow up on this question after closing, please feel free to reopen it. In that case, we will get back to you within 24 hours. We apologize for the inconvenience caused by the closure and thank you so much for your support of PaddlePaddle Group!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants