Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

modify lstm ouput for aggregated model #45

Merged
merged 2 commits into from Apr 15, 2017
Merged

Conversation

rickymf4
Copy link
Contributor

Using the h value of the lstm top layer as the input for aggregated model is better, and easier to converge.

  • Train log:
    INFO:tensorflow:/job:master/task:0: training step 218206| Hit@1: 0.87 PERR: 0.72 GAP: 0.81 Loss: 5.09105
    INFO:tensorflow:/job:master/task:0: training step 218207| Hit@1: 0.85 PERR: 0.73 GAP: 0.81 Loss: 5.16746
    INFO:tensorflow:/job:master/task:0: training step 218208| Hit@1: 0.79 PERR: 0.68 GAP: 0.80 Loss: 5.13253
    INFO:tensorflow:/job:master/task:0: training step 218209| Hit@1: 0.90 PERR: 0.76 GAP: 0.82 Loss: 5.16415
    INFO:tensorflow:/job:master/task:0: training step 218210| Hit@1: 0.88 PERR: 0.76 GAP: 0.80 Loss: 5.36496
    INFO:tensorflow:/job:master/task:0: training step 218211| Hit@1: 0.83 PERR: 0.72 GAP: 0.82 Loss: 4.48507
    INFO:tensorflow:/job:master/task:0: training step 218212| Hit@1: 0.85 PERR: 0.71 GAP: 0.83 Loss: 4.91971

  • Eval on frame level validation set,
    Avg_Hit@1: 0.838 | Avg_PERR: 0.705 | MAP: 0.367 | GAP: 0.780 | Avg_Loss: 5.795533

  • The following are the training status
    The orange curves: original code using all concatenated state.
    The blue curves: modified code using top layer h state.
    image

last cell of  lstm as the input for aggregated model
using the h value of the last lstm layer as the input for aggregated model is better, and easier to converge.
@LeegleechN
Copy link
Collaborator

Thanks! I've wanted to investigate this for a while.

@LeegleechN LeegleechN merged commit 6d0c91d into google:master Apr 15, 2017
@csyanbin
Copy link

Hi, what is the base_learning_rate and num_moe_experts in video_level_models?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants