Show and Tell: A Neural Image Caption Generator #1056

VamWolf · 2017-01-03T09:47:32Z

https://github.com/tensorflow/models/blob/master/im2txt/g3doc/show_and_tell_architecture.png

paddle 实现这个结构，是不是
image embeding 一层
word embeding 一层
两层连接，concat_layer
然后，作为lstm 的输入，lstm层得到encoded vector.
decode ，参考seqToseq 的解码部分就可以了

Zrachel · 2017-01-03T10:53:06Z

有一点不一样，
两层连接，应该用SequenceConcatLayer (concat_layer是指两个拼起来的size是两个相加的size)

VamWolf · 2017-01-03T12:47:24Z

SequenceConcatLayer 在 trainer_config_helpers 里目前没有，./trainer/config_parser.py 里有，应该是可以按照 paddle v1 的方式使用？？？

Zrachel · 2017-01-04T02:26:55Z

是的

Yancey1989 · 2017-07-28T10:12:25Z

这issue太久没更新了，如果有更新再打开吧，多谢。

Yancey1989 closed this as completed Jul 28, 2017

wangxicoding pushed a commit to wangxicoding/Paddle that referenced this issue Dec 9, 2021

Speed up for hybrid parallel (PaddlePaddle#1056)

15693c7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show and Tell: A Neural Image Caption Generator #1056

Show and Tell: A Neural Image Caption Generator #1056

VamWolf commented Jan 3, 2017

Zrachel commented Jan 3, 2017

VamWolf commented Jan 3, 2017

Zrachel commented Jan 4, 2017

Yancey1989 commented Jul 28, 2017

Show and Tell: A Neural Image Caption Generator #1056

Show and Tell: A Neural Image Caption Generator #1056

Comments

VamWolf commented Jan 3, 2017

Zrachel commented Jan 3, 2017

VamWolf commented Jan 3, 2017

Zrachel commented Jan 4, 2017

Yancey1989 commented Jul 28, 2017