Error when generating with long prefix #38

anishthite · 2019-05-09T05:12:51Z

Hi! When I generate text with a prefix longer than 4 characters, I get the following error:
tensorflow.python.framework.errors_impl.InvalidArgumentError: indices[0,0] = 1024 is not in [0, 1024)[[{{node sample_sequence_6/while/model/GatherV2_1}}]]

It does not occur if the prefix is "Hi!", but it does occur when it is "Hi, this is a longer piece of text"
Do you know why this may be happening?

The text was updated successfully, but these errors were encountered:

anishthite · 2019-05-09T05:13:44Z

The issue does not occur when using the original generate script from OpenAI

anishthite · 2019-05-09T05:27:33Z

It seems to be occurring on this line:
out = sess.run(output, feed_dict={ context: batch_size * [context_tokens] })
the OpenAI script has
for _ in range(nsamples // batch_size): out = sess.run(output, feed_dict={ context: [context_tokens for _ in range(batch_size)] })[:, len(context_tokens):]

Could this difference be causing the issue?

minimaxir · 2019-06-16T15:45:54Z

Looking into this now since it is affecting the cloud API. It's possible this difference could affect it.

minimaxir · 2019-06-16T16:44:50Z

Able to reproduce, but only if the generated length is long as well (which is the default, and explains why I haven't seen the issue as I work with smaller texts)

It's possible the prefix length in tokens + generated text in tokens > 1024. It might be a good idea to cap the length at that difference.

anishthite changed the title ~~Error when generating wiht long prefix~~ Error when generating with long prefix May 9, 2019

woctezuma mentioned this issue Jun 16, 2019

InvalidArgumentError (see above for traceback): indices[0,1024] = 1024 is not in [0, 1024) #72

Closed

minimaxir added the bug Something isn't working label Jun 16, 2019

minimaxir pinned this issue Jun 16, 2019

minimaxir added a commit that referenced this issue Jun 16, 2019

Cap gen length if prefix to prevent OOB (#38)

d9b673e

minimaxir unpinned this issue Jun 17, 2019

minimaxir mentioned this issue Jun 19, 2019

Predicting with PrettyBigModel InvalidArgumentError: indices[0,0] = 1024 is not in [0, 1024) ConnorJL/GPT2#4

Closed

yijunzhouzoey mentioned this issue Sep 21, 2020

Error when calculating the validation loss - indices[0,1200] = 1200 is not in [0, 1024) nshepperd/gpt-2#63

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when generating with long prefix #38

Error when generating with long prefix #38

anishthite commented May 9, 2019 •

edited

Loading

anishthite commented May 9, 2019

anishthite commented May 9, 2019

minimaxir commented Jun 16, 2019

minimaxir commented Jun 16, 2019

Error when generating with long prefix #38

Error when generating with long prefix #38

Comments

anishthite commented May 9, 2019 • edited Loading

anishthite commented May 9, 2019

anishthite commented May 9, 2019

minimaxir commented Jun 16, 2019

minimaxir commented Jun 16, 2019

anishthite commented May 9, 2019 •

edited

Loading