Inference Speed for Long Articles #68

saptarshi059 · 2023-01-13T19:51:36Z

Is there any way to increase the generation speed for extremely long articles, such as 5000 tokens long? I've been trying to apply several optimization tricks, but none seems to work. Or is it just the case that text-generation in general for such long spans WILL be slow and there's no way around it?

magicknight · 2023-03-06T23:15:14Z

Is there any way to increase the generation speed for extremely long articles, such as 5000 tokens long? I've been trying to apply several optimization tricks, but none seems to work. Or is it just the case that text-generation in general for such long spans WILL be slow and there's no way around it?

How to generate 5000 tokens?

saptarshi059 · 2023-03-11T17:47:33Z

So there's no way to directly generate ~5000 tokens,.. that's a limitation of any decoder-based models since they can only process tokens up to their maximum input length which in this case is 2048,.. what I was doing then is,. generate 2048 tokens,.. and then use the last N (say 150) tokens as input to generate new text (almost like a sliding window).. in this way I saw that the ultimate text was reasonably coherent..

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference Speed for Long Articles #68

Inference Speed for Long Articles #68

saptarshi059 commented Jan 13, 2023

magicknight commented Mar 6, 2023

saptarshi059 commented Mar 11, 2023

Inference Speed for Long Articles #68

Inference Speed for Long Articles #68

Comments

saptarshi059 commented Jan 13, 2023

magicknight commented Mar 6, 2023

saptarshi059 commented Mar 11, 2023