What is [Prompt len] and [Tokens to Generate]? #2

fishiu · 2023-10-31T22:49:58Z

Sorry I am not quite familiar with inference: in fine-tune/training, I simply use the concept of max_seq_length. Are [Prompt len] and [Tokens to Generate] the same as max_seq_length? How could they be different?

RahulSChand · 2023-11-01T00:59:21Z

You are right that in fine-tuning/training there is no concept of "prompt Len" & "tokens to generate". Only max_seq_length is required. When you use the GitHub site, the max_seq_length = prompt Len + tokens to Generate.

The prompt Len & context Len concept is only for inference time. For example, if you have a question which is made of 100 words(tokens) and you want to generate an answer of 500 tokens. Here the first 100 tokens are processed at once while the next 500 tokens are processed token by token. Therefore a distinction is needed. The first 100 words are your "prompt Len" & the next 500 words are your "tokens to generate"

fishiu · 2023-11-01T01:10:50Z

Thanks for your reply! So if I want to get the memory result of fine-tuning, I should set "tokens to generate" to 0 right? However that is forbidden (warning that it have to be positive)

RahulSChand · 2023-11-01T01:23:17Z

You can set it as 1. The results will be almost exactly the same (maybe a 1-2 MB difference at most)

RahulSChand closed this as completed Nov 1, 2023

RahulSChand added the question Further information is requested label Nov 5, 2023

RahulSChand self-assigned this Nov 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is [Prompt len] and [Tokens to Generate]? #2

What is [Prompt len] and [Tokens to Generate]? #2

fishiu commented Oct 31, 2023

RahulSChand commented Nov 1, 2023 •

edited

fishiu commented Nov 1, 2023

RahulSChand commented Nov 1, 2023

What is [Prompt len] and [Tokens to Generate]? #2

What is [Prompt len] and [Tokens to Generate]? #2

Comments

fishiu commented Oct 31, 2023

RahulSChand commented Nov 1, 2023 • edited

fishiu commented Nov 1, 2023

RahulSChand commented Nov 1, 2023

RahulSChand commented Nov 1, 2023 •

edited