No way to get ONLY the generated text, not including the prompt. #17117

monsieurpooh · 2022-05-06T17:46:15Z

System Info

- `transformers` version: 4.15.0
- Platform: Windows-10-10.0.19041-SP0
- Python version: 3.8.5
- PyTorch version (GPU?): 1.10.2+cu113 (True)
- Tensorflow version (GPU?): 2.5.1 (True)
- Flax version (CPU?/GPU?/TPU?): not installed (NA)
- Jax version: not installed
- JaxLib version: not installed
- Using GPU in script?: yes
- Using distributed or parallel set-up in script?: no

Who can help?

@Narsil @patrickvonplaten

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

At first I thought I could just substring by the prompt's length. This doesn't work because there's a bug where it converts every instance of " ," to "," in the generated text.

For example, "Characters in this scene: King , Gertrude." becomes "Characters in this scene: King, Gertrude."

In https://github.com/huggingface/transformers/blob/main/src/transformers/generation_utils.py there are tons of options but not a single one of them allows us to specify it to ONLY return the generated text, not including the prompt.

I could do a workaround where I replace all the " ," with "," myself, but I'm sure this is a code smell which could lead to future problems.

Example code:

 gen_tokens = model.generate(input_ids, do_sample=specifiedDoSample, temperature=specifiedTemperature, max_length=calculated_max_length, min_length=calculated_min_length, repetition_penalty=specifiedRepetitionPenalty, bad_words_ids=badWordsTokens)

        #gen_text = tokenizer.batch_decode(gen_tokens)[0]

Expected behavior

Two possibilities: Either don't modify the prompt at all so I can substring by the prompt's length, or have an option where we get only the generated text not including the prompt.

The text was updated successfully, but these errors were encountered:

Narsil · 2022-05-09T08:35:50Z

Hi @monsieurpooh ,

generate will not change, since it's a relatively low level function, it really does exactly what it should do to the relative tensors (encoder-decoder and decoder-only don't work the same for instance).

Two suggestions:

Simple modification gen_text = tokenizer.batch_decode(gen_tokens[input_ids.shape[0]:])[0] (Ignore the first ids you sent)
Use a pipeline:

from transformers import pipeline

# This will remove the text for you.
pipe = pipeline(model="gpt2", return_full_text=False)
print(pipe("This is a test"))

Does that solve your issue ?

monsieurpooh · 2022-05-12T03:38:05Z

Thanks so much for your help Narsil! After a tiny bit of debugging and learning how to slice tensors, I figured out the correct code is: tokenizer.batch_decode(gen_tokens[:, input_ids.shape[1]:])[0]
It returns the correct tokens even when there's a space after some commas and periods.

Narsil · 2022-05-12T07:10:17Z

Thank you for giving the correct code here, will help other users for sure ! :)

GonyRosenman · 2024-02-26T19:17:47Z

is there a fix for this?

i'm using an workaround like this:
encoding = tokenizer(batch['prompt'], return_tensors='pt', padding=True).to(device) with torch.no_grad(): generated_ids = model.generate(**encoding) generated_ids = generated_ids[:, encoding.input_ids.shape[1]:] generated_texts = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)

srama2512 · 2024-05-07T01:58:51Z

Thanks so much for your help Narsil! After a tiny bit of debugging and learning how to slice tensors, I figured out the correct code is: tokenizer.batch_decode(gen_tokens[:, input_ids.shape[1]:])[0] It returns the correct tokens even when there's a space after some commas and periods.

Small observation. This works only if

batch size = 1, or
all elements of the batch have the same input context length.

monsieurpooh added the bug label May 6, 2022

monsieurpooh closed this as completed May 12, 2022

aws-rhsoln mentioned this issue Nov 10, 2023

list of parameters available for generate method in HuggingFaceGenerationModelAdapter class aws-neuron/aws-neuron-samples#56

Closed

danielhanchen mentioned this issue Apr 14, 2024

How do I output only the response and not the Instructions and Input unslothai/unsloth#335

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No way to get ONLY the generated text, not including the prompt. #17117

No way to get ONLY the generated text, not including the prompt. #17117

monsieurpooh commented May 6, 2022 •

edited

Narsil commented May 9, 2022

monsieurpooh commented May 12, 2022

Narsil commented May 12, 2022

GonyRosenman commented Feb 26, 2024

srama2512 commented May 7, 2024

No way to get ONLY the generated text, not including the prompt. #17117

No way to get ONLY the generated text, not including the prompt. #17117

Comments

monsieurpooh commented May 6, 2022 • edited

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

Narsil commented May 9, 2022

monsieurpooh commented May 12, 2022

Narsil commented May 12, 2022

GonyRosenman commented Feb 26, 2024

srama2512 commented May 7, 2024

monsieurpooh commented May 6, 2022 •

edited