documents not being applied in apply_chat_tempplate #33421

selkordy · 2024-09-11T04:49:30Z

System Info

transformers version: 4.44.2
Platform: macOS-14.6.1-arm64-arm-64bit
Python version: 3.12.5
Huggingface_hub version: 0.24.6
Safetensors version: 0.4.4
Accelerate version: 0.33.0
Accelerate config: not found
PyTorch version (GPU?): 2.4.0 (False)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using distributed or parallel set-up in script?: parallel

Who can help?

@ArthurZucker

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

I'm trying to apply documents in the chat template as per the chat_templating article, however it seems to be ignored. Passing documents has no effect on the chat template.

https://huggingface.co/docs/transformers/en/chat_templating

`
from transformers import AutoTokenizer

tokenizer = AutoTokenizer.from_pretrained("HuggingFaceH4/zephyr-7b-beta")

chat1 = [
{"role": "user", "content": "Which is bigger, the moon or the sun?"},
{"role": "assistant", "content": "The sun."}
]
chat2 = [
{"role": "user", "content": "Which is bigger, a virus or a bacterium?"},
{"role": "assistant", "content": "A bacterium."}
]

document1 = {
"title": "The Moon: Our Age-Old Foe",
"contents": "Man has always dreamed of destroying the moon. In this essay, I shall..."
}

document2 = {
"title": "The Sun: Our Age-Old Friend",
"contents": "Although often underappreciated, the sun provides several notable benefits..."
}
model_input = tokenizer.apply_chat_template([chat1,chat2], tokenize=False, add_generation_prompt=False, documents=[document1, document2])

Expected behavior

I expect the chat template to include the documents

The text was updated successfully, but these errors were encountered:

A-Duss · 2024-09-11T07:31:23Z

I've been trying to pinpoint where documents should be passed to the model. From what I gathered after exploring the Jinja templates, it seems they should be included in the chat_template key within the tokenizer_config.json file. However, it appears that Zephyr-7B-beta's chat template doesn't currently support this, as is.

Even if it did, I'm unsure how documents would be integrated into the chat template. The documentation (mentioned by @selkordy code snippets don't clearly specify which model is used, and while I noticed 'NousResearch/Hermes-2-Pro-Llama-3-8B' was the last loaded model in the documentation code, its tokenizer is currently broken due to a typo in the latest commit on its tokenizer_config.json file. Anyway, I didn't find any reference to documents in the chat template for that model either.

@Rocketknight1, I saw you implemented this in #30621—thanks for the great work, must have been a heck of a headache!
Would you be able to provide any insights into how this feature is supposed to work?

Rocketknight1 · 2024-09-11T12:57:56Z

Hi @selkordy @A-Duss, the cause of this problem is simply that documents is not supported by many models, and as a result, their chat templates discard this input. I should probably update the documentation to make this clearer, and maybe reduce the emphasis on documents because it's not widely supported.

However, one model that does support it is Command-R and Command-R+, using the rag ("retrieval-augmented generation") template. You can see it used in the "grounded generation" examples in their model cards.

A-Duss · 2024-09-11T13:25:34Z

@Rocketknight1 Thanks for the clarification! I think it might be helpful to use Command-R as the model in the example within the documentation then, while noting that not all models support this feature. I’m happy to assist with this if you’re short on time.

Rocketknight1 · 2024-09-11T14:22:36Z

@A-Duss sure! If you want to open a PR to update the chat template docs and tag me, that'd be great. However, we'd prefer to avoid apply_grounded_generation_template(), since it's very specific to Command-R. You can get the same effect for Command-R's models using the standard apply_chat_template() function like so:

tokenizer.apply_chat_template(messages=messages, documents=documents, chat_template="rag")

selkordy · 2024-09-12T01:39:06Z

I see that when I look at the tokenizer_config there is no where it includes documents in the jinja config, and have a better understanding of how the library works.

Thank you @A-Duss and @Rocketknight1

A-Duss · 2024-09-15T05:29:32Z

@A-Duss sure! If you want to open a PR to update the chat template docs and tag me, that'd be great. However, we'd prefer to avoid apply_grounded_generation_template(), since it's very specific to Command-R. You can get the same effect for Command-R's models using the standard apply_chat_template() function like so:

tokenizer.apply_chat_template(messages=messages, documents=documents, chat_template="rag")

Noted, I'm working on it, I will open a PR once its looking decent.

github-actions · 2024-10-11T08:03:15Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

selkordy added the bug label Sep 11, 2024

A-Duss mentioned this issue Sep 16, 2024

Add explicit example for RAG chat templating #33503

Merged

3 tasks

github-actions bot closed this as completed Oct 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

documents not being applied in apply_chat_tempplate #33421

documents not being applied in apply_chat_tempplate #33421

selkordy commented Sep 11, 2024 •

edited

Loading

A-Duss commented Sep 11, 2024

Rocketknight1 commented Sep 11, 2024

A-Duss commented Sep 11, 2024

Rocketknight1 commented Sep 11, 2024 •

edited

Loading

selkordy commented Sep 12, 2024

A-Duss commented Sep 15, 2024

github-actions bot commented Oct 11, 2024

documents not being applied in apply_chat_tempplate #33421

documents not being applied in apply_chat_tempplate #33421

Comments

selkordy commented Sep 11, 2024 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

A-Duss commented Sep 11, 2024

Rocketknight1 commented Sep 11, 2024

A-Duss commented Sep 11, 2024

Rocketknight1 commented Sep 11, 2024 • edited Loading

selkordy commented Sep 12, 2024

A-Duss commented Sep 15, 2024

github-actions bot commented Oct 11, 2024

selkordy commented Sep 11, 2024 •

edited

Loading

Rocketknight1 commented Sep 11, 2024 •

edited

Loading