[Chatllama] it seems you don't support flan_t5_xl to generate rewards training data #241

lonelydancer · 2023-03-10T07:36:48Z

you claim"If you prefer avoiding external paid APIs, we suggest using HuggingFace’s models (e.g. flan_t5_xl) as described in more detail in the Supported models section."
howerver,
python artifacts/generate_rewards.py ./artifacts/datasets/reward_training_data.json --model flan_t5_xl

self.llm = LLMChain(llm=openai_llm, prompt=prompt_template)

PierpaoloSorbellini · 2023-03-10T08:18:06Z

Hi @lonelydancer thank you for the feedback.
Yep you are right, we actually added the support to T5 only in extend_rlhf_dataset.py . The error in the readme is due to the fact that we wanted to update the documentation and code as soon as possible to fix as many issues as possible in the shortest amount of time. Unfortunately, we didn't make to add all the things that we want to, as a result that sentence wasn't properly removed from the readme.md. Thank you for spotting the mistake. We will adjust this inconsistency as soon as possible and possibly add the feature.

avacaondata · 2023-03-10T16:59:26Z

I've faced the same issue

pengwei-iie · 2023-03-31T08:52:24Z

I've faced the same issue, have you fixed it?

PierpaoloSorbellini changed the title ~~it seems you don't support flan_t5_xl to generate rewards training data~~ [Chatllama] it seems you don't support flan_t5_xl to generate rewards training data Mar 14, 2023

Linus-J mentioned this issue May 29, 2023

[ChatLLaMA] Add flan-t5-xl support for local and API model to generate synthetic reward_training_data scores #344

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Chatllama] it seems you don't support flan_t5_xl to generate rewards training data #241

[Chatllama] it seems you don't support flan_t5_xl to generate rewards training data #241

lonelydancer commented Mar 10, 2023

PierpaoloSorbellini commented Mar 10, 2023 •

edited

Loading

avacaondata commented Mar 10, 2023

pengwei-iie commented Mar 31, 2023

[Chatllama] it seems you don't support flan_t5_xl to generate rewards training data #241

[Chatllama] it seems you don't support flan_t5_xl to generate rewards training data #241

Comments

lonelydancer commented Mar 10, 2023

PierpaoloSorbellini commented Mar 10, 2023 • edited Loading

avacaondata commented Mar 10, 2023

pengwei-iie commented Mar 31, 2023

PierpaoloSorbellini commented Mar 10, 2023 •

edited

Loading