-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add belebele #885
add belebele #885
Conversation
Thank for adding this dataset! The prompt is a bit different from the paper, they use
|
@ManuelFay I'm still seeing |
Ok I'm removing the description, in the paper, from my understanding there was a 0-shot scenario with a description and a 5-shot without. Since the exact content of the description was not stated explicitly, let's just do the 5 shot I guess. |
I changed the |
Maybe we can tag @satyanshukla, the author for his input ! |
Hey one of the authors of the paper here, happy to help ! Though it's unclear to me what the question is, is it just about If so, I know we were trying out different prompts and the one reported is the one that worked best though idk if we tried using |
Hey Lucas ! Yes it's essentially for:
Thanks again for everything |
1. there were \n in the real prompt
2. the end of Section 4.2 has a quick paragraph on the zero-shot evaluation
but I can get for you the exact natural language prompts we used
3. Just to clarify: am I approving EleutherAI the authority to use my
dataset or am I approving that the implementation is what I want ?
…On Thu, Oct 5, 2023 at 6:28 PM Manuel Faysse ***@***.***> wrote:
Hey Lucas ! Yes it's essentially for:
- Are there spaces between \n like in the paper or was it just for
readability but the real prompt has no added whitespaces?
- Is there a description of the task in 0-shot settings ? If so, which
is it ? I did not find it in the paper ?
- Lastly, just nice to get your approval for the task implementation
so we can get an "author approved" benchmark implementation, makes it more
official !
Thanks again for everything
—
Reply to this email directly, view it on GitHub
<#885 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AHQITKRFO7XYWAEOCYE5SU3X55NA3AVCNFSM6AAAAAA5GG3PGGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTONBZHA3DKNRXGI>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
This would simply be agreeing that the implementation matches what you used in your paper! |
To clarify 1, the doubt is not if \n was used in the prompt. Our doubt is if there is any whitespace between \n and text. |
Ahhh I see, yeah there was probably no extra whitespace
Ok let me parse through the implementation and try to understand deeper and
will get the zero-shot natural language instructions for y'all
…On Fri, Oct 6, 2023 at 11:41 AM Julen Etxaniz ***@***.***> wrote:
To clarify 1, the doubt is not if \n was used in the prompt. Our doubt is
if there is any whitespace between \n and text.
—
Reply to this email directly, view it on GitHub
<#885 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AHQITKUXZOVKV7UBU42C43LX6BGFRAVCNFSM6AAAAAA5GG3PGGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTONJRGI2DENZQGI>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
cc-ing @davisliang f"{instruction}\n###\nPassage:\n{passage}\n###\nQuery:\n{query}\n###\nChoices:\n(A) {A}\n(B) {B}\n(C) {C}\n(D) {D}\n###\nAnswer:\n" Example:
Proccessing the outputs:
|
Thank you! If I understand correctly, that is the prompt for instruction/chat models, right? The prompt used for 5-shot in-context learning is the one you mention in the paper (removing the extra spaces between \n and text). |
Correct
…On Tue, Oct 10, 2023 at 5:24 AM Julen Etxaniz ***@***.***> wrote:
Thank you! If I understand correctly, that is the prompt for
instruction/chat models, right? The prompt used for 5-shot in-context
learning is the one you mention in the paper (removing the extra spaces
between \n and text).
—
Reply to this email directly, view it on GitHub
<#885 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AHQITKTWFXDKUWNGSYCX72DX6U46JAVCNFSM6AAAAAA5GG3PGGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTONJVGI3TIMJZGU>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Okay then, I guess we are good for the 5-shot one (the one adapted for the lm eval harness), let's merge ? |
Big-Refactor version of https://github.com/EleutherAI/lm-evaluation-harness/pull/882/files