Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What content is encoded when Llama13B encoded a sentence #8

Closed
cgpeter96 opened this issue Jan 11, 2024 · 3 comments
Closed

What content is encoded when Llama13B encoded a sentence #8

cgpeter96 opened this issue Jan 11, 2024 · 3 comments

Comments

@cgpeter96
Copy link

Only prompt? or prompt+response?

@VPeterV
Copy link
Collaborator

VPeterV commented Jan 11, 2024

We have integrated the prompt within our code. If you want to use our scorers, you only need to give your query sentence for scoring complexity or query + response for scoring quality.

Our code will convert your input (i.e. query or query + response) to our designed format for scorers to score samples.

@edbeeching
Copy link

Hi @VPeterV. Related question: In the Score-First, Diversity-Aware Data Selection algorithm are the llama-13b sentence embeddings a mean of the token embeddings, the last token embedding, or something else?

@VPeterV
Copy link
Collaborator

VPeterV commented Jan 16, 2024

Hi @VPeterV. Related question: In the Score-First, Diversity-Aware Data Selection algorithm are the llama-13b sentence embeddings a mean of the token embeddings, the last token embedding, or something else?

Hi @edbeeching ! We encode the sentence by using the hidden states of the last token for one sentence (i.e. the last token in query + response)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants