Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The effect of Clustering via Pooling may be greater? #2

Open
HarryWu99 opened this issue Apr 27, 2024 · 1 comment
Open

The effect of Clustering via Pooling may be greater? #2

HarryWu99 opened this issue Apr 27, 2024 · 1 comment

Comments

@HarryWu99
Copy link

Just a guess.

What will happen if H2O also uses Clustering via Pooling when comparing? It seems that Clustering via Pooling can improve the effectiveness of such drop token methods.

@leeyeehoo
Copy link
Collaborator

As we stated in the paper, the generated answers are very query-dependent. So evicting KV during generation may introduce losses of information. Given a high-level example, if a user gives the model a book, the first question is about the first chapter, and the model evicts other parts. The user queries about the last chapter, the model will have very limited knowledge about the answer.
Pooling is a very interesting observation since the model will perform perfectly on easier tasks like the original haystack task without pooling. But when you switch to more challenging tasks, the method with pooling is significantly better than the one without pooling.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants