Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Standardize behavior on summari[s/z]ation requests #97

Open
markovial opened this issue Sep 5, 2023 · 1 comment
Open

Standardize behavior on summari[s/z]ation requests #97

markovial opened this issue Sep 5, 2023 · 1 comment

Comments

@markovial
Copy link

When I ask to summarize certain posts, especially the more recent ones that are not in the database, instead of saying - I don't know what that is. It just infers what the content would be by the title of the post and imagines an answer.

The example where this came up was the alignment forum post - A Primer On Chaos published on Mar 2023. It generated this answer:

image

It doesn't happen every time though. More often than not, it just says I don't know what that is.
image

Potential solution suggested by Bionic : We could hardcode that if it sees the word ‘summari[s/z]e’ and a url it adds the url as required metadata url in the search and uses a specific summarization prompt. And ditto for other similar things, like when a user says ‘in url, what…’, in could catch it and add the url as constraint in the search.

Generally, I agree with this approach. Maybe summarize and some other synonyms as well? Just to make sure we catch the intended behavior.

@ccstan99
Copy link
Collaborator

ccstan99 commented Sep 9, 2023

There's a few ways to address this:

  1. Mentioned in prompt engineering, we can (and should) encourage "I don't know"
  2. Use the LLM (to catch synonyms) to generate pinecone metadata query (not only for summaries but also for authors, dates, sources), possibly using functions
  3. If using regex as suggested but we'd need to be careful for questions like: "Summarize the differences in opinion between Eliezer Yudkowsky and Paul Christiano." Matching urls might be problematic for items cross-posted on different forums.

On a separate issues, I'm not even sure why the post from March 2023 isn't in the ARD. It should be.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants