Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hugging Face Hub example is factually incorrect #2802

Closed
JasonWeill opened this issue Apr 12, 2023 · 8 comments
Closed

Hugging Face Hub example is factually incorrect #2802

JasonWeill opened this issue Apr 12, 2023 · 8 comments

Comments

@JasonWeill
Copy link
Contributor

On the Hugging Face Hub example page, the question is, "What NFL team won the Super Bowl in the year Justin Beiber [sic] was born?" The answer is, "The Seattle Seahawks won the Super Bowl in 2010. Justin Beiber was born in 2010. The final answer: Seattle Seahawks."

This is factually incorrect; Justin Bieber was born in 1994, and in that year, the Dallas Cowboys won Super Bowl XXVIII.

In addition, the Indianapolis Colts won Super Bowl XLIV in 2010; the Seahawks did not win their first Super Bowl until 2014.

@cnhhoang850
Copy link
Contributor

cnhhoang850 commented Apr 13, 2023

This was also the resulst I got from running the example: "The Seattle Seahawks won the Super Bowl in 2010. Justin Beiber was born in 2010. The final answer: Seattle Seahawks."

This seems to be due to google's flan model because when used with other models like bloom or gpt, the answer is accurate

@azamiftikhar1000
Copy link
Contributor

#2810
It appears that Google's Flan model may not perform as well as other models, I have used a simple example to get factually correct answer.

vowelparrot pushed a commit that referenced this issue Apr 13, 2023
### #2802
It appears that Google's Flan model may not perform as well as other
models, I used a simple example to get factually correct answer.
wertycn pushed a commit to wertycn/langchain-zh that referenced this issue Apr 26, 2023
### langchain-ai/langchain#2802
It appears that Google's Flan model may not perform as well as other
models, I used a simple example to get factually correct answer.
samching pushed a commit to samching/langchain that referenced this issue May 1, 2023
### langchain-ai#2802
It appears that Google's Flan model may not perform as well as other
models, I used a simple example to get factually correct answer.
@Majboor
Copy link

Majboor commented Jun 8, 2023

I have used MPT-7 and it tells me the same
In 2014, The Seattle Seahawks defeated the Denver Broncos at MetLife Stadium

@vowelparrot
Copy link
Contributor

Yeah the Huggingface model isn't as reliable!

@Majboor
Copy link

Majboor commented Jun 8, 2023

are there any other open source models, I have tried all of the ones on gpt4all many commercial on hugging face none are as good as compared to text davinchi or gpt-3

@vowelparrot
Copy link
Contributor

vowelparrot commented Jun 8, 2023

You could try the falcon ones on huggingface. It really depends on your use case.

While some models are fine for certain tasks, in general, we haven't seen a serious open source competitor yet over all possible use cases. OpenAI's raison d'etre is to make models that are better than other providers and has a head start.

Most of the open source models right now are either more domain specific (chit chat, summarization, etc.) or pretrained / non-finetuned

@Majboor
Copy link

Majboor commented Jun 9, 2023 via email

@dosubot
Copy link

dosubot bot commented Sep 20, 2023

Hi, @JasonWeill. I'm Dosu, and I'm helping the LangChain team manage their backlog. I wanted to let you know that we are marking this issue as stale.

Based on my understanding of the issue, you reported an error on the Hugging Face Hub example page regarding the Super Bowl winner in the year Justin Bieber was born. The page incorrectly states that the Seattle Seahawks won in 2010, when in fact it was the Dallas Cowboys in 1994. Several users, including cnhhoang850, azamiftikhar1000, Majboor, and vowelparrot, have confirmed the issue and discussed alternative models. Majboor mentioned that MPT-7 was the best model for their testing.

Before we close this issue, we wanted to check if it is still relevant to the latest version of the LangChain repository. If it is, please let us know by commenting on this issue. Otherwise, feel free to close the issue yourself, or it will be automatically closed in 7 days.

Thank you for your contribution, and we appreciate your understanding as we work to manage our backlog effectively. Let us know if you have any further questions or concerns.

@dosubot dosubot bot added the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Sep 20, 2023
@dosubot dosubot bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 27, 2023
@dosubot dosubot bot removed the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Sep 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants