Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Datasets: MedQA, MedMCQA, PubmedQA #68

Closed
matthias-samwald opened this issue Jan 11, 2023 · 1 comment
Closed

Datasets: MedQA, MedMCQA, PubmedQA #68

matthias-samwald opened this issue Jan 11, 2023 · 1 comment
Labels

Comments

@matthias-samwald
Copy link
Contributor

The CoTs for these datasets come from Lievin et al 2022. https://arxiv.org/abs/2207.08143

@matthias-samwald
Copy link
Contributor Author

Just a minor observation of the MedMCQA source data (not an issue pertaining to our code): in the gold-standard CoTs, certain citations re-appear a lot (e.g. "Ref Harrison20th edition pg 2456" appears over >60 times). I'm pretty sure that some of these citations are not correct, since it appears in a wide variety of contexts.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant