Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarification Needed on the Specificity of test_dataset #22

Closed
ViceSilva opened this issue Apr 16, 2024 · 1 comment
Closed

Clarification Needed on the Specificity of test_dataset #22

ViceSilva opened this issue Apr 16, 2024 · 1 comment

Comments

@ViceSilva
Copy link

Hello,

I am currently working with the project and have a question regarding the test_dataset used within. Could you please clarify whether the test_dataset needs to be domain-specific, particularly tailored to the RAG domain, or if a generic labeled dataset is suitable for this purpose?

@robbym-dev
Copy link
Collaborator

For evaluating Retrieval-Augmented Generation (RAG) models like the ones you’re working with, the choice of a validation dataset can significantly influence how well the model’s performance generalizes across different types of data and use cases.

If your RAG model is intended to be used in a specific domain (like medical, legal, or technical documents), it would be beneficial to use a domain-specific validation dataset. This approach helps ensure that the model performs well on the type of content it will encounter in its expected environment.

However, if the model is intended for more general use, a generic labeled dataset could suffice. This kind of dataset helps evaluate the model’s ability to handle a broad range of topics and types of queries.

In your case, it be beneficial to use a domain-specific validation dataset tailored to the RAG domain to accurately evaluate your RAG model using ARES.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants