-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Description
[ ] I have checked the documentation and related resources and couldn't resolve my bug.
Describe the bug
The answer generated by my model is similar to the ground_truths, but my answer_correctness score is very poor eventhough I have good score in the other metrics like context_recall, precision and faithfulness.
Ragas version:0.1.18
Python version:3.12
Code to Reproduce
scores = evaluate(
eval_dataset,
metrics=[context_recall, context_precision, faithfulness, answer_correctness],
llm = llm,
embeddings=GoogleGenerativeAIEmbeddings(google_api_key="AIzaSyCJFrruyyyuuuytuuiysJnFcQZJa-E-QJOxyjK_X8rpBkIQ", model="models/embedding-001"),
)
Error trace
No error but poor answer correctness metric score
Expected behavior
To get a good score for the answer_correctness like I got for the others


Additional context
Add any other context about the problem here.