Migrate factual correctness #2401

rhlbhatnagar · 2025-11-05T04:26:27Z

No description provided.

anistark

ascore implementation executes verification sequentially.

We can probably execute these in parallel, which is sequential right now:

Verify response claims against reference
Verify reference claims against response

thoughts?

anistark · 2025-11-05T06:41:05Z

src/ragas/metrics/collections/_factual_correctness.py

+    statements: List[StatementFaithfulnessAnswer]
+
+
+def claim_decomposition_prompt(


do we need this method here or can we place it somewhere in common ragas.prompt.metrics.common

anistark · 2025-11-05T06:44:00Z

tests/e2e/metrics_migration/test_factual_correctness_migration.py

+
+                # Ensure implementations give reasonably similar scores
+                # Factual correctness may have more variation due to claim decomposition and different LLM behavior
+                assert score_diff < 0.35, (


35% tolerance is too high, no? Is it intended? Can it be lowered to 10-15 % ?

anistark · 2025-11-05T06:47:28Z

src/ragas/metrics/collections/_factual_correctness.py

+
+        return MetricResult(value=float(np.round(score, 2)))
+
+    async def _decompose_claims(self, response: str) -> List[str]:


We're missing callbacks in this and _verify_claims methods.

Is it intended? Callbacks would help in analysis and tracing.

Migrate factual correctness

ee40879

dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Nov 5, 2025

anistark reviewed Nov 5, 2025

View reviewed changes

rhlbhatnagar marked this pull request as draft November 5, 2025 07:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Migrate factual correctness #2401

Migrate factual correctness #2401

rhlbhatnagar commented Nov 5, 2025 •

edited

Loading

Uh oh!

anistark left a comment

Uh oh!

anistark Nov 5, 2025

Uh oh!

anistark Nov 5, 2025

Uh oh!

anistark Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		statements: List[StatementFaithfulnessAnswer]


		def claim_decomposition_prompt(


		return MetricResult(value=float(np.round(score, 2)))

		async def _decompose_claims(self, response: str) -> List[str]:

Migrate factual correctness #2401

Are you sure you want to change the base?

Migrate factual correctness #2401

Conversation

rhlbhatnagar commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

anistark left a comment

Choose a reason for hiding this comment

Uh oh!

anistark Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

anistark Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

anistark Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rhlbhatnagar commented Nov 5, 2025 •

edited

Loading