Add danFEVER #97

KennethEnevoldsen · 2024-01-25T15:53:54Z

Added danfever as a retrieval dataset

fixes #93

…ian-embedding-benchmark into add-danfever

KennethEnevoldsen · 2024-01-25T15:58:44Z

src/seb/registered_tasks/mteb_retrieval.py

+            for claim, evidence, label_id in zip(claims, evidences, labels):
+                claim_is_supported = class_labels[label_id] == "Supported"
+
+                sim = 1 if claim_is_supported else 0  # negative for refutes claims - is that what we want?


@Muennighoff - The DanFEVER dataset is similar to the FEVER dataset. Just wanted to make sure that this dataset is constructed fairly similar to FEVER.

I use the claim as the query to all the evidence segments as the corpus. The relevance score is then determined by whether the claim is supported.

However, I am unsure if assigning 0 to "not supported" and "not enough evidence" is meaningful.

What are your thoughts?

However, I am unsure if assigning 0 to "not supported" and "not enough evidence" is meaningful.

If that's the same way it is done for FEVER, then I think it's okay!

I am unsure how it is done for FEVER (can I find the processing script somewhere?)

This is what it says in BEIR, so it does seem like they everything that's not the evidence is a 0
FEVER [60] The Fact Extraction and VERification dataset is collected to facilitate the automatic fact checking. We utilize the original paper splits as queries Q and retrieve evidences from the pre-processed Wikipedia Abstracts (June 2017 dump) as our corpus T

x-tabdeveloping

Looks good, feel free to merge

KennethEnevoldsen added 7 commits January 25, 2024 16:06

Merge branch 'main' of https://github.com/KennethEnevoldsen/scandinav…

796e3c9

…ian-embedding-benchmark into add-danfever

feat: Added danfever

ccec57c

style: ran linter

bcc1231

fix: Update indexes to strings

37d165f

feat: Added performance metrics for danfever

22eb72b

Merge branch 'main' of https://github.com/KennethEnevoldsen/scandinav…

69a5a03

…ian-embedding-benchmark into add-danfever

tests: convert test_task back to normal

be2c071

KennethEnevoldsen commented Jan 25, 2024

View reviewed changes

KennethEnevoldsen requested a review from x-tabdeveloping January 25, 2024 15:59

KennethEnevoldsen added 2 commits January 25, 2024 17:04

tests: remove tests which has to be changed when adding new datasets

04aa44e

appease pyright

a572962

x-tabdeveloping approved these changes Jan 26, 2024

View reviewed changes

KennethEnevoldsen merged commit 801753f into main Jan 26, 2024
4 of 6 checks passed

KennethEnevoldsen deleted the add-danfever branch January 26, 2024 08:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add danFEVER #97

Add danFEVER #97

KennethEnevoldsen commented Jan 25, 2024 •

edited

Loading

KennethEnevoldsen Jan 25, 2024

Muennighoff Jan 25, 2024

KennethEnevoldsen Jan 25, 2024 •

edited

Loading

Muennighoff Jan 25, 2024

x-tabdeveloping left a comment

Add danFEVER #97

Add danFEVER #97

Conversation

KennethEnevoldsen commented Jan 25, 2024 • edited Loading

KennethEnevoldsen Jan 25, 2024

Choose a reason for hiding this comment

Muennighoff Jan 25, 2024

Choose a reason for hiding this comment

KennethEnevoldsen Jan 25, 2024 • edited Loading

Choose a reason for hiding this comment

Muennighoff Jan 25, 2024

Choose a reason for hiding this comment

x-tabdeveloping left a comment

Choose a reason for hiding this comment

KennethEnevoldsen commented Jan 25, 2024 •

edited

Loading

KennethEnevoldsen Jan 25, 2024 •

edited

Loading