Skip to content

Latest commit

 

History

History
11 lines (8 loc) · 753 Bytes

CDQA_F1-recall.md

File metadata and controls

11 lines (8 loc) · 753 Bytes

CDQA: F1-recall

F1-recall measures the overlap between model-generated responses and ground truth, focusing on the model's ability to reproduce key elements from the reference.

  1. Tokenization: Both the generated text and ground truth are segmented into token lists using word segmentation tools.
  2. Calculation: Determine the ratio of tokens in the model's output that also appear in the ground truth token list.
  3. Formula: F1-recall = (Number of common tokens) / (Total number of tokens in ground truth)