The corpus is split into training (220'000 texts), evaluation (10'000 texts) and test (10'000 texts) sets. For each text (text_id.src) there is a corresponding reference summary (text_id.tgt).
See http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.821.pdf for more information.