About using the dataset NYT-FB #1

vhientran · 2021-01-06T13:37:50Z

Sorry for disturbing you. However, I wonder about using the dataset NYT-FB in your experiment. While TACRED test set provides the relation type for each sentence, I cannot find each relation type for each sentence in NYT-FB. I already got the NYT-FB dataset from Diego Marcheggiani, but most sentences are without relation type as (https://github.com/diegma/relation-autoencoder/blob/master/data-sample.txt). I wonder how to evaluate your system on NYT-FB without labels?

Thanks for your help!

ttthy · 2021-01-09T15:45:56Z

Hi @angelotran05 ,

Please find the statistics of positive sentences (labelled sentences) in our paper, Table 3 Appendix A.
There are 262 relation types in NYT-FB, "...2.1% of the sentences in NYT-FB were aligned against Freebase’s triplets" (page 3, section 3 Experiments and results, datasets).
All data are used during training, but only the labelled sentences are used for evaluation (7,793 and 33,808 sentences in dev and test set, respectively).
Let me know if you have other questions.

Best,

https://www.aclweb.org/anthology/2020.acl-main.669.pdf

vhientran · 2021-01-10T06:33:20Z

Hi @ttthy ,

I got it. Thank you very much for your help.

All the best,

vhientran closed this as completed Jan 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About using the dataset NYT-FB #1

About using the dataset NYT-FB #1

vhientran commented Jan 6, 2021

ttthy commented Jan 9, 2021

vhientran commented Jan 10, 2021

About using the dataset NYT-FB #1

About using the dataset NYT-FB #1

Comments

vhientran commented Jan 6, 2021

ttthy commented Jan 9, 2021

vhientran commented Jan 10, 2021