meta_data_vocab comprises of sentences, not tokens #16

Aatlantise · 2022-01-21T08:38:28Z

Hello,

It looks like meta_data_vocab used as an argument for model declaration is ... not in a format familiar to me? The vocabularies seem to be comprised of sentences, rather than tokens.

I attempted not providing meta_data_vocab as an input, since it seems to be an optional argument, but that also fails due to a snipper of code that invokes meta_data_vocab.itos.

>>> META_DATA.vocab.itos
['<unk>', 'A trial run run on this initialization sentence initializes the OpenIE6 open information extractor .']

>>> meta_data_vocab.itos
['<unk>', 'A trial run run on this initialization sentence initializes the OpenIE6 open information extractor .']

Is meta_data_vocab meant to look like this? I was trying to declare a model that could be used for predicting any given input text, but meta_data_vocab seems to prevent this, assigning each model to one specific predict_fp.

Much thanks!

The text was updated successfully, but these errors were encountered:

SaiKeshav · 2022-01-23T16:38:12Z

Hi, thank you for your interest in our work. Yes, meta_data_vocab you have is correct. It contains the actual sentences themselves. So that when we print the final predictions of the system, we print the corresponding sentence along with it.

To achieve what you want, one simple solution is to pass the meta_data_vocab to the forward function instead at the time of initialization. Does that make sense?

Aatlantise · 2022-01-26T12:14:03Z

Much thanks for your input! I was able to do what I wanted by declaring the model without meta_data_vocab then setting model._meta_data_vocab at the later time, before each inference. A natural follow up question is--would I be able to infer each input text without having to declare a new trainer object every time? The software allows me to do so, but seems to use an indexing of some sort from previous dataloader objects.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

meta_data_vocab comprises of sentences, not tokens #16

meta_data_vocab comprises of sentences, not tokens #16

Aatlantise commented Jan 21, 2022

SaiKeshav commented Jan 23, 2022

Aatlantise commented Jan 26, 2022

meta_data_vocab comprises of sentences, not tokens #16

meta_data_vocab comprises of sentences, not tokens #16

Comments

Aatlantise commented Jan 21, 2022

SaiKeshav commented Jan 23, 2022

Aatlantise commented Jan 26, 2022