You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have followed the code here and generate all 3 tsv files under DisExtract/data/books/ALL18_2019jan02_[valid, train, test].tsv. However the format is not aligned with the required json file to run pretraining for Factual Adapter. The format of the tsv is also different than the required json format as well.
The content format of generated tsv file after executing python producer.py is as follows:
[Sentence 1]\t[Sentence 2]\t[Marker]
...
The required json file format should be as follows:
Is there a conversion script that convert generated tsv format to json?
The text was updated successfully, but these errors were encountered:
theblackcat102
changed the title
In consistency in pretraining data format for Factual Adapter
Incorrect pretraining data format for Factual Adapter
Jan 5, 2021
I have followed the code here and generate all 3 tsv files under DisExtract/data/books/ALL18_2019jan02_[valid, train, test].tsv. However the format is not aligned with the required json file to run pretraining for Factual Adapter. The format of the tsv is also different than the required json format as well.
The content format of generated tsv file after executing
python producer.py
is as follows:The required json file format should be as follows:
Is there a conversion script that convert generated tsv format to json?
The text was updated successfully, but these errors were encountered: