Negative sampling #1

TGDivy · 2020-07-09T16:18:47Z

Hi,
In the dataset file for the baseline,
line 128.
knowledge_key = "{}__{}__{}".format(knowledge["domain"], knowledge["entity_id"], knowledge["doc_id"]) # find snippets with same entity as candidates prefix = "{}__{}".format(knowledge["domain"], knowledge["entity_id"])
I think the code might be intended to select knowledge snippet from a specific document. I.e. if the label was hotel__1__15, then to select all hotel__1__. However, because of the code it also ends up selecting documents like hotel__11__, hotel__12__*, etc. This could potentially be fixed.

The text was updated successfully, but these errors were encountered:

chaoweihuang mentioned this issue Jul 27, 2020

fix negative sampling issue #2

Merged

TGDivy closed this as completed Jul 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Negative sampling #1

Negative sampling #1

TGDivy commented Jul 9, 2020

Negative sampling #1

Negative sampling #1

Comments

TGDivy commented Jul 9, 2020