Skip to content

Issues with dangling zshot.linker.LinkerRegen #74

Answered by marmg
louisguitton asked this question in Q&A
Discussion options

You must be logged in to vote

Hello. The max_input_len is the maximum number of tokens the model can use as input. When we create the input sentences, we need to add special tokens to specify which are the entities to classify, we create one different sentence for each entity in the text.

In the case that the input text is longer than the max_input_len, we truncate the input text, keeping both left and right context of the entity, with the same number of tokens in both left and right side.

The linker will see all the mentions as we are creating one input sentence for each mention, but the context will be limited to that number of tokens. If the text is too long and there are too many mentions in it, this will result i…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by louisguitton
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants