Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update mentions extractors #21

Merged
merged 4 commits into from
Oct 24, 2022
Merged

Update mentions extractors #21

merged 4 commits into from
Oct 24, 2022

Conversation

marmg
Copy link
Collaborator

@marmg marmg commented Oct 20, 2022

Status Type ⚠️ Core Change Issue
Ready Feature Yes Link

Problem

As discussed in #18, some Mentions extractor may use entities with descriptions that may be different from the ones used by the lnker.

Solution

Add a new field, mentions, to allow the usage of different entities in the mentions extractors.

Also, added TARS and SMXM models as mentions extractors.

Other changes (e.g. bug fixes, small refactors)

  • Fixed bug in REGEN when there are no mentions to link
  • Refactor MentionsExtractor to use zshot.utils.data_models.span.Span instead of spacy.tokens.span as discussed in Refactor mentions extractor #19

Closes #18, #19

Signed-off-by: Marcos Martinez <Marcos.Martinez.Galindo@ibm.com>
Signed-off-by: Marcos Martinez <Marcos.Martinez.Galindo@ibm.com>
…can use entities with descriptions. Added TARS and SMXM as mentions extractors

Signed-off-by: Marcos Martinez <Marcos.Martinez.Galindo@ibm.com>
… force garbage collector due to memory problems

Signed-off-by: Marcos Martinez <Marcos.Martinez.Galindo@ibm.com>
@marmg marmg force-pushed the feat/update-mentions-extractor branch from 7dc4bdd to 59fea41 Compare October 21, 2022 14:44
@marmg marmg merged commit 708b895 into main Oct 24, 2022
@marmg marmg deleted the feat/update-mentions-extractor branch October 24, 2022 10:07
GabrielePicco pushed a commit that referenced this pull request Oct 28, 2022
* Added POS Mentions extractor

* Added v2 pos mentions extractor

* Added ZShot Param class to help user to register new components

* Added ZShot PipelineConfig

* Added zshot serialization

* Updated zshot seriailzation

* Fixed test_zshot

* blank

* ♻️ Refactor code

* ♻️ Use tempdir for temporary directory

* ♻️ Rename disable_default_ner field

* Updated serialization test

* ⏪ Remove external code

Co-authored-by: Gabriele Picco <Gabriele.Picco@ibm.com>
GabrielePicco pushed a commit that referenced this pull request Oct 28, 2022
* Added POS Mentions extractor

* Added v2 pos mentions extractor

* Added ZShot Param class to help user to register new components

* Added ZShot PipelineConfig

* Added zshot serialization

* Updated zshot seriailzation

* Fixed test_zshot

* blank

* ♻️ Refactor code

* ♻️ Use tempdir for temporary directory

* ♻️ Rename disable_default_ner field

* Updated serialization test

* ⏪ Remove external code

Co-authored-by: Gabriele Picco <Gabriele.Picco@ibm.com>
GabrielePicco pushed a commit that referenced this pull request Oct 28, 2022
* 🎨 Refactored mentions extractor to work with zshot spans.
* 🐛 Fixed bug in linker_regen when there are no mentions to link
* ✨ Added mentions to PipelineConfig and ZShot, now mentions extractor can use entities with descriptions. Added TARS and SMXM as mentions extractors
* ✅ Added tests for SMXM and TARS mentions extractors. Updated tests to force garbage collector due to memory problems
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Adding SMXM model as a mentions extractor
1 participant