Modifing the weights of words in the models #335

SeonggwanAhn · 2023-07-24T05:11:20Z

Hi. Your work MedCAT is so impressive.
I want to ask you a question.

Are the weights of words in the model changeable?
If possible, please let me know how to modify the weights of words in model.

Thanks

mart-r · 2023-07-25T12:39:35Z

Hi!

I'm not entirely sure what you're asking.
Are you trying to add more weight to a specific meaning of an ambiguous word (name)?
Are you trying to avoid recognition of certain words (names) altogether?
Something else?

But in general, there is no way to add more weights to any specific concept or a specific name of a concept.
With that said, the training set will have a significant impact on which concepts and/or names the model is able to effectively identify.

Then again, if you wish to limit the concepts your model is working with, you can always filter out the CUIs you don't need (CDB.filter_by_cui).
Or you could add the CUIs to a filter in the config (config.linking.filters.cuis).

SeonggwanAhn · 2023-08-21T05:00:48Z

Thanks for your reply.
What I mean is whether I can 'modify the concept vector' of a specific word in vocabulary.

Or Can I further train an already trained model(download completed) with my additional document?
I want to transfer and adjust this model for my experiment.

mart-r · 2023-08-21T08:13:03Z

Yes, you are more than welcome to further train and/or fine tune an existing model. The additional training data can change what the model can recognise significantly. But it all depends on the training route you're taking (whether unsupervised or supervised) as well as the specific training set.

So all in all, by using your own dataset to further train the model, you can probably achieve what you're trying to do.
But it almost certainly won't be possible with a single document.

SeonggwanAhn closed this as completed Sep 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modifing the weights of words in the models #335

Modifing the weights of words in the models #335

SeonggwanAhn commented Jul 24, 2023 •

edited

Loading

mart-r commented Jul 25, 2023

SeonggwanAhn commented Aug 21, 2023

mart-r commented Aug 21, 2023

Modifing the weights of words in the models #335

Modifing the weights of words in the models #335

Comments

SeonggwanAhn commented Jul 24, 2023 • edited Loading

mart-r commented Jul 25, 2023

SeonggwanAhn commented Aug 21, 2023

mart-r commented Aug 21, 2023

SeonggwanAhn commented Jul 24, 2023 •

edited

Loading