How to limit the amount of data added amounting to approximately 10% of the original data #2

XY-1 · 2020-02-14T12:16:25Z

I am deeply interested in your research paper, and I would like to make an additional research.
Could you tell me how did you limit the amount of data added amounting to approximately 10% of the original data, in detail?

Did you ignore all the matched terms which are the same with specific entries?
---For example, if the word "thank" is decided to be the entry to be ignored, ignore all the word "thank" in the corpus.
Or, did you ignore matched term depending on the opportunity of matching?
-- For example, the word "thank" in the first sentence is ignored, but the word "thank" in the second sentence may be not ignored.
Did you decide sentences in which you ignored the matched terms in advance?
In other words, before the term matching, did you split sentences into 90% sentences and 10% sentences, and matched terms only to 10%? sentences?
Or, As a result of ignoring term match, did sentences contain term annotations were added amounting to approximately 10% of the original data?
Is it possible to ignore specific match terms when there are multiple match terms in one sentence?
--For example, if the word "thank", "common", "vote" are matched in one sentence, is it possible to ignore only "thank"?

mtresearcher closed this as completed Aug 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to limit the amount of data added amounting to approximately 10% of the original data #2

How to limit the amount of data added amounting to approximately 10% of the original data #2

XY-1 commented Feb 14, 2020

How to limit the amount of data added amounting to approximately 10% of the original data #2

How to limit the amount of data added amounting to approximately 10% of the original data #2

Comments

XY-1 commented Feb 14, 2020