Combining both model and rule based sentensizers #11107
-
Hi everyone. When I iterate over all sentences in a doc using the Code for adding the new symbols:
Am I doing something wrong? or is there another way to add exceptions to the model based sentencizer? Thanks, |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
You should be able to use both the sentence recognizer and the sentencizer together without issue. When you say "only one can be used" what do you mean - do you get an error or something? The issue might be that if you don't set the
|
Beta Was this translation helpful? Give feedback.
-
Thank you @polm for you answer. I've actually used the Experiment text:
I will identify sentences using:
When I use the
After adding the sentencizer:
The order of pipeline components:
The new language model gives the output:
When I say "only one can be used", I mean that I would expect the combination of both splitting by
|
Beta Was this translation helpful? Give feedback.
Thank you @polm for you answer. I've actually used the
overwrite
setting before but it didn't help. Let me work an example to show what I'm doing and the outputs:Experiment text:
I will identify sentences using:
When I use the
en_core_web_lg
language model I'm getting:After adding the sentencizer: