Speaker name extraction doesn't consider more than one word. #1

saurabhshri · 2017-05-16T20:02:43Z

In case of single-word speaker name, the parser works fine, but in case there are more than one word, only last word is extracted.

Saurabh: Oompa Loompa is extracted as Oompa Loompa, while
Saurabh Shri: Oompa Loompa is extracted as Saurabh Oompa Loompa.

This is just a matter of searching till the start of statement / appearance of "\n".

The text was updated successfully, but these errors were encountered:

Speaker Name detection on steroids :P :P

saurabhshri · 2017-05-20T12:58:56Z

Resolved in 3f3e2a3 .

Jonathhhan · 2022-03-27T00:11:17Z

I have a similar issue. For example getDialogue() MAN 1: Looks like a tie. becomes MAN: Looks like a tie. and it should become Looks like a tie.. And BLANEY: Rusk did it. doesnt change and should become: Rusk did it. And somehow your pull request above gives me a dublicate symbols error message. It would be great, if that is possible to fix.

saurabhshri referenced this issue May 20, 2017

Improving Speaker Name detection.

3f3e2a3

Speaker Name detection on steroids :P :P

saurabhshri closed this as completed May 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speaker name extraction doesn't consider more than one word. #1

Speaker name extraction doesn't consider more than one word. #1

saurabhshri commented May 16, 2017

saurabhshri commented May 20, 2017

Jonathhhan commented Mar 27, 2022 •

edited

Loading

Speaker name extraction doesn't consider more than one word. #1

Speaker name extraction doesn't consider more than one word. #1

Comments

saurabhshri commented May 16, 2017

saurabhshri commented May 20, 2017

Jonathhhan commented Mar 27, 2022 • edited Loading

Jonathhhan commented Mar 27, 2022 •

edited

Loading