Named Entity Recognition coherence #2

jesus-seijas-sp · 2018-08-18T22:23:07Z

Describe the bug
Named Entity Recognition is not working as expected:

If an enumerated entity is added, it only finds the first occurance
If there is a regular expression entity that overlaps with an enumerated entity, if returns both instead of finding edges

To Reproduce

const { NlpManager } = require('node-nlp');

const manager = new NlpManager({ languages: ['en'] });
manager.addRegexEntity('mail', /\b(\w[-._\w]*\w@\w[-._\w]*\w\.\w{2,3})\b/ig);
manager.addNamedEntityText('location', 'barcelona', ['en'], ['Barcelona', 'Barna']);
manager.addNamedEntityText('location', 'madrid', ['en'], ['Madrid']);

const result = manager.process('en', 'My mail is barcelona@barcelona.es and i live in madrid', {});
console.log(result);

Expected behavior
Currently it returns:

[ { start: 11,
       end: 20,
       levenshtein: 0,
       accuracy: 1,
       option: 'barcelona',
       sourceText: 'Barcelona',
       entity: 'location',
       utteranceText: 'barcelona' },
     { start: 11,
       end: 33,
       accuracy: 1,
       sourceText: 'barcelona@barcelona.es',
       utteranceText: 'barcelona@barcelona.es',
       entity: 'mail' } ]

It should return:

[ { start: 11,
       end: 33,
       accuracy: 1,
       sourceText: 'barcelona@barcelona.es',
       utteranceText: 'barcelona@barcelona.es',
       entity: 'mail' },
{ start: 48,
       end: 53,
       levenshtein: 0,
       accuracy: 1,
       option: 'madrid',
       sourceText: 'Madrid',
       entity: 'location',
       utteranceText: 'madrid' }
   ]

The text was updated successfully, but these errors were encountered:

jesus-seijas-sp · 2018-08-26T14:11:38Z

Solved by two strategies:

Refactored Named Entities and Manager so find several occurences per utterance
Implemented a reduce edges strategy: https://github.com/axa-group/nlp.js/blob/master/lib/util/similar-search.js#L242

jesus-seijas-sp closed this as completed Aug 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Named Entity Recognition coherence #2

Named Entity Recognition coherence #2

jesus-seijas-sp commented Aug 18, 2018

jesus-seijas-sp commented Aug 26, 2018

Named Entity Recognition coherence #2

Named Entity Recognition coherence #2

Comments

jesus-seijas-sp commented Aug 18, 2018

jesus-seijas-sp commented Aug 26, 2018