You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Look at this example : the result should be Language.FRENCH without any doubt, but it's Language.UNKNOWN :
LanguageDetectordetector = LanguageDetectorBuilder.fromLanguages(Language.ENGLISH, Language.FRENCH).build();
LanguagedetectedLanguage = detector.detectLanguageOf("Découverte du château grâce à l'application visite virtuelle");
assertEquals(Language.FRENCH, detectedLanguage);
Expected :FRENCH
Actual :UNKNOWN
The text was updated successfully, but these errors were encountered:
Thank you for this report, @bdecarne. This is actually a bug in the rule-based filter engine which is supposed to classify the character â as a possible indicator for French. Unfortunately, I missed to include French as a possible language for this character. This is easy to fix and I will do that soon.
pemistahl
changed the title
False negative example
French not treated as possible language for character 'â'
Nov 14, 2021
Hello !
Look at this example : the result should be
Language.FRENCH
without any doubt, but it'sLanguage.UNKNOWN
:The text was updated successfully, but these errors were encountered: