Some of the English words detect as different language #76

kooshansari · 2022-11-24T05:24:02Z

Please check the below sheet. For most of the simple English words it detects as different language

janheinrichmerker · 2023-02-16T23:56:41Z

Most language detectors don't work well on very short texts (in this case a single word).
You could use the model's output scores to define a threshold under which no language is detected. Otherwise the language labels on short texts will probably be noisy.

bfischer1121 · 2023-06-19T11:24:38Z

Why are language detectors so bad on short text? I get that the sample size is small but one would think they would switch approaches to a basic sanity check. e.g., the characters "age" have absolutely no correlation with the characters found in Korean. This seems to be an issue with every language detection library we've used -- pure randomness!

AmitMY · 2023-07-07T10:53:52Z

I feel like this one might be a little better - https://mediapipe-studio.webapps.google.com/demo/language_detector

bfischer1121 · 2023-07-07T11:49:17Z

Nice suggestion (detects 6/7 correctly)

kooshansari changed the title ~~Some of the English words~~ Some of the English words detect as different language Nov 24, 2022

AmitMY mentioned this issue Nov 7, 2023

[Feature] Replace Spoken Language Identification with MediaPipe Solutions sign/translate#116

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some of the English words detect as different language #76

Some of the English words detect as different language #76

kooshansari commented Nov 24, 2022

janheinrichmerker commented Feb 16, 2023

bfischer1121 commented Jun 19, 2023

AmitMY commented Jul 7, 2023

bfischer1121 commented Jul 7, 2023

Some of the English words detect as different language #76

Some of the English words detect as different language #76

Comments

kooshansari commented Nov 24, 2022

janheinrichmerker commented Feb 16, 2023

bfischer1121 commented Jun 19, 2023

AmitMY commented Jul 7, 2023

bfischer1121 commented Jul 7, 2023