You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When the probability of a given document for a certain language is over X, the process could exit early and return the language.
It’s at least interesting to research, maybe by splitting the input in parts of N characters/trigrams...
This also touches on another problem: normalising probability-values. Currently, franc returns [languageCode, 1] for a very-probable language, which might be confusing for further processing.
The text was updated successfully, but these errors were encountered:
When the probability of a given document for a certain language is over X, the process could exit early and return the language.
It’s at least interesting to research, maybe by splitting the input in parts of N characters/trigrams...
This also touches on another problem: normalising probability-values. Currently, franc returns
[languageCode, 1]
for a very-probable language, which might be confusing for further processing.The text was updated successfully, but these errors were encountered: