Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate language clustering using an ontology such as Glottolog #377

Open
twagoo opened this issue Feb 23, 2024 · 0 comments
Open

Investigate language clustering using an ontology such as Glottolog #377

twagoo opened this issue Feb 23, 2024 · 0 comments

Comments

@twagoo
Copy link
Member

twagoo commented Feb 23, 2024

From the feedback document "Unexpected behaviour of the VLO":

For some languages (such as German), a lot of different variants are available, including variants in different languages (“Mitteldeutsch”, “Oberdeutsch”, “German”, “German, Middle High (ca. 1050-1500)”, ...). They are not all standard German and not all tagged as German, so it isn’t easy to find them all. Typological hierarchies can be disputed, but the ~5438 different languages currently listed are difficult to navigate.

We would need to think how to present this and make it “facet ready”. One way could be to predefine a set of relevant language groups (perhaps determined by frequency??) and enhance the index with these.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant