-
Notifications
You must be signed in to change notification settings - Fork 186
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Greek support #282
Comments
Thanks for the link, that's very useful. Reviewing it, however, the list of English stopwords (at 895 words) is far longer and more extensive than the fairly conservative lists currently available in quanteda through However the Greek list from the source you listed has only 79 words, so it is much smaller. However this also shows that the coverage of the different languages from your source is highly imbalanced. What do you think of the 79 words, compared to the other language lists in |
To be honest I have absolutely no idea. I'm in the middle of processing greek parliamentary written questions and only see a bunch of weird characters ;-). |
EL_stopwords.xlsx |
There is still one mistake in the stopword list: "τηs", which is the translation of "hers", should be replaced with "της". |
Thanks, fixed in 8fc9954 |
Hi, are there any plans to fully support Greek as a language? From what I can tell there are no stopwords available although some sources already exist.
The text was updated successfully, but these errors were encountered: