Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix bad char encoding for italian stopwords #39

Merged
merged 1 commit into from
Dec 21, 2020

Conversation

moty66
Copy link
Contributor

@moty66 moty66 commented Dec 21, 2020

Italian stop words has many issues with the accent chars ò à è é ù ì, so some stop words are wrong, example:
ahimè instead of ahimé
più instead of più

In this pull request is replaces these bad encodings to the correct accents manually

@arianpasquali
Copy link
Contributor

Thank you for pointing that out.

@arianpasquali arianpasquali reopened this Dec 21, 2020
@arianpasquali arianpasquali merged commit 75eea57 into LIAAD:master Dec 21, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants