Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore(nlp): Greek tokenizer, stemmer and stopwords added #1110

Merged
merged 2 commits into from Nov 19, 2020

Conversation

elozano98
Copy link
Contributor

@elozano98 elozano98 commented Nov 18, 2020

Depends on #1109. Please review it first. ⚠️

Description

Greek tokenizer, stemmer, and stopwords have been added to contentful nlp.

Context

Adding them will make it possible to process Greek text.

Approach taken / Explain the design

The tokenizer and the stemmer used are from the nlpjs library while the stopwords have been collected from the nltk python library.

Testing

The pull request...

  • ✔️ has unit tests

@elozano98 elozano98 requested review from vanbasten17 and dpinol and removed request for vanbasten17 November 18, 2020 16:37
Base automatically changed from contentful/ro to master November 19, 2020 14:35
@elozano98 elozano98 merged commit 02a4972 into master Nov 19, 2020
@elozano98 elozano98 deleted the contentful/el branch November 19, 2020 14:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants