New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add text cleaner node #75
Comments
@lalitpagaria Could you list down the text cleaning features we are looking for here. Eg stemming, lemmatisation, stop word removal etc? |
Thank you @shahrukhx01. Please find requested details as follows - Few of cleaning feature could be (not extensive, if you have more idea please add them as well) -
Following is design consideration -
I know it is very extensive list, it helped me to express my mind. It is not required to have implementation of all. We can start to add basic cleaning first and then enhance it. Please let me know would you like to work on it and create a PR |
sounds good. I'll start working on it and create a PR on the first draft I come up with. |
closed with #110 |
Idea to have configurable text cleaning node.
This node also have predefined template to clean tweets, facebook feed, app reviews etc.
For detail refer #75 (comment)
The text was updated successfully, but these errors were encountered: