You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Implement NLP to extract keywords from SPAM and HAM corpus.
A frequency vector of these keywords would be a great feature for our model. To make sure, we have keywords specific to SPAM and HAM characteristics of the PR, we decide to do the following.
N = complexity of the model (starting with 30, might change iteratively to achieve better results)
A = Top N keywords list from SPAM dataset
B = Top N keywords list from HAM dataset
SPAM_KEYWORDS = (A - B)
HAM_KEYWORDS = (B - A)
Suggest using multi-rake for rapid keyword extraction from corpus
The text was updated successfully, but these errors were encountered:
Implement NLP to extract keywords from SPAM and HAM corpus.
A frequency vector of these keywords would be a great feature for our model. To make sure, we have keywords specific to SPAM and HAM characteristics of the PR, we decide to do the following.
Suggest using multi-rake for rapid keyword extraction from corpus
The text was updated successfully, but these errors were encountered: