Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: improvements to @stdlib/nlp-expand-contractions #497

Closed
wants to merge 8 commits into from
Closed

feat: improvements to @stdlib/nlp-expand-contractions #497

wants to merge 8 commits into from

Conversation

titanism
Copy link

Resolves #496

titanism added a commit to spamscanner/spamscanner that referenced this pull request Jun 13, 2022
"cannot": "can not",
"cant": "can not",
"cant've": "cannot have",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would this not be "can not have"?

"cannot": "can not",
"cant": "can not",
"cant've": "cannot have",
"cantve": "cannot have",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same comment as above.

"couldntve": "could not have",
"couldve": "could have",
"cuppa": "cup of",
"cya": "see ya",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

s/ya/you/?

"arent": "are not",
"asap": "as soon as possible",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm...some of these seem more like abbreviations, rather than contractions.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

True, if we go the route of strict contractions, then we need to make it clear in the README too I think. Other packages have these and I was just trying to merge everything together (so then we can review like this). Thank you for looking into all this btw.

"bday": "birthday",
"bo's'n": "bosun",
"bo's'ns": "bosuns",
"btw": "by the way",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

More an acronym/abbreviation than a contraction.

"dyou": "did you",
"e'en": "even",
"e'er": "ever",
"everybody's": "everybody is",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is ambiguous. Could be a possessive.

"e'en": "even",
"e'er": "ever",
"everybody's": "everybody is",
"everyone's": "everyone is",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same comment as above.

"itsn't": "it is not",
"iunno": "I do not know",
"kinda": "kind of",
"kk": "okay",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also a bit tricky to distinguish a contraction from slang.

"mayn't": "may not",
"methinks": "I think",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is old English, the modern translation of which is "I think".

@titanism
Copy link
Author

You and other maintainers are welcome to make edits / delete changes, or delete certain commits altogether if need be.

This pull request was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[RFC]: Improvements to @stdlib/nlp-expand-contractions
2 participants