Acronym: add underscore test case #1436

link2xt · 2019-01-08T16:55:01Z

Many regular expression libraries have a way to detect word boundaries,
but their definition of word characters includes underscores.

\b and \w metacharacters were designed to detect programming language keywords,
and it is a common mistake to use them to match words in natural languages.

rpottsoh · 2019-01-08T17:06:16Z

@link2xt thanks for opening the PR. The Travis CI build failed. Please check the details and correct the issue that it is reporting. Let me know if you have any questions.

sshine · 2019-01-08T20:56:09Z

Excellent!

Should the acronym of "Foo_Bar" be "FB" and not "F"? And should the acronym of "_Foo" be well-defined?

This does not appear to be a clarification of the exercise, but an extension of it. And it seems to test two different things. So it would warrant two test cases and a major version bump.

I am personally content with not having them, but I'm not seeing a lot of solutions to this exercise. I think your suggestion to teach that \w in regex includes _ is one that benefits many tracks.

And I think that if the acronym of "Foo-Bar" is "FB", you can make the case for _. Is that what you're doing?

petertseng · 2019-01-08T21:30:16Z

\b and \w metacharacters were designed to detect programming language keywords,
and it is a common mistake to use them to match words in natural languages.

This has subtly pointed out that up to this point, all inputs have been written in a form that one might see in natural language. I think FOO_BAR is something I would not expect to see in natural language (but I could be convinced about _BAZ_ if we assume that someone is trying to emulate underlining a word). So FOO_BAR would be a departure from existing test cases for this reason. Be prepared to accept this departure.

The above comment must not be read as either variant of "I {support, oppose} adding FOO_BAR".

link2xt · 2019-01-08T21:35:39Z

This has subtly pointed out that up to this point, all inputs have been written in a form that one might see in natural language.

I guess I will change this to look like markdown-emphasized word then.

Many regular expression libraries have a way to detect word boundaries, but their definition of word characters includes underscores. \b and \w metacharacters were designed to detect programming language keywords, and it is a common mistake to use them to match words in natural languages.

link2xt · 2019-01-08T21:43:56Z

@sshine

Should the acronym of "Foo_Bar" be "FB" and not "F"?

I guess it is better not to test it then, because it is ambiguous, just like "FooBar", which is also not tested. But _foo_ should not be treated different from *foo* simply because your language has regex library.

add underscore test case fix the text example to pass the newly added test case exercism/problem-specifications#1436

rpottsoh added the new test case idea label Jan 8, 2019

link2xt mentioned this pull request Jan 8, 2019

Acronym mentoring: add another (better?) regexp exercism/website-copy#691

Merged

rpottsoh approved these changes Jan 8, 2019

View reviewed changes

sshine merged commit cacf1f1 into exercism:master Jan 9, 2019

link2xt deleted the acronym_underscore branch January 9, 2019 05:17

This was referenced Jan 11, 2019

Acronym: update to v1.7.0 exercism/delphi#356

Merged

acronym: add leading / trailing and multiple separator case #1432

Open

This was referenced Mar 25, 2019

acronym: Update and exclude new tests exercism/ruby#950

Merged

acronym: Update and exclude new tests exercism/ruby#953

Merged

sshine pushed a commit to exercism/haskell that referenced this pull request Oct 10, 2019

acronym: upgrade to 1.7.0 (add extra test case) (#861)

9252572

add underscore test case fix the text example to pass the newly added test case exercism/problem-specifications#1436

sharno mentioned this pull request Oct 10, 2019

upgrade acronym to 1.7.0 (add extra test case) exercism/haskell#861

Merged

petertseng mentioned this pull request Dec 10, 2021

Word Count - Adds multiple apostrophe test case #1628

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Acronym: add underscore test case #1436

Acronym: add underscore test case #1436

link2xt commented Jan 8, 2019

rpottsoh commented Jan 8, 2019 •

edited

sshine commented Jan 8, 2019

petertseng commented Jan 8, 2019

link2xt commented Jan 8, 2019

link2xt commented Jan 8, 2019

Acronym: add underscore test case #1436

Acronym: add underscore test case #1436

Conversation

link2xt commented Jan 8, 2019

rpottsoh commented Jan 8, 2019 • edited

sshine commented Jan 8, 2019

petertseng commented Jan 8, 2019

link2xt commented Jan 8, 2019

link2xt commented Jan 8, 2019

rpottsoh commented Jan 8, 2019 •

edited