dash and lowered_split fix for inflect.py _plnoun() #127

picobyte · 2021-03-16T19:52:05Z

In function _plnoun(), the variable lowered_split is split on dashes, but when reusing this variable, code required whitespace split lowerd words, not dash split.

jaraco · 2022-02-01T02:40:26Z

Can you add a test that captures the missed expectation prior to the patch?

jaraco · 2022-03-23T21:46:49Z

I regret not having responded to this sooner. It does now appear to be abandoned. Please feel free to revive the effort at some point.

picobyte · 2022-05-01T09:17:24Z

I'm guessing one would have to trigger this with 'prima-donna' then one would get back something like 'prima donnas' and the singular prima donna in the tests/test_pl_si.py when added to tests/words.txt. Maybe there wiyld be more if pl_sb_irregular_compound would contain more words in the future, like man-of-war, jack-in-the-pulpit

What is the concern with this change? It seems to fix a dash- space confusion, are you inclined to think this was intentional?

jaraco · 2022-05-01T14:20:09Z

What is the concern with this change?

It's changing behavior in a function that's already unmanageably complex without any tests. That is, I could revert the change and the tests will still pass. To be sure, it's not your fault the implementation is so complex.

Moreover, ~~this PR barely has a problem description~~. It alludes to a problem, but did not even provide a single example of a failed expectation. Now I have one example, 'prima-donna', but I'm not even sure that's a good example. When I look up the phrase, it appears it's supposed to be "prima donna" so it would be incorrect to hyphenate the phrase. I also observe that the dash separation logic only applies to two or more dashes.

Man-of-war may be a better example.

Still, I'd like to see some tests added that capture not only the missed expectations, but also expectations around proximate concerns (such as space-separated words or compound subjects with only one dash).

jaraco · 2022-05-01T16:00:50Z

Looking at this a bit more, I see that there's similar code in _sinoun that already been re-factored to handle spaces and dashes differently. It would be nice if there were a way to unify this logic too.

jaraco · 2022-05-01T17:11:02Z

See #153, where I've refactored the code to unify the logic around prepositional phrases.

jaraco · 2022-05-01T18:26:15Z

My mistake. I see now that #126 captures the issue. Let's continue the conversation there.

Update inflect.py

0c0769e

picobyte mentioned this pull request Mar 16, 2021

possible dash/whitespace issue in _plnoun() #126

Closed

jaraco closed this Mar 23, 2022

jaraco reopened this May 1, 2022

jaraco closed this May 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dash and lowered_split fix for inflect.py _plnoun() #127

dash and lowered_split fix for inflect.py _plnoun() #127

picobyte commented Mar 16, 2021

jaraco commented Feb 1, 2022

jaraco commented Mar 23, 2022

picobyte commented May 1, 2022 •

edited

jaraco commented May 1, 2022 •

edited

jaraco commented May 1, 2022

jaraco commented May 1, 2022

jaraco commented May 1, 2022

dash and lowered_split fix for inflect.py _plnoun() #127

dash and lowered_split fix for inflect.py _plnoun() #127

Conversation

picobyte commented Mar 16, 2021

jaraco commented Feb 1, 2022

jaraco commented Mar 23, 2022

picobyte commented May 1, 2022 • edited

jaraco commented May 1, 2022 • edited

jaraco commented May 1, 2022

jaraco commented May 1, 2022

jaraco commented May 1, 2022

picobyte commented May 1, 2022 •

edited

jaraco commented May 1, 2022 •

edited