495: Default string values with dashes are mistakenly treated as dynamic defaults #578

lindsay-stevens · 2022-01-17T08:54:52Z

Closes #495

Why is this the best possible solution? Were any other approaches considered?

Approaches the problem by adding a lexer, whose parsing rules are used to identify expression tokens that indicate a dynamic expression. These tokens are: math operators, union operator, xpath predicates, pyxform references, and function calls. There may be others but the lexer provides a framework to add to the parsing rules, or the business logic using them.

As mentioned in #495, a regular regex isn't adequate to identify the various tokens that indicate a dynamic expression. I looked for "simple" lexer libraries that could parse XPath in Python. Although that isn't really what is needed, there were some big hints in euxml for identifying common tokens and especially valid XML/XPath names. Also these searches revealed the presence of a regex scanner in the Python standard library, which is used here. The euxml library uses PLY but it seemed like overkill (if not already) to learn and bring in a dependency for this heuristic check.

Includes tests to reproduce the issues described in 495 and 533, as well as a range of cases which should / should not be considered dynamic. Some of these overlap with existing tests but I didn't want to go as far as refactoring them until I got some feedback on this approach.

What are the regression risks?

There currently aren't tests to benchmark performance for huge forms full of complex default expressions. I found that using a shared / pre-compiled scanner speeds things up signficantly vs. making a new one for each question, but I'm not sure on the remaining overhead for huge forms.

It's possible that some cases where an expression is or is not a default may be missed. I think this PR covers the all known cases so it would not be a regression at least.

Does this change require updates to documentation? If so, please file an issue here and include the link below.

Probably not, but it might be worth making a note of how Pyxform decides what is dynamic, to preempt forum questions.

Before submitting this PR, please make sure you have:

included test cases for core behavior and edge cases in tests
run nosetests and verified all tests pass
run black pyxform tests to format code
verified that any code or assets from external sources are properly credited in comments

lindsay-stevens · 2022-03-15T18:56:54Z

Hi @lognaturel there may be some more to do on this wrt. tests but I think it is ready for a look. Thanks!

…SForm#495 - select1 choice test passes ODK Validate but generates an XForm with: <setvalue event="odk-instance-first-load" ref="/test/s1" value="a-2"/> - text default test fails ODK Validate with error: Invalid XPath in value set action declaration: 'https://my-site.com' Problem found at nodeset: ${model}[@xforms-version=1.0.0]/setvalue With element <setvalue event="odk-instance-first-load" ref="${t3}" value="https://my-site.com"/>

lognaturel

Definitely more involved than I would have gone for but more correct, too!

Here are some cases I played around with:

            self.Case(True, "date", """concat('2022-03', '-14')""", ""),

            self.Case(False, "text", """f-4""",""),
            self.Case(False, "text", """./f-4""",""),
            self.Case(True, "text", """./f - 4""",""),
            self.Case(False, "integer", """7-4""",""),
            self.Case(True, "integer", """7 - 4""",""),

The behavior with - is a change so that could possibly break some form updates. I can't imagine these types of cases are common but it might be good to have the tests as explicit documentation. And as you said, also add something to user-facing docs.

tests/test_dynamic_default.py

lindsay-stevens · 2022-03-29T13:05:46Z

Thanks for the review! I will update the tests as mentioned above to complete this PR.

@lognaturel

- deleted tests where covered by case in TestDynamicDefaultSimpleInput - updated tests using string matchers to use xpath instead - added pyxform_test_case ability to escape literal pipe in values - clarified existing translation performance test description - added util function for coalesce - added tests suggested by @lognaturel and performance tests - updated test strategy for cases that initially or after rendering contain single quotes, namely to use xpath_exact to compare outside of xpath, instead of xpath_match.

lindsay-stevens · 2022-04-14T12:13:52Z

Should be good to go now. New commit is all about tests, per commit message for ba7b196:

deleted existing tests where covered by case in TestDynamicDefaultSimpleInput
updated tests using string matchers to use xpath instead
added pyxform_test_case ability to escape literal pipe in values
clarified existing translation performance test description
added util function for coalesce
added tests suggested by @lognaturel and performance tests
updated test strategy for cases that initially or after rendering
contain single quotes, namely to use xpath_exact to compare outside
of xpath, instead of xpath_match.

The performance tests are:

test_dynamic_default_performance__time: processing time for 500, 1000, 2000, 5000 dynamic default questions shouldn't take much longer than no-op dynamic defaults check (results were approx 0 to 6%).
test_dynamic_default_performance__memory: question with 2000 dynamic default questions shouldn't increase memory usage more than x2.

lognaturel · 2022-04-14T17:54:29Z

🥳 🥳 🥳

lindsay-stevens mentioned this pull request Jan 17, 2022

Default string values with dashes are mistakenly treated as dynamic defaults #495

Closed

lindsay-stevens force-pushed the pyxform-495 branch from da1e5a9 to bc45b1d Compare March 12, 2022 06:23

lindsay-stevens marked this pull request as ready for review March 15, 2022 18:54

lindsay-stevens requested a review from lognaturel March 15, 2022 18:54

lindsay-stevens added 2 commits March 17, 2022 18:36

add: lexer to identify dynamic defaults in expressions vs. strings

72b1880

lindsay-stevens force-pushed the pyxform-495 branch from 52c2197 to 72b1880 Compare March 17, 2022 07:36

lognaturel reviewed Mar 23, 2022

View reviewed changes

tests/test_dynamic_default.py Outdated Show resolved Hide resolved

tests/test_dynamic_default.py Outdated Show resolved Hide resolved

tests/test_dynamic_default.py Outdated Show resolved Hide resolved

lindsay-stevens mentioned this pull request Mar 29, 2022

Move markdown to pyxform functionality out of tests and into main package #599

Closed

lindsay-stevens requested a review from lognaturel April 14, 2022 12:14

lognaturel approved these changes Apr 14, 2022

View reviewed changes

lognaturel merged commit b0ad3a7 into XLSForm:master Apr 14, 2022

lindsay-stevens deleted the pyxform-495 branch April 14, 2022 19:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

495: Default string values with dashes are mistakenly treated as dynamic defaults #578

495: Default string values with dashes are mistakenly treated as dynamic defaults #578

lindsay-stevens commented Jan 17, 2022 •

edited

Loading

lindsay-stevens commented Mar 15, 2022

lognaturel left a comment

lindsay-stevens commented Mar 29, 2022

lindsay-stevens commented Apr 14, 2022

lognaturel commented Apr 14, 2022

495: Default string values with dashes are mistakenly treated as dynamic defaults #578

495: Default string values with dashes are mistakenly treated as dynamic defaults #578

Conversation

lindsay-stevens commented Jan 17, 2022 • edited Loading

Why is this the best possible solution? Were any other approaches considered?

What are the regression risks?

Does this change require updates to documentation? If so, please file an issue here and include the link below.

Before submitting this PR, please make sure you have:

lindsay-stevens commented Mar 15, 2022

lognaturel left a comment

Choose a reason for hiding this comment

lindsay-stevens commented Mar 29, 2022

lindsay-stevens commented Apr 14, 2022

lognaturel commented Apr 14, 2022

lindsay-stevens commented Jan 17, 2022 •

edited

Loading