Use consistent filepath pattern matching across the board #39

mchaaler · 2022-03-19T23:22:48Z

Fix issue #36

… in test_utils.py First step of issue smarie#36

…tching Second step of issue smarie#36

mchaaler · 2022-03-19T23:31:04Z

Sorry I'm not familiar enough with Git/Github processes.
Although I created a new branch to specifically address issue #36, it appears that I did not first 'exclude' the commit addressing issue #37 (97a4516), which was already proposed in PR #38.

Anyway...
The build is now failing on Windows as well as on linux.

smarie · 2022-03-21T11:05:28Z

Although I created a new branch to specifically address issue #36, it appears that I did not first 'exclude' the commit addressing issue #37 (97a4516), which was already proposed in PR #38.

Since I now merged #38, you can simply update this branch (pull from original/main) and the commits should disappear from the displayed diff here

smarie · 2022-03-21T11:40:21Z

The build is now failing on Windows as well as on linux.

Nice ! I'll modify the github workflow to

add the test site generation to the tests session for all runners (not only 3.7)
enable windows

I'll do this in another PR, but once it is done we should be able to see it fail correctly :)

smarie · 2022-03-21T16:47:20Z

Done. Can you please update this branch, so that we can see if now the windows workers fail too ?

mchaaler · 2022-03-21T22:30:57Z

Success! All tests are now failing! 🤪

smarie · 2022-03-22T11:10:51Z

Cool ! Thanks @mchaaler .
Now the easy part :) fixing the issue. Do you want to give it a try ?

…at they return pathlib objects instead of simple strings

…e of md5 hash check Fix issue smarie#34

mchaaler · 2022-03-22T21:09:25Z

Do you want to give it a try ?

Sure!

mchaaler · 2022-03-22T21:34:15Z

Since filepaths are sometimes compared to filepaths stored in the mkdocs/mkdocs-gallery config dict, I had to go back to the source and override Dir and File classes from mkdocs.config.config_options so that they return pathlib objects (mchaaler@2ff91ab).

Then fixing the not-failing-although-expected-to-fail bug (#34) was just a matter of adding the src_py_file path to the gallery_conf['failing_examples'] dict...
... and removing the minimum (but necessary) number of as_posix() calls so that the tests and build pass.

Let's check that and see if the CI builds are successful.

src/mkdocs_gallery/plugin.py

smarie · 2022-03-23T12:57:04Z

src/mkdocs_gallery/gen_single.py

+            # If expected to fail, let's assume it did when executed previously
+            if script.src_py_file in script.gallery_conf['expected_failing_examples']:
+                script.gallery_conf['failing_examples'][script.src_py_file] = (
+                    "Due to MD5 check, script has not been actually executed - "
+                    "Assumed it failed as expected during previous execution."
+                )


I would be curious to see how this is done in sphinx-gallery. Maybe there is an opportunity to suggest them a PR ?

Indeed I did not check sphinx-gallery before implementing the proposed solution.

https://github.com/sphinx-gallery/sphinx-gallery/blob/26fc508bac5459c3d6daf5db64cf5d45cda53b7d/sphinx_gallery/gen_rst.py#L961-L969

It seems they handle this by using the gallery_conf['stale_examples'] list instead of the gallery_conf['failing_examples'] dict. I'm not familiar enough with the global process to evaluate the best approach. It seemed natural to me to use the 'failing_examples' entry, but maybe we should give it a try.

mkdocs-gallery is really a copy of sphinx-gallery, I just modified a few design patterns (pathlib, objects) and made a few steps more readable, but I tried not to change the process at all.

The equivalent lines are here:

mkdocs-gallery/src/mkdocs_gallery/gen_single.py

Lines 963 to 980 in b2a6ccf

if not script.has_changed_wrt_persisted_md5():

# A priori we can...

skip_and_return = True

# ...however for executables (not shared modules) we might need to run anyway because of config

if script.is_executable_example():

if script.gallery_conf['run_stale_examples']:

# Run anyway because config says so.

skip_and_return = False

else:

# Add the example to the "stale examples" before returning

script.gallery_conf['stale_examples'].append(script.dwnld_py_file.as_posix())

if skip_and_return:

# Return with 0 exec time and mem usage, and the existing thumbnail

thumb_source_path = script.get_thumbnail_source(file_conf)

thumb_file = create_thumb_from_image(script, thumb_source_path)

return GalleryScriptResults(script=script, intro=intro, exec_time=0., memory=0., thumb=thumb_file)

So it is a bit strange, why the not modified failing sample is not added to stale samples in our case

oh-oh ! it seems that there is an "as_posix" here ! :)

src/mkdocs_gallery/gen_single.py

mchaaler · 2022-03-23T21:27:24Z

src_file is used in many lines in this function body. In particular _set_reset_logging_tee, os.chdir and compile. I can not guarantee that these behave similarly when they receive a path object.

Let's check that:

_set_reset_logging_tee - I first added the requested type hints in _check_reset_logging_tee and the _LoggingTee constructor. Then the src_filename attribute is used in _LoggingTee.write(), where the logger.verbose call redirects to Logger.debug. This seems safe.
os.chdir - Accepts a path-like object since 3.6 (https://docs.python.org/3.7/library/os.html#os.chdir)
compile - The documentation (https://docs.python.org/3.7/library/functions.html#compile) is not very detailed about the use of the 'filename' parameter. Doesn't seem to be used to access the file, but to display the filename in error messages (builtins.compile() docstring says: "The filename will be used for run-time error messages."). To be safe we could use str(src_file) instead of sending the path object, but my understanding is that, whatever the string formatting method used (f-strings, str.format or legacy % operator), path objects conversion to strings is flawless.

…_code_block() itself and in called methods

smarie · 2022-03-25T20:32:53Z

Nice analysis @mchaaler ! I resolved the comment on src_file.
The only remaining comment is on the "stale_example"

mchaaler · 2022-03-26T16:51:15Z

The only remaining comment is on the "stale_example"

I'm currently analyzing how it is done in sphinx-gallery. It seems (still pending deeper understanding...) that the 'expected-to-fail' examples are executed anyway.

Here's why:

The handle_exception() method in gen_rst.py reverts the script_vars['execute_script'] value to 'False' although the script is both executable and executed: link to sphinx-gallery.
This is done just after storing the exception in the gallery_conf['failing_examples'] dict.
Then, back to execute_script(), the part dedicated to md5 checksum file write is simply skipped.

... thus preventing the md5 file to ever be written for the failing or not executed scripts...
... thus bypassing the if md5sum_is_current check in generate_file_rst.

I would be curious to see how this is done in sphinx-gallery. Maybe there is an opportunity to suggest them a PR ?

Indeed. We might at least ask if this approach has been chosen because they did not find a better way to handle expected-to-fail scripts. Since they set up the whole md5-check process to avoid re-executing scripts that did not change, it seems weird not to rely on it for all unmodified files.

At the end, this means that my previous comment

It seems they handle this by using the gallery_conf['stale_examples'] list instead of the gallery_conf['failing_examples'] dict.

is not accurate: the expected-to-fail scripts are not handled the way we thought in sphinx-gallery.

smarie · 2022-03-27T13:47:16Z

Thanks a lot for this analysis @mchaaler ! Well if this is correct, sphinx-gallery's current behaviour is definitely sub-optimal to me.

still pending deeper understanding...

Let me know how do you want to proceed. We can define "the ideal fix" in this PR (probably relying on 'stale_sample' then?) so as to move on and release, and we can share it with them afterwards ; or we can open a discussion with them and find a common solution before implementing. On my side I see no hurry, so I am fine with either way.

…skipped because of md5 hash check Fix issue smarie#34

mchaaler · 2022-03-28T20:10:36Z

I just pushed another commit (c61bec2) proposing an alternative to the fix.

With 4d7baa9 we had:

mkdocs-gallery successfully executed 0 out of 2 files subselected by:

    gallery_conf["filename_pattern"] = '\\\\plot'
    gallery_conf["ignore_pattern"]   = '__init__\\.py'

after excluding 15 files that had previously been run (based on MD5).

And with c61bec2 we have:

mkdocs-gallery successfully executed 0 out of 0 files subselected by:

    gallery_conf["filename_pattern"] = '\\\\plot'
    gallery_conf["ignore_pattern"]   = '__init__\\.py'

after excluding 15 files that had previously been run (based on MD5).

Note the 0 ou of 0 files instead of 0 out of 2.
This prevents the expected-to-fail scripts to be taken into account in the total number of scripts that should have been run in gen_gallery.summarize_failing_examples.

@smarie Let me know if you would qualify this approach as the "ideal fix"...

mchaaler · 2022-03-28T20:29:14Z

Regarding sphinx-gallery, I implemented something similar to optimally handle expected-to-fail examples and I might dare submitting a PR at some time. I'm still struggling with the tests, though.
By the way, I'm quite surprised by the number of assert statements in their test functions. I thought that the adopted practice was "1 test function = 1 assert statement", so that one knows exactly, by the the failing test function's name itself, which expected behavior was broken. But I definitely lack experience in terms of tests implementation.

@smarie May I add a comment in sphinx-gallery/sphinx-gallery#895 you created, to start the discussion about the choice they made in order to handle the expected-to-fail examples?
Or would it preferable to open a dedicated issue?

smarie · 2022-03-30T19:05:40Z

Let me know if you would qualify this approach as the "ideal fix"...

Yes, it looks nice to me ! Let's merge this.

I thought that the adopted practice was "1 test function = 1 assert statement"

No, this is really not the case. It is rather "as many assert statements as needed", the main point is that the test should be focused on a single situation.

would it preferable to open a dedicated issue?

Indeed it is preferable.
Inside the issue you may refer to sphinx-gallery/sphinx-gallery#895 for the general intent of aligning between ourselves, and also to this PR #39 for the discussion.

smarie · 2022-03-30T20:15:50Z

0.7.5 is now released, with the fix. Thanks a lot @mchaaler for all your work to solve these issues !

mchaaler · 2022-03-30T21:29:33Z

0.7.5 is now released, with the fix. Thanks a lot @mchaaler for all your work to solve these issues !

You're welcome! Proud to provide useful contributions and happy for the constructive feedback you gave.

mchaaler added 2 commits March 20, 2022 00:13

Add 'matches_filepath_pattern()' method in utils.py and related tests…

fc4954a

… in test_utils.py First step of issue smarie#36

Use new 'matches_filepath_pattern()' method for consistent pattern ma…

69fa058

…tching Second step of issue smarie#36

This was referenced Mar 21, 2022

"Examples expected to fail, but not failing" #34

Closed

Failing tests when launched under WSL #37

Closed

smarie mentioned this pull request Mar 21, 2022

Add windows worker in CI + execute mkdocs build twice on all test workers #42

Closed

Merge branch 'smarie:main' into issue36

855c5a1

mchaaler added 2 commits March 22, 2022 21:49

Override Dir and File classes from mkdocs.config.config_options so th…

2ff91ab

…at they return pathlib objects instead of simple strings

Fix not-failing-although-expected-to-fail scripts when skipped becaus…

1c4bf04

…e of md5 hash check Fix issue smarie#34

smarie reviewed Mar 23, 2022

View reviewed changes

src/mkdocs_gallery/plugin.py Show resolved Hide resolved

smarie reviewed Mar 23, 2022

View reviewed changes

src/mkdocs_gallery/gen_single.py Show resolved Hide resolved

Ensure that src_file is consistently used as a path object in execute…

4d7baa9

…_code_block() itself and in called methods

Fix (improvement) not-failing-although-expected-to-fail scripts when …

c61bec2

…skipped because of md5 hash check Fix issue smarie#34

smarie merged commit 31e9e5c into smarie:main Mar 30, 2022

smarie mentioned this pull request Mar 30, 2022

Have consistent filename pattern matching across the board #36

Closed

mchaaler mentioned this pull request Mar 30, 2022

Unmodified expected_failing examples are not covered by md5 check sphinx-gallery/sphinx-gallery#936

Open

mchaaler deleted the issue36 branch March 30, 2022 21:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use consistent filepath pattern matching across the board #39

Use consistent filepath pattern matching across the board #39

mchaaler commented Mar 19, 2022

mchaaler commented Mar 19, 2022 •

edited

smarie commented Mar 21, 2022

smarie commented Mar 21, 2022 •

edited

smarie commented Mar 21, 2022

mchaaler commented Mar 21, 2022

smarie commented Mar 22, 2022

mchaaler commented Mar 22, 2022

mchaaler commented Mar 22, 2022

smarie Mar 23, 2022

mchaaler Mar 23, 2022

smarie Mar 25, 2022

smarie Mar 25, 2022

mchaaler commented Mar 23, 2022

smarie commented Mar 25, 2022

mchaaler commented Mar 26, 2022 •

edited

smarie commented Mar 27, 2022

mchaaler commented Mar 28, 2022

mchaaler commented Mar 28, 2022 •

edited

smarie commented Mar 30, 2022

smarie commented Mar 30, 2022

mchaaler commented Mar 30, 2022

	if not script.has_changed_wrt_persisted_md5():
	# A priori we can...
	skip_and_return = True

	# ...however for executables (not shared modules) we might need to run anyway because of config
	if script.is_executable_example():
	if script.gallery_conf['run_stale_examples']:
	# Run anyway because config says so.
	skip_and_return = False
	else:
	# Add the example to the "stale examples" before returning
	script.gallery_conf['stale_examples'].append(script.dwnld_py_file.as_posix())

	if skip_and_return:
	# Return with 0 exec time and mem usage, and the existing thumbnail
	thumb_source_path = script.get_thumbnail_source(file_conf)
	thumb_file = create_thumb_from_image(script, thumb_source_path)
	return GalleryScriptResults(script=script, intro=intro, exec_time=0., memory=0., thumb=thumb_file)

Use consistent filepath pattern matching across the board #39

Use consistent filepath pattern matching across the board #39

Conversation

mchaaler commented Mar 19, 2022

mchaaler commented Mar 19, 2022 • edited

smarie commented Mar 21, 2022

smarie commented Mar 21, 2022 • edited

smarie commented Mar 21, 2022

mchaaler commented Mar 21, 2022

smarie commented Mar 22, 2022

mchaaler commented Mar 22, 2022

mchaaler commented Mar 22, 2022

smarie Mar 23, 2022

Choose a reason for hiding this comment

mchaaler Mar 23, 2022

Choose a reason for hiding this comment

smarie Mar 25, 2022

Choose a reason for hiding this comment

smarie Mar 25, 2022

Choose a reason for hiding this comment

mchaaler commented Mar 23, 2022

smarie commented Mar 25, 2022

mchaaler commented Mar 26, 2022 • edited

smarie commented Mar 27, 2022

mchaaler commented Mar 28, 2022

mchaaler commented Mar 28, 2022 • edited

smarie commented Mar 30, 2022

smarie commented Mar 30, 2022

mchaaler commented Mar 30, 2022

mchaaler commented Mar 19, 2022 •

edited

smarie commented Mar 21, 2022 •

edited

mchaaler commented Mar 26, 2022 •

edited

mchaaler commented Mar 28, 2022 •

edited