many: detect local source changes #2167

kyrofa · 2018-06-23T01:02:57Z

Have you followed the guidelines for contributing?
Have you signed the CLA?
If this is a bugfix. Have you checked that there is a bug report open for the issue you are trying to fix on bug reports?
If this is a new feature. Have you discussed the design on the forum?
Have you successfully run ./runtests.sh static?
Have you successfully run ./runtests.sh unit?

The snapcraft CLI doesn't notice changes to local sources once the parts have been pulled: one must clean and re-pull in order to take them into account. This PR resolves LP: #1583718 by comparing local source timestamps to the pull state timestamp, and updating the pull (and all subsequent) steps as necessary.

sergiusens · 2018-06-25T14:09:25Z

Hey, what a PR, it is quite big and lots of tests seem to be failing, so I'll leave it to you to fix them before moving forward with an actual review.

codecov-io · 2018-06-25T16:36:39Z

Codecov Report

Merging #2167 into master will decrease coverage by 0.03%.
The diff coverage is 87.63%.

@@            Coverage Diff             @@
##           master    #2167      +/-   ##
==========================================
- Coverage   91.32%   91.28%   -0.04%     
==========================================
  Files         201      202       +1     
  Lines       12617    12757     +140     
  Branches     1874     1900      +26     
==========================================
+ Hits        11522    11645     +123     
- Misses        741      752      +11     
- Partials      354      360       +6

Impacted Files	Coverage Δ
snapcraft/internal/pluginhandler/_dirty_report.py	`100% <ø> (ø)`	⬆️
snapcraft/file_utils.py	`97.56% <100%> (ø)`	⬆️
snapcraft/_baseplugin.py	`93.4% <100%> (+0.07%)`	⬆️
...apcraft/internal/pluginhandler/_outdated_report.py	`100% <100%> (ø)`
snapcraft/plugins/cmake.py	`89.28% <100%> (+2.18%)`	⬆️
snapcraft/internal/errors.py	`99.53% <100%> (ø)`	⬆️
snapcraft/internal/lifecycle/_status_cache.py	`100% <100%> (ø)`	⬆️
snapcraft/internal/sources/_base.py	`92.06% <69.23%> (-5.94%)`	⬇️
snapcraft/internal/sources/_local.py	`84.9% <78.94%> (-15.1%)`	⬇️
snapcraft/internal/pluginhandler/__init__.py	`91.87% <82.6%> (-0.42%)`	⬇️
... and 4 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 25043ab...6cc1d79. Read the comment docs.

kyrofa · 2018-06-25T17:49:11Z

[...] lots of tests seem to be failing, so I'll leave it to you to fix them before moving forward with an actual review.

Yeah I had to move the in-test XDG cache around, so integration tests are taking a few iterations. I'll ping you.

The snapcraft CLI doesn't notice changes to local sources once the parts have been pulled: one must clean and re-pull in order to take them into account. Start comparing local source timestamps to the pull state timestamp, updating the pull (and all subsequent) steps as necessary. LP: #1583718 Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

kyrofa · 2018-06-26T03:49:53Z

Alright @sergiusens, this one should be good, now.

sergiusens · 2018-06-27T16:51:26Z

snapcraft/file_utils.py



-def _link(source: str, destination: str, follow_symlinks: bool=False) -> None:
+def link(source: str, destination: str, follow_symlinks: bool=False) -> None:


now that they are public it would be nice to force kwargs and add some docstrings if you don't mind

This API needs to be compatible with copy2 and link_or_copy, so the only kwarg we can force is follow_symlinks, but I agree that it's an improvement.

Fixed, thank you for catching the lack of docs!

sergiusens · 2018-06-27T16:51:37Z

snapcraft/file_utils.py

@@ -137,7 +137,7 @@ def _link(source: str, destination: str, follow_symlinks: bool=False) -> None:
        raise SnapcraftCopyFileNotFoundError(source)


-def _copy(source: str, destination: str, follow_symlinks: bool=False) -> None:
+def copy(source: str, destination: str, follow_symlinks: bool=False) -> None:


sergiusens · 2018-06-27T16:54:22Z

snapcraft/internal/lifecycle/_runner.py

@@ -277,11 +285,18 @@ def _run_step(self, *, step: steps.Step, part, progress, hint=''):

        part = _replace_in_part(part)

+    def _run_step(self, *, step: steps.Step, part, progress, hint=''):
+        self._prepare_to_run(step=step, part=part)


maybe _prepare_to_run_step?

Went with _prepare_step so we have _{prepare,run,complete}_step.

sergiusens · 2018-06-27T16:54:37Z

snapcraft/internal/lifecycle/_runner.py

        notify_part_progress(part, progress, hint)
        getattr(part, step.name)()

        # We know we just ran this step, so rather than check, manually twiddle
        # the cache
+        self._step_complete(part, step)
+
+    def _step_complete(self, part, step):


_complete_step for parity?

Agreed, done.

sergiusens · 2018-06-27T17:13:56Z

snapcraft/internal/sources/_base.py

+            raise RuntimeError("source must be checked before it's updated")
+        self._update()
+
+    def _check(self, target: str):


why not just do the abc. thing here, you can leave the docstring, it will still be valid code.

Because then we'd need to add it to all sources, even those that don't actually support doing this. Since the vast majority of sources don't support it, it seemed to make sense to assume a lack of support in the API, and have only that source that does support it provide an implementation.

sergiusens · 2018-06-27T21:04:30Z

snapcraft/internal/lifecycle/_runner.py

+        else:
+            getattr(self, '_re{}'.format(step.name))(part, '({})'.format(
+                outdated_report.get_summary()))
+

 def notify_part_progress(part, progress, hint='', debug=False):


this is old, but what is the difference?

Just keep it green 😄

I'm afraid I'm not clear on what you're saying, here. Are you asking me to change something?

sergiusens · 2018-06-27T21:07:59Z

snapcraft/internal/lifecycle/_runner.py

@@ -316,6 +331,28 @@ def _handle_dirty(self, part, step, dirty_report, cli_config):
        getattr(self, '_re{}'.format(step.name))(part, hint='({})'.format(
            dirty_report.get_summary()))

+    def _handle_outdated(self, part, step, outdated_report, cli_config):


same as handle_dirty, can this take a dirty_action as a parameter instead of passing down the entire object?

With the introduction of the outdated concept, I think this is just another case of dirty. Can we add more precise qualifiers like handle_dirty_definitions and handle_dirty_[local]_source?

Let me give you my thought process. A step (for a specific part) can be in one of two states:

Not run (we call this "clean")

Run, which is further broken down into sub-states:
2.1. Complete
2.2. Needs to be cleaned and run again (we call this "dirty", i.e. it needs to be cleaned)
2.3. Needs to be run again, but does not need to be cleaned (in this PR we call this "outdated", i.e. it needs to be updated)

Both 2.2 and 2.3 are caused by something, which needs to be recorded somewhere so we can tell the user what it was. We do that with the {Dirty,Outdated}Report classes. This allows us to isolate concerns, separating the "what makes this {dirty,outdated}" from "oh, this is {dirty,outdated}, I need to take appropriate action." If we combine these reports into one, we have a few issues:

We would need to come up with a new name, since "dirty" no longer works (i.e. the solution is not to clean it anymore).

Concerns leak. Now lifecycle needs to know "Oh, the project properties changed, okay, I know I need to clean" or "Oh, an earlier step happened, I know I need to update". This spread of knowledge seems error-prone.

(1) isn't a big deal, we can figure that one out. The word "outdated" could be used to refer to both situations, I suppose. (2) we can probably solve by keeping things similar to how they are today, but have a master class composed of both a dirty and outdated report, and take action appropriately. That at least gives us a single report to deal with in the PluginHandler's API and thus the lifecycle, but other than that doesn't buy us much. Do you like that idea better?

I did not say combine them, I just meant that "dirty" and "outdated" are easier to confuse and that we should narrow the scope of what it means down.

Your thought process is from the point of view of the action wrt lifecycle and my mind is trending towards the state of things (why is it dirty/outdated) whilst the report comes from a component that does not determine the future actions.

These two terms lead to confusions, is this dirty because it is outdated (so far that has been our logic, right?). The what is outdated is what I am asking to more clearly state in the method calls, checks and class names.

But yeah, by no means, unless it gets converted into the state class merge these two.

If you are really stuck with the names, I have added a couple more comments that would satisfy me. My intention was that from the names of variables and classes it was crystal clear what it meant without having to read the code, I can compromise with a bit more documentation under the internal class names and methods called.

I agree that this should be cleaned up, but as we discussed on the call, let's see if we can get this PR in a decent state with docs and then take another pass (in another PR) at improving it further.

sergiusens · 2018-06-27T21:16:10Z

snapcraft/internal/lifecycle/_runner.py

-                                part,
-                                'Skipping {}'.format(current_step.name),
-                                '(already ran)')
+                            outdated_report = self._cache.get_outdated_report(


I have complications accepting this procedural flow, just the level of nesting.
How is dirty_report so different from outdated_report? How can we have a dirty_report if we haven't has_step_run? Do you think this can be shuffled a bit?

That's because it's the ugliest thing on Earth. I tried to avoid touching too much here, but let me see what I can do.

There, that's quite a bit better. Thoughts?

sergiusens · 2018-06-27T21:22:10Z

snapcraft/internal/pluginhandler/__init__.py

+            self.plugin.statedir, steps.PULL)
+
+        # Not all sources support checking for updates
+        with contextlib.suppress(NotImplementedError):


ok, I see the reason for raising now, still would be nice to have a different exception to avoid over catching (or check the payload) to not swallow exceptions like this.

Sure thing.

Fixed with a custom exception.

sergiusens · 2018-06-27T21:22:52Z

snapcraft/internal/pluginhandler/__init__.py

+            if os.path.exists(self.plugin.build_basedir):
+                shutil.rmtree(self.plugin.build_basedir)
+
+            # FIXME: It's not necessary to ignore here anymore since it's now


wow, this is so old 😅

sergiusens · 2018-06-27T21:27:48Z

snapcraft/internal/sources/_local.py

@@ -25,13 +26,79 @@

 class Local(Base):

+    def __init__(self, *args, copy_function=file_utils.link_or_copy, **kwargs):


Do we have a scenario for copy_function to be different?

Yeah, right here. Didn't want to duplicate functionality.

sergiusens · 2018-06-27T21:29:09Z

snapcraft/internal/sources/_local.py

+
+    def _update(self):
+        # First, copy the directories
+        for directory in self._updated_directories:


this feels a lot like the pluginhandlers "migration" code. Maybe we need some generalization there (in the future)

There are definitely some similarities in that they both operate on directories and files, but the pluginhandler does some special stuff as well.

sergiusens · 2018-06-27T21:30:22Z

tests/fixture_setup/_fixtures.py

@@ -116,6 +116,13 @@ def setUp(self):
        patcher_dirs.start()
        self.addCleanup(patcher_dirs.stop)

+        self.useFixture(fixtures.EnvironmentVariable(


They are back!

sergiusens · 2018-06-27T21:31:02Z

tests/integration/__init__.py

-        self.useFixture(fixtures.EnvironmentVariable(
-            'XDG_DATA_HOME', os.path.join(self.path, 'data')))
+        # Use a separate path for XDG dirs, or changes there may be detected as
+        # source changes.


kudos on the code comment, makes sense

Most annoying change I had to make in the PR! Touched so many things. I should have extracted it, in retrospect.

sergiusens

pretty good, scanned through sans tests which I will look at later. Just a couple of niggles here and there, nothing much.

…ect_source_changes

Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

sergiusens · 2018-06-28T12:50:19Z

snapcraft/internal/lifecycle/_runner.py

+        # If this step hasn't yet run, all we need to do is run it
+        if not self._cache.has_step_run(part, current_step):
+            getattr(self, '_run_{}'.format(current_step.name))(part)
+            return


instead of multiple returns here, it seems this could be handled very well with elif and a final else.

We're fetching reports before checking them, which is heavy to do upfront if we don't actually need to, so it doesn't fit particularly well within an if/elif unless we want to nest within elses.

sergiusens · 2018-06-28T12:51:32Z

snapcraft/internal/sources/errors.py

+
+    fmt = (
+        'Failed to update source: '
+        "{source!s} sources don't support updating"


Can you add a . to end the sentence please?

sergiusens · 2018-06-28T12:52:44Z

snapcraft/internal/pluginhandler/__init__.py

@@ -249,6 +250,32 @@ def is_clean(self, step):
        except errors.NoLatestStepError:
            return True

+    def is_outdated(self, step):
+        """Return true if the given step needs to be updated (no cleaning)."""


Can you clarify this doc string (what is in between parens).

Done, though mostly by directing the reader to get_outdated_report().

sergiusens · 2018-06-28T12:53:14Z

snapcraft/internal/pluginhandler/__init__.py

+        return self.get_outdated_report(step) is not None
+
+    def get_outdated_report(self, step: steps.Step):
+        """Return an OutdatedReport class describing why step is outdated.


Can you add a follow up paragraph on what it means to be outdated? Simil for what it means to be dirty in its counterpart.

Done, for both outdated and dirty.

sergiusens · 2018-06-28T12:54:47Z

snapcraft/internal/pluginhandler/_outdated_report.py

+
+
+class OutdatedReport:
+    """The OutdatedReport class explains why a given step is outdated."""


can you further expand in a paragraph what it means to be outdated (conditions that trigger it)? Simil for DirtyReport please.

Done, for both outdated and dirty.

Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

sergiusens · 2018-06-29T01:53:25Z

snapcraft/internal/lifecycle/_status_cache.py

@@ -34,8 +35,8 @@ def __init__(self, config: _config.Config) -> None:
        """
        self.config = config
        self._steps_run = dict()  # type: Dict[str, Set[steps.Step]]
-        self._outdated_reports = collections.defaultdict(dict)  # type: _Report
-        self._dirty_reports = collections.defaultdict(dict)  # type: _Report
+        self._outdated_reports = collections.defaultdict(dict)  # type: _OutdatedReport  # noqa


why noqa or why noqa with no qualifier, and if it is to prevent a line break, I'd rather change the line limit to 99 😉
Use your parens until then 😆

Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

sergiusens · 2018-06-29T20:03:13Z

atta boy 😄

kyrofa force-pushed the feature/1583718/detect_source_changes branch from 6b262c9 to 0c85aba Compare June 23, 2018 02:30

kyrofa force-pushed the feature/1583718/detect_source_changes branch from 0c85aba to 3abc332 Compare June 25, 2018 16:36

kyrofa force-pushed the feature/1583718/detect_source_changes branch from 3abc332 to 77481b3 Compare June 25, 2018 18:09

Merge branch 'master' into feature/1583718/detect_source_changes

d64fe62

sergiusens reviewed Jun 27, 2018

View reviewed changes

sergiusens requested changes Jun 27, 2018

View reviewed changes

kyrofa added 5 commits June 27, 2018 15:40

Merge remote-tracking branch 'origin/master' into feature/1583718/det…

f1779f5

…ect_source_changes

lifecycle: refactor 'run' to be more clear

46bfd96

Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

file_utils: document copy() and link()

f6946f6

Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

lifecycle: rename prepare and complete methods

5a43815

Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

sources: raise custom error instead of NotImplementedError

e95e7e1

Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

sergiusens reviewed Jun 28, 2018

View reviewed changes

kyrofa added 2 commits June 28, 2018 10:18

source errors: add period

3885d28

Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

pluginhandler: update dirty/outdated docs

841378c

Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

sergiusens reviewed Jun 29, 2018

View reviewed changes

Urgh

6cc1d79

Signed-off-by: Kyle Fazzari <kyrofa@ubuntu.com>

sergiusens approved these changes Jul 2, 2018

View reviewed changes

sergiusens merged commit 39c4c39 into canonical:master Jul 2, 2018



		def _link(source: str, destination: str, follow_symlinks: bool=False) -> None:
		def link(source: str, destination: str, follow_symlinks: bool=False) -> None:

		@@ -25,13 +26,79 @@

		class Local(Base):

		def __init__(self, args, copy_function=file_utils.link_or_copy, *kwargs):



		class OutdatedReport:
		"""The OutdatedReport class explains why a given step is outdated."""

many: detect local source changes #2167

many: detect local source changes #2167

Conversation

kyrofa commented Jun 23, 2018 • edited

sergiusens commented Jun 25, 2018

codecov-io commented Jun 25, 2018 • edited

Codecov Report

kyrofa commented Jun 25, 2018 • edited

kyrofa commented Jun 26, 2018

Choose a reason for hiding this comment

kyrofa Jun 27, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kyrofa Jun 27, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kyrofa Jun 27, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sergiusens left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sergiusens commented Jun 29, 2018

kyrofa commented Jun 23, 2018 •

edited

codecov-io commented Jun 25, 2018 •

edited

kyrofa commented Jun 25, 2018 •

edited

kyrofa Jun 27, 2018 •

edited

kyrofa Jun 27, 2018 •

edited

kyrofa Jun 27, 2018 •

edited