feat(core): make update work with new storage #2304

m-alisafaee · 2021-09-01T22:59:53Z

Description

renku update with the new storage/metadata. It has a PATHS parameter which limits the update to the specified paths.

Fixes #2257

renku/cli/update.py

renku/core/commands/update.py

renku/core/models/workflow/composite_plan.py

renku/core/commands/update.py

renku/core/metadata/gateway/activity_gateway.py

Panaetius

Looks really nice now. It's great seeing things finally come together!

renku/cli/update.py

renku/core/utils/git.py

Panaetius · 2021-09-10T13:53:40Z

renku/core/commands/update.py

+    all_activities = defaultdict(set)
+
+    def have_identical_inputs_and_outputs(activity1, activity2):
+        return sorted(u.entity.path for u in activity1.usages) == sorted(


I think set() instead of sorted() would work just as well. Not that it matters but there's not really a reason for sorting.

I believe there is not clear way of comparing activities. The idea here was to cover cases like cat A A B > C and cat A B B > C.

Panaetius · 2021-09-10T13:56:41Z

renku/core/commands/update.py

+        if paths:
+            # NOTE: Add the activity to check if it also matches the condition
+            downstream_chains.append((activity,))
+            downstream_chains = [c for c in downstream_chains if any(g.entity.path in paths for g in c[-1].generations)]


Can users ask Renku to update a whole folder? if so we'd need to check it here.

Yes, they can. I've fixed this and added a test for it.

renku/core/commands/update.py

Panaetius · 2021-09-10T14:03:12Z

renku/core/commands/update.py

+
+    if len(activities) > 1:
+        activity_collection = ActivityCollection(activities=activities)
+        activity_gateway.add_activity_collection(activity_collection)


It looks like the "activity-collections" only exists to enable some tests. it is a bit odd to me to store something in users' repositories that is only used for our tests, essentially littering the database with unneeded data.

I guess we could have a TestingActivityGateway(ActivityGateway) in tests/ and inject that for tests, to only have this index when testing code?

ActivityCollection is to mark that these activities have been executed together as a result of an update or a rerun. So, it's not just for testing. I was not sure what other metadata we need to include here (specifically if we need a link in the Activity to its ActivityCollection if any). What do you think?

a link from ActivityCollection to Activity makes sense to me. Best to discuss it with the KG team as well

Panaetius · 2021-09-10T14:06:39Z

renku/core/management/workflow/concrete_execution_graph.py

@@ -141,6 +141,9 @@ def workflow_graph(self):
                    workflow_graph.add_node(node)
                continue

+            if not next(self.graph.predecessors(node), None):


It might only make a difference on very large repositories with complex workflows, but

intermediate_predecessor = next(self.graph.predecessors(node), None) if not intermediate_predecessor: continue [...] source = next(self.graph.predecessors(intermediate_predecessor), None)

would only have to calculate the predecessor once.

It always helps with the code's readability.

Panaetius · 2021-09-10T14:15:07Z

renku/core/models/workflow/composite_plan.py

        name: str,
-        derived_from: str = None,
-        plans: List[Union["CompositePlan", Plan]] = None,
+        plans: List[Plan] = None,


Should it be AbstractPlan ? A CompositePlan created by a user could contain a CompositePlan

👍
I made it as it was before (Union["CompositePlan", Plan]) which should also help with ide type linter.

Panaetius

Thank you!

github-actions bot added the documentation:pending label Sep 1, 2021

m-alisafaee force-pushed the 2257-new-renku-update branch from 37cb9a7 to 33631be Compare September 1, 2021 23:52

m-alisafaee changed the base branch from 2130-workflow-execute to master September 1, 2021 23:52

m-alisafaee force-pushed the 2257-new-renku-update branch from 33631be to ec3bc7b Compare September 2, 2021 01:32

m-alisafaee removed the documentation:pending label Sep 2, 2021

m-alisafaee force-pushed the 2257-new-renku-update branch from ec3bc7b to 8ab311c Compare September 2, 2021 02:21

m-alisafaee marked this pull request as ready for review September 2, 2021 07:20

m-alisafaee requested a review from a team as a code owner September 2, 2021 07:20

Panaetius reviewed Sep 2, 2021

View reviewed changes

m-alisafaee marked this pull request as draft September 6, 2021 22:16

m-alisafaee force-pushed the 2257-new-renku-update branch from 8ab311c to cff3d7d Compare September 6, 2021 22:17

Panaetius reviewed Sep 7, 2021

View reviewed changes

renku/core/metadata/gateway/activity_gateway.py Show resolved Hide resolved

m-alisafaee force-pushed the 2257-new-renku-update branch from 86496f8 to 95be0ca Compare September 7, 2021 13:20

m-alisafaee marked this pull request as ready for review September 7, 2021 14:38

m-alisafaee marked this pull request as draft September 8, 2021 11:57

m-alisafaee force-pushed the 2257-new-renku-update branch from 95be0ca to 97146c1 Compare September 8, 2021 13:32

m-alisafaee marked this pull request as ready for review September 10, 2021 08:20

Panaetius requested changes Sep 10, 2021

View reviewed changes

m-alisafaee marked this pull request as draft September 13, 2021 12:49

m-alisafaee added 3 commits September 13, 2021 15:03

feat(core): make update work with new storage

2cf0a7f

Address review comments

2b10968

add ActivityCollection

d23ca23

m-alisafaee force-pushed the 2257-new-renku-update branch 6 times, most recently from 45f3998 to 7279445 Compare September 13, 2021 16:21

Address review comments

f57ccb2

m-alisafaee force-pushed the 2257-new-renku-update branch from 7279445 to f57ccb2 Compare September 13, 2021 16:27

m-alisafaee marked this pull request as ready for review September 13, 2021 17:46

Panaetius approved these changes Sep 14, 2021

View reviewed changes

m-alisafaee merged commit c047ed9 into master Sep 14, 2021

m-alisafaee deleted the 2257-new-renku-update branch September 14, 2021 16:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(core): make update work with new storage #2304

feat(core): make update work with new storage #2304

m-alisafaee commented Sep 1, 2021 •

edited

Panaetius left a comment

Panaetius Sep 10, 2021

m-alisafaee Sep 13, 2021

Panaetius Sep 10, 2021

m-alisafaee Sep 13, 2021

Panaetius Sep 10, 2021

m-alisafaee Sep 13, 2021

Panaetius Sep 13, 2021

Panaetius Sep 10, 2021

m-alisafaee Sep 13, 2021

Panaetius Sep 10, 2021

m-alisafaee Sep 13, 2021

Panaetius left a comment

feat(core): make update work with new storage #2304

feat(core): make update work with new storage #2304

Conversation

m-alisafaee commented Sep 1, 2021 • edited

Description

Panaetius left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Panaetius left a comment

Choose a reason for hiding this comment

m-alisafaee commented Sep 1, 2021 •

edited