Add `mutates_input=` and `returns_graph=` to `_dispatchable` #7191

eriknw · 2023-12-26T17:22:33Z

Knowing which functions mutate input graphs may be helpful when implementing (or testing) caching.

This changes a behavior: if a function is known to mutate an input graph, then this does not automatically convert an input graph to a backend graph. Conversion can still happen by using backend= keyword.

Some of the algorithms that now have mutates_input=True may be good candidates to add copy= arguments.

Adding mutates_input= to dispatch decorator is "best effort". It's possible (even likely) that some were missed, especially functions that add data to .graph. I think this is okay--we can fix them when found by backend implementers--but please share if you know of any other functions that mutate input graphs.

I noticed that negative_edge_cycle and held_karp_ascent temporarily mutate input graphs. I was surprised by this, and it means these are not thread-safe (and who knows if they may be permanently modified if there's an exception).

I also noticed that lukes_partitioning, is_kl_connected, and kl_connected_subgraph use deepcopy to copy the graph before mutating it. This is different than what is done elsewhere, and does not work if the input graphs have been frozen (side thought: why isn't there an unfreeze function?).

reverse function has copy= argument, but it does not mutate the graph. If copy=False, then a read-only view is returned.

Finally, I don't know what to do with minimum_cut. It can mutate residual if residual= was passed in via **kwargs. The dispatch machinery does not handle graphs passed in via **kwargs. We can punt on this until a backend wants to implement it.

CC @rlratzel

eriknw · 2023-12-26T17:25:36Z

networkx/utils/backends.py

-        if self._is_testing and self._automatic_backends and backend_name is None:
-            # Special path if we are running networkx tests with a backend.
-            return self._convert_and_call_for_tests(
-                self._automatic_backends[0],
-                args,
-                kwargs,
-                fallback_to_nx=self._fallback_to_nx,
-            )
-


Note that I moved this code block to further below so that iterators of input graphs are turned into lists. This allows the "fallback to networkx" functionality to work even for iterator of graphs inputs (otherwise, the iterators would be consumed by the first call).

eriknw · 2023-12-26T17:44:19Z

networkx/utils/backends.py

+                    not (
+                        args[arg_pos]
+                        if len(args) > arg_pos
+                        else kwargs.get(arg_name[4:], True)


Note that this implicitly assumes e.g. copy=True is the default everywhere (which is true today).

eriknw · 2024-01-29T18:38:34Z

I added returns_graph=True to @nx._dispatchable like we discussed in the weekly dispatching meeting (although we didn't really discuss the argument name, so suggestions for a better name are welcome). I still need to change the logic to not convert-and-dispatch by default if a function returns a graph.

Even if we don't include graph generators/constructors, there were more functions that return graphs than I was expecting!

eriknw · 2024-01-29T19:32:30Z

Aha, our tests worked and found a new function that returns a graph (and needs returns_graph=True):
https://github.com/networkx/networkx/actions/runs/7700821863/job/20985472423?pr=7191#step:5:696

E           RuntimeError: `returns_graph` is incorrect for modular_product

This PR has been updated and is ready for review.

rlratzel · 2024-02-06T15:35:37Z

Thanks! This is a good topic for dispatching, especially as we work toward getting caching in for NX 3.3

This changes a behavior: if a function is known to mutate an input graph, then this does not automatically convert an input graph to a backend graph. Conversion can still happen by using backend= keyword.

Can you explain the reason for the behavior change? My understanding is that a function that mutates the input graph - as a side effect or otherwise - will not result in those updates being propagated to the NX Graph if dispatched, and therefore dispatching the call will result in different behavior (ie. a difference that breaks user code). What I'm not clear on is what the expectation is if a user forces the conversion/dispatch using backend=. Is this just use-at-your-own-risk, or are backends expected to properly mutate the NX Graph in that case? I'm assuming it's not the latter otherwise we'd then allow the automatic conversion/dispatch.

Either way, I think if a user sets NETWORKX_AUTOMATIC_BACKENDS=... then this new behavior will be surprising. Here's a couple of options:

backends that can't mutate the NX graph should return False for can_run, which means being strict about treating Graph mutations as part of the contract for the function that backends have to honor. I suppose if that was decided, then we wouldn't need mutates_input= and we'd have to enforce the proper mutations with tests. (this feels like the most work, but the right decision IMO).
somehow tell users we're making this decision for them: warning, logging, throw an exception by default which can be overridden by a config option, something else.

rlratzel · 2024-02-06T16:16:24Z

Chatted offline with @eriknw (Erik, please correct me as needed):

The bigger goal is to not break user code, so the change here to not auto-dispatch will avoid that completely since backends are not currently assumed to mutate the input graph
There's still an element of surprise for users using NETWORK_AUTOMATIC_BACKENDS=, but this is lower priority than preventing the breakage of user code
- A future effort will be made to prevent this surprise too, and there's overlap with this and should_run
- Introspection features ("tell me what you're going to do and why"), logging, and config are all topics/upcoming features related to solving this problem
We want one of the overriding principles to be "if I tell you to do this then don't decide for me" and therefore backend= will still allow a user to force dispatch to a particular backend. This allows power users, backend developers, etc. to dispatch even if graph mutations aren't supported.

…x#7191) * Add `mutates_input=True` to `_dispatch` * Add `returns_graph=True` to `_dispatchable` * Don't auto-convert for functions that return Graphs; also, updates * more comments * Make `returns_graph` attribute private (for now)

Add mutates_input=True to _dispatch

1e773fb

eriknw commented Dec 26, 2023

View reviewed changes

eriknw added 2 commits January 12, 2024 19:34

Merge branch 'main' into mutates_input

fd5fbf0

Add returns_graph=True to _dispatchable

21ab9fa

eriknw changed the title ~~Add mutates_input=True to _dispatch~~ Add mutates_input= and returns_graph= to _dispatchable Jan 29, 2024

Merge branch 'main' into mutates_input

f506cd1

eriknw added 2 commits January 29, 2024 10:57

Don't auto-convert for functions that return Graphs; also, updates

37bd7d7

Merge branch 'main' into mutates_input

52596f2

more comments

936de87

dschult added the type: Maintenance label Jan 30, 2024

eriknw mentioned this pull request Jan 30, 2024

Test whether function should use Yields instead of Returns #7258

Open

rossbar added the Dispatching Related to dispatching and backend support label Jan 31, 2024

Merge branch 'main' into mutates_input

a946de8

eriknw mentioned this pull request Feb 10, 2024

Automatically cache results from simple functions #7283

Draft

rlratzel approved these changes Feb 12, 2024

View reviewed changes

eriknw added 3 commits February 16, 2024 21:02

Merge branch 'main' into mutates_input

2f373fc

Merge branch 'main' into mutates_input

007c727

Make returns_graph attribute private (for now)

17b96ea

MridulS merged commit 5b9aec5 into networkx:main Feb 28, 2024
41 checks passed

jarrodmillman added this to the 3.3 milestone Feb 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `mutates_input=` and `returns_graph=` to `_dispatchable` #7191

Add `mutates_input=` and `returns_graph=` to `_dispatchable` #7191

eriknw commented Dec 26, 2023

eriknw Dec 26, 2023

eriknw Dec 26, 2023

eriknw commented Jan 29, 2024

eriknw commented Jan 29, 2024

rlratzel commented Feb 6, 2024

rlratzel commented Feb 6, 2024

Add mutates_input= and returns_graph= to _dispatchable #7191

Add mutates_input= and returns_graph= to _dispatchable #7191

Conversation

eriknw commented Dec 26, 2023

eriknw Dec 26, 2023

Choose a reason for hiding this comment

eriknw Dec 26, 2023

Choose a reason for hiding this comment

eriknw commented Jan 29, 2024

eriknw commented Jan 29, 2024

rlratzel commented Feb 6, 2024

rlratzel commented Feb 6, 2024

Add `mutates_input=` and `returns_graph=` to `_dispatchable` #7191

Add `mutates_input=` and `returns_graph=` to `_dispatchable` #7191