feat: task graph html repr #99

jokasimr · 2024-01-11T16:44:01Z

Fixes #70

jokasimr · 2024-01-11T16:44:52Z

Here's a suggestion for how the task graph repr could look:

The entries under "based on" lists only the inputs to the graph, not the intermediate results.

jokasimr · 2024-01-11T17:07:26Z

Case with ParamTables:

SimonHeybrock · 2024-01-12T07:30:19Z

src/sciline/task_graph.py

+                '<p>Task graph computes</p>\n',
+                _list_items(leafs),
+                '<p>based on</p>\n',
+                _list_max_n_then_hide(roots),
+                f'<p>using the scheduler {scheduler}.</p>',


I must say not a fan of the "narrative" formatting. How about:

Output keys: A, B, C Scheduler: DaskScheduler Providers: ....

For the dask scheduler, it would be very helpful if we can include which underlying dask scheduler (get function) is used, see here: https://scipp.github.io/sciline/generated/classes/sciline.scheduler.DaskScheduler.html.

It was a bit of an experiment ;) I'll change it.

However I don't think the last column should be "Providers" but rather something like "Input parameters"?
Or do you think it would be better to list all providers in the task graph rather than only the inputs?

Once the user has selected what output they are interested in I think the intermediate providers are kind of uninteresting, while the inputs that contribute to a particular output might be interesting for debugging purposes / understanding what is needed to compute a particular quantitiy.

Once the user has selected what output they are interested in I think the intermediate providers are kind of uninteresting

Maybe? We can only list the inputs for now.

jokasimr · 2024-01-12T09:47:45Z

New style

SimonHeybrock · 2024-01-12T09:55:22Z

src/sciline/task_graph.py

+    )
+
+
+def _list_max_n_then_hide(items: Sequence[str], n: int = 5, header: str = '') -> str:


Good function name 👍

SimonHeybrock · 2024-01-12T09:57:48Z

src/sciline/task_graph.py

+            {
+                escape(keyname(key))
+                for key, (_, requires) in self._graph.items()
+                if len(requires) == 0


This is not a sufficient check for an input param. One can define providers without inputs. We should use the same check as in other places (Pipeline repr, graph visualization).

Hm I'm not sure I understand. Is there a conceptual difference between a parameter and a provider without inputs?

Yes, a parameter is typically something a user sets, i.e., it would typically be different every time the pipeline is used (and therefore seeing its value is valuable). A provider is more static and would typically not change (and we cannot get its value without calling it, which may be expensive).

SimonHeybrock · 2024-01-12T09:59:33Z

src/sciline/task_graph.py

+                if len(requires) == 0
+            }
+        )
+        scheduler = escape(self._scheduler.__class__.__name__)


Can you include the underlying dask scheduler, this is important information. Add a method to the schedulers? Not sure if __str__ or __repr__ would make sense for them, maybe?

SimonHeybrock · 2024-01-12T10:00:34Z

src/sciline/utils.py

+def keyname(key: Key) -> str:
+    if isinstance(key, Item):
+        return f'{keyname(key.tp)}({keyname(key.label[0].tp)})'
+    args = get_args(key)
+    if len(args):
+        parameters = ', '.join(map(keyname, args))
+        return f'{qualname(key)}[{parameters}]'
+    return qualname(key)


How does this compare to what is used in visualize? Can we use the same code?

Yes I think we can.

jokasimr · 2024-01-12T10:07:52Z

New version displaying some more details about the scheduler

jokasimr requested a review from SimonHeybrock January 11, 2024 16:44

SimonHeybrock reviewed Jan 12, 2024

View reviewed changes

SimonHeybrock approved these changes Jan 15, 2024

View reviewed changes

jokasimr added 2 commits January 15, 2024 11:36

feat: task graph html repr

9c5f801

refactor: clean up and reuse some of the logic in pipline.__repr__

805eeb3

jokasimr force-pushed the html-repr-for-taskgraph branch from c699e1c to 805eeb3 Compare January 15, 2024 10:37

jokasimr enabled auto-merge January 15, 2024 10:38

jokasimr merged commit 7951e09 into main Jan 15, 2024
5 checks passed

jokasimr deleted the html-repr-for-taskgraph branch January 15, 2024 10:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: task graph html repr #99

feat: task graph html repr #99

jokasimr commented Jan 11, 2024

jokasimr commented Jan 11, 2024 •

edited

Loading

jokasimr commented Jan 11, 2024

SimonHeybrock Jan 12, 2024

jokasimr Jan 12, 2024 •

edited

Loading

SimonHeybrock Jan 12, 2024

jokasimr commented Jan 12, 2024

SimonHeybrock Jan 12, 2024

SimonHeybrock Jan 12, 2024

jokasimr Jan 12, 2024

SimonHeybrock Jan 12, 2024

SimonHeybrock Jan 12, 2024

SimonHeybrock Jan 12, 2024

jokasimr Jan 12, 2024

jokasimr commented Jan 12, 2024

		)


		def _list_max_n_then_hide(items: Sequence[str], n: int = 5, header: str = '') -> str:

feat: task graph html repr #99

feat: task graph html repr #99

Conversation

jokasimr commented Jan 11, 2024

jokasimr commented Jan 11, 2024 • edited Loading

jokasimr commented Jan 11, 2024

Choose a reason for hiding this comment

jokasimr Jan 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jokasimr commented Jan 12, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jokasimr commented Jan 12, 2024

jokasimr commented Jan 11, 2024 •

edited

Loading

jokasimr Jan 12, 2024 •

edited

Loading