Combined run_cell and execute_cell #12

MSeal · 2020-02-04T05:51:17Z

Addresses #11
Changed the default for store_history to False (was intended to be changed with nbconvert 6.0). In doing so I found an issue with ipython kernel execution, where execution counts were not tracked when history is disabled. I've made nbclient manage execution_count itself by default now with the ability to control the behavior in execute_cell.
Also removed extra debug prints left in tests

MSeal · 2020-02-04T08:16:55Z

I would like to break execute_cell into smaller parts, but given it's complexity from nbconvert and all the tracking it does I wait to tackle that later on.

MSeal · 2020-02-05T20:57:30Z

@choldgraf if you wouldn't mind taking a look at this one, I'd like to merge it before the initial release and before more PRs get made that'd be in conflict.

choldgraf · 2020-02-05T21:15:59Z

Hey, will try to get to it soon. Sorry, I just finished a flight to Australia 🇦🇺

MSeal · 2020-02-05T21:17:18Z

No worries, I really appreciate all the reviews you've been doing. If you can't get to it for a bit that's fine too.

choldgraf

This all looks pretty reasonable to me - what was the differing purposes of run_cell and execute_cell in the first place? I had one question about execute_cells args, but other than that this looks like a nice simplification to me

choldgraf · 2020-02-10T11:25:01Z

nbclient/execute.py

+        if execution_count:
+            cell['execution_count'] = execution_count
+        self._check_raise_for_error(cell, exec_reply)
+        self.nb['cells'][cell_index] = cell


since we've already got access to the notebook in self.nb, couldn't this method be simplified by simply providing a cell_index rather than cell and cell_index? Then the first thing that could be done is running

cell = self.nb['cells'][cell_index]

since we've already got access to the notebook in self.nb

I think that's actually a bad thing.
The notebook should not be stored in the executor!

This mistake was made in nbconvert, where the notebook was passed to the preprocess() method and subsequently stored in the executor:

https://github.com/jupyter/nbconvert/blob/8c9bffc8deb65ced99b20684ddd148fd885ad9e8/nbconvert/preprocessors/execute.py#L404

A later preprocess() call could overwrite the locally stored notebook.
Therefore, the preprocess() call was not re-entrant.

I've already mentioned this problem in nbconvert: jupyter/nbconvert#886

During the move to nbclient, the situation seems to have gotten worse!

Now the notebook seems to be passed to the constructor of the executor, which means one executor instance can only ever execute a single notebook.
Is that really what you want?

To me, it would make much more sense if one executor instance could execute multiple notebooks in a row. Ideally also concurrently.

[UPDATE: I've created a separate issue for this: #18]

Now the notebook seems to be passed to the constructor of the executor, which means one executor instance can only ever execute a single notebook.
Is that really what you want?

Yes. There's a lot of state it holds for this particular execution and all the configuration for the execution means that having a owning object makes it a lot easier to manage this state and object manipulation. Otherwise we would have to pass many many arguments to every function, making the function signatures unwieldy.

A later preprocess() call could overwrite the locally stored notebook.
Therefore, the preprocess() call was not re-entrant.
...

During the move to nbclient, the situation seems to have gotten worse!

The issue in nbconvert was that it was half-way between abstractions. There were some functions that were reusable and some that were not without a new object. This class should now be re-entrant on execution methods at the top. .execute called twice will execute the whole notebook twice without having prior run state cause issues (I should add a test for this). I don't think it's "worse", it just more an object oriented approach than a functional one. in this case with lots of state and configuration object oriented has a lot of benefits over functional.

The rename that merged for changing to NotebookClient should help with the abstraction of concerns. Specifically it's denoting it acts on behalf of a notebook, meaning it should hold the state of the said notebook.

To me, it would make much more sense if one executor instance could execute multiple notebooks in a row. Ideally also concurrently.

I didn't design for this pattern. We could have designed for that but I choose to keep it closer to how the code was before to reduce the number of changes -- also because I'm not sure that's strictly a better model. You do have the execute method for repeating calls to the client object to execute everything.

I didn't design for this pattern. We could have designed for that but I choose to keep it closer to how the code was before to reduce the number of changes -- also because I'm not sure that's strictly a better model.

This is pragmatic. I'd like to see the code evolve as well to be more functional, but at the same time we're still in this annoying mess of very class oriented dynamic traitlets for configuration in Jupyterland.

MSeal · 2020-02-10T17:56:01Z

@choldgraf

This all looks pretty reasonable to me - what was the differing purposes of run_cell and execute_cell in the first place? I had one question about execute_cells args, but other than that this looks like a nice simplification to me

Mostly an artifact of organic growth of the code in nbconvert I think. There were competing thoughts of how the interface should operate that weren't reconciled. One of the two was doing some state prep before it did an almost stateless execution call. Not necessary at this point imo.

rgbkrk · 2020-02-10T18:09:49Z

I think after the conflict is cleaned up you should go ahead with this simplification before taking it even further for a refactor. 👍

choldgraf · 2020-02-10T20:03:41Z

I agree w/ @rgbkrk - we can tidy-up and extend the package over time, but I don't think any of that should block on an initial release, and this is a clear improvement to me!

choldgraf · 2020-02-11T06:52:06Z

Woo 🎉

MSeal requested a review from choldgraf February 4, 2020 05:51

MSeal mentioned this pull request Feb 10, 2020

Replace NBConvert with NBClient nteract/papermill#472

Merged

choldgraf reviewed Feb 10, 2020

View reviewed changes

mgeier mentioned this pull request Feb 10, 2020

"class-instead-of-function" anti-pattern in Executor class #18

Closed

rgbkrk approved these changes Feb 10, 2020

View reviewed changes

Combined run_cell and execute_cell

c81e1a9

MSeal force-pushed the execute_cell branch from cd1c363 to c81e1a9 Compare February 11, 2020 06:41

MSeal merged commit 8d5fc8c into jupyter:master Feb 11, 2020

MSeal deleted the execute_cell branch February 11, 2020 08:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Combined run_cell and execute_cell #12

Combined run_cell and execute_cell #12

MSeal commented Feb 4, 2020

MSeal commented Feb 4, 2020

MSeal commented Feb 5, 2020

choldgraf commented Feb 5, 2020

MSeal commented Feb 5, 2020 •

edited

Loading

choldgraf left a comment

choldgraf Feb 10, 2020 •

edited

Loading

mgeier Feb 10, 2020 •

edited

Loading

MSeal Feb 10, 2020

rgbkrk Feb 10, 2020

MSeal commented Feb 10, 2020

rgbkrk commented Feb 10, 2020

choldgraf commented Feb 10, 2020

choldgraf commented Feb 11, 2020 •

edited

Loading

Combined run_cell and execute_cell #12

Combined run_cell and execute_cell #12

Conversation

MSeal commented Feb 4, 2020

MSeal commented Feb 4, 2020

MSeal commented Feb 5, 2020

choldgraf commented Feb 5, 2020

MSeal commented Feb 5, 2020 • edited Loading

choldgraf left a comment

Choose a reason for hiding this comment

choldgraf Feb 10, 2020 • edited Loading

Choose a reason for hiding this comment

mgeier Feb 10, 2020 • edited Loading

Choose a reason for hiding this comment

MSeal Feb 10, 2020

Choose a reason for hiding this comment

rgbkrk Feb 10, 2020

Choose a reason for hiding this comment

MSeal commented Feb 10, 2020

rgbkrk commented Feb 10, 2020

choldgraf commented Feb 10, 2020

choldgraf commented Feb 11, 2020 • edited Loading

MSeal commented Feb 5, 2020 •

edited

Loading

choldgraf Feb 10, 2020 •

edited

Loading

mgeier Feb 10, 2020 •

edited

Loading

choldgraf commented Feb 11, 2020 •

edited

Loading