Reference implementation for EEP 56 #4638

juhlig · 2021-03-17T14:13:09Z

This PR reflects the current state of and will be updated with EEP 56.

As the EEP has not settled down fully, this PR contains no additional tests and documentation (yet).

lib/stdlib/doc/src/supervisor.xml

juhlig · 2021-03-23T14:27:13Z

Last commit adds test cases:

auto-shutdown with transient significant children
auto-shutdown with temporary significant children
(no) auto-shutdown when a significant child is a bystander, ie is restarted because of a sibling's death
escalation of auto-shutdown; a (top) supervisor has another supervisor as a significant child, and this (child) supervisor has a significant worker as a child; the test ensures that the exit of the worker child causes an auto-shutdown of the (child) supervisor, and this in turn causes an auto-shutdown of the (top) supervisor
checks for the map form of child specs to ensure that the new significant flag is validated
check for the new auto_shutdown sup flag to ensure that it is validated

Checks for nonsensical combinations, like having significant => true in the child specs of an auto_shutdown => never supervisor or having significant => true in combination with restart => permanent are currently not tested because it is in our opinion not fully settled if those should be allowed and ignored, or forbidden altogether. A question posted by @Maria-12648430 on the EEPs mailing list yielded only a single reply, which was in favor of forbidding this.

juhlig · 2021-03-24T14:31:56Z

Latest commit refines and augments test cases.

IngelaAndin · 2021-03-26T07:57:42Z

I and @HansN feel positive about the way this turning out. As we are waiting for OTP-24 RC2 we can not test it in the builds just yet. However I have run the tests myself with cover and I got some uncovered lines.

do_auto_shutdown(_Child, State=#state{auto_shutdown = never}) ->
--
735 | 4020 | {ok, State};
736 |   | do_auto_shutdown(Child, State) when not ?is_significant(Child)->
737 | 1 | {ok, State};
738 |   | do_auto_shutdown(_Child, State=#state{auto_shutdown = any_significant}) ->
739 | 7 | {shutdown, State};
740 |   | do_auto_shutdown(_Child, State=#state{auto_shutdown = all_significant})
741 |   | when ?is_simple(State) ->
742 | :-( | case dyn_size(State) of
743 |   | 0 ->
744 | :-( | {shutdown, State};
745 |   | _ ->
746 | :-( | {ok, State}
747 |   | end;
748 |   | do_auto_shutdown(_Child, State=#state{auto_shutdown = all_significant}) ->
749 | 4 | case
750 |   | children_any(
751 |   | fun
752 |   | (_, #child{pid = undefined}) ->
753 | 3 | false;
754 |   | (_, #child{significant = true}) ->
755 | 2 | true;
756 |   | (_, _) ->
757 | :-( | false
758 |   | end,
759 |   | State#state.children
760 |   | )
761 |   | of
762 |   | true ->
763 | 2 | {ok, State};
764 |   | false ->
765 | 2 | {shutdown, State}
766 |   | end.

juhlig · 2021-03-26T09:18:04Z

Hi @IngelaAndin,

I and @HansN feel positive about the way this turning out.

Great =D

However I have run the tests myself with cover and I got some uncovered lines.

Yes, the test suite is not 100% finished yet, I'm extending it and fill the gaps as time permits.

Maria-12648430 · 2021-03-26T09:36:07Z

@IngelaAndin

I and @HansN feel positive about the way this turning out.

Nice :) I'm quite pleased about how this is rolling along, too.

I have been told that an upcoming OTB meeting will look into this. Other things aside, we would appreciate a decision (or preference or something) about the issue of allowing (ignoring) or forbidding option combinations that make no sense, like restart => permanent and significant => true, or auto_shutdown => never and significant => true.
The current state of this PR takes the simpler approach to allow those. A question posted on the eeps list yielded only one answer (by @okeuday, I believe) and that one was in favor of forbidding. If they were to be forbidden we will need more tests, and it needs to be mentioned in the documentation (which we haven't started on yet).

IngelaAndin · 2021-03-26T10:02:01Z

@Maria-12648430 we will take it up on the meeting (next week).

Maria-12648430 · 2021-03-26T10:11:41Z

@IngelaAndin thanks :)

juhlig · 2021-03-26T14:18:28Z

@IngelaAndin I augmented the existing and added a new test, and the lines you pointed out are now covered. Depending on the decision concerning pointless option combinations, more tests will be added later to ensure the desired behavior. Also, there should be tests for upgrading, but that also depends, to a degree, on how pointless combinations are to be treated, so these will be added later, too.

IngelaAndin · 2021-03-29T09:13:51Z

Hi, thanks for the update. I will enable the test again. (The are automatically disabled if you push something new).

I can also report that your branch was breaking the test case upgrade_supervisor in the release_handler_SUITE of the sasl application and supervisor_incorrect_return in behaviour_SUITE of the dialyzer application. So you need to look into that.

juhlig · 2021-03-29T14:31:36Z

Last commit fixes the two failing tests pointed out by @IngelaAndin, and adds tests for upgrading auto_shutdown sup flags and significant child spec flags.

juhlig · 2021-03-30T16:09:06Z

Last commit handles invalid combinations of auto_shutdown and significant values.

HansN · 2021-03-31T13:57:01Z

There was an OTB yesterday that decided that:

We are positive to EEP-0056 and want to try to get it into OTP 24 in May and the release candidate OTP-24rc3 (mid April).

We want supported combinations of parameters to be as limited as possible. I.e. only support the obvious use cases (no corner cases).

Attempts to use unsupported combinations should result in error during setup.

Documentation of the supported combinations is important to have before introduction.

Please also change relevant parts in the supervisor documentation in "Design Principles". The file path is
$ERL_TOP/system/doc/design_principles/sup_princ.xml
and the URI at erlang.org is:
https://erlang.org/doc/design_principles/sup_princ.html

Maria-12648430 · 2021-03-31T14:46:23Z

We are positive to EEP-0056 and want to try to get it into OTP 24 in May

Great 🥰

and the release candidate OTP-24rc3 (mid April).

That should be ample time, because this...

We want supported combinations of parameters to be as limited as possible. I.e. only support the obvious use cases (no corner cases).

Attempts to use unsupported combinations should result in error during setup.

... has already been done in @juhlig's last commit from yesterday, and this...

Documentation of the supported combinations is important to have before introduction.

Please also change relevant parts in the supervisor documentation in "Design Principles". The file path is
$ERL_TOP/system/doc/design_principles/sup_princ.xml
and the URI at erlang.org is:
https://erlang.org/doc/design_principles/sup_princ.html

... is what I'm starting on right now :)
I'll probably finish the documentation early next week. The implementation itself is ready for a review.

Maria-12648430 · 2021-04-15T09:15:48Z

@garazdawi my last commit addresses all the documentation changes you requested. Or rather, I did my best to address them ;)

The only thing I didn't provide are examples for how an automatic shutdown takes place. That would require some illustration with images I think, I'll leave that for later. Is there some sort of guideline for images in the documentation that I should know of?

garazdawi · 2021-04-15T09:22:09Z

Is there some sort of guideline for images in the documentation that I should know of?

We don't really have a guideline as such, but recently we have used Dia to draw things and then set up a makefile rule that can create the .png. The .png needs to be committed into git as we don't want to depend on imagemagic in order to create the docs.

Maria-12648430 · 2021-04-15T09:58:35Z

We don't really have a guideline as such, but recently we have used Dia to draw things and then set up a makefile rule that can create the .png. The .png needs to be committed into git as we don't want to depend on imagemagic in order to create the docs.

I can look into that next week I think.

Anyway, unless there are new change requests, we're now pretty much done here, right?

HansN · 2021-04-15T10:56:59Z

I have added 912d1fe to the nightly tests now. If there are not any errors and no one complains, I'll merge this PR tomorrow and we could open a new one with the items left:

The error message in supervisor:validSignificant/3 (needs more thinking/discussions)
Examples for how an automatic shutdown takes place.

Have I forgot anything?
Is @garazdawi satisfied with the fixes committed so far?

garazdawi

Yes, we can merge this. I do however think that we can do a better job in the documentation, especially the documentation in the "OTP Design Principles".

In the "Stopping a Child Process" section I think it would be beneficial to include a small description about who can terminate a child (i.e. an external entity or the child itself) with a description of how you go about it for both scenarios. Maybe there should be two subsections here?

The "Supervisor Behaviour" chapter is not meant to be a restatement of what you can find in the supervisor reference manual, but rather a guide about how to use the features, when you should use them, and things to keep in mind.

lib/stdlib/doc/src/supervisor.xml

Co-authored-by: Lukas Larsson <garazdawi@gmail.com>

Maria-12648430 · 2021-04-15T12:16:56Z

Yes, we can merge this. I do however think that we can do a better job in the documentation, especially the documentation in the "OTP Design Principles".

In the "Stopping a Child Process" section I think it would be beneficial to include a small description about who can terminate a child (i.e. an external entity or the child itself) with a description of how you go about it for both scenarios. Maybe there should be two subsections here?

The "Supervisor Behaviour" chapter is not meant to be a restatement of what you can find in the supervisor reference manual, but rather a guide about how to use the features, when you should use them, and things to keep in mind.

Sorryyyy, I'm doing the best I can 😢😢😢
j/k 😉

But seriously, I find it surprisingly difficult to fit new things like this into the existing document :( In the end, it probably boils down to a bigger restructuring/rewrite project for the chapter, something that requires quite some time and effort and many review cycles.

HansN · 2021-04-15T13:00:58Z

I think that re-writing documentation parts that are more of a general kind is out of scope for a "simple" add-a-feature PR like this. But I agree that the supervisor Design Principles would benefit from a general brush-up, but that is a separate task in my opinion.

Maria-12648430 · 2021-04-15T13:36:55Z

@HansN my point exactly :)

IngelaAndin · 2021-04-15T13:39:07Z

Just had time to catch up on this. I agree with @HansN that we should merge this and make sure to make a new PR that enhances the error messages in time for OTP-24, but I really like to see the implementation tested in RC3. I think that clear error messages are way more important than backwards compatible error messages. Error reasons are many times specified as term() for this purpose. In the ssl application we have some errors that are specified as {alert_atom(), term()} as in those case we know that the error will always result in a TLS alert that you might want to match on. But normal error reasons are for debugging and should not be matched on.

While I agree with @garazdawi that it is important with good docs I also agree with @hans that we can not really expect @Maria-12648430 to solve all our legacy problems with the existing documentation as part of this PR. Hopfully @Maria-12648430 would be interested to collaborate to make further improvements, as this will also benefit the new feature added here.

garazdawi · 2021-04-15T14:06:37Z

It was not my intention to say that I expected a rewrite of the entire "Supervisor Behaviour" chapter. My wording was apparently poor as all three of you seem to have interpreted it as such. My apologies.

This PR adds new functionality around how supervisors terminate and how child processes can trigger such termination. As such I think that it would be reasonable to take a larger look at those specific sections in the "Supervisor Behaviour" chapter.

garazdawi · 2021-04-15T14:13:32Z

As an example of what I am missing right now is a discussion under "Stopping a Child Process" of why it is a bad idea to call supervisor:terminate_child/2 function from within the child to be terminated.

IngelaAndin · 2021-04-15T14:15:48Z

@garazdawi I then think that those smaller document enhancements could come with the improved error messages for OTP-24. But I still would like us to merge the code before RC3 so that we get as much regression test as possible ;)

garazdawi · 2021-04-15T15:16:41Z

@garazdawi I then think that those smaller document enhancements could come with the improved error messages for OTP-24. But I still would like us to merge the code before RC3 so that we get as much regression test as possible ;)

Yes, I agree.

HansN · 2021-04-16T08:17:03Z

A big thank you for the PR!

lhoguin · 2021-04-16T08:34:56Z

Congratulations to everyone involved! It's a big one!

Maria-12648430 · 2021-04-16T08:43:08Z

Good morning everyone :)

It was not my intention to say that I expected a rewrite of the entire "Supervisor Behaviour" chapter. My wording was apparently poor as all three of you seem to have interpreted it as such. My apologies.

@garazdawi or maybe my wording was bad first ^^; When I brought up rewriting, it was not because I assumed that you wanted me to do it right here and now. What I really wanted to say is that I can't make my additions fit nicely into the existing chapter and a rewrite (or brush-up, as @HansN put it) might be the best way to make it nice and consistent again.

Hopfully @Maria-12648430 would be interested to collaborate to make further improvements, as this will also benefit the new feature added here.

@IngelaAndin Sure, I'll do my best :) Deadline for documentation is end of april, I believe?

Maria-12648430 · 2021-04-16T08:44:04Z

Oh, yay, merged =D

HansN reviewed Mar 18, 2021

View reviewed changes

lib/stdlib/doc/src/supervisor.xml Outdated Show resolved Hide resolved

HansN added enhancement priority:high team:PS Assigned to OTP team PS team:VM Assigned to OTP team VM testing currently being tested, tag is used by OTP internal CI labels Mar 18, 2021

juhlig force-pushed the eep56 branch from 7149a80 to 480bd19 Compare March 18, 2021 13:59

juhlig mentioned this pull request Mar 18, 2021

supervisor: add restart type "intrinsic" #4521

Closed

lhoguin approved these changes Mar 18, 2021

View reviewed changes

HansN removed the testing currently being tested, tag is used by OTP internal CI label Mar 19, 2021

juhlig force-pushed the eep56 branch from ef19da8 to abfa09b Compare March 24, 2021 13:58

IngelaAndin added the testing currently being tested, tag is used by OTP internal CI label Mar 26, 2021

juhlig force-pushed the eep56 branch from abfa09b to ac796a8 Compare March 26, 2021 14:06

juhlig force-pushed the eep56 branch from ac796a8 to 794b4c7 Compare March 29, 2021 08:57

juhlig force-pushed the eep56 branch from 794b4c7 to 5ee6cfb Compare March 29, 2021 14:29

HansN self-assigned this Mar 31, 2021

HansN assigned IngelaAndin Mar 31, 2021

juhlig and others added 7 commits April 14, 2021 16:09

Disallow invalid auto_shutdown and significant combinations

3c86822

Tests for terminations of non-significant children

55aba30

Add supervisor:check_childspecs/2

405656e

Docs for supervisor auto-shutdown

39e3e5a

Add more sections to supervisor introduction

bf422e6

Add support for more section nesting depth in chunks

d54dcee

Extend supervisor significant_bystander test case

04ddf2c

juhlig force-pushed the eep56 branch from 63f36cc to 04ddf2c Compare April 14, 2021 14:40

Improve docs for supervisor auto-shutdown

912d1fe

garazdawi approved these changes Apr 15, 2021

View reviewed changes

lib/stdlib/doc/src/supervisor.xml Outdated Show resolved Hide resolved

Update lib/stdlib/doc/src/supervisor.xml

0a52f11

Co-authored-by: Lukas Larsson <garazdawi@gmail.com>

HansN merged commit 07cced1 into erlang:master Apr 16, 2021

juhlig mentioned this pull request Apr 19, 2021

supervisor: refine error reasons for bad combinations of options #4746

Merged

Maria-12648430 mentioned this pull request Apr 20, 2021

supervisor: improve and augment docs for automatic shutdown #4751

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reference implementation for EEP 56 #4638

Reference implementation for EEP 56 #4638

juhlig commented Mar 17, 2021

juhlig commented Mar 23, 2021

juhlig commented Mar 24, 2021

IngelaAndin commented Mar 26, 2021

juhlig commented Mar 26, 2021

Maria-12648430 commented Mar 26, 2021

IngelaAndin commented Mar 26, 2021

Maria-12648430 commented Mar 26, 2021

juhlig commented Mar 26, 2021

IngelaAndin commented Mar 29, 2021

juhlig commented Mar 29, 2021

juhlig commented Mar 30, 2021

HansN commented Mar 31, 2021

Maria-12648430 commented Mar 31, 2021

Maria-12648430 commented Apr 15, 2021

garazdawi commented Apr 15, 2021

Maria-12648430 commented Apr 15, 2021

HansN commented Apr 15, 2021

garazdawi left a comment

Maria-12648430 commented Apr 15, 2021

HansN commented Apr 15, 2021

Maria-12648430 commented Apr 15, 2021

IngelaAndin commented Apr 15, 2021 •

edited

garazdawi commented Apr 15, 2021

garazdawi commented Apr 15, 2021 •

edited

IngelaAndin commented Apr 15, 2021

garazdawi commented Apr 15, 2021

HansN commented Apr 16, 2021

lhoguin commented Apr 16, 2021

Maria-12648430 commented Apr 16, 2021

Maria-12648430 commented Apr 16, 2021

Reference implementation for EEP 56 #4638

Reference implementation for EEP 56 #4638

Conversation

juhlig commented Mar 17, 2021

juhlig commented Mar 23, 2021

juhlig commented Mar 24, 2021

IngelaAndin commented Mar 26, 2021

juhlig commented Mar 26, 2021

Maria-12648430 commented Mar 26, 2021

IngelaAndin commented Mar 26, 2021

Maria-12648430 commented Mar 26, 2021

juhlig commented Mar 26, 2021

IngelaAndin commented Mar 29, 2021

juhlig commented Mar 29, 2021

juhlig commented Mar 30, 2021

HansN commented Mar 31, 2021

Maria-12648430 commented Mar 31, 2021

Maria-12648430 commented Apr 15, 2021

garazdawi commented Apr 15, 2021

Maria-12648430 commented Apr 15, 2021

HansN commented Apr 15, 2021

garazdawi left a comment

Choose a reason for hiding this comment

Maria-12648430 commented Apr 15, 2021

HansN commented Apr 15, 2021

Maria-12648430 commented Apr 15, 2021

IngelaAndin commented Apr 15, 2021 • edited

garazdawi commented Apr 15, 2021

garazdawi commented Apr 15, 2021 • edited

IngelaAndin commented Apr 15, 2021

garazdawi commented Apr 15, 2021

HansN commented Apr 16, 2021

lhoguin commented Apr 16, 2021

Maria-12648430 commented Apr 16, 2021

Maria-12648430 commented Apr 16, 2021

IngelaAndin commented Apr 15, 2021 •

edited

garazdawi commented Apr 15, 2021 •

edited