PoC library simplification #6

samuelallan72 · 2022-08-09T23:09:00Z

Here is an aggressive PoC rewrite to simplify the library,
while retaining most of the functionality.

Some specific points:

Remove the special MasterStatus class. This design was awkward,
because it was a subclass of Status, but provided functionality
that should have been provided by the StatusPool,
resulting in a lot of edge case code.
- Merge required functionality into the StatusPool.
Remove unnecessary classes; the clobberer classes could be pure functions.
Rename clobberer to summarizer
Leverage more existing code:
- use UnknownStatus as-is
- use StatusBase.from_name
Remove some magic, but retain just enough
to still support pool.status_name get/set access shortcuts.
Remove the static subclass based approach to defining the pool
(it's cool, but feels too magical, and I think in practice
the procedural method of building the pool will be more practical).
Consolidate status tag and attr into a single name.
Remove the externally managed private variables from Status class.
Remove static class variables from Status class (originally for tracking ids for ordering)

Fixes #7
Fixes #8
Fixes #9
Fixes #10
Fixes #11
Fixes #12
Fixes #13
Fixes #14
Fixes #15
Fixes #16
Fixes #17
Fixes #19
Fixes #20
Fixes #21
Fixes #22

Here is an aggressive PoC rewrite to simplify the library, while retaining most of the functionality. Some specific points: - Remove the special `MasterStatus` class. This design was awkward, because it was a subclass of Status, but provided functionality that should have been provided by the StatusPool, resulting in a lot of edge case code. - Merge required functionality into the StatusPool. - Remove unnecessary classes; the clobberer classes could be pure functions. - Rename clobberer to summarizer - Leverage more existing code: - use UnknownStatus as-is - use StatusBase.from_name - Remove some magic, but retain just enough to still support `pool.status_name` get/set access shortcuts. - Remove the static subclass based approach to defining the pool (it's cool, but feels too magical, and I think in practice the procedural method of building the pool will be more practical). - Consolidate status tag and attr into a single name. - Remove the externally managed private variables from Status class. - Remove static class variables from Status class (originally for tracking ids for ordering)

PietroPasotti · 2022-08-10T07:56:34Z

The main reason behind the static pool definition with Statuses as descriptors was the code completion/typing it enabled.
One solution to this would be to add() a status on __setattr__ if not present already.
This would mean __init__ looks like:

self.status = StatusPool(self, skip_unknown=True, auto_commit=self.AUTO_COMMIT)
self.status.workload = Status("workload")
self.status.relation_1 = Status("relation_1")
self.status.relation_2 = Status("relation_2")

While I see that dynamically adding things works just as well, I'm not convinced it's more ergonomic. A downside of doing it this way is that it clutters __init__. Too many lines. But that's just my very personal take on style 😬
Of course, we could late-bind the Pool like:

I'm going to give it some thought.
But thanks for the input! I'll definitely use most of this and publish a new major release.
@rwcarlsen also had some ideas about how to strip this library down, I'll ask him to give this some thought as well.

PietroPasotti · 2022-08-10T08:01:31Z

Another thought on the dynamic-adding versus statically defined:
do we validate the name/tag, to raise if it's not a valid python descriptor? Otherwise we have a weird asymmetry where

self.status.add(Status('foo'))
self.status.foo # works
self.status.add(Status('foo-*123124sdXX []')) 
self.status. ??? # borks

samuelallan72 · 2022-08-10T08:12:55Z

The main reason behind the static pool definition with Statuses as descriptors was the code completion/typing it enabled.

I'm not a fan of designing hacky or magic code to optimise for editor code completion. Firstly because this is python; if we wanted real typing we would use a language with real typing, so we just need to grimace and bear it (ie. write nice pythonic code and deal with the lack of perfect typing). Secondly, I've seen this kind of thing before - it seems really nice to have great autocompletion, until you look back at the unmaintainable code that made it possible. Gotta make a trade off here.

While I see that dynamically adding things works just as well, I'm not convinced it's more ergonomic. A downside of doing it this way is that it clutters init. Too many lines.

How about this - doesn't clutter init:

def build_status_pool(charm):
  pool = StatusPool(charm, skip_unknown=True, auto_commit=True)
  pool.workload = Status("workload")
  pool.relation_1 = Status("relation_1")
  pool.relation_2 = Status("relation_2")
  return pool

class Charm(...):
  def __init__(self, ...):
    self.status_pool = build_status_pool(self)

Of course, we could late-bind the Pool like:

Huh I've never seen that before in python. What's going on there? If this is simply so the pool can be dynamically built at top-level code, does a build_status_pool(charm) -> StatusPool function acheive the same thing without this kind of binding?

do we validate the name/tag, to raise if it's not a valid python descriptor? Otherwise we have a weird asymmetry ...

I say no. There's still the non-syntax sugar method of accessing the status from the pool, and if a dev wants to name a status foo-*123124sdXX [], then that's on them. :P

samuelallan72 · 2022-08-10T08:31:44Z

do we validate the name/tag, to raise if it's not a valid python descriptor? Otherwise we have a weird asymmetry ...

I say no. There's still the non-syntax sugar method of accessing the status from the pool, and if a dev wants to name a status foo-*123124sdXX [], then that's on them. :P

To add more about why this is fine...

Also note that there are two options for setting a name here:

static string
dynamically

In case 1, static string, the dev has full control and can ensure they set it to something that can be used as an attribute name if they want to use that.

In case 2, dynamically, you're not going to be using the attribute shorthand anyway. eg.

name = some_function_that_returns_a_string()
pool.add(Status(name))
pool.... # cannot access this way because you don't know what `name` is at edit time
getattr(pool, name) # works regardless if `name` is not valid syntax for an attribute name
pool.get_status(name) # also works

rwcarlsen

In general, I think there needs to be some pondering, discussion, and agreement on how exactly we want the user experience/api to evolve before engaging in major implementation mods. Otherwise we'll just be code thrashing unnecessarily and will just settle on something less than optimal because we got worn out writing/reviewing code.

rwcarlsen · 2022-08-10T13:49:58Z

README.md

+        status_pool = StatusPool(self)
+        status_pool.add(Status("workload"))  # this tracks workload status
+        status_pool.add(Status("relation_1"))  # this tracks my integration #1
+        status_pool.add(Status("relation_2"))  # this tracks my integration #2


What about something like:

status_pool = StatusPool(self, 'workload', 'relation_1', 'relation_2')

rwcarlsen · 2022-08-10T16:04:56Z

README.md

+        status_pool.set_status("relation_1", ActiveStatus('✅'))
+        status_pool.relation_2 = ActiveStatus('✅')
+
+        workload_status = status_pool.workload
+        workload_status.status = ActiveStatus('✅')


Seems like trying to accommodate everything and the kitchen sink here is a bit much. I'd prefer we settle on one canonical way to set/retrieve status. I'd almost be inclined to just do status_pool['relation_1'] = ActiveStatus()

rwcarlsen · 2022-08-10T16:29:14Z

compound_status.py

+        self,
+        charm: CharmBase,
+        key: str = "status_pool",
+        summarizer: Callable[[List[Status], bool], str] = summarize_worst_only,


The issue here would be if the library becomes widely used, we'd get divergence between what gets reported in juju status - since everyone will be tempted to customize it differently. I think I'd prefer we just have one canonical presentation of overall status - unchangeable.

Yeah I have to agree, although I do like the customisability. I left some thoughts on the internal design doc about this too.

rwcarlsen · 2022-08-10T16:34:49Z

As a general comment though, I do agree that auto-completion should not be the primary driver for implementation design - although it is certainly an important consideration.

samuelallan72 · 2022-08-11T00:08:22Z

In general, I think there needs to be some pondering, discussion, and agreement on how exactly we want the user experience/api to evolve before engaging in major implementation mods. Otherwise we'll just be code thrashing unnecessarily and will just settle on something less than optimal because we got worn out writing/reviewing code.

I think at this stage it will benefit from actually using this library in some charms, and iterating on the api based on usage patterns and painpoints that emerge. This early stage of development is really the only opportunity we'll have to make many breaking changes, as nothing is depending on it yet.

There's an adage that says one should write client code first. As in, write PoC code in a client (ie. a charm) how it most makes sense in context, then write the library/api code based on how you found you needed to use it. This is useful because it avoids writing an api that you imagine to be useful and ergonomic, but then discovering it's not so much when actually using it in a real world charm.

This has kind of been done with the example charm, but it's still difficult to simulate real-world use cases from a small example. For example, with the experimenting I've been doing the last couple of days to integrate into a sunbeam charm, it's become apparent that pretty much all statuses will need to be dynamically added to the pool via add_status, so the static class-based declaration of the pool may not actually be that useful perhaps (I could be wrong, but that's just an example).

This is why I'm in favour of drastically reducing the complexity and featureset here, then gradually adding features back in as we find a need (which will include iterating on the api to ensure the new features form a coherent set).

compound_status.py

samuelallan72 · 2022-08-15T00:28:07Z

fyi I'm now using this branch to iterate as I experiment with using it. See related patches:

charm-glance-k8s: https://review.opendev.org/c/openstack/charm-glance-k8s/+/852688
charm-ops-sunbeam: https://review.opendev.org/c/openstack/charm-ops-sunbeam/+/852796

samuelallan72 · 2022-08-16T22:49:44Z

Closing because we're going to avoid a major rewrite here, in favour of smaller iterations and prototyping elsewhere.

rwcarlsen reviewed Aug 10, 2022

View reviewed changes

samuelallan72 commented Aug 12, 2022

View reviewed changes

compound_status.py Show resolved Hide resolved

Samuel Walladge added 5 commits August 12, 2022 13:31

add pyproject so can install with pip

905299f

minor api change

34fb847

more api updates

dabdfe3

remove unused method

f2c99e8

ensure never set unknown status

406c68f

samuelallan72 closed this Aug 16, 2022

PietroPasotti mentioned this pull request Aug 19, 2022

Experiment with PoolableStatuses #25

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PoC library simplification #6

PoC library simplification #6

samuelallan72 commented Aug 9, 2022 •

edited

PietroPasotti commented Aug 10, 2022

PietroPasotti commented Aug 10, 2022

samuelallan72 commented Aug 10, 2022

samuelallan72 commented Aug 10, 2022 •

edited

rwcarlsen left a comment •

edited

rwcarlsen Aug 10, 2022

rwcarlsen Aug 10, 2022

rwcarlsen Aug 10, 2022

samuelallan72 Aug 10, 2022

rwcarlsen commented Aug 10, 2022

samuelallan72 commented Aug 11, 2022 •

edited

samuelallan72 commented Aug 15, 2022

samuelallan72 commented Aug 16, 2022

PoC library simplification #6

PoC library simplification #6

Conversation

samuelallan72 commented Aug 9, 2022 • edited

PietroPasotti commented Aug 10, 2022

PietroPasotti commented Aug 10, 2022

samuelallan72 commented Aug 10, 2022

samuelallan72 commented Aug 10, 2022 • edited

rwcarlsen left a comment • edited

Choose a reason for hiding this comment

rwcarlsen Aug 10, 2022

Choose a reason for hiding this comment

rwcarlsen Aug 10, 2022

Choose a reason for hiding this comment

rwcarlsen Aug 10, 2022

Choose a reason for hiding this comment

samuelallan72 Aug 10, 2022

Choose a reason for hiding this comment

rwcarlsen commented Aug 10, 2022

samuelallan72 commented Aug 11, 2022 • edited

samuelallan72 commented Aug 15, 2022

samuelallan72 commented Aug 16, 2022

samuelallan72 commented Aug 9, 2022 •

edited

samuelallan72 commented Aug 10, 2022 •

edited

rwcarlsen left a comment •

edited

samuelallan72 commented Aug 11, 2022 •

edited