Add implicit model generation for `load_all_yaml` and `from_dict` #34

ca-scribner · 2022-04-28T19:23:24Z

Foreward

This PR proposes some convenience for interacting with batches of yaml. I'm not married to the API, etc., and the code does not have a full test suite, but just curious if you're interested in a feature like this and if so what suggestions/requests you have on the API. If you're interested, I'll flesh it out further.

Proposed feature

Sometimes, code might want to apply a YAML file that has CustomResources it might not know about (or might not otherwise need to know about). Currently, we need to instantiate generic resources for these objects before passing their definitions to the cluster for action (say to create an instance of a CustomResource). This proposal tries to make using CustomResources from YAML a bit more convenient by adding an optional client argument to codecs.from_dict or codecs.load_all_yaml. If from_dict or load_all_yaml are passed a Client via this arg, whenever they find an unknown generic resource they try to implicitly load that resource's definition from the cluster. If this does not work, they fail like before. This allows for the application of YAML without needing to know ahead of time what CRs might be defined in the yaml.

Implementation details

We client.list(CustomResourceDefinition) to look for the resource we are currently missing, creating a GenericResource for it if found. If a Client is not provided to codecs.from_dict or codecs.load_all_yaml, the existing behaviour is unchanged.

When trying to create an unknown resource (a resource not in the core kubernetes set or previously defined as a GenericResource) using codecs.from_dict or codecs.load_all_yaml, this change attempts to generate a generic model for the unknown resource. Using a provided, initialized lightkube Client, we client.list(CustomResourceDefinition) to look for the resource we are currently missing, creating a GenericResource for it if found. If a Client is not provided to codecs.from_dict or codecs.load_all_yaml, the behaviour is unchanged.

…ployment) Previously, the code would try to create a generic model for native objects like apps/v1/deployment. This change delays attempting to infer a model until after we've tried to import the model as if it is a fully modeled by lightkube (eg: not generic)

gtsystem · 2022-04-29T20:12:51Z

Interesting idea. I was thinking we can make this more reusable and explicit just adding a function to load CDRs from the cluster.

Usage example:

from lightkube import Client
from lightkube import generic_resource as gr

client = Client()
gr.load_in_cluster_generic_resources(client)   # here we explicitly load all CRDs, we can also return them 

...
with open('deploy.yaml') as f:
    codecs.load_all_yaml(f)

Same can be used to get a resource to be used for read/write:

from lightkube import Client
from lightkube import generic_resource as gr

client = Client()
gr.load_in_cluster_generic_resources(client)
CronJob = gr.get_generic_resource("stable.example.com/v1", "Job")  # get the class representing this resource.
client.get(CronJob)

ca-scribner · 2022-05-10T12:29:50Z

Sorry for the slow response.

I like the idea of a function specific for this purpose of loading the cluster's resources, that sounds like a nice helper. And it integrates well with get_generic_resource as you show.

What about the case where YAML both defines a CRD and then creates a CR for that definition? In that situation, pre-loading the generics will not be enough. The only thing I can think of for that situation is catching the error in the load_all_yaml and trying to get the CRD from the cluster at that time

gtsystem · 2022-05-21T09:27:04Z

Right, for that we can have a parameter as you mention that just create any CRD encountered during load of the YAML stream.
Something like:

# load_in_cluster_generic_resources should be called first for resources not defined in the file ...

with open('deploy.yaml') as f:
    codecs.load_all_yaml(f, create_missing_resources=True). # whenever a CRD is encountered in the file, it's created if missing.

…ps/v1/deployment)" This reverts commit e5841cc.

…ad_all_yaml" This reverts commit 2e7e562.

generic_resource.load_in_cluster_generic_resources automates creating generic resources for all custom resources in a cluster. This is done by listing all CRDs available from a provided Client, creating a new generic resource for each version of each resource.

Adds an optional create_resources_for_crds boolean parameter to codecs.load_all_yaml. If False (default), there is no change in behaviour of load_all_yaml. If create_resources_for_crds==True, the load_all_yaml will create generic resources for any CRDs it loads.

Merges in code from commit 0328fdf

ca-scribner · 2022-06-07T19:39:14Z

I've tried to refactor based on your suggestions. What do you think?

gtsystem

Nice work! I added few comments. Apart from that, we also need to document the new functions in generic-resources.md

lightkube/generic_resource.py

lightkube/codecs.py

lightkube/generic_resource.py

ca-scribner · 2022-06-14T20:56:22Z

All sounds reasonable! I've focused my spare time on the other PR the past few days and will be out of town this week, but will address these suggestions next week. Thanks!

* Improve docstrings * Remove some dependencies on apiextensions_v1 * expose new public generic_resource functions * add async_load_in_cluster_generic_resources and test * remove direct load of lightkube.models

ca-scribner · 2022-06-24T21:17:05Z

Docs not forgotten but ran out of time today. Will update in the next commit

gtsystem · 2022-06-30T15:35:58Z

lightkube/core/internal_models.py

@@ -1,7 +1,7 @@
 import sys

 try:
-    from ..models import meta_v1, autoscaling_v1
+    from ..models import meta_v1, autoscaling_v1, apiextensions_v1


In case of import failure we need to define a mock for each method/class used in apiextensions_v1. See bottom of this same file for examples.

lightkube/codecs.py

lightkube/core/internal_models.py

The asyncmock package (needed for python <3.8) works differently from the builtin in 3.8. Commented out part of the test to handle that.

tests/test_generic_resource.py

gtsystem · 2022-07-08T07:34:33Z

Good progress!
We have few things left:

One comment regarding the mocks for mkdocs
The documentation for the new public functions exposed in codecs and in generic_resources.

I was also thinking that we may need an integration test for load_in_cluster_generic_resources() as it uses the client. So create a CRD, wait it to be available and call load_in_cluster_generic_resources().

gtsystem · 2022-07-08T07:20:32Z

lightkube/core/internal_models.py

+    try:
+        from ..models import apiextensions_v1 as apiextensions
+    except:
+        from ..models import apiextensions_v1beta1 as apiextensions


In case of import failure we need to define a mock for each method/class used in apiextensions_v1. See bottom of this same file for examples (copied over comment as it was not visible any longer).

Just FYI, you may get a conflict on this file as #37 was merged.

Ok I misunderstood this. I think now I've got it

# Conflicts: # lightkube/core/internal_models.py

Addresses review feedback about missing mocks.

Also fixes bugs found by the e2e tests

Correctly handles mocking for the async for loops and non-async AsyncClient.list() method

Previously, test_load_in_cluster_generic_resources used a CRD for CronTab that was shared with another test. If that CRD was applied and not deleted in a previous test, test_load_in_cluster_generic_resources fails because it asserts the CRD does not exist at the start of the test. This change makes a new (very similar) CRD so we cannot have this collision.

Previously, cleanup of resources in test_load_in_cluster_generic_resources_* did not wait for their resources to be cleaned up before completion, sometimes resulting in test_load_in_cluster_generic_resources_async starting too fast and failing because it sees a leftover resource from the previous test. This adds a small persistence to the tests to avoid short, transient deletion delays

…s_async Avoids resource collision by randomly naming the CRDs for testing.

ca-scribner · 2022-07-11T20:33:42Z

Added e2e tests for both regular and async clients and glad you asked because both had bugs :D I was using models when I should have been using resources for the CRDs, but I think that is good now. I also noticed that the AsyncClient.list() method is not async - is that on purpose or a bug? If a bug and it should be async, async_load_in_cluster_generic_resources will need a minor tweak

Ran into a few problems in the tests where I hit race conditions between the client.delete(crd) calls of one test and the start of another test, but I think I have those resolved now by just making randomly named CRDs for the tests.

ca-scribner · 2022-07-11T21:19:49Z

Docs are added now as well. Thanks for all the great feedback!

gtsystem · 2022-07-12T15:58:10Z

Changes merged, great work!

AsyncClient.list() returns an async generator which will start executing once iterating. So iteration requires await but the call to list itself doesn't.

ca-scribner · 2022-07-12T17:05:22Z

ty for the helpful reviews! and makes sense for AsyncClient.list(), thanks!

ca-scribner added 2 commits April 8, 2022 13:59

ca-scribner added 8 commits June 6, 2022 16:41

Revert "fix: _load_model improperly handles some k8s objects (like ap…

9420108

…ps/v1/deployment)" This reverts commit e5841cc.

Revert "feat: add implicit generic model creation to from_dict and lo…

4446c40

…ad_all_yaml" This reverts commit 2e7e562.

fix: incorrect ValueError message for _create_resources_from_crd

b2f05bf

make generic_resource.create_resources_from_crd() public

504d4a8

Cherry pick fix for load_all_yaml that handles empty yaml documents

ee3eb3a

Merges in code from commit 0328fdf

Merge branch 'master' into add-implicit-model-generation

f06297e

ca-scribner marked this pull request as ready for review June 7, 2022 19:38

fix: add missing test data example-multi-version-crd.yaml

c91eb33

gtsystem reviewed Jun 10, 2022

View reviewed changes

lightkube/generic_resource.py Show resolved Hide resolved

Address review feedback:

70c9252

* Improve docstrings * Remove some dependencies on apiextensions_v1 * expose new public generic_resource functions * add async_load_in_cluster_generic_resources and test * remove direct load of lightkube.models

gtsystem reviewed Jun 30, 2022

View reviewed changes

ca-scribner added 4 commits July 7, 2022 16:54

Handle that k8s 1.15 does not have apiextensions_v1

4c544ae

Add asyncmock dependency to support tests in python < 3.8

724fadb

Add asyncmock dependency to python-package.yml tests

174c149

fix: async generic_resources tests

e8306a9

The asyncmock package (needed for python <3.8) works differently from the builtin in 3.8. Commented out part of the test to handle that.

ca-scribner commented Jul 7, 2022

View reviewed changes

tests/test_generic_resource.py Show resolved Hide resolved

gtsystem reviewed Jul 8, 2022

View reviewed changes

Merge branch 'master' into add-implicit-model-generation

9703dc6

# Conflicts: # lightkube/core/internal_models.py

ca-scribner added 8 commits July 11, 2022 12:05

fix: definition of mock method/class for apiextensions

f56a11d

Addresses review feedback about missing mocks.

feat: add e2e tests for load_in_cluster_generic_resources

ebe8414

Also fixes bugs found by the e2e tests

fix: test_async_load_in_cluster_generic_resources

953a68f

Correctly handles mocking for the async for loops and non-async AsyncClient.list() method

fix: debugging test helper delete_and_wait

f09c320

fix: use randomly named CRDs in test_load_in_cluster_generic_resource…

61bdeed

…s_async Avoids resource collision by randomly naming the CRDs for testing.

fix: use randomly named CRDs in test_load_in_cluster_generic_resource…

45156a4

…s_async Avoids resource collision by randomly naming the CRDs for testing.

docs: adds documentation for new generic resources and codecs

30fe8ad

gtsystem approved these changes Jul 12, 2022

View reviewed changes

gtsystem merged commit 9318652 into gtsystem:master Jul 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add implicit model generation for `load_all_yaml` and `from_dict` #34

Add implicit model generation for `load_all_yaml` and `from_dict` #34

ca-scribner commented Apr 28, 2022

gtsystem commented Apr 29, 2022

ca-scribner commented May 10, 2022

gtsystem commented May 21, 2022

ca-scribner commented Jun 7, 2022

gtsystem left a comment

ca-scribner commented Jun 14, 2022

ca-scribner commented Jun 24, 2022

gtsystem Jun 30, 2022

gtsystem commented Jul 8, 2022

gtsystem Jul 8, 2022

gtsystem Jul 8, 2022

ca-scribner Jul 11, 2022

ca-scribner commented Jul 11, 2022

ca-scribner commented Jul 11, 2022

gtsystem commented Jul 12, 2022

ca-scribner commented Jul 12, 2022

Add implicit model generation for load_all_yaml and from_dict #34

Add implicit model generation for load_all_yaml and from_dict #34

Conversation

ca-scribner commented Apr 28, 2022

Foreward

Proposed feature

Implementation details

gtsystem commented Apr 29, 2022

ca-scribner commented May 10, 2022

gtsystem commented May 21, 2022

ca-scribner commented Jun 7, 2022

gtsystem left a comment

Choose a reason for hiding this comment

ca-scribner commented Jun 14, 2022

ca-scribner commented Jun 24, 2022

gtsystem Jun 30, 2022

Choose a reason for hiding this comment

gtsystem commented Jul 8, 2022

gtsystem Jul 8, 2022

Choose a reason for hiding this comment

gtsystem Jul 8, 2022

Choose a reason for hiding this comment

ca-scribner Jul 11, 2022

Choose a reason for hiding this comment

ca-scribner commented Jul 11, 2022

ca-scribner commented Jul 11, 2022

gtsystem commented Jul 12, 2022

ca-scribner commented Jul 12, 2022

Add implicit model generation for `load_all_yaml` and `from_dict` #34

Add implicit model generation for `load_all_yaml` and `from_dict` #34