Calculate CEL cost totals #108612

DangerOnTheRanger · 2022-03-09T18:11:25Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

This PR is a part of #107573, and adds support at the CRD level for CEL expression cost calculation as per the KEP, and emits an error message if the CRD CEL cost limit is exceeded. This PR builds off of #108419 by taking into account maxLength and associated fields when calculating the total maximum cost for a CRD's CEL expressions.

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

NONE

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation.go

fedebongio · 2022-03-10T17:39:41Z

/triage accepted

jpbetz · 2022-03-12T01:43:12Z

@DangerOnTheRanger Just to make sure we're on the same page, I'm expecting that:

per-expression estimated cost is: *

per-CRD estimated cost: sum(per-expression estimated costs)

I wanted to point this out because this PR is titled "per-CRD" but includes logic to calculate number of times an expression can be evaluated.

DangerOnTheRanger · 2022-03-14T16:11:35Z

Kermit Alexander II Just to make sure we're on the same page, I'm expecting that:

per-expression estimated cost is: *

per-CRD estimated cost: sum(per-expression estimated costs)

I wanted to point this out because this PR is titled "per-CRD" but includes logic to calculate number of times an expression can be evaluated.

Yeah, I think there was unfortunately some ambiguity there. I've renamed the PR so the changes make a bit more sense/have a bit more context, hopefully.

cici37 · 2022-03-17T23:31:28Z

Hi @DangerOnTheRanger, when you rebase, please remove TODO in compilation_test and update the cost limit with const PerCallLimit. Thanks

jpbetz · 2022-03-21T15:11:03Z

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation.go

+	}
+}
+
+func getCRDCost(baseCost uint64, schemaNode *schemaTree) uint64 {


Maybe use a name different than "CRD cost"? This computes the cost of a single CEL expression, not the cost of all the CEL expressions in a CRD.

Yes, I think it could be more descriptive. I've renamed it to getExpressionCost - how does that sound?

DangerOnTheRanger · 2022-03-25T19:43:27Z

/retest

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation.go

liggitt · 2022-03-26T04:17:48Z

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation.go

@@ -981,6 +1026,61 @@ func ValidateCustomResourceDefinitionOpenAPISchema(schema *apiextensions.JSONSch
 	return allErrs
 }

+func extractMaxElements(schema *apiextensions.JSONSchemaProps) *uint64 {


what does a nil return mean?

godoc:

extractMaxElements returns the factor by which the schema increases the number of possible data elements for its children. If schema is a map and has MaxProperties or an array has MaxItems, the int pointer of the max value is returned. If schema is a map or array and does not have MaxProperties or MaxItems, nil is returned to indicate that there is no limit to the possible number of data elements imposed by the current schema. If the schema is an object, 1 is returned to indicate that there is not increase to the number of possible data elements for its children. Primitives do not have children, but 1 is returned for simplicity in this case.

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation.go

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation_test.go

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation.go

liggitt · 2022-03-26T05:31:02Z

staging/src/k8s.io/apiextensions-apiserver/third_party/forked/celopenapi/model/schemas.go

+// Note that this only assumes a single comma between data elements, so if the schema is contained under only maps,
+// this estimates a higher cardinality that would be possible.
+func MaxCardinality(s *schema.Structural) uint64 {
+	sz := estimateMinSizeJSON(s) + 1 // assume at least one comma between elements


is this being called in ways that will make us repeatedly recursively evaluate the size of a schema?

if I have a schema 40 nesting levels deep, and have a cel rule at each level, does this call:

estimateMinSizeJSON(root) (traversing all child schemas to compute the min size of the root)

estimateMinSizeJSON(level 1) (re-traversing all child schemas to compute the min size of level 1)

...
?

beyond the scope of this PR, but a similar question exists for other callers of estimateMinSizeJSON via SchemaDeclType / estimateMaxArrayItemsPerRequest / estimateMaxAdditionalPropertiesPerRequest

we want to make sure a deep schema with cel rules at the root and other levels isn't going to be super expensive to compute cost on

This general problem is fairly pervasive in the CRD validation side of things. While I was looking at how we could do more work as a post traversal step to make the min calculations cheap, I noticed that we construct a new structural schema whenever we compile CEL programs, which is another case of use doing a recursive traversal at every level (for the worst case). We also convert those schemas to the "decl" format that CEL accepts in compile (which is another recursive traversal).

How would you feel about a beta task where we construct a benchmark that reproduces this problem well and then optimize it away? A traversal that starts at the first branch in the tree where a CEL rule encountered that accumulates some base facts (like min sizes), and prepares the structural and "decls" schemas should allow for a lot of reuse. But it's a larger change.

sounds ok for beta, as long as the calculations that do recursive traversals in this PR and #108990 are behind the cel validation feature gate

jpbetz · 2022-03-27T03:58:32Z

Feedback applied on new commits.

liggitt · 2022-03-27T18:23:11Z

needs squash, and has an unused import compilation error:

staging/src/k8s.io/apiextensions-apiserver/pkg/apiserver/schema/cel/compilation_test.go:21:2: "math" imported but not used (typecheck)
	"math"
	^

liggitt · 2022-03-27T18:24:14Z

lgtm otherwise

dims · 2022-03-28T12:28:24Z

@DangerOnTheRanger please visit the red CI jobs!

liggitt · 2022-03-28T17:42:01Z

/approve
/retest

@jpbetz has final lgtm

k8s-ci-robot · 2022-03-28T17:43:42Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: DangerOnTheRanger, liggitt

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~staging/src/k8s.io/apiextensions-apiserver/OWNERS~~ [liggitt]
~~staging/src/k8s.io/apiextensions-apiserver/pkg/apis/OWNERS~~ [liggitt]
~~test/integration/apiserver/OWNERS~~ [liggitt]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

jpbetz · 2022-03-28T17:58:52Z

/lgtm

jpbetz · 2022-03-28T18:37:33Z

/hold cancel

k8s-triage-robot · 2022-03-28T19:33:01Z

This PR may require API review.

If so, when the changes are ready, complete the pre-review checklist and request an API review.

Status of requested reviews is tracked in the API Review project.

k8s-ci-robot requested review from deads2k and mikedanese March 9, 2022 18:12

jpbetz reviewed Mar 9, 2022

View reviewed changes

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation.go Outdated Show resolved Hide resolved

jpbetz reviewed Mar 9, 2022

View reviewed changes

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation.go Outdated Show resolved Hide resolved

jpbetz reviewed Mar 9, 2022

View reviewed changes

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation.go Outdated Show resolved Hide resolved

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Mar 10, 2022

DangerOnTheRanger changed the title ~~[WIP] Calculate per-CRD CEL cost~~ [WIP] Calculate CEL cost totals Mar 14, 2022

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 14, 2022

DangerOnTheRanger mentioned this pull request Mar 15, 2022

CEL MaxLength integration #108419

Merged

DangerOnTheRanger force-pushed the cel-crd-maxlength branch from ac86a88 to f5b6d34 Compare March 18, 2022 13:19

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Mar 18, 2022

jpbetz reviewed Mar 21, 2022

View reviewed changes

DangerOnTheRanger force-pushed the cel-crd-maxlength branch 2 times, most recently from 37762dc to dbc87e3 Compare March 25, 2022 18:32

cici37 reviewed Mar 25, 2022

View reviewed changes

staging/src/k8s.io/apiextensions-apiserver/pkg/apis/apiextensions/validation/validation.go Outdated Show resolved Hide resolved

liggitt reviewed Mar 26, 2022

View reviewed changes

liggitt added this to the v1.24 milestone Mar 26, 2022

jpbetz mentioned this pull request Mar 27, 2022

Add godoc to cardinality, treat negative MaxItems/Properties/Lengths as 0 DangerOnTheRanger/kubernetes#2

Merged

k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Mar 27, 2022

DangerOnTheRanger force-pushed the cel-crd-maxlength branch from d47025b to 78b4326 Compare March 27, 2022 04:51

Add per-CRD cost evaluation.

7e66bd2

DangerOnTheRanger force-pushed the cel-crd-maxlength branch from 78b4326 to 7e66bd2 Compare March 28, 2022 16:19

k8s-ci-robot added release-note-none Denotes a PR that doesn't merit a release note. and removed do-not-merge/release-note-label-needed Indicates that a PR should not merge because it's missing one of the release note labels. labels Mar 28, 2022

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 28, 2022

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Mar 28, 2022

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Mar 28, 2022

k8s-ci-robot merged commit e413507 into kubernetes:master Mar 28, 2022

This was referenced Jul 26, 2022

Promote feature CustomResourceValidationExpressions to beta #111158

Closed

Promote feature CustomResourceValidationExpressions to beta #111524

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calculate CEL cost totals #108612

Calculate CEL cost totals #108612

DangerOnTheRanger commented Mar 9, 2022 •

edited by liggitt

fedebongio commented Mar 10, 2022

jpbetz commented Mar 12, 2022

DangerOnTheRanger commented Mar 14, 2022

cici37 commented Mar 17, 2022

jpbetz Mar 21, 2022

DangerOnTheRanger Mar 21, 2022

DangerOnTheRanger commented Mar 25, 2022

liggitt Mar 26, 2022

jpbetz Mar 27, 2022 •

edited

liggitt Mar 26, 2022

liggitt Mar 26, 2022

jpbetz Mar 26, 2022

liggitt Mar 27, 2022

jpbetz commented Mar 27, 2022

liggitt commented Mar 27, 2022

liggitt commented Mar 27, 2022

dims commented Mar 28, 2022

liggitt commented Mar 28, 2022

k8s-ci-robot commented Mar 28, 2022

jpbetz commented Mar 28, 2022

jpbetz commented Mar 28, 2022

k8s-triage-robot commented Mar 28, 2022

Calculate CEL cost totals #108612

Calculate CEL cost totals #108612

Conversation

DangerOnTheRanger commented Mar 9, 2022 • edited by liggitt

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

fedebongio commented Mar 10, 2022

jpbetz commented Mar 12, 2022

DangerOnTheRanger commented Mar 14, 2022

cici37 commented Mar 17, 2022

jpbetz Mar 21, 2022

Choose a reason for hiding this comment

DangerOnTheRanger Mar 21, 2022

Choose a reason for hiding this comment

DangerOnTheRanger commented Mar 25, 2022

liggitt Mar 26, 2022

Choose a reason for hiding this comment

jpbetz Mar 27, 2022 • edited

Choose a reason for hiding this comment

liggitt Mar 26, 2022

Choose a reason for hiding this comment

liggitt Mar 26, 2022

Choose a reason for hiding this comment

jpbetz Mar 26, 2022

Choose a reason for hiding this comment

liggitt Mar 27, 2022

Choose a reason for hiding this comment

jpbetz commented Mar 27, 2022

liggitt commented Mar 27, 2022

liggitt commented Mar 27, 2022

dims commented Mar 28, 2022

liggitt commented Mar 28, 2022

k8s-ci-robot commented Mar 28, 2022

jpbetz commented Mar 28, 2022

jpbetz commented Mar 28, 2022

k8s-triage-robot commented Mar 28, 2022

DangerOnTheRanger commented Mar 9, 2022 •

edited by liggitt

jpbetz Mar 27, 2022 •

edited