cmd/cue: serious performance regression #2243

nxcc · 2023-02-03T15:16:48Z

What version of CUE are you using (`cue version`)?

cue version v0.5.0-beta.5

go version go1.19.3
       -compiler gc
       -trimpath true
     CGO_ENABLED 0
          GOARCH amd64
            GOOS linux
         GOAMD64 v1

Does this issue reproduce with the latest release?

yes

What did you do?

cue eval ./testcase (see testcase.zip)

What did you expect to see?

the expected output after a fraction of a second (<0.2s), like with cue v0.5.0-beta2

What did you see instead?

the expected output after more than 12 seconds

The text was updated successfully, but these errors were encountered:

mvdan · 2023-02-03T21:15:02Z

Thanks for reporting. v0.5.0-beta.2 is roughly as fast as v0.4.3, so it must have been a recent change that was included in v0.5.0-beta.5.

I bisected between the two, which pointed me at https://review.gerrithub.io/c/cue-lang/cue/+/549247. Not entirely surprising:

This change more aggresively early evaluates conjuncts to prevent them from being missed if a node is in finalization mode. This fix most likely will not cover all cases, but hopefully enough to hold over until v0.6.

I imagine the more aggressive early evaluation might be causing worse performance in your case.

mvdan · 2023-02-03T21:29:27Z

Reduced it down by quite a bit:

import (
	"strings"
	"crypto/sha512"
	"crypto/sha256"
	"crypto/md5"
	"encoding/base64"
	"encoding/hex"
	"regexp"
	"list"
)

secret: this={
	(#metadataName & {data: this.data}).out
}

#metadataName: {
	DATA=data: {[string]: string | bytes, ...}
	out: metadata: name: "-" + (#shortHash & {data: DATA}).out
}

#shortHash: {
	data: {[string]: string | bytes, ...}

	_hashLen:       6
	_sortedKeys:    list.SortStrings([ for k, _ in data {k}])
	_joinedKV:      strings.Join([ for _, k in _sortedKeys {data[k]}], "\n")
	_joinedKVBytes: '\(_joinedKV)'
	_md5:           md5.Sum(_joinedKVBytes)
	_sha256:        sha256.Sum256(_joinedKVBytes)
	_sha512:        sha512.Sum512(_joinedKVBytes)
	_base64:        base64.Encode(null, _sha512+_sha256+_md5)
	_base36:        regexp.ReplaceAll("[^a-z0-9]", _base64, "")
	_fragments:     strings.SplitN( _base36, "", _hashLen+1)
	_truncated:     strings.Join(_fragments[0:_hashLen], "")

	out: _truncated
}

secret: data: {
	d1: _
	d2: _
	d3: _
	d4: _
	d5: _
}

With this shorter config, v0.5.0-beta.5 takes over a second, and v0.4.3 barely takes ten milliseconds.

$ cue version
cue version v0.0.0-20230202180031-576d0e461a99

go version devel go1.21-88a36c9e9a Thu Feb 2 20:23:27 2023 +0000
      -buildmode exe
       -compiler gc
     CGO_ENABLED 1
          GOARCH amd64
            GOOS linux
         GOAMD64 v3
             vcs git
    vcs.revision 576d0e461a990ddcdab9da7a10375a1c6d87a865
        vcs.time 2023-02-02T18:00:31Z
    vcs.modified false
$ time cue eval new.cue >stdout

real	0m1.281s
user	0m1.606s
sys	0m0.040s
$ time cue-v0.4.3 eval new.cue >stdout-stable

real	0m0.018s
user	0m0.018s
sys	0m0.009s
$ diff -u stdout-stable stdout
$

mvdan · 2023-02-07T10:55:29Z

I believe @tmm1 is running into the same performance regression; cue export test.cue in https://github.com/tmm1/taxes.cue takes about five seconds on v0.4.3, but appears to spin CPU for a very long time on v0.5.0-beta.5. I ran it for multiple minutes and gave up; it uses one full CPU while it runs.

Worth noting that beta.2 fails, partially because of #2246, and it's unclear if the other errors are caused by 2246 as well. But at least the export finishes in about ten seconds instead of taking a long time.

Small reproducer:

import "list"

out: #qualifiedDividendsAndCapitalGainTax | *"incomplete"

#qualifiedDividendsAndCapitalGainTax: {
	_in: {
		_f1040: _
		_filingStatus: _f1040.filingStatus
		_form1040: {
			l3a: _f1040.qualifiedDividends
			l15: _f1040.taxableIncome
			l7:  _f1040.capitalGainOrLoss
		}
	}
	out: _sheet.l17
	_sheet: {
		l1:  _in._form1040.l15
		l2:  _in._form1040.l3a
		l3:  _in._form1040.l7
		l4:  l2 + l3
		l5:  list.Max([0, l1 - l4])
		l6:  40_400
		l7:  list.Min([l1, l6])
		l8:  list.Min([l5, l7])
		l9:  l7 - l8
		l10: list.Min([l1, l4])
		l11: l9
		l12: l10 - l11
		l13: 445_850
		l14: list.Min([l1, l13])
		l15: l5 + l9
		l16: list.Max([0, l14 - l15])
		l17: list.Min([l12, l16])
	}
}

That smaller reproducer takes 0.08s on v0.4.3, 0.25s on v0.5.0-beta.1, 0.30s on v0.5.0-beta.2, and 4.63s on v0.5.0-beta5. CUE master (02e19c8) also takes nearly five seconds. I believe it is the same performance regression as the one reported here, because both regressed between beta2 and beta5, and both involve multiple levels of fields which do computation on top of each other, e.g. strings.Join or list.Min.

I bisected between beta.2 and beta.5 and the wall time jumps from 0.3s to 4.5s with https://review.gerrithub.io/c/cue-lang/cue/+/549247, which further convinces me that the two users ran into the same performance regression: they both bisected to the same CUE change.

myitcv · 2023-06-14T11:15:50Z

Marking this as v0.7.0. If we were to address this by backing out the change that caused the regression, we would likely need an additional fix as part of v0.6.0. As things stand, we don't want to delay v0.6.0 if we can avoid it, and instead prefer to address this along with other performance issues in v0.7.0. This has the added benefit of the fix likely being considerably easier, because of refactors in the evaluation engine as part of v0.7.0 (that are not part of v0.6.0, where the cost of making such changes is higher).

nxcc added NeedsInvestigation Triage Requires triage/attention labels Feb 3, 2023

nxcc changed the title ~~massive performance regression using the api~~ massive performance regression Feb 3, 2023

nxcc changed the title ~~massive performance regression~~ cmd/cue: massive performance regression Feb 3, 2023

nxcc changed the title ~~cmd/cue: massive performance regression~~ cmd/cue: serious performance regression Feb 3, 2023

mvdan removed the Triage Requires triage/attention label Feb 3, 2023

mvdan added this to the v0.5.0 comprehension rework milestone Feb 3, 2023

mvdan mentioned this issue Feb 8, 2023

projects: add github.com/tmm1/taxes.cue cue-unity/unity#112

Closed

myitcv added the v0.5 hangover Serious bugs that did not get fixed in v0.5.0 label Apr 6, 2023

myitcv added the zGarden label Jun 13, 2023

myitcv modified the milestones: v0.5.0 comprehension rework, v0.7.0: performance and disjunction fixes Jun 14, 2023

myitcv removed the zGarden label Jun 14, 2023

mvdan mentioned this issue Aug 24, 2023

evaluator: it looks like the result of executing functions is not reused #2566

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/cue: serious performance regression #2243

cmd/cue: serious performance regression #2243

nxcc commented Feb 3, 2023 •

edited

mvdan commented Feb 3, 2023

mvdan commented Feb 3, 2023

mvdan commented Feb 7, 2023

myitcv commented Jun 14, 2023

cmd/cue: serious performance regression #2243

cmd/cue: serious performance regression #2243

Comments

nxcc commented Feb 3, 2023 • edited

What version of CUE are you using (cue version)?

Does this issue reproduce with the latest release?

What did you do?

What did you expect to see?

What did you see instead?

mvdan commented Feb 3, 2023

mvdan commented Feb 3, 2023

mvdan commented Feb 7, 2023

myitcv commented Jun 14, 2023

nxcc commented Feb 3, 2023 •

edited

What version of CUE are you using (`cue version`)?