Improve performance of GET requests #271

moio · 2024-01-22T12:10:21Z

👋 hamilton maintainers!

I am opening this to ask whether it would make sense to include some new code, or patch existing code, in order to improve resource consumption of the Get method - particularly memory usage but also CPU.

I came to this problem as part of my day work at SUSE for Rancher, which uses hamilton to implement Entra ID (formerly Azure AD) integration. In some high-scale use tests we have users with principals in the thousands, groups in the millions and group assignments per principal in the hundreds - and enough traffic to need multiple (tens of) goroutines to periodically refresh all these objects in parallel. In extreme cases we have seen hundreds of MBs to GiBs worth of heap objects, which tend to cause out-of-memory situations.

Here is an example of a similar flame graph:

Of course our usage pattern isn't typical, and we are working at ways to change the way we interact with the library. Nevertheless according to my analysis Get (and code called by it) does quite a bit of needless computation:

OData json.Unmarshal calls are amplified by a factor of 2 due to the error handling in OData.UnmarshalJSON
FromResponse calls, which drive OData.UnmarshalJSON calls, are amplified by a factor of 2 by performRequest
Get needlessly re-unmarshal responses once after FasterResponse (again with the amplification factors above), and especially
Get un-marshals and subsequently re-marshals responses if they are paginated. This unmarshaling-marshaling extra round is repeated by the number of pages

This PR cuts down unmarshaling to one pass only (divided in sub-passes via json.RawMessage) and eliminates remarshaling altogether.

I could not find ways to achieve this without altering the OData struct and structs/methods which use it (ConsistencyFailureFunc, ValidStatusFunc, GetHttpRequestInput), therefore, I opted for a completely new code path in fasterclient.go. The approach can be changed if API changes are acceptable.

For now and for the sake of discussion, I only patched the one endpoint that really hurts my use case: ListGroupMemberships. Changes are minimal other than using the FasterGet method in place of Get - ideally this could be extended to all other (internal) usages of Get.

I am also contributing code to exercise ListGroupMemberships more thoroughly so that the effects of the patch are evident. Furthermore I added a utility method that will output CPU and memory usage during the test run.

My results are as follows:

	main	faster_get
Run 1	610 ms	273 ms
Run 2	817 ms	122 ms
Run 3	643 ms	353 ms
Average	690 ms	249 ms

CPU load is ~36.14% of the original

	main	faster_get
Run 1	21 MiB	3 MiB
Run 2	22 MiB	3 MiB
Run 3	23 MiB	3 MiB
Average	22 MiB	3 MiB

Memory consumption is ~13.64% of the original

Is there any interest to merge such an improvement? What would be needed to have a similar PR accepted?

Thanks in advance, I hope this helps

Signed-off-by: Silvio Moioli <silvio@moioli.net>

Cascades to GetHttpRequestInput and related functions Signed-off-by: Silvio Moioli <silvio@moioli.net>

Use json.RawMessage to unmarshal OData values separately from the rest of the OData structure, and directly into the a desired result slice. Signed-off-by: Silvio Moioli <silvio@moioli.net>

Unmarshal errors separately from the rest so that unmarshaling with two different types can be avoided. Signed-off-by: Silvio Moioli <silvio@moioli.net>

Signed-off-by: Silvio Moioli <silvio@moioli.net>

Replace the hamilton library with a 0.46 version with this PR merged: manicminer/hamilton#271 Signed-off-by: Silvio Moioli <silvio@moioli.net>

Signed-off-by: Silvio Moioli <silvio@moioli.net>

… re-run if response middlewares are set Signed-off-by: Silvio Moioli <silvio@moioli.net>

moio · 2024-02-23T09:38:35Z

@manicminer do you have any impressions to share about this work?

includes manicminer/hamilton#271 Signed-off-by: Silvio Moioli <silvio@moioli.net>

moio added 10 commits January 22, 2024 12:00

Add utility method to output metrics in test results

495757a

Signed-off-by: Silvio Moioli <silvio@moioli.net>

users_test: make number of groups configurable, boost it

5dd90bb

Signed-off-by: Silvio Moioli <silvio@moioli.net>

users_test: add metrics to ListGroupMemberships

610c9a4

Signed-off-by: Silvio Moioli <silvio@moioli.net>

fasterclient: create an exact copy of the Get method

d89d373

Signed-off-by: Silvio Moioli <silvio@moioli.net>

fasterclient: copy performRequest as well

231aa2a

Signed-off-by: Silvio Moioli <silvio@moioli.net>

fasterclient: copy FromResponse as well

eee4616

Signed-off-by: Silvio Moioli <silvio@moioli.net>

fasterclient: copy OData as well

767c0e7

Cascades to GetHttpRequestInput and related functions Signed-off-by: Silvio Moioli <silvio@moioli.net>

fasterclient: use two-step JSON unmarshaling

3f5a297

Use json.RawMessage to unmarshal OData values separately from the rest of the OData structure, and directly into the a desired result slice. Signed-off-by: Silvio Moioli <silvio@moioli.net>

fasterclient: avoid double unmarshaling in OData.UnmarshalJSON

d76f26c

Unmarshal errors separately from the rest so that unmarshaling with two different types can be avoided. Signed-off-by: Silvio Moioli <silvio@moioli.net>

fasterclient: avoid duplicated call to FasterFromResponse

519ba30

Signed-off-by: Silvio Moioli <silvio@moioli.net>

fasterclient: bugfix: guard against nil case

0e3b1a2

Signed-off-by: Silvio Moioli <silvio@moioli.net>

moio force-pushed the faster_get branch from 7a66d8c to 0e3b1a2 Compare January 25, 2024 13:17

moio added 2 commits January 26, 2024 08:27

fasterclient: bugfix: guard against nil case

d807b65

Signed-off-by: Silvio Moioli <silvio@moioli.net>

fasterclient: bugfix: run FromResponse once if DisableRetries is set,…

3ca9a86

… re-run if response middlewares are set Signed-off-by: Silvio Moioli <silvio@moioli.net>

moio added a commit to rancher/rancher that referenced this pull request Mar 1, 2024

hamilton: use patched version

2b16ee6

includes manicminer/hamilton#271 Signed-off-by: Silvio Moioli <silvio@moioli.net>

This was referenced May 21, 2024

[SURE-7372] [2.8] Reduce resource usage from Azure AD provider rancher/rancher#45543

Open

[2.8] Use patched hamilton library rancher/rancher#45544

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of GET requests #271

Improve performance of GET requests #271

moio commented Jan 22, 2024 •

edited

Loading

moio commented Feb 23, 2024

Improve performance of GET requests #271

Are you sure you want to change the base?

Improve performance of GET requests #271

Conversation

moio commented Jan 22, 2024 • edited Loading

moio commented Feb 23, 2024

moio commented Jan 22, 2024 •

edited

Loading