refactor(utils): improve performance of `copyEmptyArrayProps` function by qmateub · Pull Request #1816 · commercetools/nodejs

qmateub · 2022-11-09T14:40:39Z

Summary

Refactors a little bit the way copyEmptyArrayProps is implemented in order to seek a better performance specially for cases with a huge amount of items in an array.

Description

Context: We (pangolins) do a heavy use of this library in Audit Log. We were observing some customer comparisons that were taking a huge amount of time and when debugging that led us to spotting sync actions library as the one that was causing the delay.

After some investigation, we found that the library was not really performant for cases when customers with a huge amount of addresses where compared between versions. In our case it was a customer with almost 13k addresses in the old and new version.

In this PR we propose a solution by changing the implementation to remove the usage of spread operators inside a reduce in favour of attribute assignments or even the usage of a map instead of array methods which usually led to the O(n^2) problem More info

Todo

changeset-bot · 2022-11-09T14:40:42Z

🦋 Changeset detected

Latest commit: 85daa2e

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
@commercetools/sync-actions	Minor

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

qmateub · 2022-11-09T14:41:41Z

-        const merged = {
-          ...newObj,
-          ...nextObject,
-        }


This combined object was not needed as it is exactly what the accumulator does in the reduce.

qmateub · 2022-11-09T14:42:54Z

+          const hashMapValue = value.reduce((acc, val) => {
+            acc[val.id] = val
+            return acc
+          }, {})


This is a small trick to avoid the usage of find in a situation of array of array (of arrays). The difference between both approaches was remarkable when debugging this locally.

With hashmap it was less than 100 ms
With .find around 6 seconds

As a pattern I will name this trick a "dictionary"... you could even do this a little trickier

const valueDict = Object.assign({}, ...value.map( x=>( { [x.id]: x}) ) )

fwiw the object assign method is a bit slower, probably because you are mapping through all of the values three times (once for the map, once for the spread, once for the assignment).

If it's a large array (100k elements) the reduce approach takes ~2ms whereas the object assign approach takes ~50ms.

// Create an array for testing. const testArray = new Array(100_000).fill(null).map((_, index) => ({ id: index })); // Store individual iteration times. const reduceTimes = []; const objectAssignTimes = []; const testIterations = 1_000; for (let i = 0; i < testIterations; i++) { // Run reduce method. const startTimeReduce = Date.now(); testArray.reduce((acc, element) => { acc[element.id] = element; return acc; }, {}); reduceTimes.push(Date.now() - startTimeReduce); // Run object assign method. const startTimeObjectAssign = Date.now(); Object.assign({}, ...testArray.map((element) => ({ [element.id]: element }))); objectAssignTimes.push(Date.now() - startTimeObjectAssign); } // Log results. console.log({ totalTimeReduce: reduceTimes.reduce((acc, val) => acc + val) / testIterations }); console.log({ totalTimeObjectAssign: objectAssignTimes.reduce((acc, val) => acc + val) / testIterations, });

Well... I just said trickier :-)

Thanks for the testing!

Another interesting approach will be if Array.prototype.group proposal gets approved. It won't return exactly a dictionary because the values will be an array...

But it will read nicely:

array.group( x=>x.id )

Thanks both for the input and the testing around this. It was really a nice learning experience and I think this improves the library for those cases that we see big numbers.

Just for the record, the case that we were checking was a customer from cimpress that had almost 13k addresses defined in it and the size of the entity was almost 9MB 😨 💥

qmateub · 2022-11-09T14:43:16Z

                )
                /* eslint-disable no-param-reassign */
-                newObj[key][i] = nestedObject
+                merged[key][i] = nestedObject


That means that we can just reassign this to the accumulator.

qmateub · 2022-11-09T14:43:59Z

-            [key]: isNil(newObj[key]) ? [] : newObj[key],
-          }
+          merged[key] = isNil(newObj[key]) ? [] : newObj[key]
+          return merged


instead of using the approach of spread + assignment, we assign first and then return the merged object to avoid the spread operator inside the .reduce

qmateub · 2022-11-09T14:44:08Z

-            [key]: nestedObject,
-          }
+          merged[key] = nestedObject
+          return merged


Same here 👍

qmateub · 2022-11-09T14:45:14Z

+  expect(old).toEqual(oldObj)
+  expect(fixedNewObj).toEqual(newObj)
+  expect(end - start).toBeLessThan(100)
+})


Quick test to verify this. It is using performance from node in order to verify the function.

You can run this test in main to validate that there were performance issues with this scenario 👍

(or checkout 97f3638 and run it there)

codecov · 2022-11-09T15:04:08Z

Codecov Report

Merging #1816 (85daa2e) into master (b07dbae) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master    #1816   +/-   ##
=======================================
  Coverage   94.64%   94.64%           
=======================================
  Files         141      141           
  Lines        4875     4877    +2     
  Branches     1332     1332           
=======================================
+ Hits         4614     4616    +2     
  Misses        258      258           
  Partials        3        3

Impacted Files	Coverage Δ
...s/sync-actions/src/utils/copy-empty-array-props.js	`100.00% <100.00%> (ø)`

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

danrleyt

Thanks for the brilliant contribution, that will also help us immensely.

qmateub added Status: Review Type: Enhancement labels Nov 9, 2022

qmateub requested review from emmenko, markus-azer, taylor-knapp and tdeekens November 9, 2022 14:40

qmateub self-assigned this Nov 9, 2022

qmateub commented Nov 9, 2022

View reviewed changes

wrsenn approved these changes Nov 9, 2022

View reviewed changes

taylor-knapp approved these changes Nov 9, 2022

View reviewed changes

qmateub requested a review from ajimae November 9, 2022 15:00

ajimae approved these changes Nov 9, 2022

View reviewed changes

qmateub requested review from a team and removed request for emmenko and tdeekens November 9, 2022 15:26

qmateub added 3 commits November 10, 2022 09:33

test(utils): add test for performance with large arrays

30681b0

fix(utils): improve performance of copyEmptyArrayProps fn

d87fb9e

fix(test): validate test with 200 ms

582fbb5

qmateub force-pushed the PANGOLIN-2073-sync-actions-improve-performance branch from f33aa97 to 582fbb5 Compare November 10, 2022 08:33

docs(release): add changeset

85daa2e

danrleyt approved these changes Nov 14, 2022

View reviewed changes

danrleyt merged commit cad54c4 into master Nov 14, 2022

danrleyt deleted the PANGOLIN-2073-sync-actions-improve-performance branch November 14, 2022 15:10

Conversation

qmateub commented Nov 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Description

Todo

Uh oh!

changeset-bot Bot commented Nov 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

angel-marin-ct Nov 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qmateub Nov 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qmateub Nov 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Nov 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

danrleyt left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

qmateub commented Nov 9, 2022 •

edited

Loading

changeset-bot Bot commented Nov 9, 2022 •

edited

Loading

angel-marin-ct Nov 9, 2022 •

edited

Loading

qmateub Nov 9, 2022 •

edited

Loading

qmateub Nov 9, 2022 •

edited

Loading

codecov Bot commented Nov 9, 2022 •

edited

Loading