Problems with bytea performance #1286

mramato · 2017-05-10T16:24:09Z

I originally opened this as part of a knex issue: knex/knex#2052 but was directed here instead.

I have a simple query:

SELECT glb FROM models where assets_id = 16 LIMIT 1;

glb is a bytea column and the value I'm retrieving is 28630077 bytes (~27MB) (models contains a single row in this example). The query takes 13305 ms to run and the Node process (not the DB process) maxes out the CPU while the query is running). If I query for the assets_id column instead of the glb column, it only takes 2 ms.

Running the same query with the same data from the psql command line completes almost immediately:

time psql -A -c "SELECT glb FROM models where assets_id = 16 LIMIT 1;" master postgres > out.glb

real    0m0.679s
user    0m0.000s
sys     0m0.031s

I also tested the same query in pg-native and it completed in ~450ms, but using pg-native isn't an option for me at this time (though I might have to re-evaluate that depending on where this issue goes).

Here's the table definition for completeness.

CREATE TABLE public.models
(
  assets_id integer NOT NULL,
  glb bytea NOT NULL,
  CONSTRAINT models_pkey PRIMARY KEY (assets_id),
  CONSTRAINT models_assets_id_foreign FOREIGN KEY (assets_id)
      REFERENCES public.assets (id) MATCH SIMPLE
      ON UPDATE NO ACTION ON DELETE CASCADE
)
WITH (
  OIDS=FALSE
);
ALTER TABLE public.models
  OWNER TO postgres;

Finally, I thought maybe it was a performance issue in the type parser, but all of the time is taken up by the query and then the typeParser completes almost instantly at the end.

Am I doing something wrong? Or is there a performance issue with bytea issues? I'd be happy to debug this further myself if someone can point me in the correct direction. Thanks in advance.

The text was updated successfully, but these errors were encountered:

vitaly-t · 2017-05-10T17:12:11Z

I'd be happy to debug this further myself if someone can point me in the correct direction

Please do, many people would want to know 😉 Your own test is the best direction so far, add logs within the connection and the pg-types, see where the main delay happens ;)

UPDATE

On second thought though, since use of the pg-native made such a huge difference, means it is not a pg-types issue, more like inside the Connection object. There may be something wrong with the read operation.

vitaly-t · 2017-05-10T17:33:29Z

This caught my interest, so I did some quick local testing, and I can confirm the following results...

Reading a single bytes field that contains a 16MB file:

With JavaScript bindings: 4s
With Native bindings: 200ms

That's a 20x difference - huge!!!!

mramato · 2017-05-10T17:34:03Z

On second thought though, since use of the pg-native made such a huge difference, means it is not a pg-types issue, more like inside the Connection object.

Thanks, I'll start there and see what I can find.

mramato · 2017-05-10T17:35:05Z

~~Reading is faster for you in JavaScript? Or is that a typo?~~

Just saw your edit. Glad you can reproduce. I'll keep digging but let me know if you find something on your end.

mramato · 2017-05-10T18:02:59Z

I did some profiling and a huge amount of time (~10 seconds of the 13 for my test case) is spent in Reader.addChunk from packet-reader: https://github.com/brianc/node-packet-reader/blob/master/index.js#L18

More specifically, it looks like lastChunk is set to a buffer instead of false most of the time causing tons of Buffer.concat calls, (874 in my case) which kills performance and probably applies a ton of memory pressure as well. I'm not exactly sure why this is happening, but I'm guessing refactoring the code to avoid all of the extra allocations would go a long way to fixing this. In my own memory streams, I usually keep an array of buffers and only concast once at the end; but I haven't learned enough about the code here yet to know if that's a possible solution.

Thoughts?

vitaly-t · 2017-05-10T18:08:27Z

I'm not sure yet. All I can see is that all the time is being spent inside the on data handler:

https://github.com/brianc/node-postgres/blob/master/lib/connection.js#L128

Which line in there results in the longest delay - harder to tell.

vitaly-t · 2017-05-10T18:21:16Z

Yeah, most of the time is being spent on this line: https://github.com/brianc/node-postgres/blob/master/lib/connection.js#L129

From my total 4s it eats about 3.2s of time, that's definitely bad!!!

mramato · 2017-05-10T18:23:35Z

Yep, that's exactly what I'm seeing. I think I know what the problem is and I'm trying to rewrite addChunk to be more performant.

vitaly-t · 2017-05-10T18:26:19Z

@brianc It seems that this line kills the performance when dealing with large bytea columns: https://github.com/brianc/node-packet-reader/blob/master/index.js#L22

It slows down the library 20 times (from my tests), compared to the Native Bindings.

And it may be worthwhile revisiting the entire on-data handler: https://github.com/brianc/node-postgres/blob/master/lib/connection.js#L128

Unfortunately, I cannot be more specific at present, those things require a bit of a closer look.

mramato · 2017-05-10T18:55:33Z

@vitaly-t, I have a fix for the packet-reader problem that I mentioned in #1286 (comment).

I've gone from ~13 seconds to ~450ms, matching the native time!

vitaly-t · 2017-05-10T18:58:06Z

@mramato Wow, magical! I want to see that! Will you do a PR? ;)

I was, in the meantime, trying to ask a question here: http://stackoverflow.com/questions/43900530/slow-buffer-concat

@mramato If you are doing a PR, keep an eye on this question on StackOverflow, it might offer even a better idea, as there is already one answer there ;)

`Reader.prototype.addChunk` was calling `Buffer.concat` constantly, which increased garbage collection and just all-around killed performance. The exact implications of this is documented in brianc/node-postgres#1286, which has a test case for showing how performance is affected. Rather than concatenating buffers to the new buffer size constantly, this change uses a growth strategy that doubles the size of the buffer each time and tracks the functional length in a separate `chunkLength` variable. This significantly reduces the amount of allocation and provides a 25x performance in my test cases, the larger the amount of data the query is returning, the greater improvement of performance. Since this uses a doubling buffer, it was important to avoid growing forever, so I also added a reclaimation strategy which reduces the size of the buffer wever time more than half of the data has been read.

mramato · 2017-05-10T19:45:59Z

@vitaly-t see brianc/node-packet-reader#3, should be easy enough for you to patch your local copy and verify my claims.

Not sure of the full extent of this improvement, did I just make every node-based Postgres app in the world significantly faster? 😄

vitaly-t · 2017-05-10T20:12:26Z

@mramato Well done! I can confirm that the change in brianc/node-packet-reader#3 in fact improves the performance really well, close to what we get with the Native Bindings.

I cannot however confirm that the change guarantees the data integrity. New tests towards that are a must-have for such a change.

mramato · 2017-05-10T20:13:48Z

I cannot however confirm that the change guarantees the data integrity. New tests towards that are a must-have for such a change.

Thanks. Can you be more specific regarding exactly what you would like to see that isn't covered by the existing tests? Are you talking about adding tests in node-packet-reader or tests in node-postgres?

vitaly-t · 2017-05-10T20:15:00Z

I'm saying that we need to make sure that node-packet-reader tests cover this change. And I'm not saying they don't already, I haven't checked them yet.

mramato · 2017-05-10T20:16:10Z

OK, thanks. I'll do a pass myself and see if there is anything obvious I can add. Thanks for the help on this by the way.

mramato · 2017-05-10T20:32:47Z

@vitaly-t I used istanbul to verify coverage and the new buffer compaction was not tested at all (but the rest of the changes were adequately covered. I added a new test to verify the behavior is expected and the file is back to 100% coverage. Hopefully that PR is now good to go, but please let me know if there's anything else you need me to do.

vitaly-t · 2017-05-10T20:54:21Z

@mramato Good by me! But I'm not the one to approve the PR 😉

brianc · 2017-05-15T15:07:04Z

Thanks for your help y'all! Much appreciated!

pdkovacs · 2018-04-23T07:45:53Z

Isn't a similar bug potentially affecting the text type as well? I haven't tried or experienced any problem with it, am just wondering, because reading text type (also of "infinite" length) is likely to involve mechanisms similar to what is used for bytea (copying heavily across buffers or the like).

charmander · 2018-04-23T18:55:09Z

@pdkovacs The packet-reader package that was patched reads entire messages; parsing individual fields wasn’t the problem. The fix applies to all types.

pdkovacs · 2018-04-24T19:21:24Z

Ah, I see, thanks!

mramato mentioned this issue May 10, 2017

Problems with bytea performance knex/knex#2052

Closed

charmander added the bug label May 10, 2017

mramato mentioned this issue May 10, 2017

Major performance improvement brianc/node-packet-reader#3

Merged

vitaly-t mentioned this issue May 11, 2017

Invalid \n escaping by the driver vitaly-t/pg-promise#253

Closed

brianc closed this as completed May 15, 2017

vitaly-t mentioned this issue Jun 22, 2017

pg-promise v5 vs v6 vitaly-t/pg-promise#355

Closed

regevbr mentioned this issue Jun 17, 2020

Major performance issues with bytea performance #2240

Closed

regevbr mentioned this issue Jun 18, 2020

fix: major performance issues with bytea performance #2240 #2241

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problems with bytea performance #1286

Problems with bytea performance #1286

mramato commented May 10, 2017

vitaly-t commented May 10, 2017 •

edited

vitaly-t commented May 10, 2017 •

edited

mramato commented May 10, 2017

mramato commented May 10, 2017 •

edited

mramato commented May 10, 2017

vitaly-t commented May 10, 2017 •

edited

vitaly-t commented May 10, 2017 •

edited

mramato commented May 10, 2017

vitaly-t commented May 10, 2017 •

edited

mramato commented May 10, 2017

vitaly-t commented May 10, 2017 •

edited

mramato commented May 10, 2017

vitaly-t commented May 10, 2017

mramato commented May 10, 2017

vitaly-t commented May 10, 2017

mramato commented May 10, 2017

mramato commented May 10, 2017

vitaly-t commented May 10, 2017 •

edited

brianc commented May 15, 2017

pdkovacs commented Apr 23, 2018 •

edited

charmander commented Apr 23, 2018

pdkovacs commented Apr 24, 2018

Problems with bytea performance #1286

Problems with bytea performance #1286

Comments

mramato commented May 10, 2017

vitaly-t commented May 10, 2017 • edited

vitaly-t commented May 10, 2017 • edited

mramato commented May 10, 2017

mramato commented May 10, 2017 • edited

mramato commented May 10, 2017

vitaly-t commented May 10, 2017 • edited

vitaly-t commented May 10, 2017 • edited

mramato commented May 10, 2017

vitaly-t commented May 10, 2017 • edited

mramato commented May 10, 2017

vitaly-t commented May 10, 2017 • edited

mramato commented May 10, 2017

vitaly-t commented May 10, 2017

mramato commented May 10, 2017

vitaly-t commented May 10, 2017

mramato commented May 10, 2017

mramato commented May 10, 2017

vitaly-t commented May 10, 2017 • edited

brianc commented May 15, 2017

pdkovacs commented Apr 23, 2018 • edited

charmander commented Apr 23, 2018

pdkovacs commented Apr 24, 2018

vitaly-t commented May 10, 2017 •

edited

vitaly-t commented May 10, 2017 •

edited

mramato commented May 10, 2017 •

edited

vitaly-t commented May 10, 2017 •

edited

vitaly-t commented May 10, 2017 •

edited

vitaly-t commented May 10, 2017 •

edited

vitaly-t commented May 10, 2017 •

edited

vitaly-t commented May 10, 2017 •

edited

pdkovacs commented Apr 23, 2018 •

edited