chore(postgres): Optimize the database. #1842

Ivansete-status · 2023-07-05T12:15:32Z

Background

When nwaku has the "store/archive" protocol mounted it can store and retrieve historical messages. All this information is kept in a single table, messages. We need to optimize this.

Details

We need to get the maximum performance possible with regard to insert/select operations.
We need a rapid response when duplicate msgs happens. For that, we may need to adapt how the message id is generated so that we achieve high selectivity.

Tasks

~~[ ] Rename the messages table to MESSAGE.~~ (that renaming caused issues in the current existing shards.test fleet.
~~- [ ] Apply @Menduist 's enhancement suggestions for a more appropriate asynchronous handling: feat(common): added postgress async pool wrapper #1631 (comment)~~
Perform tests in a standalone database. This is a DB analyst task where operations are checked in a table with hundreds of millions of rows.
Apply "integration" performance tests.
ℹ️ We understand that a query performance is acceptable in Grafana by checking that "Waku Archive Query Duration" is <50ms.
ℹ️ For that, we will use the next repo: https://github.com/waku-org/test-waku-query (cc - @richard-ramos )

Related issue

#1604

The text was updated successfully, but these errors were encountered:

jm-clius · 2023-07-15T15:22:36Z

add an index column with the consistent, deterministic message hash as message ID.

edited: being considered in #2112

Ivansete-status · 2023-11-07T14:25:30Z

<<Apply "integration" performance tests.>> This is done in the https://github.com/waku-org/test-waku-query repo

Ivansete-status · 2023-11-07T14:32:44Z

I'm reluctant to perform the @Menduist enhancement. We might still have a bottleneck in while pqIsBusy(db) == 1: for concurrent Store queries (see TODO comment in

nwaku/waku/common/databases/db_postgres/dbconn.nim

Line 124 in 6da1aee

while db.pqisBusy() == 1:

)

Ivansete-status · 2023-11-07T14:35:04Z

Tests had been performed in the database itself with ~12 million rows. On the other hand, the bottleneck is within the db.pqisBusy() as mentioned above. See also https://www.notion.so/Postgres-e33d8e64fa204c4b9dcb1514baf9c582

Ivansete-status · 2023-11-10T14:22:18Z

add an index column with the consistent, deterministic message hash as message ID.

Thanks for adding this point @jm-clius!
I'll check it in this issue because it is properly being tackled in #2112 by @ABresting
Cheers

Ivansete-status · 2023-11-10T19:12:03Z

We conclude the Postgres optimization for now.
Find more details in https://github.com/waku-org/nwaku/wiki/Adoption-of-Postgres-as-an-archive-system

Ivansete-status added the track:maintenance label Jul 5, 2023

Ivansete-status mentioned this issue Jul 5, 2023

chore(archive): allow request concurrency towards PostgreSQL #1604

Closed

5 tasks

SionoiS assigned Ivansete-status Jul 25, 2023

fryorcraken removed the track:maintenance label Jul 31, 2023

vpavlin added this to the Release 0.20.0 milestone Aug 1, 2023

Ivansete-status mentioned this issue Aug 7, 2023

PostgreSQL #1888

Closed

8 tasks

fryorcraken added the E:PostgreSQL See https://github.com/waku-org/pm/issues/84 for details label Sep 8, 2023

Ivansete-status mentioned this issue Sep 12, 2023

feat: Store/archive schema optimisations #1949

Closed

jm-clius modified the milestones: Release 0.20.0, Release 0.21.0 Sep 12, 2023

fryorcraken mentioned this issue Sep 8, 2023

[Epic] PostgreSQL in service node: Further optimisations waku-org/pm#84

Closed

8 tasks

Ivansete-status mentioned this issue Oct 5, 2023

chore: postgres_driver.nim - rename table's name from "messages" to "message" #2110

Merged

gabrielmer modified the milestones: Release 0.21.0, Release 0.22.0 Oct 12, 2023

This was referenced Oct 27, 2023

chore: Minor Postgres optimizations #2166

Merged

chore: Optimize postgres - use of rowCallback approach #2171

Merged

chore: Optimize postgres - prepared statements in select #2182

Merged

Ivansete-status closed this as completed Nov 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(postgres): Optimize the database. #1842

chore(postgres): Optimize the database. #1842

Ivansete-status commented Jul 5, 2023 •

edited

jm-clius commented Jul 15, 2023 •

edited by Ivansete-status

Ivansete-status commented Nov 7, 2023

Ivansete-status commented Nov 7, 2023

Ivansete-status commented Nov 7, 2023 •

edited

Ivansete-status commented Nov 10, 2023

Ivansete-status commented Nov 10, 2023

chore(postgres): Optimize the database. #1842

chore(postgres): Optimize the database. #1842

Comments

Ivansete-status commented Jul 5, 2023 • edited

Background

Details

Tasks

Related issue

jm-clius commented Jul 15, 2023 • edited by Ivansete-status

Ivansete-status commented Nov 7, 2023

Ivansete-status commented Nov 7, 2023

Ivansete-status commented Nov 7, 2023 • edited

Ivansete-status commented Nov 10, 2023

Ivansete-status commented Nov 10, 2023

Ivansete-status commented Jul 5, 2023 •

edited

jm-clius commented Jul 15, 2023 •

edited by Ivansete-status

Ivansete-status commented Nov 7, 2023 •

edited