(PDB-4420) partition of resource_events #3027

robdaemon · 2019-07-18T18:40:28Z

Prototype of partitioning the resource_events table. This creates a base
table (from the same schema as the existing resource_events table) and
will create 8 total partitions during migration (this ISO week +- 4)

Partitions are based on ISO 8601 weeks. Inserting into the partitions is
controlled via a trigger that directs the insert to the appropriate
partition.

Partitions are dynamically created by scanning the events to be inserted
and performing a "CREATE TABLE IF NOT EXISTS" for the partition. This is
akin to the (ensure-certname) code that already exists.

The inheritance-based implementation was chosen to maintain backwards
compatibility with PostgreSQL 9.x, since the other components of PE are
not ready for PostgreSQL 11.

puppetcla · 2019-07-18T23:00:17Z

CLA signed by all contributors.

austb

This looks good to me so far. I do theorize that when we ask around, they'll want us to delimit this on the day boundary. The most common setting I've seen for reports-ttl in production is 3d and if I wrapped my head around this properly, that means this change, while making gc cheaper, would significantly increase the size of their reports and resource events table.

robdaemon · 2019-08-05T23:01:28Z

I chose ISO weeks because it's the most straightforward approach. Even if we want the granularity for queries to be lower, I think the storage granularity has to stay at the week, otherwise we risk making this either too complicated, or a performance problem on the number of partitions we create (imagine 365.25 tables per year, that's a lot)

npwalker · 2019-08-27T15:09:36Z

I agree that partitioning on week seems like it would miss most of the benefit of partitioning. If you partition on day then a query that looks for reports in the last 6 hours (anything less than a day is probably the most common report query) only has to look at the one smaller partition. If you partition on week then you can look at less data if your report-ttl is large enough and also the performance profile of your query will change day over day as the day after you make a new partition things will be speedy and will get slower as the week progresses.

Why would creating more tables create a performance impact?

robdaemon · 2019-09-03T18:34:10Z

If the average report TTL is < 7 days, then partitioning by day would be better. Do we have details on the actual use of report-ttl in the field?

See: https://www.postgresql.org/docs/9.6/ddl-partitioning.html

All constraints on all partitions of the master table are examined during constraint exclusion, so large numbers of partitions are likely to increase query planning time considerably. Partitioning using these techniques will work well with up to perhaps a hundred partitions; don't try to use many thousands of partitions.

This is where the impact on the number of partitions comes into play, and one of the driving reasons for using ISO weeks.

puppetlabs-jenkins · 2019-09-05T22:36:28Z

Tests Failed \o/

rbrw

Overall seems pretty close.

rbrw · 2019-08-15T21:45:03Z

src/puppetlabs/puppetdb/scf/migrate.clj

+    (doall
+     (map (fn [week-offset]
+            (partitioning/create-resource-events-partition (.plusWeeks now week-offset)))
+          weeks))))


If we don't need to keep/return the seq, then might change this to dorun, or just collapse it to a doseq.

documentation/partitioning.markdown

src/puppetlabs/puppetdb/scf/migrate.clj

src/puppetlabs/puppetdb/scf/storage.clj

test/puppetlabs/puppetdb/integration/resource_events.clj

austb · 2019-09-25T22:22:11Z

Looks like Jenkins might actually be lying to us here. The acceptance tests are failing https://cinext-jenkinsmaster-enterprise-prod-1.delivery.puppetlabs.net/view/puppetdb/view/master/job/enterprise_puppetdb_integration-system-puppetdb_full-master/1947/

I think it's because only the ezbake job is nested in the PR Kickoff job. Might take a bit of cjc wizardry to fix that without duplicating jobs... I'll make a ticket

src/puppetlabs/puppetdb/cli/services.clj

src/puppetlabs/puppetdb/scf/partitioning.clj

austb · 2019-10-21T17:52:32Z

documentation/partitioning.markdown

@@ -0,0 +1,49 @@
+# Partitioning in PuppetDB


I've added some stuff in dev-docs/ related to how to reconcile git commits/Jira tickets for release. This could probably end up there.

test/puppetlabs/puppetdb/integration/resource_events.clj

test/puppetlabs/puppetdb/examples/reports.clj

austb

👍

Prototype of partitioning the resource_events table. This creates a base table (from the same schema as the existing resource_events table) and will create 8 total partitions during migration (this ISO week +- 4) Partitions are based on ISO 8601 weeks. Inserting into the partitions is controlled via a trigger that directs the insert to the appropriate partition. Partitions are dynamically created by scanning the events to be inserted and performing a "CREATE TABLE IF NOT EXISTS" for the partition. This is akin to the (ensure-certname) code that already exists. The inheritance-based implementation was chosen to maintain backwards compatibility with PostgreSQL 9.x, since the other components of PE are not ready for PostgreSQL 11. TODO: Migrate existing events to partitioned tables - they are currently dropped. TODO: GC by dropping expired partitions.

(PDB-4468) migration test for partitioned tables Add a migration test for creating the new partitioned tables Migrate existing resource events data to the new table. Rolls up the previous migration that creates the event_hash column into this new migration.

This query returns a different order in pg11 than it does in pg9.6, so adding this ORDER BY clause makes it behave the same in both.

* Adds a config parameter: database.resource-events-ttl with a default of 14d * Adds two admin metrics: resource-events-purges and resource-events-purge-time * Adds documentation for this new feature This will allow the resource_events table to be cleaned up at a different interval than the reports table.

uses calendar year and day of year (1-366) for the partition names

Perform a GC by dropping tables past our expiration date. Rounded to the nearest day.

Change use of LocalDate/LocalDateTime to ZonedDateTime and use UTC for all partitioning operations.

Fixed issues found in code review.

Port of #3074 to partitioning work

Use the formatting string that matches PostgreSQL for outputting UTC offsets as +00 instead of Z

Previous commit to add batched inserts removed the creation of partitions based on row dates, fixed Ensure that new tables are created in the active transaction, otherwise you get an error: org.postgresql.util.PSQLException: ERROR: portal "C_8" does not exist

Remove the fk constraint to reports. Creating this constantly will cause us to utilize all available locks in a migration transaction (default is 64). This fk was to be removed when the reports table is partitioned anyhow.

Group inserts by day, so they insert directly into the partition instead of routing through the trigger during migration Enable rewriteBatchInserts to additionally speed up bulk inserts Runtime on the customer dataset on n2: 2019-10-11 11:35:02,037 INFO [main] [p.t.internal] Finished shutdown sequence real 19m13.106s user 14m44.906s sys 1m28.030s

Applies a floor of a day to the resource-events-ttl, since we partition based on days, the TTL of resource-events cannot be any less than a day.

Allow rewriteBatchedInserts to be set per connection pool

Moving a doc to dev-docs/ Replacing a use of str with trs for a log message

robdaemon added the work in progress (...and please don't merge) label Jul 18, 2019

robdaemon added please review and removed work in progress (...and please don't merge) labels Aug 1, 2019

austb self-requested a review August 5, 2019 16:06

austb reviewed Aug 5, 2019

View reviewed changes

robdaemon requested a review from a team as a code owner September 4, 2019 21:13

rbrw reviewed Sep 13, 2019

View reviewed changes

austb added the higher priority label Sep 19, 2019

austb removed the higher priority label Sep 27, 2019

austb mentioned this pull request Oct 3, 2019

(PDB-2487) adding the ability to gc the resource-events table #2834

Closed

austb added the don't merge label Oct 23, 2019

austb reviewed Oct 23, 2019

View reviewed changes

austb removed the don't merge label Oct 24, 2019

austb approved these changes Nov 4, 2019

View reviewed changes

Robert Roland added 10 commits November 13, 2019 10:55

(PDB-4468) fix difference between pg11 and pg9.6

603973d

This query returns a different order in pg11 than it does in pg9.6, so adding this ORDER BY clause makes it behave the same in both.

(PDB-4468) partition by day instead of iso week

33a03c8

uses calendar year and day of year (1-366) for the partition names

(PDB-4468) gc by dropping old tables

b31bbe3

Perform a GC by dropping tables past our expiration date. Rounded to the nearest day.

(PDB-4468) ensure usage of UTC

5fcc627

Change use of LocalDate/LocalDateTime to ZonedDateTime and use UTC for all partitioning operations.

(PDB-4468) code review feedback

d2b27e4

Fixed issues found in code review.

(PDB-4504) Use batched inserts

d082e90

Port of #3074 to partitioning work

(PDB-4468) fix test when ran in UTC

81ba96d

Use the formatting string that matches PostgreSQL for outputting UTC offsets as +00 instead of Z

Robert Roland added 6 commits November 13, 2019 10:55

(PDB-4468) remove fk constraint to reports

4ecbdea

Remove the fk constraint to reports. Creating this constantly will cause us to utilize all available locks in a migration transaction (default is 64). This fk was to be removed when the reports table is partitioned anyhow.

(PDB-4468) rounding the resource-events-ttl period

c6600d1

Applies a floor of a day to the resource-events-ttl, since we partition based on days, the TTL of resource-events cannot be any less than a day.

(PDB-4468) rewriteBatchedInserts is a kw arg

c103168

Allow rewriteBatchedInserts to be set per connection pool

(PDB-4468) code review feedback

43a5f9c

Moving a doc to dev-docs/ Replacing a use of str with trs for a log message

austb merged commit f9adb08 into puppetlabs:master Nov 13, 2019

npwalker mentioned this pull request Dec 13, 2019

No longer pg_repack resource_events and reports now that they are partitioned puppetlabs/puppetlabs-pe_databases#33

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(PDB-4420) partition of resource_events #3027

(PDB-4420) partition of resource_events #3027

robdaemon commented Jul 18, 2019 •

edited

puppetcla commented Jul 18, 2019

austb left a comment

robdaemon commented Aug 5, 2019

npwalker commented Aug 27, 2019

robdaemon commented Sep 3, 2019

puppetlabs-jenkins commented Sep 5, 2019

rbrw left a comment

rbrw Aug 15, 2019

austb commented Sep 25, 2019

austb Oct 21, 2019

austb left a comment

(PDB-4420) partition of resource_events #3027

(PDB-4420) partition of resource_events #3027

Conversation

robdaemon commented Jul 18, 2019 • edited

puppetcla commented Jul 18, 2019

austb left a comment

Choose a reason for hiding this comment

robdaemon commented Aug 5, 2019

npwalker commented Aug 27, 2019

robdaemon commented Sep 3, 2019

puppetlabs-jenkins commented Sep 5, 2019

rbrw left a comment

Choose a reason for hiding this comment

rbrw Aug 15, 2019

Choose a reason for hiding this comment

austb commented Sep 25, 2019

austb Oct 21, 2019

Choose a reason for hiding this comment

austb left a comment

Choose a reason for hiding this comment

robdaemon commented Jul 18, 2019 •

edited