call partman.partition_data_proc during partman.run_maintenance #655

hvanderland · 2024-04-19T15:54:44Z

This is a very useful set of procedures.
One problem we have is that the run_maintenance task fails when there is data in the default partition for the new parrtition.

Could the partman.run_maintenance not call partman.partition_data_proc instead of generating an P001 error ?

The code could check partman.check_default if there is data in the default partition which needs to be moved for the new partition being added. Then call the partman.partition_data_proc and if need the partman.partition_gap_fill to generate the missing partitions.

After that run the normal maintenatance.

keithf4 · 2024-04-19T16:01:30Z

The problem with this is often when the default gets data, it can be A LOT of data. If that's the case, it could cause an extremely expensive write operation to kick off during normal maintenance.

I instead recommend setting up whatever monitoring application you have in your environment to run check_default() and alert if it detects any data. That way if default data does appear it can be handled appropriately for the situation.

hvanderland · 2024-04-19T16:31:29Z

Thank you for the quick response.

Understand, yes was planning to do these calls in our own scheduler.
I was thinking on adding a flag to the run maintenance to enable this, there it should be an exception scenario, but understand the reasoning not to do this.

keithf4 · 2024-04-19T17:23:00Z

Yeah I've thought of doing this as well with a flag. But then I've given people a flag to something that could potentially be very disruptive. I'd rather provide the means to monitor for it and have users need to go out of their way to fix this situation properly.

The other thing is that if you're frequently seeing data go into the default that you feel this needs to be automated, I'd likely say there's other problems that need to be fixed:

If the data is just slightly out of the normal time window, you may just need to adjust the premake value to make sure the necessary tables are there. There's nothing wrong with a premake of 20, 30 or more since partman is only ever usually making one child table per partition set during maintenance.
If the data is frequently far out of the normal range, it may be a bug that needs to be fixed to avoid that situation.
If it's not a bug, it's pretty far out of the normal scenario for range partitioning, IMO. In that case you'd be better off writing your own procedure to monitor the default and take appropriate actions depending on how much data is there.

hvanderland · 2024-04-19T17:32:16Z

Yes, but creating high number of future partitions has impact on performance. Queries like 'give me all the rows newer then yesterday' will scan over all these partitions. For our implementations the number of rows ending up in the default will be an exception and as you correctly remark should be low volume else there is a design issue.

Thank you for the clarification

keithf4 · 2024-04-19T17:41:27Z

But those tables are empty for the most part, and as of PG12+ that performance impact of having a higher number of partitions (1000+) is negligible until you start getting into REALLY high numbers. And I'd say if your partition numbers are getting that high, you may want to re-evaluate your partitioning interval and seriously consider retention options to remove unneeded data from that partition set.

I'd encourage you to test and see what that performance impact is. If it's not negligible, I'd write up the scenario and share it on the developer mailing lists so they can see what the problem is.

hvanderland · 2024-04-19T18:52:47Z

Adding more partitions increases the parsing time. We prevent using generic plans for these tables to allow partition pruning.

Using the partman code this can simply be tested :
update partman.part_config set premake = 100 where parent_table = 'xxx.time_taptest_table';
select partman.run_maintenance('xxx.time_taptest_table',true);
explain analyze select * from xxx.time_taptest_table where col3 > current_date;
explain analyze select * from xxx.time_taptest_table where col3 between current_date and current_date + interval '1 day';

It shows that the parsing time goes up if we increase the number of partitions.
Postgres goes to every partition to check if the data falls in it's range.
Yes this is milliseconds, but on high volume this makes a difference.

keithf4 · 2024-04-19T19:14:41Z

If that millisecond difference is demonstratively affecting your application, I can certainly understand that. Most cases I've seen myself that difference didn't really matter vs the overhead of having to deal with the cleanup of the default table.

One thing I may suggest if you're really getting down to that level of performance being important is to take advantage of pg_partman's predictable naming pattern and query the child tables directly. If you know the time condition you're asking for at the application level, dynamically generate the query there there to write the exact child tables you're targeting. That completely bypasses all partition pruning.

hvanderland closed this as completed Apr 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

call partman.partition_data_proc during partman.run_maintenance #655

call partman.partition_data_proc during partman.run_maintenance #655

hvanderland commented Apr 19, 2024

keithf4 commented Apr 19, 2024

hvanderland commented Apr 19, 2024

keithf4 commented Apr 19, 2024 •

edited

Loading

hvanderland commented Apr 19, 2024

keithf4 commented Apr 19, 2024

hvanderland commented Apr 19, 2024

keithf4 commented Apr 19, 2024 •

edited

Loading

call partman.partition_data_proc during partman.run_maintenance #655

call partman.partition_data_proc during partman.run_maintenance #655

Comments

hvanderland commented Apr 19, 2024

keithf4 commented Apr 19, 2024

hvanderland commented Apr 19, 2024

keithf4 commented Apr 19, 2024 • edited Loading

hvanderland commented Apr 19, 2024

keithf4 commented Apr 19, 2024

hvanderland commented Apr 19, 2024

keithf4 commented Apr 19, 2024 • edited Loading

keithf4 commented Apr 19, 2024 •

edited

Loading

keithf4 commented Apr 19, 2024 •

edited

Loading