Don't generate monitoring snapshots for statements which only reference MON$DATABASE #7567

mrotteveel · 2023-05-05T15:09:18Z

Currently, referencing any monitoring table will generate a monitoring snapshot. This is a relatively costly operation, and it should be unnecessary for queries which only reference the MON$DATABASE monitoring table.

Excluding MON$DATABASE from generating a monitoring table should make it cheaper to use that table.

AlexPeshkoff · 2023-05-05T15:14:48Z

That's not as good as seems at the first glance. First of all, snapshot is created not for statement but for transaction. And nobody can guarantee that such transaction does not reference other monitoring tables later. I.e. we can have inconsistency between MON$DATABASE and other monitoring tables.

hvlad · 2023-05-05T15:39:46Z

Agree with @AlexPeshkoff.

Instead of creating "short path"'s for some very special cases, we need to re-think whole monitoring architecture, imho.

mrotteveel · 2023-05-05T15:48:14Z

That's not as good as seems at the first glance. First of all, snapshot is created not for statement but for transaction. And nobody can guarantee that such transaction does not reference other monitoring tables later. I.e. we can have inconsistency between MON$DATABASE and other monitoring tables.

What inconsistencies are you thinking of? I don't see which data in MON$DATABASE could become inconsistent with other monitoring tables.

AlexPeshkoff · 2023-05-05T15:53:21Z

On 5/5/23 18:48, Mark Rotteveel wrote: That's not as good as seems at the first glance. First of all, snapshot is created not for statement but for transaction. And nobody can guarantee that such transaction does not reference other monitoring tables later. I.e. we can have inconsistency between MON$DATABASE and other monitoring tables. What inconsistencies are you thinking of? I don't see which data in MON$DATABASE could become inconsistent with other monitoring tables.

For example transaction with next transaction number may arrive in mon$transactions.

aafemt · 2023-05-05T15:53:38Z

Transaction and attachment numbers.

+1 for rethinking whole system. IMHO, monitoring tables must be dirty read (may be with cursor stability), i.e. formed at the moment the request access them.

mrotteveel · 2023-05-05T15:59:23Z

For example transaction with next transaction number may arrive in mon$transactions.

I can't come up with a scenario where that would result in an inconsistency or a problem. I think it would be an acceptable risk.

livius2 · 2023-05-05T17:09:52Z

Maybe better add alias for call of.
Instead of MON$DATABASE add alias like e.g. TMP$DATABASE which do not take whole snapshot but is the same as MON$DATABASE.

AlexPeshkoff · 2023-05-05T17:21:48Z

On 5/5/23 18:59, Mark Rotteveel wrote: For example transaction with next transaction number may arrive in mon$transactions. I can't come up with a scenario where that would result in an inconsistency or a problem. I think it would be an acceptable risk.

May be better go other way not adding any risk, even one that appears acceptable from first POV? And do not limit ourself if we will want/need to add more data to monitoring. There are many other ways to solve mon$ performance problem - for example explicit session control statement to switch mon$ tables into statement (i.e. cursor) stability mode? That's just first idea that came to my mind.

dyemanov · 2023-05-05T18:18:51Z

I don't mind rethinking the original idea. Just want to mention two points that caused the snapshot being transaction-level.

Different calls to the MON$ tables inside one transaction should return consistent results. Yes, usually only statement-level consistency is enough. But sometimes you may want to query MON$ATTACHMENTS and only later query e.g. MON$IO_STATS for some particular attachment. And you expect IDs to be consistent between these two queries.
Different calls to the MON$ tables inside one transaction should be fast. Monitoring never was lightning fast and while there were improvements in the performance area in v3 and also recently in v5, caching the snapshot could still be a good idea.

That said, I don't mind having both consistency options (transaction-level and statement-level) available, let's just define how it should be controlled by users.

hvlad · 2023-05-05T18:26:46Z

I also consider new session control statements as a way to go.

For the beginning: things that could be managed:

snapshot scope: all tables or just required by query;
snapshot lifetime: per-query or until transaction end or explicitly defined by user.

aafemt · 2023-05-05T19:13:54Z

But sometimes you may want to query MON$ATTACHMENTS and only later query e.g. MON$IO_STATS for some particular attachment. And you expect IDs to be consistent between these two queries.

And they aren't? I was sure that ID is attachment_id and it is stable during attachment lifetime. The same for transactions and statements.

Also I cannot say for everybody but if the attachment has disappeared at the moment when I query IO_STATS, it is fine for me to get nothing.

livius2 · 2023-05-05T20:27:35Z

And they aren't?

You misunderstud ticket. It is for changing MON$DATABASE to not get whole monitoring snapshot.
And in the comment above it is warning if this is about to change it may produce such problems.
Now it is consistent throught whole transaction.

aafemt · 2023-05-05T21:34:41Z

Yes, and I would like to hear how this inconsistency in IDs can appear if these IDs are stable during whole object's lifetime, not just for monitoring snapshot.

dyemanov · 2023-05-06T06:35:33Z

But sometimes you may want to query MON$ATTACHMENTS and only later query e.g. MON$IO_STATS for some particular attachment. And you expect IDs to be consistent between these two queries.

And they aren't? I was sure that ID is attachment_id and it is stable during attachment lifetime. The same for transactions and statements.

Attachment/transaction/statement IDs are stable. But MON$*_STATS tables have an artificial primary key which is globally unique in the shared memory but remapped to snapshot-level artificial IDs when snapshot is created, so two different snapshots may have two different IDs for the same object.

dyemanov · 2023-05-06T06:46:41Z

I also consider new session control statements as a way to go.

For the beginning: things that could be managed:
* snapshot scope: all tables or just required by query;

* snapshot lifetime: per-query or until transaction end or explicitly defined by user.

Snapshot scope is meaningless for statement-level snapshots, as we already know all tables accessed by the statement. And while I agree that we could control that scope for transaction-level snapshots, I'm not really sure this is needed. If we expect the dynamically extended snapshot (with tables loaded by demand) being possibly inconsistent between its tables, then user may just use statement-level snapshot with the same side effects. The only useful usage case that comes to mind is when user wants to get the small snapshot ASAP (without loading huge mon$compiled_statements, for example) and will query this snapshot later in the same transaction which also must be fast. I dunno how common this is in practice.

sim1984 · 2023-05-06T11:09:03Z

Why not simply link the scope of the snapshot to the isolation level of the transaction? Read committed snapshot at the statement level, snapshot - at the transaction level.

dyemanov · 2023-05-06T18:03:37Z

It's also an option, but what if someone need to use a different snapshot level than its transaction (which is generally unknown inside a procedure)? Using an autonomous transaction could help, but only if it allows to override the parent transaction options which AFAIK is currently impossible (although IMO should be supported).

AlexPeshkoff · 2023-05-07T05:50:55Z

Always using autonomous transaction (even with ability to change isolation) is also not always good - this breaks an ability to use permanent snapshot of parent RC transaction, which works now.

mrotteveel added component: engine type: improvement labels May 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't generate monitoring snapshots for statements which only reference MON$DATABASE #7567

Don't generate monitoring snapshots for statements which only reference MON$DATABASE #7567

mrotteveel commented May 5, 2023

AlexPeshkoff commented May 5, 2023

hvlad commented May 5, 2023

mrotteveel commented May 5, 2023

AlexPeshkoff commented May 5, 2023 via email

aafemt commented May 5, 2023

mrotteveel commented May 5, 2023

livius2 commented May 5, 2023

AlexPeshkoff commented May 5, 2023 via email

dyemanov commented May 5, 2023 •

edited

hvlad commented May 5, 2023

aafemt commented May 5, 2023

livius2 commented May 5, 2023 •

edited

aafemt commented May 5, 2023

dyemanov commented May 6, 2023

dyemanov commented May 6, 2023

sim1984 commented May 6, 2023

dyemanov commented May 6, 2023

AlexPeshkoff commented May 7, 2023 via email

Don't generate monitoring snapshots for statements which only reference MON$DATABASE #7567

Don't generate monitoring snapshots for statements which only reference MON$DATABASE #7567

Comments

mrotteveel commented May 5, 2023

AlexPeshkoff commented May 5, 2023

hvlad commented May 5, 2023

mrotteveel commented May 5, 2023

AlexPeshkoff commented May 5, 2023 via email

aafemt commented May 5, 2023

mrotteveel commented May 5, 2023

livius2 commented May 5, 2023

AlexPeshkoff commented May 5, 2023 via email

dyemanov commented May 5, 2023 • edited

hvlad commented May 5, 2023

aafemt commented May 5, 2023

livius2 commented May 5, 2023 • edited

aafemt commented May 5, 2023

dyemanov commented May 6, 2023

dyemanov commented May 6, 2023

sim1984 commented May 6, 2023

dyemanov commented May 6, 2023

AlexPeshkoff commented May 7, 2023 via email

dyemanov commented May 5, 2023 •

edited

livius2 commented May 5, 2023 •

edited