ADBDEV-4251: Fix use by ORCA of a newer index with HOT-chain in an older transaction. #619

KnightMurloc · 2023-09-21T05:14:05Z

When heap tuple is updated by legacy planner and the updated tuple is
placed at the same page (heap-only tuple, HOT), an update chain is created.
It's a chain of updated tuples, in which each tuple's ctid points to the next
tuple in the chain.

HOT chains allow to store only one index entry, which points to the first tuple
in the chain. And during Index Scan we pass through the chain, and the first
tuple visible for the current transaction is taken (for more information, see
src/backend/access/heap/README.HOT).

If we create a second index on column that has been updated, it will store the
ctid of the beginning of the existing HOT chain. If a repeatable read
transaction started before the transaction in which the second index was
created, then this index could be used in the query plan. As a result of the
search for this index, a tuple could be found that does not meet the search
condition (by a new value that is not visible to the transaction)

In the case of the legacy planner, this problem is solved the following way:

"To address this issue, regular (non-concurrent) CREATE INDEX makes the
new index usable only by new transactions and transactions that don't
have snapshots older than the CREATE INDEX command. This prevents
queries that can see the inconsistent HOT chains from trying to use the
new index and getting incorrect results. Queries that can see the index
can only see the rows that were visible after the index was created,
hence the HOT chains are consistent for them."

But ORCA does not handle this case and can use an index with a broken HOT-chain.

This patch resolves the issue for ORCA in the same way as legacy planner. During
planning we ignore newly created indexes based on their xmin.

Additionally, ORCA faced another related problem. Since ORCA has its own cache
(MD Cache) and can cache a relation object without an index that cannot be used
in the current snapshot (because MDCacheSetTransientState function returns true),
we won't be able to use the index after the problematic snapshot changes.
Therefore, we need to reset the cache after the snapshot changes in order to use
index.

This patch solves the problem in the following way: during index filtering, if
we encounter an index that we cannot use, we save TransactionXmin in the
mdcache_transaction_xmin variable. In the next queries, we check the saved xmin,
and if it is valid and differs from the current one, we reset the cache.

The create_index_hot test has also been changed. Now optimizer is turned off
before the update. Since ORCA always uses Split Update, in which case HOT chains
are not created and the problem is not reproduced. And that's why ORCA wasn't
actually tested before.

BenderArenadata · 2023-09-21T06:30:10Z

Allure report https://allure-ee.adsw.io/launch/54747

BenderArenadata · 2023-09-22T05:46:08Z

Allure report https://allure-ee.adsw.io/launch/54823

…e current transaction as the postgres optimizer does When updating a row in heap tables, a HOT (Heap-Only Tuple) chain is created where the old version of the row points to the new version. When creating an index on such a table and there is already another index, the new index's elements will point to the beginning of the chain (the old value), while having the new value. In a situation where we have a parallel transaction reading this table using the new index, we may find the old row. for more info, see src/backend/access/heap/README.HOT example: In the initial state, we have a tuple that the index points to. In another session, we open a transaction with a repeatable read isolation level and do a select. in session 1, we update the tuple, thereby creating a HOT chain and creating an index for the updated column. the new index will reference the first version of the tuple. if now in session 2 we do a search for a new index with a new value, then we will find the old tuple, since for this session the first version is alive. The PostgreSQL optimizer handles this situation correctly and ignores the new index in the old transaction. ORCA does not. This patch fixes this by adding a visibility check for the index in ORCA. Also, in such a case, we mark the cache and the plan as transient to reset the mdcache in ORCA and the cached plan in the case of a prepare at the beginning of the next transaction.

src/backend/gpopt/translate/CTranslatorRelcacheToDXL.cpp

BenderArenadata · 2023-09-26T06:53:02Z

Allure report https://allure-ee.adsw.io/launch/55104

src/backend/gpopt/translate/CTranslatorRelcacheToDXL.cpp

src/backend/gpopt/utils/COptTasks.cpp

src/backend/optimizer/plan/orca.c

src/backend/gpopt/translate/CTranslatorRelcacheToDXL.cpp

src/backend/gpopt/gpdbwrappers.cpp

BenderArenadata · 2023-11-02T13:28:18Z

Allure report https://allure-ee.adsw.io/launch/57898

BenderArenadata · 2023-11-02T15:23:20Z

Failed job Regression tests with Postgres on ppc64le: https://gitlab.adsw.io/arenadata/github_mirroring/gpdb/-/jobs/760819

BenderArenadata · 2023-11-02T15:29:22Z

Failed job Regression tests with Postgres on x86_64: https://gitlab.adsw.io/arenadata/github_mirroring/gpdb/-/jobs/760818

BenderArenadata · 2023-11-02T15:38:50Z

Failed job Regression tests with ORCA on x86_64: https://gitlab.adsw.io/arenadata/github_mirroring/gpdb/-/jobs/760820

BenderArenadata · 2023-11-02T15:56:37Z

Failed job Regression tests with ORCA on ppc64le: https://gitlab.adsw.io/arenadata/github_mirroring/gpdb/-/jobs/760821

…ly calling functions from gdb. added cache invalidation test.

BenderArenadata · 2023-11-02T17:01:51Z

Allure report https://allure-ee.adsw.io/launch/57910

BenderArenadata · 2023-11-02T19:03:51Z

Failed job Regression tests with Postgres on ppc64le: https://gitlab.adsw.io/arenadata/github_mirroring/gpdb/-/jobs/761870

BenderArenadata · 2023-11-02T19:09:44Z

Failed job Regression tests with Postgres on x86_64: https://gitlab.adsw.io/arenadata/github_mirroring/gpdb/-/jobs/761869

BenderArenadata · 2023-12-14T09:18:31Z

Allure report https://allure-ee.adsw.io/launch/60413

BenderArenadata · 2023-12-14T09:30:06Z

Failed job Resource group isolation tests on x86_64: https://gitlab.adsw.io/arenadata/github_mirroring/gpdb/-/jobs/887142

BenderArenadata · 2023-12-14T09:35:19Z

Failed job Resource group isolation tests on ppc64le: https://gitlab.adsw.io/arenadata/github_mirroring/gpdb/-/jobs/887143

BenderArenadata · 2023-12-14T19:26:02Z

Allure report https://allure-ee.adsw.io/launch/60460

BenderArenadata · 2023-12-14T19:37:15Z

Failed job Resource group isolation tests on x86_64: https://gitlab.adsw.io/arenadata/github_mirroring/gpdb/-/jobs/888541

BenderArenadata · 2023-12-14T19:42:58Z

Failed job Resource group isolation tests on ppc64le: https://gitlab.adsw.io/arenadata/github_mirroring/gpdb/-/jobs/888542

…ransaction. (#619)" This reverts commit 5894018.

…on. (#619) When heap tuple is updated by legacy planner and the updated tuple is placed at the same page (heap-only tuple, HOT), an update chain is created. It's a chain of updated tuples, in which each tuple's ctid points to the next tuple in the chain. HOT chains allow to store only one index entry, which points to the first tuple in the chain. And during Index Scan we pass through the chain, and the first tuple visible for the current transaction is taken (for more information, see src/backend/access/heap/README.HOT). If we create a second index on column that has been updated, it will store the ctid of the beginning of the existing HOT chain. If a repeatable read transaction started before the transaction in which the second index was created, then this index could be used in the query plan. As a result of the search for this index, a tuple could be found that does not meet the search condition (by a new value that is not visible to the transaction) In the case of the legacy planner, this problem is solved the following way: "To address this issue, regular (non-concurrent) CREATE INDEX makes the new index usable only by new transactions and transactions that don't have snapshots older than the CREATE INDEX command. This prevents queries that can see the inconsistent HOT chains from trying to use the new index and getting incorrect results. Queries that can see the index can only see the rows that were visible after the index was created, hence the HOT chains are consistent for them." But ORCA does not handle this case and can use an index with a broken HOT-chain. This patch resolves the issue for ORCA in the same way as legacy planner. During planning we ignore newly created indexes based on their xmin. Additionally, ORCA faced another related problem. Since ORCA has its own cache (MD Cache) and can cache a relation object without an index that cannot be used in the current snapshot (because MDCacheSetTransientState function returns true), we won't be able to use the index after the problematic snapshot changes. Therefore, we need to reset the cache after the snapshot changes in order to use index. This patch solves the problem in the following way: during index filtering, if we encounter an index that we cannot use, we save TransactionXmin in the mdcache_transaction_xmin variable. In the next queries, we check the saved xmin, and if it is valid and differs from the current one, we reset the cache. The create_index_hot test has also been changed. Now optimizer is turned off before the update. Since ORCA always uses Split Update, in which case HOT chains are not created and the problem is not reproduced. And that's why ORCA wasn't actually tested before.

…ransaction. (#619)" This reverts commit 5894018.

…on. (#619) When heap tuple is updated by legacy planner and the updated tuple is placed at the same page (heap-only tuple, HOT), an update chain is created. It's a chain of updated tuples, in which each tuple's ctid points to the next tuple in the chain. HOT chains allow to store only one index entry, which points to the first tuple in the chain. And during Index Scan we pass through the chain, and the first tuple visible for the current transaction is taken (for more information, see src/backend/access/heap/README.HOT). If we create a second index on column that has been updated, it will store the ctid of the beginning of the existing HOT chain. If a repeatable read transaction started before the transaction in which the second index was created, then this index could be used in the query plan. As a result of the search for this index, a tuple could be found that does not meet the search condition (by a new value that is not visible to the transaction) In the case of the legacy planner, this problem is solved the following way: "To address this issue, regular (non-concurrent) CREATE INDEX makes the new index usable only by new transactions and transactions that don't have snapshots older than the CREATE INDEX command. This prevents queries that can see the inconsistent HOT chains from trying to use the new index and getting incorrect results. Queries that can see the index can only see the rows that were visible after the index was created, hence the HOT chains are consistent for them." But ORCA does not handle this case and can use an index with a broken HOT-chain. This patch resolves the issue for ORCA in the same way as legacy planner. During planning we ignore newly created indexes based on their xmin. Additionally, ORCA faced another related problem. Since ORCA has its own cache (MD Cache) and can cache a relation object without an index that cannot be used in the current snapshot (because MDCacheSetTransientState function returns true), we won't be able to use the index after the problematic snapshot changes. Therefore, we need to reset the cache after the snapshot changes in order to use index. This patch solves the problem in the following way: during index filtering, if we encounter an index that we cannot use, we save TransactionXmin in the mdcache_transaction_xmin variable. In the next queries, we check the saved xmin, and if it is valid and differs from the current one, we reset the cache. The create_index_hot test has also been changed. Now optimizer is turned off before the update. Since ORCA always uses Split Update, in which case HOT chains are not created and the problem is not reproduced. And that's why ORCA wasn't actually tested before.

…on. (#619) When heap tuple is updated by legacy planner and the updated tuple is placed at the same page (heap-only tuple, HOT), an update chain is created. It's a chain of updated tuples, in which each tuple's ctid points to the next tuple in the chain. HOT chains allow to store only one index entry, which points to the first tuple in the chain. And during Index Scan we pass through the chain, and the first tuple visible for the current transaction is taken (for more information, see src/backend/access/heap/README.HOT). If we create a second index on column that has been updated, it will store the ctid of the beginning of the existing HOT chain. If a repeatable read transaction started before the transaction in which the second index was created, then this index could be used in the query plan. As a result of the search for this index, a tuple could be found that does not meet the search condition (by a new value that is not visible to the transaction) In the case of the legacy planner, this problem is solved the following way: "To address this issue, regular (non-concurrent) CREATE INDEX makes the new index usable only by new transactions and transactions that don't have snapshots older than the CREATE INDEX command. This prevents queries that can see the inconsistent HOT chains from trying to use the new index and getting incorrect results. Queries that can see the index can only see the rows that were visible after the index was created, hence the HOT chains are consistent for them." But ORCA does not handle this case and can use an index with a broken HOT-chain. This patch resolves the issue for ORCA in the same way as legacy planner. During planning we ignore newly created indexes based on their xmin. Additionally, ORCA faced another related problem. Since ORCA has its own cache (MD Cache) and can cache a relation object without an index that cannot be used in the current snapshot (because MDCacheSetTransientState function returns true), we won't be able to use the index after the problematic snapshot changes. Therefore, we need to reset the cache after the snapshot changes in order to use index. This patch solves the problem in the following way: during index filtering, if we encounter an index that we cannot use, we save TransactionXmin in the mdcache_transaction_xmin variable. In the next queries, we check the saved xmin, and if it is valid and differs from the current one, we reset the cache. The create_index_hot test has also been changed. Now optimizer is turned off before the update. Since ORCA always uses Split Update, in which case HOT chains are not created and the problem is not reproduced. And that's why ORCA wasn't actually tested before. (cherry-picked from commit 5894018)

…ransaction. (#619)" This reverts commit 5894018.

…on. (#619) When heap tuple is updated by legacy planner and the updated tuple is placed at the same page (heap-only tuple, HOT), an update chain is created. It's a chain of updated tuples, in which each tuple's ctid points to the next tuple in the chain. HOT chains allow to store only one index entry, which points to the first tuple in the chain. And during Index Scan we pass through the chain, and the first tuple visible for the current transaction is taken (for more information, see src/backend/access/heap/README.HOT). If we create a second index on column that has been updated, it will store the ctid of the beginning of the existing HOT chain. If a repeatable read transaction started before the transaction in which the second index was created, then this index could be used in the query plan. As a result of the search for this index, a tuple could be found that does not meet the search condition (by a new value that is not visible to the transaction) In the case of the legacy planner, this problem is solved the following way: "To address this issue, regular (non-concurrent) CREATE INDEX makes the new index usable only by new transactions and transactions that don't have snapshots older than the CREATE INDEX command. This prevents queries that can see the inconsistent HOT chains from trying to use the new index and getting incorrect results. Queries that can see the index can only see the rows that were visible after the index was created, hence the HOT chains are consistent for them." But ORCA does not handle this case and can use an index with a broken HOT-chain. This patch resolves the issue for ORCA in the same way as legacy planner. During planning we ignore newly created indexes based on their xmin. Additionally, ORCA faced another related problem. Since ORCA has its own cache (MD Cache) and can cache a relation object without an index that cannot be used in the current snapshot (because MDCacheSetTransientState function returns true), we won't be able to use the index after the problematic snapshot changes. Therefore, we need to reset the cache after the snapshot changes in order to use index. This patch solves the problem in the following way: during index filtering, if we encounter an index that we cannot use, we save TransactionXmin in the mdcache_transaction_xmin variable. In the next queries, we check the saved xmin, and if it is valid and differs from the current one, we reset the cache. The create_index_hot test has also been changed. Now optimizer is turned off before the update. Since ORCA always uses Split Update, in which case HOT chains are not created and the problem is not reproduced. And that's why ORCA wasn't actually tested before. (cherry-picked from commit 5894018)

KnightMurloc force-pushed the ADBDEV-4251 branch 2 times, most recently from 5d29ce0 to a3af2f5 Compare September 21, 2023 05:50

KnightMurloc force-pushed the ADBDEV-4251 branch from a3af2f5 to a7a900c Compare September 22, 2023 05:00

KnightMurloc changed the title ~~ADBDEV-4251: Fix isolation violation in case of index creation in another session.~~ ADBDEV-4251: Fix use by ORCA of a newer index with HOT-chain in an older transaction. Sep 22, 2023

KnightMurloc marked this pull request as ready for review September 22, 2023 09:52

KnightMurloc requested a review from andr-sokolov September 22, 2023 09:53

KnightMurloc force-pushed the ADBDEV-4251 branch from a7a900c to 2e76a39 Compare September 25, 2023 04:36

RekGRpth reviewed Sep 25, 2023

View reviewed changes

src/backend/gpopt/translate/CTranslatorRelcacheToDXL.cpp Outdated Show resolved Hide resolved

do not check the visibility of the index if it is not supported

f4c4ee2

KnightMurloc force-pushed the ADBDEV-4251 branch from aed85a1 to f4c4ee2 Compare September 26, 2023 06:25

Stolb27 requested a review from bimboterminator1 October 2, 2023 11:05

RekGRpth reviewed Oct 9, 2023

View reviewed changes

src/backend/gpopt/translate/CTranslatorRelcacheToDXL.cpp Outdated Show resolved Hide resolved

bimboterminator1 reviewed Oct 31, 2023

View reviewed changes

src/backend/gpopt/utils/COptTasks.cpp Outdated Show resolved Hide resolved

src/backend/optimizer/plan/orca.c Show resolved Hide resolved

bimboterminator1 reviewed Nov 1, 2023

View reviewed changes

src/backend/gpopt/translate/CTranslatorRelcacheToDXL.cpp Outdated Show resolved Hide resolved

This comment was marked as resolved.

Sign in to view

RekGRpth reviewed Nov 2, 2023

View reviewed changes

src/backend/gpopt/gpdbwrappers.cpp Outdated Show resolved Hide resolved

clarifying comments have been added. using wrappers instead of direct…

d3c0ff7

…ly calling functions from gdb. added cache invalidation test.

KnightMurloc force-pushed the ADBDEV-4251 branch from b48aa81 to d3c0ff7 Compare November 2, 2023 16:28

add void to functions arguments. change comment

ab444e7

KnightMurloc force-pushed the ADBDEV-4251 branch 2 times, most recently from 4042701 to 3fb85e0 Compare December 14, 2023 08:29

change comment

b757ec5

KnightMurloc force-pushed the ADBDEV-4251 branch from 3fb85e0 to b757ec5 Compare December 14, 2023 08:44

Merge branch 'adb-6.x-dev' into ADBDEV-4251

6791677

andr-sokolov approved these changes Dec 14, 2023

View reviewed changes

KnightMurloc merged commit 5894018 into adb-6.x-dev Dec 15, 2023
3 of 5 checks passed

KnightMurloc deleted the ADBDEV-4251 branch December 15, 2023 05:47

Stolb27 mentioned this pull request Mar 13, 2024

Arenadata patchset #55 #890

Merged

bandetto added a commit that referenced this pull request Mar 21, 2024

Revert "Fix use by ORCA of a newer index with HOT-chain in an older t…

7ade537

…ransaction. (#619)" This reverts commit 5894018.

bandetto added a commit that referenced this pull request Apr 1, 2024

Revert "Fix use by ORCA of a newer index with HOT-chain in an older t…

84bbefa

…ransaction. (#619)" This reverts commit 5894018.

andr-sokolov pushed a commit that referenced this pull request Apr 4, 2024

Revert "Fix use by ORCA of a newer index with HOT-chain in an older t…

90e2cab

…ransaction. (#619)" This reverts commit 5894018.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADBDEV-4251: Fix use by ORCA of a newer index with HOT-chain in an older transaction. #619

ADBDEV-4251: Fix use by ORCA of a newer index with HOT-chain in an older transaction. #619

KnightMurloc commented Sep 21, 2023 •

edited

Loading

BenderArenadata commented Sep 21, 2023

BenderArenadata commented Sep 22, 2023

BenderArenadata commented Sep 26, 2023

This comment was marked as resolved.

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Dec 14, 2023

BenderArenadata commented Dec 14, 2023

BenderArenadata commented Dec 14, 2023

BenderArenadata commented Dec 14, 2023

BenderArenadata commented Dec 14, 2023

BenderArenadata commented Dec 14, 2023

ADBDEV-4251: Fix use by ORCA of a newer index with HOT-chain in an older transaction. #619

ADBDEV-4251: Fix use by ORCA of a newer index with HOT-chain in an older transaction. #619

Conversation

KnightMurloc commented Sep 21, 2023 • edited Loading

BenderArenadata commented Sep 21, 2023

BenderArenadata commented Sep 22, 2023

BenderArenadata commented Sep 26, 2023

This comment was marked as resolved.

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Nov 2, 2023

BenderArenadata commented Dec 14, 2023

BenderArenadata commented Dec 14, 2023

BenderArenadata commented Dec 14, 2023

BenderArenadata commented Dec 14, 2023

BenderArenadata commented Dec 14, 2023

BenderArenadata commented Dec 14, 2023

KnightMurloc commented Sep 21, 2023 •

edited

Loading