feat: Fixing and refactoring transaction retry logic in dbapi. Also adding interceptors support for testing #1056

ankiaga · 2023-12-15T01:32:51Z

We found couple of bugs in the existing retry logic, the major one being b/315807641
Other one being assuming tuple to be a list (line 316 in old connection.py) and comparing which would never work.
These issues were not caught as there are no proper tests for these.

So this PR is fixing those issues. Also the current code was not very easy to understand so this PR also refactor the retry logic from connection.py to transaction_helper.py along with the fixes. Tests from test_connection.py have also moved to test_transaction_helper.py along with many new tests (mainly on batch statements) added

Also added interceptors support so we can test retry logic in system tests. Added system tests for the same

olavloite

Would you mind adding a description to the PR what the faulty behavior was that you found, and how you fixed it? The PR is relatively big and complex, so it would be good to have some context.

Also: After an initial pass, it is very clear that this feature does not have a good enough test coverage. There are some big problems with the current implementation (probably much of it was already there), and the fact that none of the tests have caught that is a clear sign that it is not tested well enough.

tests/system/test_dbapi.py

tests/unit/spanner_dbapi/test_connection.py

google/cloud/spanner_dbapi/transaction_helper.py

google/cloud/spanner_dbapi/batch_dml_executor.py

google/cloud/spanner_dbapi/connection.py

google/cloud/spanner_dbapi/cursor.py

google/cloud/spanner_dbapi/transaction_helper.py

google/cloud/spanner_dbapi/cursor.py

google/cloud/spanner_dbapi/transaction_helper.py

tests/unit/spanner_dbapi/test_connection.py

google/cloud/spanner_v1/testing/database_test.py

tests/system/test_dbapi.py

google/cloud/spanner_dbapi/transaction_helper.py

olavloite · 2024-01-09T11:57:26Z

My general impression is that we need a more extensive test suite for transaction retries. Tests that I would like to see are:

Tests that show that a retry fails if the underlying data has changed.
Tests that show that a retry succeeds if the underlying data has remained unchanged. The test should include a query that selects data from a table that uses all data types that are supported by Cloud Spanner. This will show that the checksum calculation works correctly for all data types.
Tests that verify that the same retry as in test 2 fails if one of the columns in the result has changed. This also verifies that the checksum calculation is 'correct' for all data types in the sense that it produces a different outcome if a value has changed.
(Potentially in a separate PR) A test that shows that a Batch DML that contains N elements and that successfully executes M=N-X of the DML statements and returns an error for statement number M+1, is correctly handled. That is: During a retry, both the M update counts + the error must be the same for the retry to be successful, and otherwise it should fail.
(Potentially in a separate PR) A test that shows that a fetchxxx call that returns an Aborted error is correctly handled. 'Correctly' in this case means either:
5.1. Propagates the Aborted error to the user without retrying
5.2. OR: Internally retries the transaction and continues the cursor from the same point.

ankiaga · 2024-01-10T11:40:41Z

My general impression is that we need a more extensive test suite for transaction retries. Tests that I would like to see are:

Tests that show that a retry fails if the underlying data has changed.

Tests that show that a retry succeeds if the underlying data has remained unchanged. The test should include a query that selects data from a table that uses all data types that are supported by Cloud Spanner. This will show that the checksum calculation works correctly for all data types.

Tests that verify that the same retry as in test 2 fails if one of the columns in the result has changed. This also verifies that the checksum calculation is 'correct' for all data types in the sense that it produces a different outcome if a value has changed.

(Potentially in a separate PR) A test that shows that a Batch DML that contains N elements and that successfully executes M=N-X of the DML statements and returns an error for statement number M+1, is correctly handled. That is: During a retry, both the M update counts + the error must be the same for the retry to be successful, and otherwise it should fail.

(Potentially in a separate PR) A test that shows that a fetchxxx call that returns an Aborted error is correctly handled. 'Correctly' in this case means either:
5.1. Propagates the Aborted error to the user without retrying
5.2. OR: Internally retries the transaction and continues the cursor from the same point.

Thanks for spending time and figuring out all tests missing here. As discussed created a bug https://buganizer.corp.google.com/issues/319404848 for this

…dding interceptors support for testing

… the statements details added for retry

…ws update count for batch dml in cursor

…helper.py

…terceptor

olavloite · 2024-01-16T09:38:13Z

Thanks for your patience and perseverance on this 👍

ankiaga requested review from a team as code owners December 15, 2023 01:32

product-auto-label bot added size: xl Pull request size is extra large. api: spanner Issues related to the googleapis/python-spanner API. labels Dec 15, 2023

ankiaga requested review from olavloite, aseering and manu2 December 15, 2023 02:42

olavloite reviewed Dec 15, 2023

View reviewed changes

ankiaga added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Dec 16, 2023

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Dec 16, 2023

ankiaga changed the title ~~fix: Fixing and refactoring transaction retry logic in dbapi~~ feat: Fixing and refactoring transaction retry logic in dbapi. Also adding interceptors support for testing Dec 20, 2023

ankiaga force-pushed the retry branch 3 times, most recently from 72ec835 to 5e013ca Compare December 27, 2023 10:53

olavloite reviewed Jan 2, 2024

View reviewed changes

ankiaga requested a review from pratickchokhani January 9, 2024 07:26

olavloite reviewed Jan 9, 2024

View reviewed changes

ankiaga added 7 commits January 10, 2024 18:11

feat: Fixing and refactoring transaction retry logic in dbapi. Also a…

454e7b9

…dding interceptors support for testing

Comments incorporated and changes for also storing Cursor object with…

0f86433

… the statements details added for retry

Some refactoring of transaction_helper.py and maintaining state of ro…

42e6e57

…ws update count for batch dml in cursor

Small fix

d7ef54f

Maintaining a map from cursor to last statement added in transaction_…

721a9ae

…helper.py

Rolling back the transaction when Aborted exception is thrown from in…

86db58a

…terceptor

Small change

bad9abe

ankiaga force-pushed the retry branch from c04bf2c to bad9abe Compare January 10, 2024 12:43

ankiaga and others added 3 commits January 10, 2024 19:50

Disabling a test for emulator run

9fda104

Reformatting

a971d33

Merge branch 'main' into retry

7e22622

olavloite approved these changes Jan 16, 2024

View reviewed changes

ankiaga added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jan 16, 2024

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jan 16, 2024

ankiaga enabled auto-merge (squash) January 16, 2024 10:32

Merge branch 'main' into retry

b05d87b

ankiaga added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jan 17, 2024

yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Jan 17, 2024

ankiaga merged commit 6640888 into googleapis:main Jan 17, 2024
12 of 13 checks passed

release-please bot mentioned this pull request Jan 17, 2024

chore(main): release 3.42.0 #1079

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Fixing and refactoring transaction retry logic in dbapi. Also adding interceptors support for testing #1056

feat: Fixing and refactoring transaction retry logic in dbapi. Also adding interceptors support for testing #1056

ankiaga commented Dec 15, 2023 •

edited

olavloite left a comment

olavloite commented Jan 9, 2024

ankiaga commented Jan 10, 2024

olavloite commented Jan 16, 2024

feat: Fixing and refactoring transaction retry logic in dbapi. Also adding interceptors support for testing #1056

feat: Fixing and refactoring transaction retry logic in dbapi. Also adding interceptors support for testing #1056

Conversation

ankiaga commented Dec 15, 2023 • edited

olavloite left a comment

Choose a reason for hiding this comment

olavloite commented Jan 9, 2024

ankiaga commented Jan 10, 2024

olavloite commented Jan 16, 2024

ankiaga commented Dec 15, 2023 •

edited