Clean up datastore emulator workarounds by gaborfeher · Pull Request #585 · googleapis/python-ndb

gaborfeher · 2020-12-12T00:18:23Z

This my proposed fix for: #584

The idea is to store a boolean to remember the type of the backend (emulator or real) during the whole life of the client. I am not super happy about this but I think it's still an improvement over the current way things work. There is already branching in the code based on the type of the backend, but it's basically a guesswork based on the behavior of the backend. And that seems to be fragile, just like my issues demonstrate. After this PR, the branching would be made explicit, based on the actual backend type.

An interesting side-effect of this change is that the change in line 181 actually also fixes #575 on my data, because the backend in my case doesn't send back an empty batch if the correct cursor is used in the query. However, based on #386 the backend might still send empty batches in some legit cases, and anyways, better safe than sorry, so I've kept the change in line 172.

Unfortunately this breaks some tests, I think mostly because conftest.in_context always creates a non-emulator context, but in some cases we'd need an emulator context.

So if you like this PR, please give me some pointers on how to parameterize conftest.in_context, or just feel free to take it over.

In any case, I think it would be a good idea to test the final PR on my data.

crwilcox · 2020-12-12T00:38:02Z

I am generally alright with this fix. Is there a reason to prefer adding the emulator or not to the context, and not to also lookup the os.environ in query? (just wondering if we need to store that context or not)

To be clear, I think we could leave it as is, just curious if there is a reason to prefer one over the other.

gaborfeher · 2020-12-12T01:41:14Z

Yeah, actually I like the idea of doing the environ lookup in the query because I think it would make testing simpler.

But I just had an additional idea: would it just make sense to fall back to the _count_brute_force implementation in case of the emulator?

crwilcox · 2020-12-12T01:50:47Z

would it just make sense to fall back to the _count_brute_force implementation in case of the emulator?

That might work (just looked at the code) 👍. I suppose the concern there might be other subtle things that might come along?

I think for now I would prefer moving to the os.environ based approach, just to avoid bringing context and additional state into things. Once you have pushed that I can kick off CI :)

gaborfeher · 2020-12-12T04:18:18Z

That might work (just looked at the code) +1. I suppose the concern there might be other subtle things that might come along?

I was more like thinking of performance issues, but on second thought, that really shouldn't be an issue.

…n up related hacks

crwilcox · 2020-12-15T20:58:38Z

                    more_results=_datastore_query.NOT_FINISHED,
                    skipped_results=1000,
                    entity_results=[],
-                    end_cursor=b"dontlookatme",


Looking at this change it seemed reasonable, but I am a bit surprised to see test changes. Can you explain why?

Short version: the buggy behavior of #575 & #584 was being guarded by tests.

Extended: in this PR I am basically restoring the behavior before #528 for the non-emulator case so I am more or less reverting the test changes of that PR as well.

More extended: these local line changes here and the other 1-2 line changes are there because of the following behavior was changed: between #528 and this PR, if an empty batch came from the datastore, then it was assumed that it's a datastore emulator and therefore skipped_cursor was used instead of end_cursor. I don't think that this is a good behavior and always the end_cursor was used before #528 so I think this should be a safe change.

And of course test_count_by_skipping_emulator had to be rewritten because the emulator behavior was changed entirely.

crwilcox · 2020-12-15T22:37:35Z

Thanks @gaborfeher. I would like if @chrisrossi or @andrewsg could weigh in as well.

andrewsg

Looks great. Thank you very much!

gaborfeher requested a review from andrewsg as a code owner December 12, 2020 00:18

google-cla Bot added the cla: yes This human has signed the Contributor License Agreement. label Dec 12, 2020

product-auto-label Bot added the api: datastore Issues related to the googleapis/python-ndb API. label Dec 12, 2020

gaborfeher mentioned this pull request Dec 12, 2020

fix: support empty not_finished messages that cause query.count() to return early #580

Merged

gaborfeher changed the title ~~Clean up the datastore emulator workarounds~~ Clean up datastore emulator workarounds Dec 12, 2020

crwilcox added the kokoro:run Add this label to force Kokoro to re-run the tests. label Dec 12, 2020

yoshi-kokoro removed the kokoro:run Add this label to force Kokoro to re-run the tests. label Dec 12, 2020

fix: always use brute-force counting with Datastore emulator and clea…

811aa9d

…n up related hacks

crwilcox added the kokoro:run Add this label to force Kokoro to re-run the tests. label Dec 13, 2020

yoshi-kokoro removed the kokoro:run Add this label to force Kokoro to re-run the tests. label Dec 13, 2020

crwilcox reviewed Dec 15, 2020

View reviewed changes

crwilcox approved these changes Dec 15, 2020

View reviewed changes

andrewsg approved these changes Dec 16, 2020

View reviewed changes

andrewsg merged commit 8480a8b into googleapis:master Dec 16, 2020

gaborfeher mentioned this pull request Dec 16, 2020

Infinite loop in _count_by_skipping #584

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up datastore emulator workarounds#585

Clean up datastore emulator workarounds#585
andrewsg merged 1 commit into
googleapis:masterfrom
gaborfeher:issue-575

gaborfeher commented Dec 12, 2020

Uh oh!

crwilcox commented Dec 12, 2020 •

edited

Loading

Uh oh!

gaborfeher commented Dec 12, 2020

Uh oh!

crwilcox commented Dec 12, 2020

Uh oh!

gaborfeher commented Dec 12, 2020

Uh oh!

crwilcox Dec 15, 2020

Uh oh!

gaborfeher Dec 15, 2020 •

edited

Loading

Uh oh!

crwilcox commented Dec 15, 2020

Uh oh!

andrewsg left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

gaborfeher commented Dec 12, 2020

Uh oh!

crwilcox commented Dec 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gaborfeher commented Dec 12, 2020

Uh oh!

crwilcox commented Dec 12, 2020

Uh oh!

gaborfeher commented Dec 12, 2020

Uh oh!

crwilcox Dec 15, 2020

Choose a reason for hiding this comment

Uh oh!

gaborfeher Dec 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

crwilcox commented Dec 15, 2020

Uh oh!

andrewsg left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

crwilcox commented Dec 12, 2020 •

edited

Loading

gaborfeher Dec 15, 2020 •

edited

Loading