Apply "unify bucket and key" before "provide bucket" #28710

dstandish · 2023-01-03T22:41:36Z

Previously if user provided full key it may be overwritten by conn bucket but now we fix ordering of decorators and this won't happen.

This fix doesn't seem to break backcompat because previously you'd get call(Bucket='bucket', Key='s3://other-bucket/file.txt') which should fail anyway.

depends on #28707

airflow/providers/amazon/aws/exceptions.py

feluelle

LGTM, but to me the tests are looking quite complex because of all the branches.

airflow/providers/amazon/aws/hooks/s3.py

tests/providers/amazon/aws/hooks/test_s3.py

dstandish · 2023-01-04T21:29:53Z

ok @o-nikolas and @feluelle i have updated these tests to move the "tokens" to individual params. it's not quite "putting the test values in the tuple", which, if i understood correctly, is what yall were thinking of (and what i more or less strenuously object to doing, myself anyway) but hopefully it's "orthodox enough" to not raise anyone's hackles (i kid) while still being relatively compact and readable. 🥳

o-nikolas · 2023-01-04T23:01:03Z

tests/providers/amazon/aws/hooks/test_s3.py

+def test_s3_head_object_decorated_behavior(mock_conn, has_conn, has_bucket, key_kind, expected):
+    if has_conn == "with_conn":


Thanks for making these changes @dstandish! It's very much appreciated 🙏 and is indeed what I was describing before. The only other improvement is that you can actually just make many of the params here simply booleans instead of strings. This would allow you to get rid of the string comparisons in the conditionals of the test body.

But of course feel free to do this or not :)

P.S. I tried, but unfortunately cannot, add a full suggested change for the above because the snippet includes deleted lines 927/928 and the GitHub UI does not allow that, and I didn't want to include a partial one so as to avoid confusion.

o-nikolas · 2023-01-04T23:13:39Z

ok @o-nikolas and @feluelle i have updated these tests to move the "tokens" to individual params. it's not quite "putting the test values in the tuple", which, if i understood correctly, is what yall were thinking of (and what i more or less strenuously object to doing, myself anyway)

Thanks @dstandish! And this was actually exactly what I was intending in the previous CR, apologies if we miscommunicated there 🙏
I only left one comment for a way to optimize it further (using booleans as the param values instead of strings for everything but expected). Feel free to do so or not 😃

feluelle · 2023-01-05T11:07:47Z

Thanks @dstandish. I appreciate that. :)
What I actually meant is 729e275. Feel free to revert if you don't like it, but this is how I would do it.

I think writing tests is not about being DRY. It much more important that they are readable and maintainable.

potiuk · 2023-01-05T11:11:39Z

Glad to see we got to common conclusions even after some initial strugles in communication :)

BTW.

I think writing tests is not about being DRY. It much more important that they are readable and maintainable.

Yep. 100% agree. DRY is important but for tests DAMP is importanter :D :D

https://enterprisecraftsmanship.com/posts/dry-damp-unit-tests

dstandish · 2023-01-05T23:20:15Z

Thanks @dstandish. I appreciate that. :)
What I actually meant is 729e275. Feel free to revert if you don't like it, but this is how I would do it.

By all means, gold plate this to your hearts content 😜

Now if we can figure out why tests failing....

Meanwhile I'm in the mountains and checked out for a few days 🌧️

Co-authored-by: Felix Uellendall <feluelle@users.noreply.github.com>

- resolve "no conn", "no bucket", "rel key" args before running the test

…arate tests" This reverts commit d696222.

This reverts commit e3e3db6.

dstandish · 2023-01-06T10:04:01Z

ok i think i have gotten the test issue fixed now... it was the result of reloading the s3 module. after that, mocks don't work.

separately i had a chance to look at the change @feluelle and i reverted, in small part cus it was failing :) but mainly because...

this is sort of why i objected to doing it that way.
with your way, we see a bunch of test values but we don't really know what they are there for. sure you could use param class and add an ID but that is a lot more noise. and it risks that the params differ from the description (id).
meanwhile, my params, to borrow a phrase, are "descriptive and meaningful phrases", e.g.
"unify", "no_conn", "no_bucket", "full_key"
this scenario covers the case unify first, with no connection (or no schema in connection), and no bucket provided as kwarg, and with a full key.
meanwhile, with yours we see
None, {"key": "s3://key_bucket/key.txt"}
if we look at this, how are we to know what it's testing? and when we look at all the params together, how do we know if we have missed a case?
with mine, we just need to verify that we all combinations of the 4 binary choices. when using the values directly, they don't quite combine that way.
and, this is also related to why i didn't want to make them booleans, which @o-nikolas suggested. by using the string values, we get good test names. test names that clearly indicate the scenario tested.
anyway, not that these other approaches aren't fine, just explaining my choices here.

thanks

feluelle · 2023-01-06T13:40:28Z

sure you could use param class and add an ID but that is a lot more noise.

Not to me, but okay.

Instead of

# full key
# no conn - no bucket - full key
(None, {"key": "s3://key_bucket/key.txt"}, ["key_bucket", "key.txt"]),

you would do:

param(None, {"key": "s3://key_bucket/key.txt"}, ["key_bucket", "key.txt"], id="unify-no_conn-no_bucket-full_key"),

and you get more descriptive output in pytest.

and it risks that the params differ from the description (id).

Isn't this the same as with comments (we have above each line)? And comments also can be valuable. You just need to make sure to update them accordingly.

dstandish · 2023-01-06T19:01:10Z

Isn't this the same as with comments (we have above each line)? And comments also can be valuable. You just need to make sure to update them accordingly.

yes absolutely it is, and this is a good reminder to remove them, which i've now done. i also rearranged things so that the truth table is built in the normal, sane way, and unify vs provide remain innermost (since those are what we are actually comparing here). now it's much easier to see that we have everything covered, and we no longer need the comments.

dstandish requested a review from eladkal as a code owner January 3, 2023 22:41

boring-cyborg bot added area:providers provider:amazon-aws AWS/Amazon - related issues labels Jan 3, 2023

dstandish force-pushed the actually-fix-aws-s3-dec-pref branch from 395045e to 9739f4d Compare January 4, 2023 00:29

dstandish requested review from Taragolis, feluelle and o-nikolas January 4, 2023 05:31

uranusjr reviewed Jan 4, 2023

View reviewed changes

airflow/providers/amazon/aws/exceptions.py Show resolved Hide resolved

dstandish force-pushed the actually-fix-aws-s3-dec-pref branch from b1f5d0e to eead6c3 Compare January 4, 2023 07:37

feluelle reviewed Jan 4, 2023

View reviewed changes

airflow/providers/amazon/aws/hooks/s3.py Outdated Show resolved Hide resolved

tests/providers/amazon/aws/hooks/test_s3.py Outdated Show resolved Hide resolved

tests/providers/amazon/aws/hooks/test_s3.py Outdated Show resolved Hide resolved

feluelle reviewed Jan 4, 2023

View reviewed changes

tests/providers/amazon/aws/hooks/test_s3.py Outdated Show resolved Hide resolved

dstandish force-pushed the actually-fix-aws-s3-dec-pref branch from 5a61f84 to 322f18e Compare January 4, 2023 21:15

o-nikolas approved these changes Jan 4, 2023

View reviewed changes

dstandish and others added 12 commits January 5, 2023 23:22

Add tests documenting behavior when s3 decorators combined

c2f394c

consolidate to one test

9edd5b5

Add test for actual s3 method

12b770e

fix name

90f3cff

docstring

42e38fb

Apply unify bucket and key before provide bucket

fe83218

fix test

9eab4c7

fix tests

8283470

touchup

b3d611a

Update airflow/providers/amazon/aws/hooks/s3.py

264a6b5

Co-authored-by: Felix Uellendall <feluelle@users.noreply.github.com>

cleanup tests

4bf7fed

add default msg

f4dbce5

Split test_unify_and_provide_bucket_name_combination into separate tests

d696222

- resolve "no conn", "no bucket", "rel key" args before running the test

dstandish force-pushed the actually-fix-aws-s3-dec-pref branch from 729e275 to d696222 Compare January 6, 2023 07:22

dstandish added 4 commits January 5, 2023 23:32

Revert "Split test_unify_and_provide_bucket_name_combination into sep…

b94aee9

…arate tests" This reverts commit d696222.

fix mock

e3e3db6

don't reload in test

7373a56

Revert "fix mock"

bb26130

This reverts commit e3e3db6.

dstandish added 2 commits January 6, 2023 10:37

remove unnec comments and order the truth table in more standard way

1eef449

add documentation and truth table ordedring fixup

14ab709

dstandish merged commit 3eee33a into apache:main Jan 6, 2023

dstandish deleted the actually-fix-aws-s3-dec-pref branch January 6, 2023 19:31

eladkal mentioned this pull request Jan 14, 2023

Status of testing Providers that were prepared on January 14, 2023 #28938

Closed

37 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply "unify bucket and key" before "provide bucket" #28710

Apply "unify bucket and key" before "provide bucket" #28710

dstandish commented Jan 3, 2023 •

edited

feluelle left a comment

dstandish commented Jan 4, 2023

o-nikolas Jan 4, 2023

o-nikolas commented Jan 4, 2023

feluelle commented Jan 5, 2023

potiuk commented Jan 5, 2023

dstandish commented Jan 5, 2023

dstandish commented Jan 6, 2023 •

edited

feluelle commented Jan 6, 2023

dstandish commented Jan 6, 2023

		def test_s3_head_object_decorated_behavior(mock_conn, has_conn, has_bucket, key_kind, expected):
		if has_conn == "with_conn":

Apply "unify bucket and key" before "provide bucket" #28710

Apply "unify bucket and key" before "provide bucket" #28710

Conversation

dstandish commented Jan 3, 2023 • edited

feluelle left a comment

Choose a reason for hiding this comment

dstandish commented Jan 4, 2023

o-nikolas Jan 4, 2023

Choose a reason for hiding this comment

o-nikolas commented Jan 4, 2023

feluelle commented Jan 5, 2023

potiuk commented Jan 5, 2023

dstandish commented Jan 5, 2023

dstandish commented Jan 6, 2023 • edited

feluelle commented Jan 6, 2023

dstandish commented Jan 6, 2023

dstandish commented Jan 3, 2023 •

edited

dstandish commented Jan 6, 2023 •

edited