Add ObjectName class for representing myschema.myobject references #42

zcmarine · 2018-07-02T03:42:37Z

This is the worst PR I've ever made. To try to justify this ugly mess so I don't feel so lousy: the choice was between one huge, incomprehensible PR (this) and 48 tiny PRs that were comprehensible and would leave the repo in a good place. My reasons for going this route were:

This PR represents the "atomic" change we're looking for: moving from not using ObjectName instances to using them everywhere. Each of the 48 intermediate commits would all leave the repo in a weird half-and-half state, though admittedly the tests pass at each commit.
48 PRs would have resulted in a LOT of back-and-forth and a ton of time. Given that I leave for several months of vacation in a few days, I was pushing for speed over proper process.
Everything in this PR is changing plumbing. Basically, where we were using strings before use an instance of a class. While on the one hand it may have been a way to get people to know the plumbing of pgbedrock, it's also a way to burn a lot of goodwill and kill people's interest in reviewing PRs. Rather than having people review plumbing, I'd prefer to talk about features and non-trivial changes.
Although it's possible that I may have screwed something up with my changes to the plumbing, we have 96% test coverage and I've tested this against Squarespace's relatively complex use case, so I feel pretty good about the fact that the changes I made didn't break anything. And if they do (which would suck), we have prior docker images and pypi versions available.

With the above given as an attempt to exonerate myself from guilt (which I still feel), I think it'd be useful to discuss this PR in person and see if there's anything else that would be useful to do for due diligence. Actually reviewing the medley of changes seems like a bad use of time.

This includes getting the following functions to all work with DBObjects: * determine_nonschema_privileges_for_schema * collapse_personal_schemas * add_privileges

…bjects

Doing schema."table" makes sense for real objects, but * isn't a real object. As a result, doing foo."*" is confusing: it seems like we're talking about a real object name '*', but we're not. To avoid this confusion, we output foo."bar" for everything except when the object_name is a *, in which case we do foo.*

DBObject is used in many spots, so it makes sense to expose this in common.py Also, originally this was two commits: * move DBObject to common.py * Rename DBObject to ObjectName However, during an interactive rebase I accidentally squashed this into one commit and am gonna just leave it that way.

ObjectName.object_name is confusing (it should be redundant but isn't: we're referring to the object name without a schema). Unfortunately, Postgres refers to entities as "objects", so we don't want to just called things ObjectName.object either. As a result, the clearest name is probably just ObjectName.unqualified_name, so we switch to that.

When a test fails it is useful to see what the ObjectName instance represents. Having a repr facilitates that.

It is better practice for us to amend our custom Dumper class rather than amend the default one, which may introduce hard-to-trace issues in other tests or downstream in the codebase.

All lists within the owns and privileges subdicts will be converted to ObjectNames. If there are any empty entries, e.g.: roleA: owns: tables: ... Then these empty entries get converted to None by PyYAML. Rather than override this, I'm leaving this behavior as: 1) It adds a some complexity to try to guard against Nones in a bunch of spots. 2) More than the complexity, it makes the code less readable 3) And most importantly, the point of pgbedrock is to make a spec simply to read and follow. Those empty entries are unnecessary cruft. As a result, it is both easier on our end and more correct based on this tool's philosophical purpose to not try to convert entries from None to an empty list. Originally, I had tried to use a customer PyYAML constructor to automatically convert these sublists from strings to ObjectName instances, but I found PyYAML confusing and complicated enough that whoever would have to maintain this code if it breaks would probably end up quite confused. Because of that it seemed wiser to just let PyYAML do its thing and then we come in with regular Python and convert the relevant parts of the loaded spec into ObjectName instances.

ensure_quoted_identifier() was necessary before we had ObjectName instances. Now it is unused and can be removed.

…stances

This shouldn't make a difference but is technically more correct. In addition, it is a good way to verify that we are indeed passing in ObjectName instances that represent schemas and not non-schema objects.

coveralls · 2018-07-02T03:46:08Z

Pull Request Test Coverage Report for Build 163

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+1.3%) to 97.235%

Totals
Change from base Build 158:	1.3%
Covered Lines:	1266
Relevant Lines:	1302

💛 - Coveralls

coveralls · 2018-07-02T03:46:08Z

Pull Request Test Coverage Report for Build 163

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+1.3%) to 97.235%

Totals
Change from base Build 158:	1.3%
Covered Lines:	1266
Relevant Lines:	1302

💛 - Coveralls

cpdean

This is quite large.

It might have been better to try to have a transition period between the two internal API's (string based vs objectname based), but I guess sometimes you gotta pull that bandaid off fast.

I've traced a few codepaths and nothing seems wrong -- wherever there used to be a string there is now an ObjectName. I thought I almost caught a few things you missed, but I was wrong. Like -- you started using ObjectNames as keys in a dict, which my reflex is that this would be a bug, but I see you implemented __hash__ so that's fine, etc.

Some day we'll have static types and big PR's like this will feel trivial ❤️.

Glad to see how far this project has come.

Best,
Conrad

Zach Marine and others added 30 commits June 9, 2018 17:31

Fix misnamed variable (default --> nondefault)

f4ab393

Add ObjectName class and tests for it

f7d39db

Temporarily do schema."table" for DBObjects to ease migration to it

aa31631

Return schema and object name in Q_GET_ALL_CURRENT_NONDEFAULTS

6afb541

Use NamedRow as convention for namedtuples from Postgres

e88fa48

Return schema and object name in Q_GET_ALL_RAW_OBJECT_ATTRIBUTES

e2ab302

Have dbcontext.get_all_current_nondefaults() return DBObjects

1542c96

Have dbcontext.get_role_current_nondefaults() return DBObjects

71069eb

Have dbcontext.get_role_objects_with_access() return DBObjects

cf44e17

Replace for loop with comprehension for readability

fb1de6c

Have dbcontext.get_all_raw_object_attributes() return DBObjects

a562521

Rename ObjectAttributes field to show it is a DBObject

3bda1d3

Have dbcontext.get_all_nonschema_objects_and_owners() return DBObjects

d6d98ee

Move 2 lines closer to where they are used

6daccae

Have ownerships.SchemaAnalyzer fully work via DBObjects

15f0ef1

Have core_generate use DBObjects

a7b6c5e

This includes getting the following functions to all work with DBObjects: * determine_nonschema_privileges_for_schema * collapse_personal_schemas * add_privileges

Have dbcontext.get_all_personal_schemas() return DBObjects

c6ad65b

Have get_all_object_attributes use DBObjects

37a5d21

Have NonschemaAnalyzer use DBObjects

90791c8

Have rest of ownerships.py use DBObjects

681aaa6

Have dbcontext.get_all_schemas_and_owners use DBObjects

aac978f

Have core_generate.determine_nonschema_privileges_for_schema used DBO…

0ef7138

…bjects

Have determine_schema_privileges() use DBObjects

87e0bf3

Have determine_schema_privileges() return DBObjects

8962955

Make DBObjects sortable

28f2046

Natively convert DBObjects during yaml.dump

3558d87

Remove obsolete if-else crutch

5eefa3e

zcmarine added 19 commits June 12, 2018 15:53

Add ObjectName.__repr__ for easier debugging

b24cf25

When a test fails it is useful to see what the ObjectName instance represents. Having a repr facilitates that.

Add representers to FormattedDumper

db3ed92

It is better practice for us to amend our custom Dumper class rather than amend the default one, which may introduce hard-to-trace issues in other tests or downstream in the codebase.

Use ObjectNames from yaml.load through whole tool

7758bc6

Remove obsolete function

3d31538

ensure_quoted_identifier() was necessary before we had ObjectName instances. Now it is unused and can be removed.

Have dbcontext.get_schema_objects() take an ObjectName

b3ba125

Have dbcontext.get_all_nonschema_objects_and_owners use ObjectName in…

727d84e

…stances

Use qualified_name for schema objects

9626e7a

This shouldn't make a difference but is technically more correct. In addition, it is a good way to verify that we are indeed passing in ObjectName instances that represent schemas and not non-schema objects.

Have determine_schema_writers use ObjectNames

aecdd19

Have determine_desired_defaults use ObjectNames

0d41fb1

Have analyze_defaults diff ObjectNames

1857146

Have get_role_current_defaults use ObjectNames

4695dbb

Have grant/revoke_default() use ObjectNames

2d849e5

Have has_default_privilege use ObjectNames

de2a388

Remove non-ObjectName references in identify_desired_objects()

007c469

Have grant/revoke_nondefault use ObjectNames

4b13744

Compare ObjectNames instead of strings

4fdcbb7

Have get_role_objects_with_access use ObjectNames

b4677ce

Fix bug with str and ObjectNames both in list

d2a10af

zcmarine requested a review from cpdean July 2, 2018 03:42

zcmarine changed the title ~~Zcm/class for object quoting~~ Add ObjectName class for representing myschema.myobject references Jul 2, 2018

cpdean approved these changes Jul 2, 2018

View reviewed changes

zcmarine merged commit 713f015 into Squarespace:master Jul 2, 2018

zcmarine deleted the zcm/class_for_object_quoting branch July 2, 2018 17:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ObjectName class for representing myschema.myobject references #42

Add ObjectName class for representing myschema.myobject references #42

zcmarine commented Jul 2, 2018

coveralls commented Jul 2, 2018

coveralls commented Jul 2, 2018 •

edited

Loading

cpdean left a comment

Add ObjectName class for representing myschema.myobject references #42

Add ObjectName class for representing myschema.myobject references #42

Conversation

zcmarine commented Jul 2, 2018

coveralls commented Jul 2, 2018

Pull Request Test Coverage Report for Build 163

💛 - Coveralls

coveralls commented Jul 2, 2018 • edited Loading

Pull Request Test Coverage Report for Build 163

💛 - Coveralls

cpdean left a comment

Choose a reason for hiding this comment

coveralls commented Jul 2, 2018 •

edited

Loading