Allow use of schema argument for project and dataset #63

vinceatbluelabs · 2020-08-12T19:05:12Z

This is a follow-on PR to #53 to implement the desired expanded project and dataset parsing behavior discussed during its review.

Tests have been updated and pass.

vinceatbluelabs · 2020-08-12T19:07:15Z

README.rst

+    # If neither dataset nor project are the default
+    sample_table_1 = Table('natality', schema='bigquery-public-data.samples')
+    # If just dataset is not the default
+    sample_table_2 = Table('natality', schema='bigquery-public-data')


As discussed in #53, I believe it's more in the spirit of SQLAlchemy and common SQL database notions of 'schema name' to specify the location, rather than encouraging people to specify it in the name (in BigQuery-speak, the project and dataset).

As per previous discussion, the code will continue to accept dataset and project specified in the table name to avoid breaking existing users, and there are regression tests added for that usage.

vinceatbluelabs · 2020-08-12T19:08:33Z

pybigquery/sqlalchemy_bigquery.py

        elif len(table_name_split) == 3:
            project, dataset, table_name = table_name_split
+        else:
+            raise ValueError("Did not understand table_name: {}".format(full_table_name))


I used .format() instead of f-strings since I assume you want to support older Python versions. The underlying BigQuery libraries seemed to use ValueError for SQL parsing issues, so I assumed this was a good exception type.

vinceatbluelabs · 2020-08-12T19:10:12Z

pybigquery/sqlalchemy_bigquery.py

+        table_ref = TableReference.from_string("{}.{}.{}".format(
+            project_id, dataset_id, table_id
+        ))
+        return table_ref


I broke out this function so it could be easily unit tested with the various valid combinations of specifying project/dataset/table in the schema and table arguments. I test drove this, and every line was added due to a corresponding test case.

vinceatbluelabs · 2020-08-12T19:10:42Z

test/test_sqlalchemy_bigquery.py

+    with pytest.raises(ValueError):
+        dialect._table_reference(provided_schema_name,
+                                 provided_table_name,
+                                 client_project)


Above are the test cases! Let me know if you think of any other scenarios we need to cover.

vinceatbluelabs · 2020-08-20T14:48:10Z

Hi @tswast! Let me know what you think.

tswast · 2020-08-27T14:18:33Z

Thanks for the contribution! It'll probably be another week or two before I have time to review this. Thanks for your patience.

tswast

Wonderful! Thanks so much for your contribution.

vinceatbluelabs added 10 commits August 11, 2020 17:30

Refactor and add test of table reference determination

579ab81

Add dialect fixture

5823b69

Add initial test cases and logic

75e6b6b

Refactor

a6631e1

Refactor

1319c1a

Remove redundant example

908fc5e

Support project.dataset in schema argument

51b5a53

Add error validation test cases

c372917

Drop f-strings for compatibility

545d0eb

Update README to match current preferred behavior

12fb2f3

vinceatbluelabs commented Aug 12, 2020

View reviewed changes

tswast self-requested a review September 16, 2020 16:24

tswast approved these changes Nov 18, 2020

View reviewed changes

tswast merged commit 72fbe2b into googleapis:master Nov 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow use of schema argument for project and dataset #63

Allow use of schema argument for project and dataset #63

Uh oh!

vinceatbluelabs commented Aug 12, 2020

Uh oh!

vinceatbluelabs Aug 12, 2020

Uh oh!

vinceatbluelabs Aug 12, 2020 •

edited

Loading

Uh oh!

vinceatbluelabs Aug 12, 2020

Uh oh!

vinceatbluelabs Aug 12, 2020

Uh oh!

vinceatbluelabs commented Aug 20, 2020

Uh oh!

tswast commented Aug 27, 2020

Uh oh!

tswast left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Allow use of schema argument for project and dataset #63

Allow use of schema argument for project and dataset #63

Uh oh!

Conversation

vinceatbluelabs commented Aug 12, 2020

Uh oh!

vinceatbluelabs Aug 12, 2020

Choose a reason for hiding this comment

Uh oh!

vinceatbluelabs Aug 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vinceatbluelabs Aug 12, 2020

Choose a reason for hiding this comment

Uh oh!

vinceatbluelabs Aug 12, 2020

Choose a reason for hiding this comment

Uh oh!

vinceatbluelabs commented Aug 20, 2020

Uh oh!

tswast commented Aug 27, 2020

Uh oh!

tswast left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vinceatbluelabs Aug 12, 2020 •

edited

Loading