Overhaul the table identification system #93

RasmusSkytte · 2024-01-29T13:09:23Z

Intent

This PR introduces a number of changes -- including breaking -- to the systems using id().

I was experiencing problems with using a number of the SCDB functions programmatically over in diseasystore.
Which also led to me creating #92, since, in my opinion, table_exists() did not function intuitively as I expected.

When investigating this issue, I found several "design issues" as I would call them that this PR attempts to rectify.

The overall issue stems, in my view, from our table identification being too ambiguous. Specifically, I do not believe we live up to the mantra: Explicit is better than implicit

This PR therefore attempts to remove all ambiguities related to table identification.

Fixes #92

Approach

A lot of things are being done in this PR.

Here I list the changes as written in the NEWS:

BREAKING CHANGES:

Table identification is now more specific (#??):

Most SCDB functions allow for tables to be specified by a character representation of "schema.table".

Before, if no schema was implied in this context, SCDB would attempt to match the table among both
permanent and temporary tables.

Now, it will always assume that a lack of schema means the default schema should be used.
This is also the case if DBI::Id() is used without a schema specification.
The show_temporary argument of get_tables() is now a simple logical (#??).

In addition, schema is always returned in the list of table (no longer NA for default schema).
Tables created with create_table() will now be temporary or permanent dependent on the default value of
DBI::dbCreateTable() (#??).

If you wish to overwrite this, use ... arguments which are passed to DBI::dbCreateTable().
If a SQLiteConnection is passed to get_schema(), the returned schema will always be "main" (#??).

Features

The S3 method as.character.Id() is added which converts DBI::Id() to character (#??).

Improvements and Fixes

Improvements for create_table() (#??):
- now writes the table if a remote connection is given. Before, it would only create the
  table with corresponding columns.
- can now create temporary tables for Microsoft SQL Server.
get_tables() now supports temporary tables for Microsoft SQL Server (#??).

Testing

Added missing tests for create_logs_if_missing() (#??).

Improved tests for get_tables(), table_exists(), and create_table() (#??).

Known issues

This PR introduces breaking changes, since we change the behaviour of the table identification.

However, when testing these changes in diseasystore, they should more fairly just be called "fixes".
For the testing suite in diseasystore it means that I can now use table_exists() and create_table().
It also means I can simplify a lot of other code with the new id() behaviour.

Checklist

The PR passes all local unit tests
I have documented any new features introduced
If the PR adds a new feature, please add an entry in NEWS.md
A reviewer is assigned to this PR

BREAKING CHANGE: any missing schema information will now assume the default schema should be used.

BREAKING CHANGE

BREAKING CHANGE: `show_temporary` argument is now a simple logical and schema is always returned (no longer NA for default schema)

BREAKING CHANGE: This changes the default value to FALSE at present

This commit reverts parts of 6bbba53

NAMESPACE

marcusmunch

Looks good to me!

R/get_table.R

marcusmunch · 2024-01-31T12:36:55Z

R/get_table.R

+      )
+    }
+  } else {
+    rlang::abort("Only character or DBI::Id inputs to table_exists is allowed!")


Technically, a tbl_dbi input is also allowed (see the generic function), which could (for readability) also go into its own S3 method, but this works as-is.

tests/testthat/test-get_table.R

Co-authored-by: Marcus Munch <marcus@marcusmunch.dk>

RasmusSkytte and others added 30 commits January 24, 2024 08:32

fix: make table identification specific

0132538

BREAKING CHANGE: any missing schema information will now assume the default schema should be used.

fix: update assertions and checks

aec76e7

feat(table_exists): simplify function

16a0bdd

fix(table_exists): use default schema if none implied

a203762

feat(get_schema): default schema is always "main"

97b9909

BREAKING CHANGE

feat(get_tables): simplify the returned list of tables

d0fb205

BREAKING CHANGE: `show_temporary` argument is now a simple logical and schema is always returned (no longer NA for default schema)

test(id): update test to the new specific behavior

55dd157

feat(create_table): temporary follows dbCreateTable

4f6deb8

BREAKING CHANGE: This changes the default value to FALSE at present

fix(create_table): use dbWriteTable instead of dbCreateTable

cb5057b

test(create_table): add tests for created tables

39b8cf9

fix(create_table): add # to temporary tables (MSSQL)

4323346

feat: add support for temporary tables (MSSQL)

bdaa594

test(create_table): fix test clean up

8260d4d

test(create_table): adjust regex in error detection

364992f

test(table_exists): improve test skips when missing schemas

045298c

docs: update documentation

ef87496

test(get_tables): add test for temporary tables

3b0eec4

fix(id): handle unspecific tbl_dbi arguments

6bbba53

fix(get_tables): remove trailing underscores (MSSQL)

ee83cc2

fix(get_tables): keep dbplyr_### tables (PostgreSQL)

cbad0a0

fix(get_tables): get temporary tables (PostgreSQL)

b5eb465

feat(get_tables): extract temporary tables only once (MSSQL)

874beb8

test(get_tables): test for uniqueness

729075d

fix(create_table): use dbWriteTable's warning for existing tables

bff0678

test(create_logs_if_missing): add tests

18d479b

feat(create_logs_if_missing): use DBI::dbCreateTable

37cf1c8

fix(table_exists): improve heuristic for default schema

6192692

feat(id): add an as.character method for Id

926e700

docs(NEWS): update NEWS with changes

3d79e6f

docs(NEWS): add PR number for changes

f1905e6

chore: linting

c152495

RasmusSkytte self-assigned this Jan 29, 2024

RasmusSkytte added the enhancement New feature or request label Jan 29, 2024

RasmusSkytte mentioned this pull request Jan 29, 2024

Improve table identification ssi-dk/diseasystore#121

Merged

5 tasks

RasmusSkytte added 3 commits January 29, 2024 14:29

chore(Logger): revert changes to Logger

0bee928

This commit reverts parts of 6bbba53

chore(create_logs_if_missing): revert un-needed changes

6d71b82

docs(NEWS:) refer to dbCreateTable instead of dbWriteTable

fec923e

RasmusSkytte requested a review from marcusmunch January 29, 2024 15:16

marcusmunch reviewed Jan 31, 2024

View reviewed changes

NAMESPACE Show resolved Hide resolved

marcusmunch approved these changes Jan 31, 2024

View reviewed changes

Apply suggestions from code review

e56cc57

Co-authored-by: Marcus Munch <marcus@marcusmunch.dk>

marcusmunch approved these changes Jan 31, 2024

View reviewed changes

RasmusSkytte merged commit f22e842 into main Jan 31, 2024
24 checks passed

RasmusSkytte deleted the fix/id_consistency branch January 31, 2024 13:38

RasmusSkytte mentioned this pull request Feb 6, 2024

Implement consistent handling of temporary tables #76

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overhaul the table identification system #93

Overhaul the table identification system #93

RasmusSkytte commented Jan 29, 2024 •

edited

marcusmunch left a comment

marcusmunch Jan 31, 2024

Overhaul the table identification system #93

Overhaul the table identification system #93

Conversation

RasmusSkytte commented Jan 29, 2024 • edited

Intent

Approach

BREAKING CHANGES:

Features

Improvements and Fixes

Testing

Known issues

Checklist

marcusmunch left a comment

Choose a reason for hiding this comment

marcusmunch Jan 31, 2024

Choose a reason for hiding this comment

RasmusSkytte commented Jan 29, 2024 •

edited