Skip to content

Airbyte ingestion improvements#513

Merged
mildbyte merged 1 commit intomasterfrom
bugfix/airbyte-image-commit-CU-pvg55y
Aug 3, 2021
Merged

Airbyte ingestion improvements#513
mildbyte merged 1 commit intomasterfrom
bugfix/airbyte-image-commit-CU-pvg55y

Conversation

@mildbyte
Copy link
Copy Markdown
Contributor

@mildbyte mildbyte commented Aug 3, 2021

When storing Airbyte raw tables, change their PK to be (ingestion timestamp, row UUID) to make storage/querying more efficient.

This stops Splitgraph from thinking that the chunks we produce are overlapping because the UUIDs are effectively random (the ideal solution would be changing the storage format and somehow noting this during commit -- 0b139a7 has more background on why this was disabled, but here, as we know the chunks aren't overlapping, we could manually mark them?)

…estamp, row UUID) to make storage/querying more efficient.

This stops Splitgraph from thinking that the chunks we produce are overlapping because the UUIDs are effectively random (the ideal solution would be changing the storage format and somehow noting this during commit -- 0b139a7 has more background on why this was disabled, but here, as we know the chunks aren't overlapping, we could manually mark them?)
@mildbyte mildbyte merged commit 3720669 into master Aug 3, 2021
mildbyte added a commit that referenced this pull request Aug 18, 2021
  * Various Airbyte ingestion improvements and support for different normalization modes, including a custom dbt model (#510, #513, #514)
  * Fix mount for data source with empty credentials schema (#515)
  * Fix `sgr cloud load`/`dump` (#520)

Full set of changes: [`v0.2.15...v0.2.16`](v0.2.15...v0.2.16)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant