Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow migrating blob/clob when it is known that the column size is relatively small #385

Open
amit-yb opened this issue Oct 3, 2022 · 3 comments
Assignees
Labels
enhancement New feature or request

Comments

@amit-yb
Copy link
Contributor

amit-yb commented Oct 3, 2022

No description provided.

@github-actions github-actions bot added the triage Needs to be triaged label Oct 3, 2022
@rahulb-yb rahulb-yb self-assigned this Oct 4, 2022
@amit-yb
Copy link
Contributor Author

amit-yb commented Oct 7, 2022

@rahulb-yb Please update this ticket with whatever decision we took about blob/clob support.

@rahulb-yb rahulb-yb added the documentation Improvements or additions to documentation label Oct 11, 2022
@rahulb-yb
Copy link
Collaborator

For the documentation:
BLOBs and CLOBs are not native data types in PG- they are closest represented by the native bytea and text datatypes respectively.
While BLOBs and CLOBs have better support for storage of data, bytea and text only allow upto 1 GB of data per cell, meaning that if the data rows we attempt to import contain >1GB of data, it would fail

As an alternative, PG provides support for LOB which are capable of holding BLOB/CLOB datatypes. This, however, is not yet supported in YB, making this alternative infeasible.

Finally, BLOB and CLOB data types can contain a lot of more data per cell as compared to most other native data types, which can cause timeout errors when combined with our logic for parallelism of import data- the size can potentially be greater by an order of upto 2^25 compared to most other data types, making it infeasible to attempt to migrate it alongside the rest of the data. Our current build already ignores importing the BLOB and CLOB data, but keeps the schema preserved, allowing the user to import their data onto the preserved schema by any means they wish

@rahulb-yb rahulb-yb added enhancement New feature or request and removed documentation Improvements or additions to documentation labels Jan 5, 2023
@shubham-yb
Copy link
Contributor

From a recent experiment it was observed that CLOB datatype row values are migrated but the upper limit of the datafile was seen to be 243 MB.
Any greater than that and the migration errors out.
Further details are in this sheet

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants