You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed there was a C-API for querying results as an arrow format (added in #1978), but I don't see any support currently in the c-api or the C or C++ appenders for bulk inserting arrow/columnar data into the database.
Currently I have several billion rows of data I would like to bulk insert that is already in columnar form in memory and the only interface that I'm aware of are the row-wise appenders. The performance of inserting into a single table is around 50,000 rows per second (on older hardware) - I'm assuming that this translation back-and-forth is likely a bottleneck.
It doesn't appear that the duckdb_data_chunk has support for this either.
The text was updated successfully, but these errors were encountered:
You should be able to use duckdb_append_data_chunk to do batch/vectorized appends, which should indeed be much more efficient than the scalar functions.
The documentation for DataChunk isn't entirely clear that data can be written to it, the free functions documented all appear to be used for reading data (with the exception of duckdb_vector_assign_string_element).
Presumably the methods on the DataChunk object itself can be used to write various data types
I noticed there was a C-API for querying results as an arrow format (added in #1978), but I don't see any support currently in the c-api or the C or C++ appenders for bulk inserting arrow/columnar data into the database.
Currently I have several billion rows of data I would like to bulk insert that is already in columnar form in memory and the only interface that I'm aware of are the row-wise appenders. The performance of inserting into a single table is around 50,000 rows per second (on older hardware) - I'm assuming that this translation back-and-forth is likely a bottleneck.
It doesn't appear that the duckdb_data_chunk has support for this either.
The text was updated successfully, but these errors were encountered: