Skip to content

executing pragma statements on duckdb.query() #2223

Answered by Mytherin
deshpand asked this question in Q&A
Discussion options

You must be logged in to vote

Upon reading https://duckdb.org/2021/05/14/sql-on-pandas.html I'm very interested in running queries, specifically replacing pd.merge() with join sql (for now) and hoping for both multi threaded execution and less usage of memory. Currently, I'm seeing neither improvement. CPU usage during query seems to be 100% or lower indicating single thread use, the query takes more memory and runs longer than pd.merge().

The current implementation is limited in what can and can't be parallelized, as we are incrementally adding parallelism support to different operators in the engine. While joins can be fully parallelized, materializing to a Pandas DataFrame cannot be parallelized yet, and as such …

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@deshpand
Comment options

Answer selected by deshpand
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants