-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix for #8440 #8879
Fix for #8440 #8879
Conversation
I never thought that this caused a bug but I did think it was dirty to preserve the arrow data if direct conversion could not be performed 👍 Actually, can we only populate this map inside of DirectConversion? That would solve both issues |
Strings don't really go through the |
Currently we create the AuxiliaryData to preserve the arrow array before we scan the column, we only need to preserve this arrow array though if DirectConversion was used, otherwise we will have made a copy of the input anyways. In cases where we don't need to preserve the arrow array, we currently still do |
That's not true for strings. I could move it to direct conversion and string conversion, but I think that's dirtier, no? |
Hmm true, it's not hurting anyone but it is a little wasteful to keep this memory alive needlessly |
Thanks! |
The more I look at it I'm afraid this is a bandaid for the actual problem? Creating the The only difference this makes is that the first chunk (ONLY the first chunk) is kept alive longer, for the duration that the That would indicate that the Another issue with this change is that we give ownership over the chunk to the first column, for the other columns: I haven't been able to produce a query that triggers this issue, but I suspect it could happen because that's definitely not intended behavior. |
No description provided.