-
Notifications
You must be signed in to change notification settings - Fork 589
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG-REPORT] open_many and concat does not handle missing columns correctly #1491
Comments
Thank you for the report. I can confirm this is a bug. For a quick and dirty solution you can try:
I will try to push a test for this soon! |
hi @JovanVeljanoski thanks for the quick reply. As a workaround I figured out which columns were breaking and removed them for now (it turned out they mostly held None data anyway which I didnt know). This fixed my problem for now. |
Always good to hear :) Thanks! |
well you guys test suite is good I think. If I read it correctly this: https://github.com/vaexio/vaex/pull/1493/checks?check_run_id=3242308011#step:18:274 is my exact bug (I am running python3.8.8 on windows 10). So it is at least reproducable. |
nice work @JovanVeljanoski ! |
Hi I have a bunch of hdf5 files which are similar but sometimes some of them miss a column.
I can open them with vaex.open_many with no error, or load them one by one with vaex.open and then stitch them together with vaex.concat. However when I try to write them out to one big hdf5 file with df.export_hdf5 then I get the error:
I expected that vaex would insert na values in the missing columns for the files, as per #156 however it does not seem to do that, am I looking at it the wrong way?
I am running vaex 4.3.0
The text was updated successfully, but these errors were encountered: