-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some tricky data for Arrow #10
Comments
It's helpful to post backtraces as that can often quickly show where the error is |
It prints out alot of stuff. It's like 11pm. I will try to get a MWE together. |
Alright, so the error here is because it doesn't know how to map |
I guess the biggest question is if we're ok not having round-tripping in these cases; i.e. we could allow writing columns of |
Not sure. Maybe see what Feather.jl does? Also of interest is what RCall should do with |
I think the priority is to allow saving things with as little problems as possible. Given your comment I understand that we have a choice:
I would prefer the option 2. maybe with a |
I mean, Feather.jl will just be a wrapper around Arrow.jl pretty soon, so they'll do whatever we do. I also think I like the idea of option 2 w/ a warning. I think people should understand that Arrow.jl isn't meant to be a serialization package, even though it can handle a wide range of types of objects. But even still, I think doing some automatic conversions will be convenient. |
Agreed |
There is a way to store arbitrary metadata right? We should remember them as |
Ah, I forgot about that; yes, that's a pretty good idea. I'll have to see how we can work the plumbing in for that. |
Fixes #10. This PR is large mainly because it adds fairly generic support for custom field metadata. We then use that support to add custom type extension for Symbol/Char vectors. We could perhaps factor this a bit more to make it even more generic for custom types, but I think this gets us pretty darn far.
Ok, took a bit of work, but I think I've got it all working for Symbol/Char. #11 |
* Allow initial support for custom extension types Fixes #10. This PR is large mainly because it adds fairly generic support for custom field metadata. We then use that support to add custom type extension for Symbol/Char vectors. We could perhaps factor this a bit more to make it even more generic for custom types, but I think this gets us pretty darn far. * Fix dict encoded
* Allow initial support for custom extension types Fixes #10. This PR is large mainly because it adds fairly generic support for custom field metadata. We then use that support to add custom type extension for Symbol/Char vectors. We could perhaps factor this a bit more to make it even more generic for custom types, but I think this gets us pretty darn far. * Fix dict encoded
I have this data in the JDF.jl tests. Arrow is currently failing to write it. We could incorporate something like this in the tests.
error msg
4,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Int64,Union{Missing, UInt64},Missing,String,String,Union{Missing, String},Union{Missing, String},Union{Missing, Symbol},Char,Union{Missing, Char}}}, ::DataFrame, ::Dict{Int64,Tuple{Int64,Type,Any}}) at C:\Users\RTX2080\.julia\packages\Arrow\FjbLX\src\write.jl:258 [9] macro expansion at C:\Users\RTX2080\.julia\packages\Arrow\FjbLX\src\write.jl:71 [inlined] [10] macro expansion at .\task.jl:332 [inlined] [11] write(::IOStream, ::DataFrame, ::Bool, ::Bool) at C:\Users\RTX2080\.julia\packages\Arrow\FjbLX\src\write.jl:58 [12] #29 at C:\Users\RTX2080\.julia\packages\Arrow\FjbLX\src\write.jl:21 [inlined] [13] open(::Arrow.var"#29#30"{Bool,DataFrame}, ::String, ::Vararg{String,N} where N; kwargs::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}) at .\io.jl:325 [14] open at .\io.jl:323 [inlined] [15] #write#28 at C:\Users\RTX2080\.julia\packages\Arrow\FjbLX\src\write.jl:20 [inlined] [16] write(::String, ::DataFrame) at C:\Users\RTX2080\.julia\packages\Arrow\FjbLX\src\write.jl:20 [17] top-level scope at REPL[44]:1 [18] include_string(::Function, ::Module, ::String, ::String) at .\loading.jl:1088The text was updated successfully, but these errors were encountered: