You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
TMD will break if fed an arrow dataset with a column of type ArrowType.Null:
(tmd-arrow/stream->dataset"withnullcol.arrow")
;; java.lang.Exception: Failed to datafy datatype class org.apache.arrow.vector.types.pojo.ArrowType$Null;; at tech.v3.libs.arrow$read_schema$fn__47204.invoke(arrow.clj:711)
If one extends the protocol to this type, for example
Release 6.100 fixes this. The arrow docs state the null schema type is for columns with no physical data so a column of all mising entries is reasonably null. Whether this is broadly useful or not is a different question...
TMD will break if fed an arrow dataset with a column of type
ArrowType.Null
:If one extends the protocol to this type, for example
It will then throw:
From looking at the code, It seems to me that TMD's assumption that datatypes will have at least 2 buffers, does not hold for this odd type.
Python used to create the arrow dataset:
The text was updated successfully, but these errors were encountered: