You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Dewey Dunnington / @paleolimbot:
I'm not sure if this is covered by one of the subtasks, but really huge binary arrays take forever to print...I am guessing because it tries to convert the entire binary array to a string before selecting the few characters that will actually be shown:
library(arrow)
#> Some features are not enabled in this build of Arrow. Run `arrow_info()` for more information.#> #> Attaching package: 'arrow'#> The following object is masked from 'package:utils':#> #> timestampreally_big_raw<- raw(1e9)
really_big_binary<-Array$create(list(really_big_raw), type= binary())
system.time(really_big_binary$ToString())
#> user system elapsed #> 12.396 1.660 14.269
(I ran into that one because the current encoding for geospatial data in Parquet files is binary() and the elements can be huge)
We have some pretty printing capabilities, but we may want to think at a high level about the design first.
Reporter: Will Jones / @wjones127
Related issues:
Note: This issue was originally created as ARROW-18359. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: