Skip to content

Default sparse read via tiledb_array to UNORDERED#488

Merged
eddelbuettel merged 7 commits intomasterfrom
de/sc-23685/default_sparse_read_to_unordered
Nov 29, 2022
Merged

Default sparse read via tiledb_array to UNORDERED#488
eddelbuettel merged 7 commits intomasterfrom
de/sc-23685/default_sparse_read_to_unordered

Conversation

@eddelbuettel
Copy link
Copy Markdown
Contributor

This PR changes the default query layout for read access via tiledb_array() and the [ accessor to 'UNORDERED' for sparse matrices (and 'ROW_MAJOR' for dense). The read setting now matches the [<- write access implementation.

Unordered generally performs better, motivating the change. This is a change in behavior as seen by the handful of tests which needed updating.

As an illustration, here is a approximately seven-fold gain with a (local disk) array using the Deutsche Boerse (csv to sparse array) data set from a tutorial from last year:

$ ./exampleDBoerse.R        
                 test replications elapsed relative                                                      
1      a <- resNone[]           10  10.663    7.451                                                      
2 b <- resUnordered[]           10   1.431    1.000                                                      
$ 

@shortcut-integration
Copy link
Copy Markdown

This pull request has been linked to Shortcut Story #23685: Performance issue running R query..

@eddelbuettel eddelbuettel merged commit 0f317f3 into master Nov 29, 2022
@eddelbuettel eddelbuettel deleted the de/sc-23685/default_sparse_read_to_unordered branch November 29, 2022 13:32
@eddelbuettel eddelbuettel mentioned this pull request Dec 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants