Option to disable result caching in bench::mark() #58

MarcusKlik · 2019-09-10T14:06:01Z

thanks for providing an excellent package, bench's extended set of measurements as compared to other benchmarking packages are extremely useful.

I have a feature request for method bench::mark(), when using it to time a number of expressions that each require a significant amount of RAM, it would be convenient to have an option to disable the caching of results. Caching a potentially large number of large sized objects quickly eats the available memory which limits benchmarking of e.g. large vectors:

nr_of_ints <- 1e8

res <- bench::mark(
  integer(nr_of_ints),
  integer(nr_of_ints),
  max_iterations = 10)

# 2 times 400 MB
object.size(res)
#> 800014016 bytes

I realize that there is a workaround by providing a custom method that returns a small result:

nr_of_ints <- 1e8

fx <- function(nr_of_ints) {
  integer(nr_of_ints)
  TRUE
}

res <- bench::mark(
  fx(nr_of_ints),
  fx(nr_of_ints),
  max_iterations = 10)
#> Warning: Some expressions had a GC in every iteration; so filtering is
#> disabled.

# small
object.size(res)
#> 99464 bytes

With this workaround, the chance of introducing additional garbage collections during benchmarking increases (and these are also measured). So that seems like a less elegant solution :-)

Would it be an idea to skip caching the results when check = FALSE ?

thanks and all the best!

The text was updated successfully, but these errors were encountered:

jimhester · 2019-09-10T19:29:49Z

The garbage collections are happening at the function boundary, you can avoid this by not creating a function.

res <- bench::mark(
  a = { integer(nr_of_ints); NULL },
  b = { integer(nr_of_ints); NULL },
  max_iterations = 10)

But also not keeping the results when check = FALSE might be ok

MarcusKlik · 2019-09-11T09:01:34Z

Hi @jimhester,

great, thanks for the workaround!

The option to disable result caching would be convenient but from your earlier comment I understand that the focus of bench::mark() is on smaller datasets and for those such an option is not really relevant, so I guess there is something to be said for both approaches...

jimhester · 2020-01-08T16:12:10Z

As of fce1e23 setting check = FALSE also disables the storage of the results.

MarcusKlik · 2020-01-08T19:22:12Z

great, thanks for adding the feature!

MarcusKlik mentioned this issue Sep 10, 2019

Explore package bench fstpackage/synthetic#19

Open

jimhester added the feature a feature request or enhancement label Sep 10, 2019

jimhester closed this as completed Jan 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to disable result caching in bench::mark() #58

Option to disable result caching in bench::mark() #58

MarcusKlik commented Sep 10, 2019

jimhester commented Sep 10, 2019

MarcusKlik commented Sep 11, 2019 •

edited

Loading

jimhester commented Jan 8, 2020

MarcusKlik commented Jan 8, 2020

Option to disable result caching in bench::mark() #58

Option to disable result caching in bench::mark() #58

Comments

MarcusKlik commented Sep 10, 2019

jimhester commented Sep 10, 2019

MarcusKlik commented Sep 11, 2019 • edited Loading

jimhester commented Jan 8, 2020

MarcusKlik commented Jan 8, 2020

MarcusKlik commented Sep 11, 2019 •

edited

Loading