You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
bincount-ed sum is 6x slower than sum, and bincount-ed count is 2x slower than count so there are definitely cases where it makes sense to split the dataset early instead of using flox. I'm seeing this on Pangeo Cloud using the GODAS dataset.
%timeit b, a = flox.core._prepare_for_flox(by, array); flox.aggregate_flox.nanlen(b, a, fill_value=0)
%timeit b, a = flox.core._prepare_for_flox(by, array); flox.aggregate_flox.sum(b, a, fill_value=0)
237 µs ± 33.6 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)
180 µs ± 6.06 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)
bincount
-ed sum is 6x slower thansum
, andbincount
-edcount
is 2x slower thancount
so there are definitely cases where it makes sense to split the dataset early instead of usingflox
. I'm seeing this on Pangeo Cloud using the GODAS dataset.The text was updated successfully, but these errors were encountered: