round reduction size up to nearest power of two to avoid overloading cache #65

davidweichiang · 2015-08-05T03:12:49Z

Hi, in reduction.py, ReductionKernel._get_basic_kernel is cached according to its arguments maxls and nd, where maxls is the smaller of the reduction size and self.init_local_size (= 1024 on my machine). If there are a lot of small reductions, there will be many different versions of the basic kernel in the cache, too many to fit in the cache. However, one of the first things that the basic kernel does is to round maxls up to the nearest power of two. This patch does the rounding up before caching so that there are at most lg(self.init_local_size) different versions of the basic kernel for each value of nd. (It also does the rounding again afterwards, to avoid breaking anything.) I think this fixes the problem with small reductions without costing anything for larger reductions.

…cache

abergeron · 2015-08-05T14:05:11Z

Good catch, this is certainly ok.

round reduction size up to nearest power of two to avoid overloading cache

round reduction size up to nearest power of two to avoid overloading …

47d2551

…cache

abergeron added a commit that referenced this pull request Aug 5, 2015

Merge pull request #65 from davidweichiang/reduction

da08a6b

round reduction size up to nearest power of two to avoid overloading cache

abergeron merged commit da08a6b into Theano:master Aug 5, 2015

davidweichiang mentioned this pull request Aug 7, 2015

lfu_cache purges most-recently added items #66

Closed

davidweichiang mentioned this pull request Aug 28, 2015

extcpy cache can get overloaded #75

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

round reduction size up to nearest power of two to avoid overloading cache #65

round reduction size up to nearest power of two to avoid overloading cache #65

davidweichiang commented Aug 5, 2015

abergeron commented Aug 5, 2015

round reduction size up to nearest power of two to avoid overloading cache #65

round reduction size up to nearest power of two to avoid overloading cache #65

Conversation

davidweichiang commented Aug 5, 2015

abergeron commented Aug 5, 2015