GitHub - dask/cachey: Caching based on computation time and storage space

Caching for Analytic Computations

Humans repeat stuff. Caching helps.

Normal caching policies like LRU aren't well suited for analytic computations where both the cost of recomputation and the cost of storage routinely vary by one million or more. Consider the following computations

# Want this
np.std(x)        # tiny result, costly to recompute

# Don't want this
np.transpose(x)  # huge result, cheap to recompute

Cachey tries to hold on to values that have the following characteristics

Expensive to recompute (in seconds)
Cheap to store (in bytes)
Frequently used
Recenty used

It accomplishes this by adding the following to each items score on each access

score += compute_time / num_bytes * (1 + eps) ** tick_time

For some small value of epsilon (which determines the memory halflife.) This has units of inverse bandwidth, has exponential decay of old results and roughly linear amplification of repeated results.

Example

>>> from cachey import Cache
>>> c = Cache(1e9, 1)  # 1 GB, cut off anything with cost 1 or less

>>> c.put('x', 'some value', cost=3)
>>> c.put('y', 'other value', cost=2)

>>> c.get('x')
'some value'

This also has a memoize method

>>> memo_f = c.memoize(f)

Install

Cachey is on PyPI and Conda-forge:

$ pip install cachey  # option 1
$ conda install cachey -c conda-forge  # option 2

Or install from source

$ python setup.py install  # option 1
$ pip install -e .  # option 2 (best for development)

Status

Cachey is new and not robust.

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.github/workflows		.github/workflows
cachey		cachey
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Caching for Analytic Computations

Example

Install

Status

About

Releases

Sponsor this project

Packages

Contributors 12

Languages

License

dask/cachey

Folders and files

Latest commit

History

Repository files navigation

Caching for Analytic Computations

Example

Install

Status

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Contributors 12

Languages

Packages