ENH: on_full policy #17

llllllllll · 2015-07-15T17:26:32Z

Allows a user to set a maximum disk usage and a policy defining what to do when the max disk is used and a new entry must be added.

The two options are currently:

raise_: raise an OSError indicating you ran out of space.
pop_lru: rotate the least recently used element out of the chest.

Users may pass any callable here; however, these two are defined for them.

…full.

…open_many

…tseq

llllllllll · 2015-08-21T17:25:45Z

Hey, This feature has been working pretty well for me, Do you think we merge this wehn you get a chance to look it over?

mrocklin · 2015-08-21T17:31:33Z

chest/core.py

+        self._dump(data, tmp)
+        bs = tmp.getvalue()
+        while self.disk_usage + len(bs) > self.available_disk:
+            self.on_full(self)


What is the overhead of this like when there are many keys on disk? This seems like potentially a lot of file system access.

Should we maintain a total instead?

The issue is that I couldn't generalize the way to account for the filesystem overhead. I wasn't positive how big all of the files would actually be on disk so I figured it was safest to ask the os.

Perhaps we could maintain a total, like how we do for memory_usage and add or subtract from it as we add or remove files from disk.

Alternatively can you measure the overhead of how this works when we have one million files? I expect this to be non-negligible at that scale.

mrocklin · 2015-08-21T17:35:22Z

I apologize for letting this linger for so long. Thanks for the ping.

llllllllll · 2015-08-21T17:38:05Z

Nol worries. this isn't blocking me because I can always deploy from my branch.

Joe Jevnik added 9 commits July 14, 2015 16:05

ENH: Allows user to pass in a custom lock object

cb4cb44

ENH: Adds an on_full policy to manage the behaviour when the disk is …

e886603

…full.

BUG: Properly close files when an exception occurs in the context of …

b9a850a

…open_many

MAINT: Remove unused local variables.

7c23f43

BUG: use index on version info because 2.6 fversion info is not struc…

40ec49d

…tseq

MAINT: remove redundant keyerror check

706102c

DEV: no cover the branches for the compat bytesio import

815e615

TST: py2 test uses slightly less disk space

c2a5a9a

ENH: export full_policy

34cb7cd

mrocklin reviewed Aug 21, 2015
View reviewed changes

llllllllll closed this Sep 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: on_full policy #17

ENH: on_full policy #17

llllllllll commented Jul 15, 2015

llllllllll commented Aug 21, 2015

mrocklin Aug 21, 2015

mrocklin Aug 21, 2015

llllllllll Aug 21, 2015

mrocklin Aug 23, 2015

mrocklin commented Aug 21, 2015

llllllllll commented Aug 21, 2015

ENH: on_full policy #17

ENH: on_full policy #17

Conversation

llllllllll commented Jul 15, 2015

llllllllll commented Aug 21, 2015

mrocklin Aug 21, 2015

Choose a reason for hiding this comment

mrocklin Aug 21, 2015

Choose a reason for hiding this comment

llllllllll Aug 21, 2015

Choose a reason for hiding this comment

mrocklin Aug 23, 2015

Choose a reason for hiding this comment

mrocklin commented Aug 21, 2015

llllllllll commented Aug 21, 2015