Partitioned file storage #35

cute-the-niini · 2023-12-25T16:14:36Z

The idea is that this will replace all current uses of IndexedDB as a file storage, consequently reducing the amount of blobs the browser needs to move around when iterating over the tables (and also reduce some of the awkward normalisation needs currently employed). This does have the nice effect of making the interference between different cartridges' data a bit less of a pain.

This patch in particular only introduces the partitioned file storage and changes the cartridge data storage to use it. The file storage is based around reference-counted buckets that hold arbitrary numbers of objects, similar to what you would get with cloud object storages (other than the reference-counted part). Being reference-counted, there's a separate GC process that will now reclaim storage space whenever a bucket's reference counter reaches 0. This works similarly for temporary buckets and persistent ones --- a persistent bucket just has references held by a long-lived entity, such as a row in the database, instead of a JS memory reference. For tracking JS references it holds a set of weak references and finalisers to clean up the set.

Cartridge installation is no longer fully-transactional as a result of this, because OPFS is not transactional. Instead, the process first creates a new bucket that holds all of the files in the cartridge, persists it, then attempts to create a database entry for the cartridge metadata (pointing to the persisted bucket). If it fails, the GC will validate that the persistent reference is still in effect next time it runs, and will decrease the counter accordingly if the row that's supposed to point back to the bucket is not in the database, consequently allowing the bucket's resources to be freed.

The idea is that this will replace all current uses of IndexedDB as a file storage, consequently reducing the amount of blobs the browser needs to move around when iterating over the tables (and also reduce some of the awkward normalisation needs currently employed). This does have the nice effect of making the interference between different cartridges' data a bit less of a pain. This patch in particular only introduces the partitioned file storage and changes the cartridge data storage to use it. The file storage is based around reference-counted buckets that hold arbitrary numbers of objects, similar to what you would get with cloud object storages (other than the reference-counted part). Being reference-counted, there's a separate GC process that will now reclaim storage space whenever a bucket's reference counter reaches 0. This works similarly for temporary buckets and persistent ones --- a persistent bucket just has references held by a long-lived entity, such as a row in the database, instead of a JS memory reference. For tracking JS references it holds a set of weak references and finalisers to clean up the set. Cartridge installation is no longer fully-transactional as a result of this, because OPFS is not transactional. Instead, the process first creates a new bucket that holds all of the files in the cartridge, persists it, then attempts to create a database entry for the cartridge metadata (pointing to the persisted bucket). If it fails, the GC will validate that the persistent reference is still in effect next time it runs, and will decrease the counter accordingly if the row that's supposed to point back to the bucket is not in the database, consequently allowing the bucket's resources to be freed.

cute-the-niini added 13 commits December 25, 2023 15:31

wip: initial work on file storage

486abf0

wip: locks

6c09d6b

wip: formatting

7649328

wip

ba8a03e

wip: more migration

f200afb

wip: reading files

5fa29ad

wip: Handle ref gc

48ad9c7

wip: proper release

e4adcaf

wip: async parsing

e91b9cc

wip: minor fixes

92ff110

wip: more fixes

d05d1a0

wip: fix parsing

89abf5e

wip: more docs

a04bb5a

cute-the-niini added enhancement New feature or request c:kernel Changes to the Kate emulator kernel (requires strict audits!) labels Dec 25, 2023

cute-the-niini merged commit eb0d70d into main Dec 25, 2023
1 check passed

cute-the-niini deleted the patch/file-store branch December 25, 2023 16:21

cute-the-niini restored the patch/file-store branch December 25, 2023 16:24

cute-the-niini deleted the patch/file-store branch December 25, 2023 16:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partitioned file storage #35

Partitioned file storage #35

cute-the-niini commented Dec 25, 2023

Partitioned file storage #35

Partitioned file storage #35

Conversation

cute-the-niini commented Dec 25, 2023