Add support for deletes #18

Shelnutt2 · 2017-08-09T12:12:09Z

Delete support is needed. Deletes can be done in multiple ways.

Add a delete indicator to the record, this essentially would be a update with a hidden field.
0 out the entire message from the file, this means we have to handle gaps in messages when reading/scanning file.
Maintain a separate file with the list of deleted rows.

"1)" or "2) "are pretty equivalent. The advantage to 2 is during a table scan one does not have to parse the message only to find it has been deleted. It might also be that option 1 makes roll back easier. However with 1 or 2 we still need to maintain a list of the ongoing rows touched in the transactions.

"3)" Does not seem to have a large benefit. If we keep the rows separate, then we just have to read that into memory and still do a comparison. The only upside compared to 1, is we don't have to parse capnp proto message to see if it is deleted or not, we can store the file offset and skip that way.

With option 2) We can also have a daemon process that periodically reorganizes a table that is closed, so zero'ed out space that is not a message is removed, truncating the file size.

Shelnutt2 · 2017-08-16T11:43:37Z

New design:

Deletes will be maintained in a separate file the references the file, and fileoffset (possibly row number). We'll need to look up this for each and every read.

When a delete transaction ends, we will mark the file on disk as "done" and start a new file for all new rows. The idea is to limit the size of files, once we know we have deleted rows. A secondary daemon process will come back by and compact files. Thus we can safely remove old rows async, but not have to store the deletes forever.

We will create n number of files, unless the compact process can start. When we compact, all non-active files will be read and moved into a single larger file.

We need to add support for reading from multiple files, scanning across each one.

This also lays the foundation for online schema upgrades, were we can possibly map the old schema to a new schema, and convert the data ondisk async.

This implements deletes in a single file. Initial ground work is layed for multiple delete and multiple data files.

Shelnutt2 mentioned this issue Aug 16, 2017

Add support for Updates #17

Closed

Shelnutt2 added a commit that referenced this issue Aug 22, 2017

Move data files into subdirectory, implement custom delete_table. #18

6c7bb6e

Shelnutt2 mentioned this issue Aug 22, 2017

Add support for deletes #36

Merged

Shelnutt2 added a commit that referenced this issue Aug 22, 2017

Move data files into subdirectory, implement custom delete_table. #18

464f87d

Shelnutt2 added a commit that referenced this issue Aug 23, 2017

WIP: add delete support. #18

3e6cb96

Shelnutt2 added a commit that referenced this issue Aug 25, 2017

WIP: add delete support. #18

5a259df

Shelnutt2 added a commit that referenced this issue Aug 25, 2017

WIP: add delete support. #18

227d288

Shelnutt2 added a commit that referenced this issue Sep 3, 2017

WIP: add delete support. #18

aaccb2e

Shelnutt2 added a commit that referenced this issue Sep 3, 2017

WIP: add delete support. #18

11f5167

Shelnutt2 added a commit that referenced this issue Sep 3, 2017

Add initial delete support. #18

cbba91e

Shelnutt2 added a commit that referenced this issue Sep 3, 2017

Add initial delete support. #18

b409d81

This implements deletes in a single file. Initial ground work is layed for multiple delete and multiple data files.

Shelnutt2 mentioned this issue Sep 3, 2017

Add advanced data/delete file support #37

Closed

Shelnutt2 closed this as completed in #36 Sep 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for deletes #18

Add support for deletes #18

Shelnutt2 commented Aug 9, 2017 •

edited

Loading

Shelnutt2 commented Aug 16, 2017

Add support for deletes #18

Add support for deletes #18

Comments

Shelnutt2 commented Aug 9, 2017 • edited Loading

Shelnutt2 commented Aug 16, 2017

Shelnutt2 commented Aug 9, 2017 •

edited

Loading