Delete Thorttling #1609

wojons · 2013-11-05T22:51:20Z

I am hopping there is a way for a feature to throttle deletes. I am partioning my data by week. overall i dont care if it takes a week to delete the last weeks data. What happenes is i delete the last week and it locks up the servers since they are racing to delete. Maybe some sort of flag for slow delete or delete yeailding

coffeemug · 2013-11-05T23:21:54Z

I think there are two open questions here.

I think we might want to give range queries a lower priority than point queries in general. @danielmewes -- can you comment on this?
We might want to add a priority flag to run. This is an interesting idea we haven't explored yet. What do others thing? Should we do it? Would it be hard to do?

jdoliner · 2013-11-05T23:25:04Z

There's some really low hanging fruit for optimizing deletes. I suspect this won't be an issue once we fix them.

danielmewes · 2013-11-05T23:30:40Z

I fear that everything we can do with respect to priorities might not have the desired effects when it comes to write queries.
I haven't actually looked at the implementation of delete, but I could imagine that this is actually a locking issue. It seems that yielding the locks within a delete might be difficult. I'll have to think about that a bit more.

@jdoliner: Do you have something specific in mind?

@wojons: What size is your database? Do you know if it fits in the cache or if there is disk i/o involved?

wojons · 2013-11-05T23:34:55Z

@coffeemug a priority flag for everything would be SUPER useful i think even on queries because some queries i expect to take a long time and dont care if its takes a long time. but there are some things that you want to be super fast.

@danielmewes the database could fit in cache if i allocated more space to the vm. but the tables are normally 4-16gb in size but soon i will keep more then 2 keeps of data and it wont fit so its all disk based also.

danielmewes · 2013-11-05T23:39:41Z

Regarding the priority flag: I'm not certain if it would work as expected. First of all there are two ways in which we can influence the priority of a given query: 1. By changing the scheduler priority of the involved coroutines. 2. By changing the i/o priority of the transaction.
Both of these options have the desired effect for some kinds of queries (e.g. backfilling). They have basically no effect for other queries (especially short running ones). In yet other cases reducing the priority of a given query can have negative effects on the overall cluster performance because it might end up holding locks for longer than necessary or interacting badly with the i/o requirements of other queries [1].

We should keep the idea in mind, but it requires a lot of testing and probably a number of careful changes to actually become a useful and reliable feature.

Edit:
Sorry, forgot the [1]: What I mean is that if two queries request the same block from disk, one of them with a high i/o priority and the other one with a low one, and the one with the low priority requests the block slightly earlier than the high-priority one, the block is actually going to be loaded at the lower priority. The high-priority query will be slowed down by the low-priority one. This seems like it would be rare, but that's just one of the things that we have to keep in mind and that require testing.

danielmewes · 2013-11-05T23:48:16Z

@wojons: As a work-around, would it be an option to divide the delete into smaller batches from the application side? For example you could run something like r.table(...).between(...).limit(50).delete() repeatedly until the deleted field of the result is 0. We will hopefully find a proper solution to the problem eventually, but my impression right now is that it could take a little while for us to completely fix this.

jdoliner · 2013-11-05T23:50:40Z

Well, one optimization is that we could turn r.table(...).between(...).delete() in to a range delete which would be fast as blazes. Another more general optimization is not transferring the entire row over the network to do a delete.

danielmewes · 2013-11-05T23:55:51Z

@jdoliner: That of course would make a lot sense.
Mind though that the "fast as blazes" will only hold for in-memory data sets, right?

coffeemug · 2013-12-17T10:28:50Z

Moving to backlog. We'll look into this after #1762 is fixed, but this is generally outside the scope of the LTS release.

wojons mentioned this issue Nov 21, 2013

Reintroduce TTL/expiration on documents #746

Closed

danielmewes mentioned this issue Nov 18, 2014

Table Backfill Nice #3342

Open

This was referenced Mar 12, 2020

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 DhavalW/rethinkdb#9

Open

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 enterstudio/rethinkdb#11

Open

This was referenced Oct 20, 2020

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 DhavalW/rethinkdb#12

Open

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 enterstudio/rethinkdb#14

Open

snyk-bot mentioned this issue Sep 3, 2021

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 ferreiramarcelo/rethinkdb#6

Open

This was referenced Sep 12, 2021

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 ferreiramarcelo/rethinkdb#7

Open

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 DhavalW/rethinkdb#16

Open

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 enterstudio/rethinkdb#18

Open

enterstudio mentioned this issue May 13, 2022

[Snyk] Fix for 1 vulnerabilities enterstudio/rethinkdb#21

Open

ferreiramarcelo mentioned this issue May 13, 2022

[Snyk] Fix for 1 vulnerabilities ferreiramarcelo/rethinkdb#10

Open

DhavalW mentioned this issue May 14, 2022

[Snyk] Fix for 1 vulnerabilities DhavalW/rethinkdb#19

Open

enterstudio mentioned this issue Oct 18, 2022

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 enterstudio/rethinkdb#23

Open

snyk-bot mentioned this issue Oct 18, 2022

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 ferreiramarcelo/rethinkdb#12

Open

DhavalW mentioned this issue Oct 19, 2022

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 DhavalW/rethinkdb#21

Open

ferreiramarcelo mentioned this issue Jun 20, 2023

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 ferreiramarcelo/rethinkdb#14

Open

DhavalW mentioned this issue Jun 21, 2023

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 DhavalW/rethinkdb#23

Open

enterstudio mentioned this issue Nov 29, 2023

[Snyk] Fix for 17 vulnerabilities enterstudio/rethinkdb#26

Open

ferreiramarcelo mentioned this issue Apr 15, 2024

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 ferreiramarcelo/rethinkdb#18

Open

DhavalW mentioned this issue Apr 15, 2024

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 DhavalW/rethinkdb#26

Open

enterstudio mentioned this issue Apr 16, 2024

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 enterstudio/rethinkdb#29

Open

ferreiramarcelo mentioned this issue May 13, 2024

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 ferreiramarcelo/rethinkdb#19

Open

DhavalW mentioned this issue May 13, 2024

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 DhavalW/rethinkdb#27

Open

enterstudio mentioned this issue May 13, 2024

[Snyk] Security upgrade gulp from 3.9.1 to 4.0.0 enterstudio/rethinkdb#30

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delete Thorttling #1609

Delete Thorttling #1609

wojons commented Nov 5, 2013

coffeemug commented Nov 5, 2013

jdoliner commented Nov 5, 2013

danielmewes commented Nov 5, 2013

wojons commented Nov 5, 2013

danielmewes commented Nov 5, 2013

danielmewes commented Nov 5, 2013

jdoliner commented Nov 5, 2013

danielmewes commented Nov 5, 2013

coffeemug commented Dec 17, 2013

Delete Thorttling #1609

Delete Thorttling #1609

Comments

wojons commented Nov 5, 2013

coffeemug commented Nov 5, 2013

jdoliner commented Nov 5, 2013

danielmewes commented Nov 5, 2013

wojons commented Nov 5, 2013

danielmewes commented Nov 5, 2013

danielmewes commented Nov 5, 2013

jdoliner commented Nov 5, 2013

danielmewes commented Nov 5, 2013

coffeemug commented Dec 17, 2013