Feature/delete service stats worker jobs by eguzki · Pull Request #73 · 3scale/apisonator

eguzki · 2019-02-06T11:35:05Z

Stats::PartitionEraserJob
Background job to delete stat keys
Stats::PartitionGeneratorJob
Background job to generate Stats::PartitionEraserJob for each stats keys subset

davidor · 2019-02-18T11:49:42Z

There are mainly 2 aspects of this PR that I think should be addressed before merging.

The first one is that there are no tests. The second one is that, in my opinion, the responsibilities of the 2 types of jobs (PartitionGeneratorJob and PartitionEraserJob) are not well defined and they do a lot of duplicated work.

PartitionGeneratorJob generates all the keys, but it discards them and only passes to each of the PartitionEraserJobs an offset. Then, each PartitionEraserJob generates again all the keys and discards all of them except a few ones (just picks N starting from the 'offset' received). It's a lot of duplicated work.

Wouldn't it be better to generate all the keys in the Generator job and just pass X of them to each of the Delete jobs? That way, the jobs would not perform any duplicated work. Also, I think that conceptually it would be easier to understand, because one job type would be responsible for generating all the keys, and the other just for deleting the keys that it received.

eguzki · 2019-02-21T14:59:55Z

The design was made to avoid serializing/deserializing to/from redis all the keys that have to be deleted. You are proposing to do that.

Besides, resque job size (basically json body size) can be big. The body size is controlled by PARTITION_BATCH_SIZE configuration parameter. Having PARTITION_BATCH_SIZE ~ 1000 keys, it makes ~ 1000 * (50 bytes per key) = 50k the size of a PartitionEraserJob resque job.

The trade off is

small PARTITION_BATCH_SIZE => small job body and lots of PartitionEraserJob jobs
big PARTITION_BATCH_SIZE => big job body and few PartitionEraserJob jobs. If some job fail, a new worker will be processed and the big partition will be erased again (many keys could already been deleted)

Other implementation would be passing indexes besides jobs, and eraser worker will only have to generate required keys, hence no duplicated work. Resque jobs would be small in size and no keys would be serialized to redis. However the implementation of indexes increases complexity of the key generator and I decided to leave it for now. Maybe future improvement.

WDYT @davidor ? Can we afford big jobs and serializing all the keys including them in job bodies?

eguzki · 2019-02-21T17:24:41Z

Implementation of PARTITION_BATCH_SIZE and DELETE_BATCH_SIZE as configuration params with default values

davidor · 2019-02-22T15:48:35Z

Serializing/deserializing could be a problem too. This is the kind of thing that needs to be measured.

What if Stats::PartitionGeneratorJob generated to and from timestamps instead of indexes? Would that be a valid option? That way each job would be responsible for generating and deleting a part of the keys without duplication and without needing to serialize/deserialize the keys.

For example, if we need to delete a whole year of stats, we could generate one job for each month, or for each day. Not sure what would be the most appropriate granularity. Or maybe split by apps or metrics?

eguzki · 2019-02-22T17:28:39Z

Partitioning by time, applications, metrics or users is rough way of partitioning. You cannot control the number of keys received by a worker to be deleted. With the current implementation, the size of a partition is controlled by a configuration parameter.

Anyway, it is a partitioning, yes.

We could leave it like it is now and write an issue to implement indexes. WDYT @davidor ?

…ON_BATCH_SIZE config params

…aram

…key generator interface

…ator job tests

eguzki · 2019-02-28T18:32:33Z

@davidor ready for a review

davidor · 2019-03-01T10:55:37Z

+
+            stats_key_gen = KeyGenerator.new(job.to_hash)
+
+            stats_key_gen.keys.drop(offset).take(length).each_slice(configuration.stats.delete_batch_size) do |slice|


I'm still not convinced about the idea of generating the whole set of keys on every PartitionEraserJob. It'd be good to hear other opinions on this. @unleashed ? @miguelsorianod ?

This might be good enough for a first version but we'll need to closely measure the impact on CPU time consumed by this kind of worker with something like stackprof. It's difficult to tell now because there are many factors that can have an impact like number of this kind of jobs generated, the average from-to interval, etc.

At least we should add a TODO here explaining what this does and mention that this is probably something that could be improved.

isn't just a subset of the keys generated/removed (length) ?

it is generating all keys up to offset + length.

I will write an issue about improving the algorithm to use indexes.

Deleting service stats is not a job that will be created often

Only a subset is deleted, but all of them are generated stats_key_gen.keys.

👍 , I was thinking on the old implementation, when enumerators were used and not all keys were generated.

…spec_helper: improve tests

davidor · 2019-03-01T13:14:03Z

Good job @eguzki 👍
I think this can be merged to the integration branch.

eguzki mentioned this pull request Feb 6, 2019

[WIP] Feature/delete service stats #71

Closed

3 tasks

mikz reviewed Feb 6, 2019

View reviewed changes

Comment thread lib/3scale/backend/stats/partition_eraser_job.rb Outdated

miguelsorianod mentioned this pull request Feb 15, 2019

[WIP] Feature/stats deletion #47

Closed

davidor reviewed Feb 15, 2019

View reviewed changes

Comment thread lib/3scale/backend/stats/partition_eraser_job.rb Outdated

davidor reviewed Feb 15, 2019

View reviewed changes

Comment thread lib/3scale/backend/stats/partition_generator_job.rb Outdated

davidor reviewed Feb 15, 2019

View reviewed changes

Comment thread lib/3scale/backend/stats/partition_eraser_job.rb Outdated

eguzki force-pushed the feature/delete-service-stats-worker-jobs branch from 80526e4 to f6e960c Compare February 21, 2019 17:22

eguzki force-pushed the feature/delete-service-stats-worker-jobs branch from f6e960c to 5f642f1 Compare February 24, 2019 17:40

davidor reviewed Feb 26, 2019

View reviewed changes

Comment thread lib/3scale/backend/stats/partition_eraser_job.rb Outdated

eguzki added 9 commits February 28, 2019 12:17

stats/partition_eraser_job: partition eraser job

378e19d

stats/partition_generator_job: partition generator job

7981401

backend/stats, backend: require new delete stats modules

5381618

partition_eraser_job, partition_generator_job: use stats queue

6234489

configuration, 3scale_backend.conf: add DELETE_BATCH_SIZE and PARTITI…

1b3d38b

…ON_BATCH_SIZE config params

stats/partition_eraser_job: use DELETE_BATCH_SIZE from config param

231985a

stats/partition_generator_job: use PARTITION_BATCH_SIZE from config p…

09d495e

…aram

delete stats: Initial implementations of key generators

aa424fb

stats/partition_generator_job: remove partition generator

800b26c

eguzki force-pushed the feature/delete-service-stats-worker-jobs branch from 6b0dbd7 to 800b26c Compare February 28, 2019 11:24

eguzki added 3 commits February 28, 2019 12:27

stats/partition_eraser_job, stats/partition_generator_job: update to …

cbea6df

…key generator interface

stats/key_generator: accept more elements in constructor

56ec02d

integration/stats_partition_generator_job_spec: stats partition gener…

c2f1c2b

…ator job tests

mikz reviewed Feb 28, 2019

View reviewed changes

Comment thread spec/integration/job_queues_spec.rb Outdated

eguzki added 2 commits February 28, 2019 19:28

spec/spec_helper: resque failure queue clear

5b0feb4

stats/partition_eraser_job: remove old validation method

1f4b11b

eguzki added 2 commits February 28, 2019 19:28

integration/stats_partition_eraser_job_spec: integration tests

256f15d

unit/stats/partition_eraser_job_spec: unittests

80d8627

eguzki force-pushed the feature/delete-service-stats-worker-jobs branch from 29cb976 to 80d8627 Compare February 28, 2019 18:29

fixup! stats/partition_generator_job: remove partition generator

8170af5

eguzki requested a review from davidor February 28, 2019 18:32