read in arguments for benchmark tool #55

spacejam · 2017-08-09T16:13:17Z

the benchmark tool should accept arguments for these parameters:

number of threads
number of total operations
proportion of reads, writes, cas, del, and scan operations among the operations
freshness bias (prefer recent/likely to be in cache, prefer old/not likely to be in cache, no preference)
non-present-key chance (the chance that a request for cas/get/set/del may be sent for a key that does not exist)
key size min, max, median
value size min, max, median
scan iterations min, max, median

pmuens · 2017-08-10T10:32:14Z

Thanks for the writeup @spacejam 👍

Jumped into this today. Here are just some quick questions / notes from my side which came up during the implementation.

I decided to use the clap crate which is quite popular for such use-cases to implement the CLI application. Any objections with that choice? Is it maybe too bloated? Here are some other crates we could consider.

However so far I'm pretty happy with clap since it's really easy to use and quite powerful as well.

Could you get into more detail regarding the options we want to use here?

Which options are required?
Which options from "option groups" can be used together and which ones are XOR (e.g. "key size min, max, median" <-- should only one argument usage be valid here or can the user pass arguments for all of them?)

Here's what I came up with so far:

number of threads
- --num-threads - u64 - (defaults to X (TBD))
number of total operations
- --num-operations - u64 - (defaults to X (TBD))
proportion of reads, writes, cas, del, and scan operations among the operations
- --prop-reads - u64
- --prop-writes - u64
- --prop-cas - u64
- --prop-del - u64
- --prop-scan - u64
freshness bias (prefer recent/likely to be in cache, prefer old/not likely to be in cache, no preference)
- --freshness-bias - String - (defaults to "no preference")
non-present-key chance (the chance that a request for cas/get/set/del may be sent for a key that does not exist)
- --no-present-key-chance - bool
key size min, max, median
- --key-size-min - u64
- --key-size-max - u64
- --key-size-median - u64
value size min, max, median
- --value-size-min - u64
- --value-size-max - u64
- --value-size-median - u64
scan iterations min, max, median
- --scan-iter-min - u64
- --scan-iter-max - u64
- --scan-iter-median - u64

Thanks in advance!

spacejam · 2017-08-10T13:07:55Z

clap: it looks awesome! I haven't used it, but if you like it, let's stick with it!
required options: none
option groups: none, but warn when !(min <= median && median <= max), (the log + env_logger crates are a nice first pick for outputting stuff, and we can decorate the logs later with things like machine resource utilization stats with a custom logger etc...)
default threads: number of cpu cores
operations: 1 million (doctor evil.jpg)
reads -> get, writes -> set (cas is a write, scan is a read, so we should be specific to the operation on the tree)
freshness-bias: valid values: old, new, random
--no-present-key-chance maybe should be --non-present-key-chance
defaults: 80% get + 15% set, + 4% scan + 1% cas. 64 byte keys, 512 byte values, with min/max/median all being the same. number of cpu cores on the machine for thread count. 50 scan iterations.

spacejam · 2017-08-10T13:17:40Z

for number of cpu cores, I already included the num_cpus crate, so we can use that to get the number

pmuens · 2017-08-12T19:45:41Z

@spacejam could you please get into more detail about the following comments?

defaults: 80% get + 15% set, + 4% scan + 1% cas.

Could you maybe provide an example CLI input with the get, set, scan, delete and cas arguments and how they're translated to percentages? I understand it in a way that you e.g. say --get 10 --set 20 --scan 50 .... But how would we be able to translate that to percentages like the ones described above since the user provides the numbers (or am I missing smth. obvious here?)?

64 byte keys, 512 byte values, with min/max/median all being the same.

Does this mean that the key we use should be exactly 64 byte long and the value always 512 bytes?

Thanks in advance!

spacejam · 2017-08-12T20:09:50Z

@pmuens no need for percentages, we can just sum all of the proportions together, use that as a max number to feed a random number generator, (in your example, 80 is the max). Say it spits back 22. We see get is 10, which is less than 22, so we chop off 10 then go to the next. set is 20, and now we're at 12, so we decide that this operation will be a set.

yeah, exactly 64 / 512

pmuens · 2017-08-13T09:23:33Z

@spacejam thanks for the comment 👍

That makes sense! I'll link the outcome of the conversation here in #56 where this will be implemented!

pmuens · 2017-08-15T16:41:00Z

@spacejam just one quick question regarding this:

@pmuens no need for percentages, we can just sum all of the proportions together, use that as a max number to feed a random number generator, (in your example, 80 is the max). Say it spits back 22. We see get is 10, which is less than 22, so we chop off 10 then go to the next. set is 20, and now we're at 12, so we decide that this operation will be a set.

Unfortunately I'm stuck understanding the random number generation usage here.

Could you provide a quick example how the proportions for a set of given Tree operations would be calculated using this? Thanks in advance! 👍

spacejam · 2017-08-15T19:08:18Z

so, if any of the tree op types are provided, the defaults for all of the others should become 0.

bench --set=5 --del=2

for each iteration of each thread that is running commands:

sum = 5 + 2
ops = vec![(Op::Set, 5), (Op::Del, 2)];

let mut choice = rand::gen_range::<usize>(0, sum);

for (op, weight) in ops {
  if weight >= choice {
    return op;
  }
  choice -= weight;
}

pmuens · 2017-08-16T10:40:05Z

Thanks for the explanation and the code-snippet @spacejam 👍

spacejam added this to the performance milestone Aug 9, 2017

spacejam assigned pmuens Aug 9, 2017

spacejam mentioned this issue Aug 9, 2017

benchmark tool #58

Closed

5 tasks

spacejam added this to doing in v1 Aug 9, 2017

pmuens mentioned this issue Aug 11, 2017

Read CLI args for benchmarking tool #66

Merged

pmuens mentioned this issue Aug 13, 2017

implement mvp benchmark calls to db #56

Closed

spacejam closed this as completed in #66 Aug 14, 2017

spacejam moved this from doing to done in v1 Aug 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

read in arguments for benchmark tool #55

read in arguments for benchmark tool #55

spacejam commented Aug 9, 2017 •

edited

pmuens commented Aug 10, 2017

spacejam commented Aug 10, 2017

spacejam commented Aug 10, 2017

pmuens commented Aug 12, 2017 •

edited

spacejam commented Aug 12, 2017

pmuens commented Aug 13, 2017

pmuens commented Aug 15, 2017

spacejam commented Aug 15, 2017

pmuens commented Aug 16, 2017 •

edited

read in arguments for benchmark tool #55

read in arguments for benchmark tool #55

Comments

spacejam commented Aug 9, 2017 • edited

pmuens commented Aug 10, 2017

spacejam commented Aug 10, 2017

spacejam commented Aug 10, 2017

pmuens commented Aug 12, 2017 • edited

spacejam commented Aug 12, 2017

pmuens commented Aug 13, 2017

pmuens commented Aug 15, 2017

spacejam commented Aug 15, 2017

pmuens commented Aug 16, 2017 • edited

spacejam commented Aug 9, 2017 •

edited

pmuens commented Aug 12, 2017 •

edited

pmuens commented Aug 16, 2017 •

edited