Prefer local disks when fetching data blocks. #9563

klauspost · 2020-05-09T13:49:38Z

Motivation and Context

Prefer local disks and reconstruct more.

This in an experimental patch that prefers local disks to remote.

If the requested server is part of the set this will always read from the local disk, even if the disk contains a parity shard. In default setup there is a 50% chance that at least one shard that otherwise would have been fetched remotely will be read locally instead.

It basically trades RPC call overhead for reed-solomon. On distributed localhost this seems to be fairly break-even, with a very small gain in throughput and latency. However on networked servers this should be a bigger.

1MB objects, before:

Operation: GET. Concurrency: 32. Hosts: 4.

Requests considered: 76257:
 * Avg: 25ms 50%: 24ms 90%: 32ms 99%: 42ms Fastest: 7ms Slowest: 67ms
 * First Byte: Average: 23ms, Median: 22ms, Best: 5ms, Worst: 65ms

Throughput:
* Average: 1213.68 MiB/s, 1272.63 obj/s (59.948s, starting 14:45:44 CEST)

After:

Operation: GET. Concurrency: 32. Hosts: 4.

Requests considered: 78845:
 * Avg: 24ms 50%: 24ms 90%: 31ms 99%: 39ms Fastest: 8ms Slowest: 62ms
 * First Byte: Average: 22ms, Median: 21ms, Best: 6ms, Worst: 57ms

Throughput:
* Average: 1255.11 MiB/s, 1316.08 obj/s (59.938s, starting 14:43:58 CEST)

Bonus fix: Only ask for heal once on an object.

If no disks are local performance should be unaffected.

How to test this PR?

Run the server.

Types of changes

New feature (non-breaking change which adds functionality)

Prefer local disks and reconstruct more. This in an experimental patch that prefers local disks to remote. If the requested server is part of the set this will always read from the local disk, even if the disk contains a parity shard. In default setup there is a 50% chance that at least one shard that otherwise would have been fetched remotely will be read locally instead. It basically trades RPC call overhead for reed-solomon. On distributed localhost this seems to be fairly break-even, with a very small gain in throughput and latency. However on networked servers this should be a bigger 1MB objects, before: ``` Operation: GET. Concurrency: 32. Hosts: 4. Requests considered: 76257: * Avg: 25ms 50%: 24ms 90%: 32ms 99%: 42ms Fastest: 7ms Slowest: 67ms * First Byte: Average: 23ms, Median: 22ms, Best: 5ms, Worst: 65ms Throughput: * Average: 1213.68 MiB/s, 1272.63 obj/s (59.948s, starting 14:45:44 CEST) ``` After: ``` Operation: GET. Concurrency: 32. Hosts: 4. Requests considered: 78845: * Avg: 24ms 50%: 24ms 90%: 31ms 99%: 39ms Fastest: 8ms Slowest: 62ms * First Byte: Average: 22ms, Median: 21ms, Best: 6ms, Worst: 57ms Throughput: * Average: 1255.11 MiB/s, 1316.08 obj/s (59.938s, starting 14:43:58 CEST) ``` Bonus fix: Only ask for heal once on an object.

klauspost · 2020-05-11T12:35:34Z

Etag mismatch for multipart 0 byte object

This could be legit. Though maybe fetching 0 byte objects seems like a waste of IO.

…prefer-local-disks

cmd/erasure-decode.go

harshavardhana

LGTM nice stuff

minio-trusted · 2020-05-19T17:15:15Z

Mint Automation

Test	Result
mint-xl.sh	✔️
mint-large-bucket.sh	✔️
mint-fs.sh	✔️
mint-dist-xl.sh	✔️
mint-gateway-s3.sh	✔️
mint-gateway-azure.sh	✔️
mint-gateway-nas.sh	✔️
Deleting image on docker hub
Deleting image locally

harshavardhana · 2020-05-20T07:28:56Z

PTAL @krishnasrinivas

cmd/erasure-decode.go

harshavardhana · 2020-05-26T16:28:07Z

@krishnasrinivas PTAL

If the requested server is part of the set this will always read from the local disk, even if the disk contains a parity shard. In default setup there is a 50% chance that at least one shard that otherwise would have been fetched remotely will be read locally instead. It basically trades RPC call overhead for reed-solomon. On distributed localhost this seems to be fairly break-even, with a very small gain in throughput and latency. However on networked servers this should be a bigger 1MB objects, before: ``` Operation: GET. Concurrency: 32. Hosts: 4. Requests considered: 76257: * Avg: 25ms 50%: 24ms 90%: 32ms 99%: 42ms Fastest: 7ms Slowest: 67ms * First Byte: Average: 23ms, Median: 22ms, Best: 5ms, Worst: 65ms Throughput: * Average: 1213.68 MiB/s, 1272.63 obj/s (59.948s, starting 14:45:44 CEST) ``` After: ``` Operation: GET. Concurrency: 32. Hosts: 4. Requests considered: 78845: * Avg: 24ms 50%: 24ms 90%: 31ms 99%: 39ms Fastest: 8ms Slowest: 62ms * First Byte: Average: 22ms, Median: 21ms, Best: 6ms, Worst: 57ms Throughput: * Average: 1255.11 MiB/s, 1316.08 obj/s (59.938s, starting 14:43:58 CEST) ``` Bonus fix: Only ask for heal once on an object.

klauspost and others added 3 commits May 9, 2020 15:47

Fix reconstruction detection

317aa56

Merge branch 'master' into prefer-local-disks

a735ed7

klauspost added 3 commits May 18, 2020 17:16

Merge branch 'master' into prefer-local-disks

53c958b

Merge branch 'prefer-local-disks' of github.com:klauspost/minio into …

744000a

…prefer-local-disks

All zero shards is ok.

b8530eb

kannappanr requested review from krishnasrinivas and harshavardhana May 18, 2020 18:36

harshavardhana requested changes May 18, 2020

View reviewed changes

cmd/erasure-decode.go Outdated Show resolved Hide resolved

cmd/erasure-decode.go Show resolved Hide resolved

Replace manual copy with actual copy

051193f

klauspost requested a review from harshavardhana May 19, 2020 16:18

harshavardhana reviewed May 19, 2020

View reviewed changes

harshavardhana approved these changes May 19, 2020

View reviewed changes

klauspost changed the title ~~Experiment: Prefer local disks~~ Prefer local disks when fetching data blocks. May 19, 2020

krishnasrinivas reviewed May 20, 2020

View reviewed changes

cmd/erasure-decode.go Show resolved Hide resolved

klauspost requested a review from krishnasrinivas May 21, 2020 16:28

krishnasrinivas approved these changes May 26, 2020

View reviewed changes

harshavardhana merged commit 4a007e3 into minio:master May 26, 2020

klauspost deleted the prefer-local-disks branch May 27, 2020 10:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prefer local disks when fetching data blocks. #9563

Prefer local disks when fetching data blocks. #9563

klauspost commented May 9, 2020

klauspost commented May 11, 2020

harshavardhana left a comment

minio-trusted commented May 19, 2020

harshavardhana commented May 20, 2020

harshavardhana commented May 26, 2020

Prefer local disks when fetching data blocks. #9563

Prefer local disks when fetching data blocks. #9563

Conversation

klauspost commented May 9, 2020

Motivation and Context

How to test this PR?

Types of changes

klauspost commented May 11, 2020

harshavardhana left a comment

Choose a reason for hiding this comment

minio-trusted commented May 19, 2020

Mint Automation

harshavardhana commented May 20, 2020

harshavardhana commented May 26, 2020