How did you improve the performance? #10

krowling · 2020-06-16T09:38:05Z

krowling
Jun 16, 2020

I have 2 questions about HSE Performance.

MongoDB Run Phase
MongoDB/HSE improve 8x more throughput than MongoDB/WiredTiger.
I think it improved performance by reducing read/write I/O.
Is this right? If this is correct, How can you reduce I/O?

As I know, YCSB does not support multi-segment key.
Therefore, I think the multi-segmnet key did not affect the performance improvement...

MongoDB Load Phase
MongoDB/HSE delivered more than 6x throughput compared to MongoDB/WiredTiger.
How did you improve performance in load phase?

Please explain compared to wiredtiger.

Thanks,
sohyun

smoyergh · 2020-06-17T21:20:47Z

smoyergh
Jun 17, 2020
Collaborator

Question-1: Many factors contribute to the performance gains. However the primary factors are 1) reduced write/read amplification (as you mention), and 2) a focus on highly concurrent data structures, including those based on RCU where applicable.

Question-2: HSE plugs into the MongoDB Storage Engine API framework, and in that context we do in fact make use of HSE's multi-segment keys. The YCSB application is a MongoDB client, not a direct HSE client.

As to load phase performance, it's the same answer as Question-1.

0 replies

krowling · 2020-06-18T06:16:33Z

krowling
Jun 18, 2020
Author

Thanks :)

How did you reduce write/read amplification compared to MongoDB/Wiredtiger?

0 replies

smoyergh · 2020-06-18T14:22:02Z

smoyergh
Jun 18, 2020
Collaborator

HSE uses a fundamentally different data layout. WiredTiger uses a btree by default (it has an LSM mode, but I don't know if it is supported). HSE uses a data layout of our own design that combines some characteristics of a trie with some characteristics of an LSM, along with novel compaction methods.

We anticipate publishing a whitepaper on the design when our schedule permits.

0 replies

krowling · 2020-06-22T06:54:47Z

krowling
Jun 22, 2020
Author

Thanks,

Are you using only the capacity class in mongoDB/HSE?
There is only nvme SSDs, but capacity class in not optional.

[configuration]

256GB DRAM / 4x Micron® 9300 NVMe SSDs 3.2TB in an LVM striped logical volume

0 replies

smoyergh · 2020-06-22T16:47:13Z

smoyergh
Jun 22, 2020
Collaborator

In the MongoDB results on the HSE Wiki we are using only the capacity media class. We configured 4x Micron 9300 SSDs as a single LVM striped logical volume, and used that logical volume as the capacity media class.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How did you improve the performance? #10

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 5 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

How did you improve the performance? #10

krowling Jun 16, 2020

Replies: 5 comments

smoyergh Jun 17, 2020 Collaborator

krowling Jun 18, 2020 Author

smoyergh Jun 18, 2020 Collaborator

krowling Jun 22, 2020 Author

[configuration]

256GB DRAM / 4x Micron® 9300 NVMe SSDs 3.2TB in an LVM striped logical volume

smoyergh Jun 22, 2020 Collaborator

krowling
Jun 16, 2020

smoyergh
Jun 17, 2020
Collaborator

krowling
Jun 18, 2020
Author

smoyergh
Jun 18, 2020
Collaborator

krowling
Jun 22, 2020
Author

smoyergh
Jun 22, 2020
Collaborator