cache the entire bolt backend into memory before restoring index #3786

xiang90 · 2015-10-31T05:47:18Z

etcd has an in-mem secondary index for the entire db. Reading the entire boltdb without moving the db file into page cache is slow. It is OK for etcd to scan the entire file first and let os move all file into page cache. It increase the throughput of read from 6K/second to 2M/second with very little overhead (pre read ingthe whole db has a throughput around 300MB/second on SSD)

@benbjohnson Do you have any suggestion? If boltdb just supports warm-up, it would be great. But I still wonder why doing a seq read on a seq written db without warm-up is so slow.

The text was updated successfully, but these errors were encountered:

benbjohnson · 2015-10-31T15:16:58Z

@xiang90 Even though the keys are written sequentially, bolt may write them randomly since it reuses old pages when available. One easy fix, if you're targeting Linux 2.6.23+, is to add an DB.MmapFlags parameter that would allow you to set MAP_POPULATE which should do the readahead. Would that work?

xiang90 · 2015-10-31T15:19:16Z

@benbjohnson

Even though the keys are written sequentially, bolt may write them randomly since

Actually, this is a newly created db with all seq writes.

One easy fix, if you're targeting Linux 2.6.23+, is to add an DB.MmapFlags parameter that would allow you to set MAP_POPULATE which should do the readahead. Would that work?

I will give it a try.

xiang90 · 2015-10-31T15:21:59Z

@benbjohnson

Thanks btw. I will let you know the result soon!

benbjohnson · 2015-10-31T15:25:01Z

@xiang90

this is a newly created db with all seq writes

Even with sequential writes, bolt will still have pages it will reuse because of the freelist being written and partially filled data pages. Those will get filled on the next transaction and the old half filled page will be available for reuse.

I will give it a try.

You'll need to add the MmapFlags as a field on bolt.Options since it's needed during bolt.Open(). Then copy that over to a DB.MmapFlags so it can be OR'd on each mmap() call. Let me know if you need any help with it.

https://github.com/boltdb/bolt/blob/master/db.go#L662-L675

xiang90 · 2015-11-16T00:53:46Z

Fixed by #3865

This adds MmapFlags to DB.Options in case we need syscall.MAP_POPULATE flag in Linux 2.6.23+ to do the sequential read-ahead, as discussed in [1]. --- [1]: etcd-io/etcd#3786

gyuho mentioned this issue Nov 6, 2015

Add MmapFlags option for MAP_POPULATE (unix) boltdb/bolt#455

Merged

jonboulle added area/performance labels Nov 13, 2015

jonboulle assigned yifan-gu and gyuho and unassigned yifan-gu Nov 13, 2015

jonboulle added this to the v2.3.0 milestone Nov 13, 2015

gyuho mentioned this issue Nov 14, 2015

storage/backend: support MAP_POPULATE for unix #3865

Merged

xiang90 closed this as completed Nov 16, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cache the entire bolt backend into memory before restoring index #3786

cache the entire bolt backend into memory before restoring index #3786

xiang90 commented Oct 31, 2015

benbjohnson commented Oct 31, 2015

xiang90 commented Oct 31, 2015

xiang90 commented Oct 31, 2015

benbjohnson commented Oct 31, 2015

xiang90 commented Nov 16, 2015

cache the entire bolt backend into memory before restoring index #3786

cache the entire bolt backend into memory before restoring index #3786

Comments

xiang90 commented Oct 31, 2015

benbjohnson commented Oct 31, 2015

xiang90 commented Oct 31, 2015

xiang90 commented Oct 31, 2015

benbjohnson commented Oct 31, 2015

xiang90 commented Nov 16, 2015