use RWMutex replace Mutex in segment #16

hengfeiyang · 2022-05-27T05:54:08Z

This will be slight, but it would cause problem when load many many segments, need a mechanism to release unused cache.

mschoch · 2022-05-27T12:04:47Z

I have 2 concerns with this approach:

We reuse the buffer, but we keep re-decomprressing the same bytes over and over again. For example, if I have match 1000 documents and they all are in the same chunk in the same segment. To simply load the _id field of each, we decompress that same compressed stored doc chunk 1000 times. Do you agree this will happen and that it is bad?
We now have additional lock contention, and all stored field access will have a bottleneck on this single shared buffer.

I don't see how this can work either.

mschoch · 2022-05-27T12:06:45Z

To me, compression chunks of stored documents is great for compression, but terrible for our existing search API, and without changes to that, it cannot perform well.

hengfeiyang · 2022-05-27T12:56:48Z

We can reduce the scale of lock, when load data we can initial many mutex equal to thunk num, but we don't cache uncompress data. We still cache uncompress data when first use it.

After then, we split the lock contention to 1/N (n=thunkNum)

Fake code:

decompressedStoredFieldChunks map[uint64]cacheData

type cacheData struct{
    data []byte
    m      sync.RWMutex
}

mschoch · 2022-05-27T13:11:13Z

I guess I misunderstood what you were doing here.

Don't you still think this is going to cache too much data?

hengfeiyang · 2022-05-27T13:30:08Z

It going will be to add a cache manager to keep memory usage at a proper size.

mschoch · 2022-05-27T13:32:10Z

It going will be to add a cache manager to keep memory size at a proper usage.

You're saying in order for this to work well, we need some other component that doesn't exist yet?

hengfeiyang · 2022-05-27T13:34:45Z

Yes, you are right, let me think more, try to find other solution.

mschoch · 2022-05-27T19:21:58Z

read.go

-	var storedFieldDecompressed []byte
-	var ok bool
-	if storedFieldDecompressed, ok = s.decompressedStoredFieldChunks[chunkI]; !ok {
+	storedFieldDecompressed := s.decompressedStoredFieldChunks[chunkI]


I don't think we ever actually succeed in updating items inside the map. Although you mutate storedFieldDecompressed I believe this is a copy, not the item the map. See this example I think is analagous: https://go.dev/play/p/s0UrS7nD_fg

Yeah, you are right, i changed it. 34f03d6

use RWMutex replace Mutex in segment

2cfaa7c

hengfeiyang mentioned this pull request May 27, 2022

ice v2 data race blugelabs/bluge#121

Open

use RWMutex replace Mutex in segment

b4b8040

split lock num equal to thunk num

5bd5b33

mschoch reviewed May 27, 2022

View reviewed changes

fix data RACE on get doc stored

34f03d6

hengfeiyang closed this Jul 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use RWMutex replace Mutex in segment #16

use RWMutex replace Mutex in segment #16

hengfeiyang commented May 27, 2022 •

edited

Loading

mschoch commented May 27, 2022

mschoch commented May 27, 2022

hengfeiyang commented May 27, 2022

mschoch commented May 27, 2022

hengfeiyang commented May 27, 2022 •

edited

Loading

mschoch commented May 27, 2022

hengfeiyang commented May 27, 2022 •

edited

Loading

mschoch May 27, 2022

hengfeiyang May 28, 2022

use RWMutex replace Mutex in segment #16

use RWMutex replace Mutex in segment #16

Conversation

hengfeiyang commented May 27, 2022 • edited Loading

mschoch commented May 27, 2022

mschoch commented May 27, 2022

hengfeiyang commented May 27, 2022

mschoch commented May 27, 2022

hengfeiyang commented May 27, 2022 • edited Loading

mschoch commented May 27, 2022

hengfeiyang commented May 27, 2022 • edited Loading

mschoch May 27, 2022

Choose a reason for hiding this comment

hengfeiyang May 28, 2022

Choose a reason for hiding this comment

hengfeiyang commented May 27, 2022 •

edited

Loading

hengfeiyang commented May 27, 2022 •

edited

Loading

hengfeiyang commented May 27, 2022 •

edited

Loading