With Write-Once Buckets, Riak Search Is Asynchronously Written To [JIRA: RIAK-1904] #512

wbrown-lg · 2015-06-25T13:39:03Z

Symptom
Data written to Write-Once buckets that are indexed by Riak Search take a sometimes very long time to show up in search results. Write speeds are very fast, but there is potential for missing index data because the index operation seems to be happening completely asynchronously.

Reproduce
Sequentially load a large number of keys into a Write-Once bucket of some period of time, perform queries on the index for that bucket during and after the data is finished being written. Note how the total record count continues to grow well after data has stopped flowing into cluster.

Assessment
Write-Once buckets combined with Riak Search cannot be used in a high throughput environment due to the lack of backpressure exerted by asynchronous PUTs to Riak Search. This was discovered with @drewkerrigan and further details may be obtained from him and I as needed.

System Configuration

32-node cluster, n=3 replication

zeeshanlakhani · 2015-06-25T15:56:16Z

@fadushin guessing you may want to take a look at this when you have some time going forward.

wbrown-lg · 2015-07-09T02:56:47Z

@zeeshanlakhani @fadushin @drewkerrigan Any updates on this? Any guesses as to when this will be fixed?

The lack of backpressure exerted by Riak Search against a write-once bucket will cause the cluster to crash under load, and this is a fairly serious bug that precludes the use of write-once buckets in our environment.

zeeshanlakhani · 2015-07-09T03:07:50Z

@wbrown-lg no updates currently. @fadushin will take a look after he finishes some other write-once work that he's currently on.

fadushin · 2015-07-10T15:27:38Z

@wbrown-lg I think we have a handle on what's wrong. The fix looks pretty straightforward (make sure we are indexing, in the write once path. We believe the only reason we are indexing now is because of yokozuna AAE, which explains the lag and overload (which I have verified on a local setup).

For a preliminary fix, have a look at:

2.1...bugfix/fd/RIAK-1937
basho/riak_kv@2.1...bugfix/fd/RIAK-1937

With the fix, I have observed about a 3x increase in put latency when indexing (and a corresponding reduction in throughput), which is what you would expect, given your above suggestions. (Note: not a "real" test environment -- just a couple of VMs)

I do not have a timeline for inclusion in a release yet, but I hope this gets the ball rolling.

zeeshanlakhani · 2015-07-29T15:57:20Z

#529 is in review and will fix this issue currently.

zeeshanlakhani · 2015-08-06T17:16:19Z

@wbrown-lg I'm closing this, as #529 has been completed and reviewed. It will be in 2.1.2. https://github.com/basho/yokozuna/wiki/WriteOnceBucketIndexingBug may be helpful as well.

wbrown-lg · 2015-08-25T00:09:24Z

@zeeshanlakhani Appreciate the fix, and I look forward to 2.1.2.

Basho-JIRA changed the title ~~With Write-Once Buckets, Riak Search Is Asynchronously Written To~~ With Write-Once Buckets, Riak Search Is Asynchronously Written To [JIRA: RIAK-1904] Jun 25, 2015

Basho-JIRA added the JIRA: To Do label Jun 25, 2015

zeeshanlakhani closed this as completed Aug 6, 2015

Basho-JIRA added JIRA: In Progress JIRA: Done and removed JIRA: To Do JIRA: In Progress labels Oct 13, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

With Write-Once Buckets, Riak Search Is Asynchronously Written To [JIRA: RIAK-1904] #512

With Write-Once Buckets, Riak Search Is Asynchronously Written To [JIRA: RIAK-1904] #512

wbrown-lg commented Jun 25, 2015

zeeshanlakhani commented Jun 25, 2015

wbrown-lg commented Jul 9, 2015

zeeshanlakhani commented Jul 9, 2015

fadushin commented Jul 10, 2015

zeeshanlakhani commented Jul 29, 2015

zeeshanlakhani commented Aug 6, 2015

wbrown-lg commented Aug 25, 2015

With Write-Once Buckets, Riak Search Is Asynchronously Written To [JIRA: RIAK-1904] #512

With Write-Once Buckets, Riak Search Is Asynchronously Written To [JIRA: RIAK-1904] #512

Comments

wbrown-lg commented Jun 25, 2015

zeeshanlakhani commented Jun 25, 2015

wbrown-lg commented Jul 9, 2015

zeeshanlakhani commented Jul 9, 2015

fadushin commented Jul 10, 2015

zeeshanlakhani commented Jul 29, 2015

zeeshanlakhani commented Aug 6, 2015

wbrown-lg commented Aug 25, 2015