Question about the use of crdb.Execute() function. #70

georgysavva · 2020-05-20T18:45:20Z

crdb package has that function:

Lines 95 to 102 in 73ffeee

    
           func Execute(fn func() error) (err error) { 
        
           	for { 
        
           		err = fn() 
        
           		if err == nil || !errIsRetryable(err) { 
        
           			return err 
        
           		} 
        
           	} 
        
           }

From the docs it's clear that it should be used to retry single statement (implicit transaction) operations.
But I don't quite understand should my app use it or not if I can't predict is 16K result buffer is enough for me in all situations. Let me explain:
My application will extract reasonable amount of rows up to 100 e.g. via limits and it's not going to stream data somewhere else. So it scans data from all rows in an array locally and return as a whole.
But I can't be sure that some batch won't exceed the 16K limit (for example because of long texts in some column).
To protect myself from transaction contention errors. I see two solutions here:

I could wrap all my single statement calls to cockraochDB in crdb.Execute().
Or It's better to increase the result buffer to always be in the limit and don't allow cockroachdb to start to stream. And If I see transaction contention errors in the logs it will mean that I either should increase the limit again or investigate why my app extracts that much data and restrict it.

It might be unrelated to this repository and should be asked in the main slack channel.

The text was updated successfully, but these errors were encountered:

georgysavva · 2020-05-27T12:40:59Z

Hey. Any update on this?)

rafiss · 2020-06-29T23:12:12Z

I think it would be best to always try to make sure that the batch won't exceed the 16K limit. What kind of data are you working with? Is there any max size for each row? If so, the safest thing would be to always make sure to only load as many rows as will fit if you assume each row has the max size.

Asking in the slack channel might be a good idea too -- dealing with a limited-size-buffer is probably something that has come up for others too.

georgysavva · 2020-06-30T08:34:54Z

The type of data that I am working with is something like user profiles in a social app. Rows contain a bunch of text columns the size of which can be limited, maybe few JSON columns with unstructured data that also shouldn't grow large. So I guess, yes, It possible to calculate each row max size and use pagination limits with the buffer size in mind. For example if I know that max row size is 500B, and I need to select up to 50 rows. The result size will be 25KB that exceeds the default buffer limit in 16KB and I need to increase it to 32KB, right?

rafiss · 2020-07-08T02:24:42Z

That math sounds good to me. :)

georgysavva · 2020-07-08T08:26:12Z

Thanks for helping me to figure this out!

georgysavva closed this as completed Jul 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the use of crdb.Execute() function. #70

Question about the use of crdb.Execute() function. #70

georgysavva commented May 20, 2020 •

edited

Loading

georgysavva commented May 27, 2020

rafiss commented Jun 29, 2020

georgysavva commented Jun 30, 2020

rafiss commented Jul 8, 2020

georgysavva commented Jul 8, 2020

Question about the use of crdb.Execute() function. #70

Question about the use of crdb.Execute() function. #70

Comments

georgysavva commented May 20, 2020 • edited Loading

georgysavva commented May 27, 2020

rafiss commented Jun 29, 2020

georgysavva commented Jun 30, 2020

rafiss commented Jul 8, 2020

georgysavva commented Jul 8, 2020

georgysavva commented May 20, 2020 •

edited

Loading