Gemini may report success even when a write fails #265

kbr- · 2021-03-18T10:01:26Z

When attempting a write, Gemini performs a retry loop:

func (cs *cqlStore) mutate(ctx context.Context, builder qb.Builder, ts time.Time, values ...interface{}) (err error) {
	var i int
	for i = 0; i < cs.maxRetriesMutate; i++ {
		err = cs.doMutate(ctx, builder, ts, values...)
		if err == nil {
			break
		}
		select {
		case <-ctx.Done():
			return ctx.Err()
		case <-time.After(cs.maxRetriesMutateSleep):
		}
	}
	if err != nil {
		if w := cs.logger.Check(zap.InfoLevel, "failed to apply mutation"); w != nil {
			w.Write(zap.Int("attempts", i), zap.Error(err))
		}
		return
	}

	cs.ops.WithLabelValues(cs.system, opType(builder)).Inc()
	return
}

We can see that if it fails maxRetriesMutate times, it reports an error. This will cause Gemini to return non-zero error code as well.

However there is one case where Gemini can exit without any errors even if a write fails. This is happens when we enter

		case <-ctx.Done():
			return ctx.Err()

which happens e.g. when we send an interrupt to Gemini. In this case we exit the loop in the middle of it and there is no guarantee that this particular mutate was successful (all previous retries failed, and the retry in which we exit might have failed as well).

Perhaps a mechanism should be introduced where Gemini finishes all in-progress mutate loops when it is interrupted before it exits so we can know if any write has failed (including the last batch of writes). If the user really wants Gemini to stop, the mechanism could wait for a second interrupt, in which case it would forcibly abort all loops like it does now.

The text was updated successfully, but these errors were encountered:

dkropachev · 2023-05-02T18:51:04Z

This particular code servers mutation operation, not whole cycle.

Additionally, when ctx.Done() returns anything ctx.Err() will return error, most likely context.Canceled, altenrativelly it could be context.DeadlineExceeded, but it is never empty, as result operation that is in the progress are getting aborted and error reported that is getting submitted into result pipeline.

But at the same time, issue reported regarding commanding Gemini to stop is real, we should work it out.
I will close this issue in the favor of #295

dkropachev closed this as completed May 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemini may report success even when a write fails #265

Gemini may report success even when a write fails #265

kbr- commented Mar 18, 2021

dkropachev commented May 2, 2023

Gemini may report success even when a write fails #265

Gemini may report success even when a write fails #265

Comments

kbr- commented Mar 18, 2021

dkropachev commented May 2, 2023