You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This issue has been automatically marked as stale because it has not had any activity in the past 30 days. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.
I was able to reproduce by overriding the DefaultCleanupPeriod in the config and adding some delay to the loadWAL method.
I can go for a fix, but I'm not sure what's the best thing to do here:
What I'd do is to just skip any instance that's not ready in the current tick's cleanup().
This way, we'd retry in wal_cleanup_period time (eg. 30mins). Question is, what if the WAL also failed to load during the second/third tick, or if it never loads? Is it okay to just keep trying?
(Not really a fan) Exit gracefully with a relevant error. Problem is that a) it adds an implicit thing that wal_cleanup_period is used for, and b) even if the agent is re-started (eg. in a K8s environment), there's nothing ensuring that it wouldn't fail each subsequent time.
Here the WAL loading takes longer than 30mins, at which point the WALCleaner runs. But because WAL is
nil
, it panics. We should fix that.The text was updated successfully, but these errors were encountered: