-
Notifications
You must be signed in to change notification settings - Fork 377
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
wal_queue_max_size
is set too late on initial box.cfg
#10013
Labels
Comments
sergepetrenko
added
bug
Something isn't working
replication
2.11
Target is 2.11 and all newer release/master branches
labels
May 15, 2024
sergepetrenko
added a commit
to sergepetrenko/tarantool
that referenced
this issue
May 16, 2024
wal_queue_max_size took effect only after the initial box.cfg call, meaning that users with non-zero `replication_sync_timeout` still synced using the default 16 Mb queue size. In some cases the default was too big and the same issues described in tarantool#5536 arised. Fix this. Closes tarantool#10013 NO_DOC=bugfix
sergepetrenko
added a commit
to sergepetrenko/tarantool
that referenced
this issue
May 16, 2024
wal_queue_max_size took effect only after the initial box.cfg call, meaning that users with non-zero `replication_sync_timeout` still synced using the default 16 Mb queue size. In some cases the default was too big and the same issues described in tarantool#5536 arose. Fix this. Closes tarantool#10013 NO_DOC=bugfix
sergepetrenko
added a commit
to sergepetrenko/tarantool
that referenced
this issue
May 20, 2024
wal_queue_max_size took effect only after the initial box.cfg call, meaning that users with non-zero `replication_sync_timeout` still synced using the default 16 Mb queue size. In some cases the default was too big and the same issues described in tarantool#5536 arose. Fix this. Closes tarantool#10013 NO_DOC=bugfix
sergepetrenko
added a commit
to sergepetrenko/tarantool
that referenced
this issue
May 21, 2024
wal_queue_max_size took effect only after the initial box.cfg call, meaning that users with non-zero `replication_sync_timeout` still synced using the default 16 Mb queue size. In some cases the default was too big and the same issues described in tarantool#5536 arose. Fix this. Closes tarantool#10013 NO_DOC=bugfix (cherry picked from commit ab0f791)
sergepetrenko
added a commit
to sergepetrenko/tarantool
that referenced
this issue
May 21, 2024
wal_queue_max_size took effect only after the initial box.cfg call, meaning that users with non-zero `replication_sync_timeout` still synced using the default 16 Mb queue size. In some cases the default was too big and the same issues described in tarantool#5536 arose. Fix this. Closes tarantool#10013 NO_DOC=bugfix (cherry picked from commit ab0f791)
This was referenced May 21, 2024
sergepetrenko
added a commit
that referenced
this issue
May 21, 2024
wal_queue_max_size took effect only after the initial box.cfg call, meaning that users with non-zero `replication_sync_timeout` still synced using the default 16 Mb queue size. In some cases the default was too big and the same issues described in #5536 arose. Fix this. Closes #10013 NO_DOC=bugfix (cherry picked from commit ab0f791)
sergepetrenko
added a commit
that referenced
this issue
May 21, 2024
wal_queue_max_size took effect only after the initial box.cfg call, meaning that users with non-zero `replication_sync_timeout` still synced using the default 16 Mb queue size. In some cases the default was too big and the same issues described in #5536 arose. Fix this. Closes #10013 NO_DOC=bugfix (cherry picked from commit ab0f791)
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
Bug description
wal_queue_max_size
is a dynamic cfg parameter, and it's applied byload_cfg
only afterbox_cfg_xc()
finishes.In case
box_cfg_xc
includes syncing with a remote master, all the syncing wil happen with defaultwal_queue_max_size
setting, 16 megabytes. Turns out, the default is too big for some applications when rows are tiny and wal_queue allows hundreds of thousands of rows to enter the queue in one event loop cycle. If this is the case, the same symptoms already discussed in #5536 appear, and the node is never in time to sync with the master.Reproduced on Tarantool 2.8.4.
A workaround for Tarantool 2.8.4 is to start with
box.cfg{replication_sync_timeout=0.01}
. Then "sync" stage ends almost immediately and the desiredwal_queue_max_size
will be applied as soon as possible.The text was updated successfully, but these errors were encountered: