Master doesn't send heartbeats to replica while scanning wals #4461

rtokarev · 2019-08-28T12:44:32Z

I've got a situation when a replica couldn't recover from a master because it scans WALs too long before sending the first row. It doesn't send heartbeats to replica while scanning, so replication disconnect timeout occurred in a replica.

It seems that in my case the first row to start to recover from is located at the end of the 00000000000003991061.xlog.

019-08-28 11:29:33.793 [21848] main/4851/main I> subscribed replica ffdaa1d1-c57c-4795-9bb7-33179fadfbe0 at fd 15, aka 10.246.1.39:3301, peer of 10.246.1.6:36686
2019-08-28 11:29:33.793 [21848] main/4851/main I> remote vclock {1: 4376075, 2: 5} local vclock {1: 4376095, 2: 5}
2019-08-28 11:29:33.800 [21848] relay/10.246.1.6:36686/101/main I> recover from `/var/lib/tarantool/xtaz_2//00000000000003991061.xlog'
2019-08-28 11:29:39.845 [21848] relay/10.246.1.6:36686/101/main I> done `/var/lib/tarantool/xtaz_2//00000000000003991061.xlog'
2019-08-28 11:29:39.846 [21848] relay/10.246.1.6:36686/101/main I> recover from `/var/lib/tarantool/xtaz_2//00000000000004376090.xlog'
2019-08-28 11:29:39.846 [21848] relay/10.246.1.6:36686/101/main I> done `/var/lib/tarantool/xtaz_2//00000000000004376090.xlog'
2019-08-28 11:29:39.846 [21848] relay/10.246.1.6:36686/101/main coio.cc:370 !> SystemError unexpected EOF when reading from socket, called on fd 15, aka 10.246.1.39:3301, peer of 10.246.1.6:36686: Broken pipe
2019-08-28 11:29:39.846 [21848] relay/10.246.1.6:36686/101/main C> exiting the relay loop

The text was updated successfully, but these errors were encountered:

sergos · 2022-06-20T15:55:36Z

Closing as duplicate of #6706 (resolved)

kyukhin added bug Something isn't working replication labels Sep 26, 2019

kyukhin added this to the 2.4.1 milestone Sep 26, 2019

kyukhin modified the milestones: 2.4.1, 2.4.2 Apr 10, 2020

kyukhin modified the milestones: 2.4.2, 2.4.3 Jun 22, 2020

kyukhin modified the milestones: 2.4.3, wishlist Oct 23, 2020

kyukhin added the teamS label Jun 20, 2022

kyukhin removed this from the wishlist milestone Jun 20, 2022

kyukhin assigned sergos Jun 20, 2022

sergos closed this as completed Jun 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Master doesn't send heartbeats to replica while scanning wals #4461

Master doesn't send heartbeats to replica while scanning wals #4461

rtokarev commented Aug 28, 2019

sergos commented Jun 20, 2022 •

edited

Loading

Master doesn't send heartbeats to replica while scanning wals #4461

Master doesn't send heartbeats to replica while scanning wals #4461

Comments

rtokarev commented Aug 28, 2019

sergos commented Jun 20, 2022 • edited Loading

sergos commented Jun 20, 2022 •

edited

Loading