Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
akka.remote.ResendUnfulfillableException: Unable to fulfill resend request since negatively acknowledged payload is no longer in buffer. The resend states between two systems are compromised and cannot be recovered. #23010
Comments
jrudolph
added 1 - triaged bug t:remoting
labels
May 23, 2017
|
The error message means that the sending side of a system message first receives acknowledgement for a message and then later receives a negative acknowledgement for that same message. How could that happen? IIUC these buffers and management data structures are kept even if the physical connection had to be reestablished. E.g. in the case above
Message 0 was likely confirmed earlier. The receiver then seems to "have forgotten" about that message and when it receives other messages later on it sends a negative acknowledgement for that message. The question is why this information gets lost at some point. |
patriknw
added a commit
that referenced
this issue
Jun 9, 2017
|
|
patriknw |
4e91f8d
|
patriknw
added a commit
that referenced
this issue
Jun 9, 2017
|
|
patriknw |
c7cab39
|
This was referenced Jun 9, 2017
patriknw
added a commit
that referenced
this issue
Jun 9, 2017
|
|
patriknw |
32f0936
|
patriknw
added a commit
that referenced
this issue
Jun 9, 2017
|
|
patriknw |
63676ee
|
patriknw
referenced
this issue
Jun 9, 2017
Merged
Fix ResendUnfulfillableException after transport failure detection, #23010 (for validation) #23130
patriknw
added this to the
2.4.19
milestone
Jun 9, 2017
patriknw
added 3 - in progress and removed 1 - triaged
labels
Jun 9, 2017
patriknw
self-assigned this
Jun 9, 2017
|
hi hAkkers |
|
We will release on Monday |
patriknw
added a commit
that referenced
this issue
Jun 11, 2017
|
|
patriknw |
14617d3
|
patriknw
added a commit
that referenced
this issue
Jun 11, 2017
|
|
patriknw |
957bdbd
|
patriknw
removed the
3 - in progress
label
Jun 11, 2017
patriknw
closed this
Jun 11, 2017
patriknw
added a commit
that referenced
this issue
Jun 11, 2017
|
|
patriknw |
4f9921b
|
patriknw
referenced
this issue
Jun 11, 2017
Merged
increase timeout in ActorsLeakSpec, #23010 #23137
patriknw
added a commit
that referenced
this issue
Jun 11, 2017
|
|
patriknw |
34ac46f
|
patriknw
referenced
this issue
Jun 11, 2017
Merged
increase timeout in ActorsLeakSpec, #23010 #23138
patriknw
added a commit
that referenced
this issue
Jun 11, 2017
|
|
patriknw |
c761772
|
patriknw
added a commit
that referenced
this issue
Jun 11, 2017
|
|
patriknw |
e983068
|
patriknw
added a commit
that referenced
this issue
Jun 12, 2017
|
|
patriknw |
a4ab142
|
patriknw
added a commit
that referenced
this issue
Jun 12, 2017
|
|
patriknw |
6685bb8
|
|
@patriknw thank you, I see release for 2.4. is there a chance to see 2.5 with this fix soon? |
|
yes, we will release 2.5.3 in a few days |
jrudolph commentedMay 23, 2017
We have seen a few (but infrequent) reports of quarantining happening with this error message.
The error messages usually look like this:
The particular acknowledgement setup can be different.
We have seen reports for this occurring on Akka 2.4.7, 2.4.11, and 2.4.17.
It seems to happen under different kind of circumstances:
So far, we haven't been able to reproduce the issue or having been able to get hold of a complete set of logs that would allow us to reproduce the issue.
Maybe related: #16623 and #19780