"Failed to write, and not due to blocking" on Memcached 1.5.3

I'm trying to debug an issue where connections to memcache timeout after a stream of EAGAIN messages on the recvfrom and sendto system calls.

Temporarily upgrading to the latest version of memcached did not resolve the issue. It can be reproduced on multiple servers across different availability zones and the servers have been replaced and the issue eventually reoccurs on the new servers.

## On the client

I have been unable to isolate the issue from Drupal so far as testing with php manually works fine. The initial drush9 command also works after memcached restart but fails on all following occurrences.

```
timeout 120 strace -s 2048 -Tfe trace=network drush9 php-eval 'true;'
[...]
[pid  1140] sendto(8,   [....] , 8196, MSG_NOSIGNAL|MSG_MORE, NULL, 0) = -1 EAGAIN (Resource temporarily unavailable) <0.000020>
```
During this time ~13,000 sets and gets are performed from the Drupal bootstrap.

## On the with server -vvv flags enabled

```
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: <37 GET 674e2bb21b4c41979d663e0d26dd12f9_%3Aconfig%3A-core.base_field_override.node.product_page_block.status
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: > FOUND KEY 674e2bb21b4c41979d663e0d26dd12f9_%3Aconfig%3A-core.base_field_override.node.product_page_block.status
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: >37 Writing bin response:
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: >37   0x81 0x0d 0x00 0x65
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: >37   0x04 0x00 0x00 0x00
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: >37   0x00 0x00 0x04 0x46
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: >37   0x02 0x3e 0x00 0x00
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: >37   0x00 0x00 0x00 0x00
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: >37   0x00 0x00 0x00 0x00
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: 37: going from conn_nread to conn_mwrite
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: Failed to write, and not due to blocking: Connection reset by peer
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: 37: going from conn_mwrite to conn_closing
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: <37 connection closed.
Mar 05 19:59:46 web-12345.redacted.com memcached[32007]: 37: going from conn_closing to conn_closed
```

## Version info
```
   ["version"]=>
    string(5) "1.5.3"
    ["libevent"]=>
    string(13) "2.0.21-stable"
```
## Client Config 

```
$settings['memcache']['options'][Memcached::OPT_COMPRESSION] = TRUE;
$settings['memcache']['options'][Memcached::OPT_BINARY_PROTOCOL] = TRUE;
````

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

"Failed to write, and not due to blocking" on Memcached 1.5.3 #458

On the client

On the with server -vvv flags enabled

Version info

Client Config

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

"Failed to write, and not due to blocking" on Memcached 1.5.3 #458

Description

On the client

On the with server -vvv flags enabled

Version info

Client Config

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions