Skip to content

Commit

Permalink
rc1: increase local connector retries
Browse files Browse the repository at this point in the history
Problem: valgrind test occasionally fails with 30 retries
on the local connector.

The timeout is in rc1 on the first access to the broker.
Raise it 40 retries.

N.B. the retry time starts at 0.016s and doubles each time until
it reaches 2s, then it's 2s each up to the maxmimum.  So times are:
  0.016, 0.032, 0.064, 0.128, 0.256, 0.512, 1.024, 2, 2, 2, ....
Therefore 30 retries is about 26s and 40 retries is about 46s.
  • Loading branch information
garlick authored and grondo committed Nov 3, 2020
1 parent 50ebbbd commit a9b32b8
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion etc/rc1
@@ -1,7 +1,7 @@
#!/bin/bash -e

# Allow connector-local more time to start listening on socket
FLUX_LOCAL_CONNECTOR_RETRY_COUNT=30 RANK=$(flux getattr rank)
FLUX_LOCAL_CONNECTOR_RETRY_COUNT=40 RANK=$(flux getattr rank)

if ! content_backing=$(flux getattr content.backing-module 2>/dev/null); then
content_backing=content-sqlite
Expand Down

0 comments on commit a9b32b8

Please sign in to comment.