Update Zipkin connection retry - 2.1.x #10486

nickjjzhao · 2021-07-09T03:40:39Z

Change Description

on networks like EOS Mainnet, 9 attempts is very small and useless for intermittent failures like running a zipkin upgrade. For production use, need to work more like:

retry every 30 seconds
have health reporting as to if connected or not
process SIGHUP or similar to force re-connect
Further if telemetry-url is a DNS name that points to multiple A or AAAA records, nodeos should try all the addreses returned before giving up.

Notes:

Existing code supports item 4 above.
Changes made in fc submodule Update Zipkin connection retry - 2.1 fc#201

Change Type

Select ONE:

Documentation

Stability bug fix

Other

Other - special case

Testing Changes

Select ANY that apply:

New Tests

Existing Tests

Test Framework

CI System

Other

Documentation Additions

Documentation Additions

Method handle_sighup() defined in zipkin is to handle signal SIGHUP, and this method is not called directly from the original SIGHUP signal handler but from other handlers, e.g., handle_sighup() of net_plugin, one of the mandatory plugins, can be used to forward signal SIGHUP by calling zipkin's handle_sighup().

Add a new option:
telemetry-retry-interval-us, optional parameter, specifies the retry interval for connecting to zipkin with default value set to 30000000

Update Zipkin connection retry

a6bd98a

heifner approved these changes Jul 9, 2021

View reviewed changes

nickjjzhao marked this pull request as draft July 9, 2021 04:08

nickjjzhao added 2 commits July 9, 2021 00:30

Revert the changes made in fc

5fd8ec6

Add Zipkin connection changes in fc submodule

9d1397e

nickjjzhao marked this pull request as ready for review July 9, 2021 13:46

heifner approved these changes Jul 9, 2021

View reviewed changes

nickjjzhao merged commit a5b0a8d into release/2.1.x Jul 9, 2021

nickjjzhao deleted the jjz-epe933-zipkin-2.1.x branch July 9, 2021 16:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update Zipkin connection retry - 2.1.x #10486

Update Zipkin connection retry - 2.1.x #10486

Uh oh!

nickjjzhao commented Jul 9, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update Zipkin connection retry - 2.1.x #10486

Update Zipkin connection retry - 2.1.x #10486

Uh oh!

Conversation

nickjjzhao commented Jul 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Change Description

Change Type

Testing Changes

Documentation Additions

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nickjjzhao commented Jul 9, 2021 •

edited

Loading