Skip to content

Limit the client reconnect behavior. #2876

@codelipenghui

Description

@codelipenghui

Actual behavior

I got some abnormal metrics in broker stats.

wechatimg101

When this happened, I checked the network metrics. The network metrics showed network was saturated.

wechatimg111

So, I searched the error log in pulsar cluster, and found some strange logs:

wechatimg113

Next, i searched the client logs in kibana:

wechatimg143

wechatimg144

wechatimg145

Regardless the client exception, I am concerned about the client reconnect behavior. I think client reconnect too much and load data from bookie to cause the high network bandwidth usage. And pulsar client exposes updateServiceUrl() to users, it will also trigger reconnect behavior.

So we should add some mechanisms to limit the reconnect to avoid high network bandwidth usage.

Metadata

Metadata

Assignees

No one assigned

    Labels

    area/clienttype/enhancementThe enhancements for the existing features or docs. e.g. reduce memory usage of the delayed messages

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions