Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] sending batch every minute but still gets error intermittently "The connection was inactive for more than the allowed 300000 milliseconds and is closed by container 'LinkTracker'" #18593

Closed
3 tasks done
ankygupta9999 opened this issue Jan 13, 2021 · 5 comments
Assignees
Labels
Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. Event Hubs needs-author-feedback More information is needed from author to address the issue. pillar-reliability The issue is related to reliability, one of our core engineering pillars. (includes stress testing) question The issue doesn't require a change to the product in order to be resolved. Most issues start as that

Comments

@ankygupta9999
Copy link

ankygupta9999 commented Jan 13, 2021

Describe the bug
I am sending a message batch to eventhub via EventHubProducerClient every Minute. But still intermittently the link tracker closes the link stating no activity since 30 minutes (Send or receive)

The connection was inactive for more than the allowed 300000 milliseconds and is closed by container 'LinkTracker'

Exception or Stack Trace

2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SessionHandler - onSessionRemoteClose connectionId[ehub_dev], entityName[MM_9999990000007], condition[Error{condition=null, description='null', info=null}]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SessionHandler - onSessionRemoteClose closing a local session for connectionId[MM_9999990000007], entityName[ehub_dev], condition[null], description[null]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.ReactorConnection - connectionId[MM_9999990000007] sessionName[ehub_dev]: Error occurred. Removing and disposing session.
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.ReactorSession - connectionId[MM_9999990000007], sessionId[ehub_dev], errorCondition[n/a]: Disposing of session.
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SendLinkHandler - onLinkRemoteClose connectionId[MM_9999990000007], linkName[cbs:sender], errorCondition[amqp:connection:forced], errorDescription[The connection was inactive for more than the allowed 300000 milliseconds and is closed by container 'LinkTracker'. TrackingId:9999b13b12345f12a12d, SystemTracker:gateway5, Timestamp:2021-01-13T12:55:26]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SendLinkHandler - processOnClose connectionId[MM_9999990000007], linkName[cbs:sender], errorCondition[amqp:connection:forced], errorDescription[The connection was inactive for more than the allowed 300000 milliseconds and is closed by container 'LinkTracker'. TrackingId:9999b13b12345f12a12d, SystemTracker:gateway5, Timestamp:2021-01-13T12:55:26]
2021-01-13 18:25:26 [single-1       ] WARN  c.a.c.a.i.RequestResponseChannel - Retry #1. Transient error occurred. Retrying after 60000 ms.
The connection was inactive for more than the allowed 300000 milliseconds and is closed by container 'LinkTracker'. TrackingId:9999b13b12345f12a12d, SystemTracker:gateway5, Timestamp:2021-01-13T12:55:26, errorContext[NAMESPACE: ehub_nspace.servicebus.windows.net, PATH: $cbs, REFERENCE_ID: cbs:sender, LINK_CREDIT: 98]
2021-01-13 18:25:26 [single-1       ] ERROR c.a.c.a.i.RequestResponseChannel - cbs - Exception in RequestResponse links. Disposing and clearing unconfirmed sends.
The connection was inactive for more than the allowed 300000 milliseconds and is closed by container 'LinkTracker'. TrackingId:9999b13b12345f12a12d, SystemTracker:gateway5, Timestamp:2021-01-13T12:55:26, errorContext[NAMESPACE: ehub_nspace.servicebus.windows.net, PATH: $cbs, REFERENCE_ID: cbs:sender, LINK_CREDIT: 98]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.ReceiveLinkHandler - onLinkRemoteClose connectionId[MM_9999990000007], linkName[cbs:receiver], errorCondition[amqp:connection:forced], errorDescription[The connection was inactive for more than the allowed 300000 milliseconds and is closed by container 'LinkTracker'. TrackingId:9999b13b12345f12a12d, SystemTracker:gateway5, Timestamp:2021-01-13T12:55:26]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.ReceiveLinkHandler - processOnClose connectionId[MM_9999990000007], linkName[cbs:receiver], errorCondition[amqp:connection:forced], errorDescription[The connection was inactive for more than the allowed 300000 milliseconds and is closed by container 'LinkTracker'. TrackingId:9999b13b12345f12a12d, SystemTracker:gateway5, Timestamp:2021-01-13T12:55:26]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SessionHandler - onSessionRemoteClose connectionId[cbs-session], entityName[MM_9999990000007], condition[Error{condition=null, description='null', info=null}]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SessionHandler - onSessionRemoteClose closing a local session for connectionId[MM_9999990000007], entityName[cbs-session], condition[null], description[null]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.ReactorConnection - connectionId[MM_9999990000007] sessionName[cbs-session]: Error occurred. Removing and disposing session.
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.ReactorSession - connectionId[MM_9999990000007], sessionId[cbs-session], errorCondition[n/a]: Disposing of session.
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.ConnectionHandler - onConnectionRemoteClose hostname[dummy.servicebus.windows.net:443], connectionId[MM_9999990000007], errorCondition[amqp:connection:forced], errorDescription[The connection was inactive for more than the allowed 300000 milliseconds and is closed by container 'LinkTracker'. TrackingId:9999b13b12345f12a12d, SystemTracker:gateway5, Timestamp:2021-01-13T12:55:26]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SendLinkHandler - onLinkLocalClose connectionId[MM_9999990000007], linkName[cbs:sender], errorCondition[amqp:connection:forced], errorDescription[The connection was inactive for more than the allowed 300000 milliseconds and is closed by container 'LinkTracker'. TrackingId:9999b13b12345f12a12d, SystemTracker:gateway5, Timestamp:2021-01-13T12:55:26]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.ReceiveLinkHandler - onLinkLocalClose connectionId[MM_9999990000007], linkName[cbs:receiver], errorCondition[null], errorDescription[null]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.ConnectionHandler - onConnectionLocalClose hostname[dummy.servicebus.windows.net:443], connectionId[MM_9999990000007], errorCondition[null], errorDescription[null]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.ConnectionHandler - onConnectionUnbound hostname[dummy.servicebus.windows.net:443], connectionId[MM_9999990000007], state[CLOSED], remoteState[CLOSED]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SendLinkHandler - onLinkFinal connectionId[MM_9999990000007], linkName[ehub_dev]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SessionHandler - onSessionFinal connectionId[MM_9999990000007], entityName[ehub_dev], condition[null], description[null]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SendLinkHandler - onLinkFinal connectionId[MM_9999990000007], linkName[cbs:sender]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.ReceiveLinkHandler - onLinkFinal connectionId[MM_9999990000007], linkName[cbs:receiver]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.SessionHandler - onSessionFinal connectionId[MM_9999990000007], entityName[cbs-session], condition[null], description[null]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.handler.ConnectionHandler - onConnectionFinal hostname[dummy.servicebus.windows.net:443], connectionId[MM_9999990000007], errorCondition[null], errorDescription[null]
2021-01-13 18:25:26 [single-1       ] INFO  c.a.m.e.i.EventHubConnectionProcessor - namespace[ehub_nspace.servicebus.windows.net] entityPath[ehub_dev]: Channel is closed.
2021-01-13 18:25:26 [single-1       ] INFO  c.a.m.e.i.EventHubReactorAmqpConnection - connectionId[MM_9999990000007]: Disposing of connection.
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.ReactorConnection - connectionId[MM_9999990000007], errorCondition[n/a]: Disposing of ReactorConnection.
2021-01-13 18:25:26 [single-1       ] INFO  c.a.c.a.i.AmqpExceptionHandler - Shutdown received: ReactorExecutor.close() was called., isTransient[false], initiatedByClient[true]

Repro Steps
Start a code which keeps sending a batch with dummy message to eventhub every minute.

Expected behavior
It should not close the connection by itself if a continuous sending of batch is being done by the process i.e. every 1 minute.

Setup (please complete the following information):

  • OS: MAC
  • IDE : IntelliJ
  • Version of the Library used : azure-messaging-eventhub 5.3.1 and azure-eventhub 3.2.2

Additional context
Add any other context about the problem here.

Information Checklist
Kindly make sure that you have added all the following information above and checkoff the required fields otherwise we will treat the issuer as an incomplete report

  • Bug Description Added
  • Repro Steps Added
  • Setup information Added
@ghost ghost added needs-triage This is a new issue that needs to be triaged to the appropriate team. customer-reported Issues that are reported by GitHub users external to the Azure organization. question The issue doesn't require a change to the product in order to be resolved. Most issues start as that labels Jan 13, 2021
@joshfree joshfree added Client This issue points to a problem in the data-plane of the library. Event Hubs pillar-reliability The issue is related to reliability, one of our core engineering pillars. (includes stress testing) labels Jan 13, 2021
@ghost ghost removed the needs-triage This is a new issue that needs to be triaged to the appropriate team. label Jan 13, 2021
@joshfree
Copy link
Member

@srnagar can you please take a look?

@srnagar
Copy link
Member

srnagar commented Jan 13, 2021

@ankygupta9999 I am taking a look at this issue. When you see the error message, can you confirm that the message you tried to send failed or did the message get sent successfully but there were some timeout errors in the logs?

@ankygupta9999
Copy link
Author

Most of the times, it connects and sends the message fine but at times, it tries few times based on amqpretry options and not able to connect again.

@conniey
Copy link
Member

conniey commented Apr 19, 2021

We recently released 5.7.0, which should alleviate this problem. However, could you turn on AZURE_LOG_LEVEL=INFO to give us more information around the time when it's failing? This would help us diagnose it.

@ramya-rao-a ramya-rao-a added the needs-author-feedback More information is needed from author to address the issue. label Sep 14, 2021
@ramya-rao-a
Copy link
Contributor

Hey @ankygupta9999

We will be closing this issue as we have not heard back from you regarding the logs that were asked.
Like @conniey said, we believe the issue should be resolved in the recent versions of the package.
Please feel free to open a new issue if you are still seeing problems

azure-sdk pushed a commit to azure-sdk/azure-sdk-for-java that referenced this issue Apr 11, 2022
Dev-datamigration-Microsoft.DataMigration-2022-03-30-preview (Azure#18213)

* Initial Commit

* Updating the API version of files

* Adding New APIs, examples and properties

* fixing model validation and spell check error

* Adding Identifier to array

* Solving pretty check

* Adding TableList and Force delete

* Correcting table list example

* Updating expand description

* Updating examples

* Adding filestoragetype on swagger and minmi example

* Adding tablelist and sqldbthresholds to examples response

* Removing SQLDBbackupconfiguration and RowThreshold (Azure#18473)

* fixing s360 errors and adding secure strings (Azure#18593)

* fixing s360 errors and adding secure strings

* nit error

* pretty check

* fileshare password as secure string

* Revert "pretty check"

This reverts commit 677d27c00d5a1185a942873bc02fa54c08fcde03.

* nit error

* Revert "nit error"

This reverts commit 345b38af7f3344a85d666b547acd91fd769a66d2.

* removing extra space

* moving targetDatabaseCollation and provisioningError

Co-authored-by: Vasundhra Manhas <vmanhas@microsoft.com>

Co-authored-by: Vasundhra Manhas <42461257+vasundhraManhas@users.noreply.github.com>
Co-authored-by: Vasundhra Manhas <vmanhas@microsoft.com>
@github-actions github-actions bot locked and limited conversation to collaborators Apr 12, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. Event Hubs needs-author-feedback More information is needed from author to address the issue. pillar-reliability The issue is related to reliability, one of our core engineering pillars. (includes stress testing) question The issue doesn't require a change to the product in order to be resolved. Most issues start as that
Projects
None yet
Development

No branches or pull requests

5 participants