Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Telegesis: Devices becoming unresponsive #57

Closed
weakfl opened this issue Dec 11, 2017 · 56 comments
Closed

Telegesis: Devices becoming unresponsive #57

weakfl opened this issue Dec 11, 2017 · 56 comments

Comments

@weakfl
Copy link
Contributor

weakfl commented Dec 11, 2017

After some time devices won't respond to commands anymore. The corresponding things are still online in OH but stop working. Judging from the comments in the community thread this problem seems to affect other users as well.

Is it possible this is caused by the growing TX queue also discussed here?

Sending a command does nothing:

10:35:28.616 [INFO ] [smarthome.event.ItemCommandEvent     ] - Item 'LivingroomLights1' received command ON
10:35:28.630 [DEBUG] [ing.zigbee.handler.ZigBeeThingHandler] - D85DEF11A1002826: Command for channel zigbee:device:49455deb:d85def11a1002826:D85DEF11A1002826_18_switch_onoff --> ON
10:35:28.632 [INFO ] [smarthome.event.ItemStateChangedEvent] - LivingroomLights1 changed from OFF to ON
10:35:28.635 [DEBUG] [rtsystems.zigbee.ZigBeeNetworkManager] - TX CMD: OnCommand [On/Off: 0/0 -> 45692/18, cluster=0006, TID=45]
10:35:28.639 [DEBUG] [rtsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=CLUSTER_SPECIFIC_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=69, commandId=1]
10:35:28.648 [DEBUG] [rtsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/1, destinationAddress=45692/18, profile=0104, cluster=6, addressMode=DEVICE, radius=31, sequence=69, payload=01 45 01]
10:35:28.652 [DEBUG] [ongle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=45692, sourceEp=0, destEp=18, profileId=260, clusterId=6, messageData=01 45 01, messageId=null]
10:35:28.655 [DEBUG] [egesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 1587

And the tx queue is just growing and growing:

2017-12-11 10:37:06.178 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: Starting mesh update
2017-12-11 10:37:06.181 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementLqiRequest [0/0 -> 0/0, cluster=0031, TID=4E, startIndex=0]
2017-12-11 10:37:06.184 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=49, addressMode=DEVICE, radius=31, sequence=78, payload=00 00]
2017-12-11 10:37:06.186 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=49, messageData=00 00, messageId=null]
2017-12-11 10:37:06.189 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 1596
2017-12-11 10:37:16.191 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: ManagementLqiRequest returned null
2017-12-11 10:37:16.194 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: IeeeAddressRequest [0/0 -> 0/0, cluster=0001, TID=4F, nwkAddrOfInterest=0, requestType=1, startIndex=0]
2017-12-11 10:37:16.197 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=1, addressMode=DEVICE, radius=31, sequence=79, payload=00 00 00 01 00]
2017-12-11 10:37:16.199 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=1, messageData=00 00 00 01 00, messageId=null]
2017-12-11 10:37:16.201 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 1597
2017-12-11 10:37:26.204 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: Ieee Address request returned null
2017-12-11 10:37:26.207 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementRoutingRequest [0/0 -> 0/0, cluster=0032, TID=50, startIndex=0]
2017-12-11 10:37:26.210 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=50, addressMode=DEVICE, radius=31, sequence=80, payload=00 00]
2017-12-11 10:37:26.213 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=50, messageData=00 00, messageId=null]
2017-12-11 10:37:26.215 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 1598
2017-12-11 10:37:36.218 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: ManagementRoutingRequest returned null
2017-12-11 10:37:36.220 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: Ending mesh update
2017-12-11 10:38:36.178 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: Starting mesh update
2017-12-11 10:38:36.182 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementLqiRequest [0/0 -> 0/0, cluster=0031, TID=51, startIndex=0]
2017-12-11 10:38:36.185 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=49, addressMode=DEVICE, radius=31, sequence=81, payload=00 00]
2017-12-11 10:38:36.188 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=49, messageData=00 00, messageId=null]
2017-12-11 10:38:36.191 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 1599
2017-12-11 10:38:46.193 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: ManagementLqiRequest returned null
2017-12-11 10:38:46.202 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: IeeeAddressRequest [0/0 -> 0/0, cluster=0001, TID=52, nwkAddrOfInterest=0, requestType=1, startIndex=0]
2017-12-11 10:38:46.211 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=1, addressMode=DEVICE, radius=31, sequence=82, payload=00 00 00 01 00]
2017-12-11 10:38:46.219 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=1, messageData=00 00 00 01 00, messageId=null]
2017-12-11 10:38:46.226 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 1600
2017-12-11 10:38:56.239 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: Ieee Address request returned null
2017-12-11 10:38:56.247 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementRoutingRequest [0/0 -> 0/0, cluster=0032, TID=53, startIndex=0]
2017-12-11 10:38:56.255 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=50, addressMode=DEVICE, radius=31, sequence=83, payload=00 00]
2017-12-11 10:38:56.264 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=50, messageData=00 00, messageId=null]
2017-12-11 10:38:56.277 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 1601
2017-12-11 10:39:06.313 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: ManagementRoutingRequest returned null
2017-12-11 10:39:06.318 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: Ending mesh update
2017-12-11 10:40:06.178 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: Starting mesh update
2017-12-11 10:40:06.181 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementLqiRequest [0/0 -> 0/0, cluster=0031, TID=54, startIndex=0]
2017-12-11 10:40:06.183 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=49, addressMode=DEVICE, radius=31, sequence=84, payload=00 00]
2017-12-11 10:40:06.187 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=49, messageData=00 00, messageId=null]
2017-12-11 10:40:06.189 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 1602

Here is a full debug log from OH startup up to a queue count of 8, maybe that helps. Queue seems to start growing around 2017-12-10 14:55:06:
zigbee_tx_queue_growing.log.zip

@cdjackson
Copy link
Contributor

cdjackson commented Dec 11, 2017 via email

@weakfl
Copy link
Contributor Author

weakfl commented Dec 11, 2017

No problem, I just wanted to provide the required information in a separate issue since it's not really related to the state update problem.

Enjoy your vacation!

@cdjackson
Copy link
Contributor

cdjackson commented Dec 11, 2017 via email

@BClark09
Copy link
Member

BClark09 commented Jan 5, 2018

Hi Chris, I've spent some time trying to narrow down my issue and believe I've narrowed it down to the same problem as this. I've been able to pin-point the moment that this occurs too:

2018-01-05 22:28:40.968 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS RX: Data 52 58 3A 33 44 45 42 2C 30 93 05 00 EA 11
2018-01-05 22:28:40.968 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS TX: Data 41 54 2B 53 45 4E 44 55 43 41 53 54 42 3A 30 32 2C 30 30 30 30 2C 30 30 2C 30 30 2C 30 30 30 30 2C 30 30 33 32 0D 00 0F 0D 0A
2018-01-05 22:28:40.968 [DEBUG] [gesis.internal.TelegesisEventFactory] - Error creating instance of EzspFrame
java.lang.NumberFormatException: For input string: "0?�"
	at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) ~[?:?]
	at java.lang.Integer.parseInt(Integer.java:580) [?:?]
	at com.zsmartsystems.zigbee.dongle.telegesis.internal.protocol.TelegesisFrame.deserializeInt16(TelegesisFrame.java:198) [202:org.openhab.binding.zigbee:2.3.0.201801042145]
	at com.zsmartsystems.zigbee.dongle.telegesis.internal.protocol.TelegesisReceiveMessageEvent.deserialize(TelegesisReceiveMessageEvent.java:165) [202:org.openhab.binding.zigbee:2.3.0.201801042145]
	at com.zsmartsystems.zigbee.dongle.telegesis.internal.TelegesisEventFactory.getTelegesisFrame(TelegesisEventFactory.java:91) [202:org.openhab.binding.zigbee:2.3.0.201801042145]
	at com.zsmartsystems.zigbee.dongle.telegesis.internal.TelegesisFrameHandler$1.run(TelegesisFrameHandler.java:123) [202:org.openhab.binding.zigbee:2.3.0.201801042145]
2018-01-05 22:28:41.247 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS RX: Data 53 52 3A 30 31 2C 30 30 31 37 38 38 30 31 30 30 46 44 39 45 44 33 2C 31 32 42 30 2C 42
2018-01-05 22:28:41.251 [DEBUG] [gesis.internal.TelegesisFrameHandler] - Telegesis event received: TelegesisRouteRecordMessageEvent [hops=1, remoteAddress=0017880100FD9ED3, networkRoute=[4784]]
2018-01-05 22:28:41.253 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis RX: TelegesisRouteRecordMessageEvent [hops=1, remoteAddress=0017880100FD9ED3, networkRoute=[4784]]
2018-01-05 22:28:41.254 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Unhandled Telegesis Frame: TelegesisRouteRecordMessageEvent [hops=1, remoteAddress=0017880100FD9ED3, networkRoute=[4784]]
2018-01-05 22:28:41.255 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS RX: Data 41 43 4B 3A 36 32
2018-01-05 22:28:41.256 [DEBUG] [gesis.internal.TelegesisFrameHandler] - Telegesis event received: TelegesisAckMessageEvent [messageId=98]
2018-01-05 22:28:41.256 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis RX: TelegesisAckMessageEvent [messageId=98]
2018-01-05 22:28:41.257 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Unhandled Telegesis Frame: TelegesisAckMessageEvent [messageId=98]
2018-01-05 22:28:41.259 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS RX: Data 53 52 3A 30 31 2C 30 30 31 37 38 38 30 31 30 30 46 44 39 45 44 33 2C 31 32 42 30 2C 42 33
2018-01-05 22:28:41.260 [DEBUG] [gesis.internal.TelegesisFrameHandler] - Telegesis event received: TelegesisRouteRecordMessageEvent [hops=1, remoteAddress=0017880100FD9ED3, networkRoute=[4784]]
2018-01-05 22:28:41.267 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis RX: TelegesisRouteRecordMessageEvent [hops=1, remoteAddress=0017880100FD9ED3, networkRoute=[4784]]
2018-01-05 22:28:41.268 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Unhandled Telegesis Frame: TelegesisRouteRecordMessageEvent [hops=1, remoteAddress=0017880100FD9ED3, networkRoute=[4784]]
2018-01-05 22:28:41.269 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS RX: Data 52 58 3A 31 32 42 30 2C 2C

I had to miss pasting a character on that unknown input string, it completely stopped the rest of the paste!?

From then on, the TX queue just keeps on growing and none of my lights respond:

2018-01-05 22:28:41.501 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 1
2018-01-05 22:28:44.217 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FFEF9F: Polling...
2018-01-05 22:28:44.217 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FFEF9F: Polling zigbee:device:35a2b5af:0017880100ffef9f:0017880100FFEF9F_11_color_color
2018-01-05 22:28:44.217 [DEBUG] [.zsmartsystems.zigbee.zcl.ZclCluster] - readSync request: ZclAttribute [cluster=COLOR_CONTROL, id=0, name=CurrentHue, dataType=UNSIGNED_8_BIT_INTEGER, lastValue=0]
2018-01-05 22:28:44.217 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ReadAttributesCommand [Color control: 0/0 -> 15851/11, cluster=0300, TID=90, identifiers=[0]]
2018-01-05 22:28:44.217 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=ENTIRE_PROFILE_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=144, commandId=0]
2018-01-05 22:28:44.217 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/1, destinationAddress=15851/11, profile=0104, cluster=768, addressMode=DEVICE, radius=31, sequence=144, payload=00 90 00 00 00]
2018-01-05 22:28:44.217 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=15851, sourceEp=0, destEp=11, profileId=260, clusterId=768, messageData=00 90 00 00 00, messageId=null]
2018-01-05 22:28:44.217 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 2
2018-01-05 22:28:44.359 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 001788011029F4B8: Polling...
2018-01-05 22:28:44.363 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 001788011029F4B8: Polling zigbee:device:35a2b5af:001788011029f4b8:001788011029F4B8_11_switch_level
2018-01-05 22:28:44.364 [DEBUG] [.zsmartsystems.zigbee.zcl.ZclCluster] - readSync request: ZclAttribute [cluster=LEVEL_CONTROL, id=0, name=CurrentLevel, dataType=UNSIGNED_8_BIT_INTEGER, lastValue=254]
2018-01-05 22:28:44.365 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ReadAttributesCommand [Level Control: 0/0 -> 45832/11, cluster=0008, TID=91, identifiers=[0]]
2018-01-05 22:28:44.366 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=ENTIRE_PROFILE_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=145, commandId=0]
2018-01-05 22:28:44.367 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/1, destinationAddress=45832/11, profile=0104, cluster=8, addressMode=DEVICE, radius=31, sequence=145, payload=00 91 00 00 00]
2018-01-05 22:28:44.368 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=45832, sourceEp=0, destEp=11, profileId=260, clusterId=8, messageData=00 91 00 00 00, messageId=null]
2018-01-05 22:28:44.369 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 3
2018-01-05 22:28:44.377 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS TX: Data 41 54 2B 53 45 4E 44 55 43 41 53 54 42 3A 30 35 2C 39 30 33 43 2C 30 30 2C 30 42 2C 30 31 30 34 2C 30 30 30 38 0D 00 8F 00 00 00 0D 0A
2018-01-05 22:28:44.870 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FFEF9F: Command for channel zigbee:device:35a2b5af:0017880100ffef9f:0017880100FFEF9F_11_color_color --> 53,25,100
2018-01-05 22:28:45.195 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FFEF9F: Command for channel zigbee:device:35a2b5af:0017880100ffef9f:0017880100FFEF9F_11_color_color --> 53,29,100
2018-01-05 22:28:45.602 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FFEF9F: Command for channel zigbee:device:35a2b5af:0017880100ffef9f:0017880100FFEF9F_11_color_color --> 53,45,100
2018-01-05 22:28:50.173 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 45832: ManagementLqiRequest returned null
2018-01-05 22:28:50.173 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 4494: ManagementLqiRequest returned null
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: MoveToHueAndSaturationCommand [Color control: 0/0 -> 15851/11, cluster=0300, TID=94, hue=37, saturation=74, transitionTime=10]
2018-01-05 22:28:50.173 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 15851: ManagementLqiRequest returned null
2018-01-05 22:28:50.173 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 4784: ManagementLqiRequest returned null
2018-01-05 22:28:50.173 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 13497: ManagementLqiRequest returned null
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: IeeeAddressRequest [0/0 -> 4784/0, cluster=0001, TID=97, nwkAddrOfInterest=4784, requestType=1, startIndex=0]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: IeeeAddressRequest [0/0 -> 15851/0, cluster=0001, TID=96, nwkAddrOfInterest=15851, requestType=1, startIndex=0]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=CLUSTER_SPECIFIC_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=148, commandId=6]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: IeeeAddressRequest [0/0 -> 4494/0, cluster=0001, TID=95, nwkAddrOfInterest=4494, requestType=1, startIndex=0]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: MoveToHueAndSaturationCommand [Color control: 0/0 -> 15851/11, cluster=0300, TID=93, hue=37, saturation=64, transitionTime=10]
2018-01-05 22:28:50.173 [DEBUG] [.zsmartsystems.zigbee.zcl.ZclCluster] - readSync request: ZclAttribute [cluster=COLOR_CONTROL, id=1, name=CurrentSaturation, dataType=UNSIGNED_8_BIT_INTEGER, lastValue=141]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: IeeeAddressRequest [0/0 -> 45832/0, cluster=0001, TID=92, nwkAddrOfInterest=45832, requestType=1, startIndex=0]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ReadAttributesCommand [Color control: 0/0 -> 4784/11, cluster=0300, TID=99, identifiers=[1]]
2018-01-05 22:28:50.173 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: ManagementRoutingRequest returned null
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=ENTIRE_PROFILE_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=153, commandId=0]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=45832/0, profile=0000, cluster=1, addressMode=DEVICE, radius=31, sequence=146, payload=00 08 B3 01 00]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=CLUSTER_SPECIFIC_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=147, commandId=6]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=4494/0, profile=0000, cluster=1, addressMode=DEVICE, radius=31, sequence=149, payload=00 8E 11 01 00]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/1, destinationAddress=15851/11, profile=0104, cluster=768, addressMode=DEVICE, radius=31, sequence=147, payload=01 93 06 25 40 0A 00]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/1, destinationAddress=15851/11, profile=0104, cluster=768, addressMode=DEVICE, radius=31, sequence=148, payload=01 94 06 25 4A 0A 00]
2018-01-05 22:28:50.173 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=15851, sourceEp=0, destEp=11, profileId=260, clusterId=768, messageData=01 93 06 25 40 0A 00, messageId=null]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=15851/0, profile=0000, cluster=1, addressMode=DEVICE, radius=31, sequence=150, payload=00 EB 3D 01 00]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=4784/0, profile=0000, cluster=1, addressMode=DEVICE, radius=31, sequence=151, payload=00 B0 12 01 00]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: IeeeAddressRequest [0/0 -> 13497/0, cluster=0001, TID=98, nwkAddrOfInterest=13497, requestType=1, startIndex=0]
2018-01-05 22:28:50.189 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=4784, sourceEp=0, destEp=0, profileId=0, clusterId=1, messageData=00 B0 12 01 00, messageId=null]
2018-01-05 22:28:50.189 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=15851, sourceEp=0, destEp=0, profileId=0, clusterId=1, messageData=00 EB 3D 01 00, messageId=null]
2018-01-05 22:28:50.189 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 3
2018-01-05 22:28:50.173 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=15851, sourceEp=0, destEp=11, profileId=260, clusterId=768, messageData=01 94 06 25 4A 0A 00, messageId=null]
2018-01-05 22:28:50.173 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=4494, sourceEp=0, destEp=0, profileId=0, clusterId=1, messageData=00 8E 11 01 00, messageId=null]
2018-01-05 22:28:50.173 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=45832, sourceEp=0, destEp=0, profileId=0, clusterId=1, messageData=00 08 B3 01 00, messageId=null]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/1, destinationAddress=4784/11, profile=0104, cluster=768, addressMode=DEVICE, radius=31, sequence=153, payload=00 99 00 01 00]
2018-01-05 22:28:50.173 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementRoutingRequest [0/0 -> 0/0, cluster=0032, TID=9A, startIndex=15]
2018-01-05 22:28:50.189 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=4784, sourceEp=0, destEp=11, profileId=260, clusterId=768, messageData=00 99 00 01 00, messageId=null]
2018-01-05 22:28:50.189 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=50, addressMode=DEVICE, radius=31, sequence=154, payload=00 0F]
2018-01-05 22:28:50.189 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 7
2018-01-05 22:28:50.189 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 6
2018-01-05 22:28:50.189 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 5
2018-01-05 22:28:50.189 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 5
2018-01-05 22:28:50.189 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 4
2018-01-05 22:28:50.189 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=13497/0, profile=0000, cluster=1, addressMode=DEVICE, radius=31, sequence=152, payload=00 B9 34 01 00]
2018-01-05 22:28:50.204 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=50, messageData=00 0F, messageId=null]
2018-01-05 22:28:50.204 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 8
2018-01-05 22:28:50.204 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 9
2018-01-05 22:28:50.204 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=13497, sourceEp=0, destEp=0, profileId=0, clusterId=1, messageData=00 B9 34 01 00, messageId=null]
2018-01-05 22:28:50.204 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS TX: Data 41 54 2B 53 45 4E 44 55 43 41 53 54 42 3A 30 35 2C 33 44 45 42 2C 30 30 2C 30 42 2C 30 31 30 34 2C 30 33 30 30 0D 00 90 00 00 00 0D 0A
2018-01-05 22:28:50.204 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 10
2018-01-05 22:28:50.204 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS TX: Data 41 54 2B 53 45 4E 44 55 43 41 53 54 42 3A 30 35 2C 42 33 30 38 2C 30 30 2C 30 42 2C 30 31 30 34 2C 30 30 30 38 0D 00 91 00 00 00 0D 0A
2018-01-05 22:28:50.220 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS TX: Data 41 54 2B 53 45 4E 44 55 43 41 53 54 42 3A 30 37 2C 33 44 45 42 2C 30 30 2C 30 42 2C 30 31 30 34 2C 30 33 30 30 0D 01 93 06 25 40 0A 00 0D 0A
2018-01-05 22:28:54.220 [DEBUG] [.zsmartsystems.zigbee.zcl.ZclCluster] - readSync request: ZclAttribute [cluster=COLOR_CONTROL, id=1, name=CurrentSaturation, dataType=UNSIGNED_8_BIT_INTEGER, lastValue=140]
2018-01-05 22:28:54.220 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ReadAttributesCommand [Color control: 0/0 -> 15851/11, cluster=0300, TID=9B, identifiers=[1]]
2018-01-05 22:28:54.220 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: MoveToHueAndSaturationCommand [Color control: 0/0 -> 15851/11, cluster=0300, TID=9C, hue=37, saturation=114, transitionTime=10]
2018-01-05 22:28:54.220 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=ENTIRE_PROFILE_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=155, commandId=0]
2018-01-05 22:28:54.220 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=CLUSTER_SPECIFIC_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=156, commandId=6]
2018-01-05 22:28:54.220 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/1, destinationAddress=15851/11, profile=0104, cluster=768, addressMode=DEVICE, radius=31, sequence=155, payload=00 9B 00 01 00]
2018-01-05 22:28:54.220 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/1, destinationAddress=15851/11, profile=0104, cluster=768, addressMode=DEVICE, radius=31, sequence=156, payload=01 9C 06 25 72 0A 00]
2018-01-05 22:28:54.220 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=15851, sourceEp=0, destEp=11, profileId=260, clusterId=768, messageData=00 9B 00 01 00, messageId=null]
2018-01-05 22:28:54.220 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=15851, sourceEp=0, destEp=11, profileId=260, clusterId=768, messageData=01 9C 06 25 72 0A 00, messageId=null]
2018-01-05 22:28:54.220 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 9
2018-01-05 22:28:54.220 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 10
2018-01-05 22:28:54.236 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS TX: Data 41 54 2B 53 45 4E 44 55 43 41 53 54 42 3A 30 35 2C 31 32 42 30 2C 30 30 2C 30 30 2C 30 30 30 30 2C 30 30 30 31 0D 00 B0 12 01 00 0D 0A
2018-01-05 22:28:54.236 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS TX: Data 41 54 2B 53 45 4E 44 55 43 41 53 54 42 3A 30 35 2C 33 44 45 42 2C 30 30 2C 30 30 2C 30 30 30 30 2C 30 30 30 31 0D 00 EB 3D 01 00 0D 0A
2018-01-05 22:29:00.205 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: ManagementRoutingRequest returned null
2018-01-05 22:29:00.205 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 001788011029DEE4: Polling...
2018-01-05 22:29:00.205 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 4494: Ieee Address request returned null
2018-01-05 22:29:00.205 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 13497: Ieee Address request returned null
2018-01-05 22:29:00.205 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 45832: Ieee Address request returned null
2018-01-05 22:29:00.205 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 4784: Ieee Address request returned null
2018-01-05 22:29:00.205 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementRoutingRequest [0/0 -> 0/0, cluster=0032, TID=9D, startIndex=15]
2018-01-05 22:29:00.205 [DEBUG] [.zsmartsystems.zigbee.zcl.ZclCluster] - readSync request: ZclAttribute [cluster=LEVEL_CONTROL, id=0, name=CurrentLevel, dataType=UNSIGNED_8_BIT_INTEGER, lastValue=254]
2018-01-05 22:29:00.205 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 15851: Ieee Address request returned null
2018-01-05 22:29:00.205 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ReadAttributesCommand [Level Control: 0/0 -> 4784/11, cluster=0008, TID=A2, identifiers=[0]]
2018-01-05 22:29:00.205 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=50, addressMode=DEVICE, radius=31, sequence=157, payload=00 0F]
2018-01-05 22:29:00.205 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementRoutingRequest [0/0 -> 4784/0, cluster=0032, TID=A1, startIndex=0]
2018-01-05 22:29:00.205 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementRoutingRequest [0/0 -> 45832/0, cluster=0032, TID=A0, startIndex=0]
2018-01-05 22:29:00.205 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=45832/0, profile=0000, cluster=50, addressMode=DEVICE, radius=31, sequence=160, payload=00 00]
2018-01-05 22:29:00.205 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=45832, sourceEp=0, destEp=0, profileId=0, clusterId=50, messageData=00 00, messageId=null]
2018-01-05 22:29:00.205 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 9
2018-01-05 22:29:00.205 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementRoutingRequest [0/0 -> 13497/0, cluster=0032, TID=9F, startIndex=0]
2018-01-05 22:29:00.221 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=13497/0, profile=0000, cluster=50, addressMode=DEVICE, radius=31, sequence=159, payload=00 00]
2018-01-05 22:29:00.221 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=13497, sourceEp=0, destEp=0, profileId=0, clusterId=50, messageData=00 00, messageId=null]
2018-01-05 22:29:00.236 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 9
2018-01-05 22:29:00.205 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementRoutingRequest [0/0 -> 4494/0, cluster=0032, TID=9E, startIndex=0]
2018-01-05 22:29:00.236 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=4494/0, profile=0000, cluster=50, addressMode=DEVICE, radius=31, sequence=158, payload=00 00]
2018-01-05 22:29:00.236 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=4494, sourceEp=0, destEp=0, profileId=0, clusterId=50, messageData=00 00, messageId=null]
2018-01-05 22:29:00.236 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 10
2018-01-05 22:29:00.205 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 001788011029DEE4: Polling zigbee:device:35a2b5af:001788011029dee4:001788011029DEE4_11_switch_level
2018-01-05 22:29:00.236 [DEBUG] [.zsmartsystems.zigbee.zcl.ZclCluster] - readSync request: ZclAttribute [cluster=LEVEL_CONTROL, id=0, name=CurrentLevel, dataType=UNSIGNED_8_BIT_INTEGER, lastValue=254]
2018-01-05 22:29:00.236 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ReadAttributesCommand [Level Control: 0/0 -> 13497/11, cluster=0008, TID=A4, identifiers=[0]]
2018-01-05 22:29:00.236 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=ENTIRE_PROFILE_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=164, commandId=0]
2018-01-05 22:29:00.205 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FDA018: Polling...
2018-01-05 22:29:00.221 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS TX: Data 41 54 2B 53 45 4E 44 55 43 41 53 54 42 3A 30 37 2C 33 44 45 42 2C 30 30 2C 30 42 2C 30 31 30 34 2C 30 33 30 30 0D 01 94 06 25 4A 0A 00 0D 0A
2018-01-05 22:29:00.205 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=4784/0, profile=0000, cluster=50, addressMode=DEVICE, radius=31, sequence=161, payload=00 00]
2018-01-05 22:29:00.236 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=4784, sourceEp=0, destEp=0, profileId=0, clusterId=50, messageData=00 00, messageId=null]

For a full log, which includes that character:
[redacted]

@cdjackson
Copy link
Contributor

Sorry @BClark09 missed this - I'll take a look at this exception tomorrow. I'm not completely convinced it's the only reason for this though, but it's still one to solve, then we can see...

@cdjackson
Copy link
Contributor

cdjackson commented Jan 6, 2018

Note to self: This exception is 'ok' - it's handled fine. It's caused by a corrupted frame from the dongle - this is not unexpected since there is no error detection/control with the Telegesis protocol.

Edit: the problem is it is processed a second time as part of the transaction handler - the exception will kill the thread there.

@cdjackson cdjackson changed the title Devices becoming unresponsive Telegesis: Devices becoming unresponsive Jan 6, 2018
@cdjackson cdjackson added the bug label Jan 6, 2018
@cdjackson
Copy link
Contributor

I've just kicked off a build so this should be included in the binding when that's complete (give it 30 minutes).

@BClark09 the exception you saw is now fixed so if this problem is solely related to that, then this issue might be resolved ;). Actually, the exception you logged was not an issue, but the same exception would have also been logged to the console and it's that one that killed the thread. Ultimately these errors will continue to occur due to corrupted frames, but they are logged as debug messages so can be ignored.

@weakfl Let me know how it looks in a few days...

@BClark09
Copy link
Member

BClark09 commented Jan 6, 2018

Alright thanks again @cdjackson! I'll see what occurs when I get home this evening. I've setup a new openHAB instance with just the zigbee binding on it, so it'll be easy to separate and pick apart logs from now :)

@cdjackson
Copy link
Contributor

cdjackson commented Jan 6, 2018 via email

@BClark09
Copy link
Member

BClark09 commented Jan 6, 2018

I got the same type of exception:

Exception.log

It's getting better for certain, I left it running for a while and eventually the lights would take about 27 seconds to respond:

RXStop.log

The logs at which I turn on the Kitchen_Light_Switch start at these lines:

2018-01-06 21:23:06.923 [ome.event.ItemCommandEvent] - Item 'Kitchen_Light_Switch' received command ON
2018-01-06 21:23:06.924 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 001788011029F4B8: Command for channel zigbee:device:35a2b5af:001788011029f4b8:001788011029F4B8_11_switch_level --> ON
2018-01-06 21:23:06.926 [vent.ItemStateChangedEvent] - Kitchen_Light_Switch changed from OFF to ON
2018-01-06 21:23:06.926 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: OnCommand [On/Off: 0/0 -> 45832/11, cluster=0006, TID=FF]

The last RX command that is seen occurs at:

2018-01-06 20:53:44.635 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS RX: Data  53 52 3A 30 30 2C 30 30 31 37 38 38 30 31 31

Hope this helps!

@cdjackson
Copy link
Contributor

I got the same type of exception:

This is fine - as I said they will continue. They are logged as debug level and are ultimately unavoidable since it's caused by corruption on the serial port. At least for the moment I prefer to leave them in the logs - maybe at some point I'll remove them.

I don't think it's causing any sort of issue though is it - it's handled, and it ignored. If they are happening often, then it will likely cause some problems as it will result in timeouts but there's no way around that as the frame is corrupt and there's no error detection in the Telegesis protocol.

Even after the exception the responses are coming back fine....

2018-01-06 20:37:25.322 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ReadAttributesCommand [Basic: 0/0 -> 13497/11, cluster=0000, TID=52, identifiers=[2]]
2018-01-06 20:37:26.293 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - RX CMD: ReadAttributesResponse [Basic: 13497/11 -> 0/0, cluster=0000, TID=52, records=[ReadAttributeStatusRecord [attributeDataType=UNSIGNED_8_BIT_INTEGER, attributeIdentifier=2, status=SUCCESS, attributeValue=1]]]

I was however suspicious that the exceptions were not the cause of this issue and I think your second log shows that - although I've not looked at it in detail yet.

@BClark09
Copy link
Member

BClark09 commented Jan 6, 2018

Yep, that makes perfect sense. Just wanted to make sure that this is expected. Please let me know if I'm throwing lots of nothing at you.

I was sure that I uploaded my own log as a file first time, but as I look at my first post I somehow managed to reupload @weakfl's one? Sorry about that!

@cdjackson
Copy link
Contributor

cdjackson commented Jan 6, 2018 via email

@BClark09
Copy link
Member

BClark09 commented Jan 6, 2018

Which log do you mean (just so I don’t spend time looking at incorrect logs ;) )

This one was a reupload, which I have now redacted:

For a full log, which includes that character:
zigbee_tx_queue_growing.log

Based on what you've said I don't think the actual one I meant is any use as it's all contained in the post. RXStop.log may have good information though.

@weakfl
Copy link
Contributor Author

weakfl commented Jan 7, 2018

I have the latest version running since morning. TX queue is still at 2 and devices are responsive.

It's probably a bit early to tell, but things are looking good so far.

@cdjackson
Copy link
Contributor

Ok, thanks. I’ve been looking over this again over the past hour and have added a timer to time out any transactions that don’t complete. This might be another cause for the problem from looking at
your original log.

I've also found I didn't catch all exceptions caused by corrupt frames, and this is what caused the issue in @BClark09 RXStop log.

I'll post another version later tonight most likely.

@cdjackson
Copy link
Contributor

#79 includes the Telegesis handler transaction timer and improves the exception trapping.

The timer is set at 500ms - if the stick doesn't respond to a transaction in this time, it will be aborted.

In my testing, I put a bodge in to randomly drop every 1 in 4 frames and the binding continued to work ok (although probably there would be a load of other problems if the error rate was really this high!).

If you start to see timeouts in the log, please let me know - I'm not 100% sure that 500ms will cover every transaction, but I think it should ;)

@weakfl
Copy link
Contributor Author

weakfl commented Jan 8, 2018

Unfortunately the queue has grown to 1800 over night. I‘ll install the latest version now and report back tonight/tomorrow.

@weakfl
Copy link
Contributor Author

weakfl commented Jan 8, 2018

Queue is still growing:

TX Telegesis queue: 368

Here's a log from the last couple of hours in case you need it.

@weakfl
Copy link
Contributor Author

weakfl commented Jan 8, 2018

I'm on 2.3.0~20180107193004-1 btw and I'm not sure if your latest changes have been merged yet?
#79 has been merged 19 hours ago but the snapshot build seems to be older?

Is there a better way to check this than comparing build times?

@cdjackson
Copy link
Contributor

In theory there should be a list of changes in cloudbees, but this doesn't seem to work. I've just triggered a new build...

@cdjackson
Copy link
Contributor

Looking at your log, I don't think it includes the latest changes, but I also don't think the latest changes will fix this issue. It looks more like I remember from your previous logs (possibly in the thread on the BJ switches) where I think that the TX queue stops. My assumption is that there's a thread sync issue between RX and TX threads, but I might double check the exception trapping to make sure that any exception is not slipping through the net...

@weakfl
Copy link
Contributor Author

weakfl commented Jan 8, 2018

Looking at your log, I don't think it includes the latest changes

That's quite possible. I'm installing from the 'openHAB 2 Unstable Repository' and not manually. I have no idea when the cloudbees builds are pushed to the package repository and haven't figured out how to tell on which build the packages are actually based on :(

I can't thank you enough for your continued efforts! Please let me know if there's anything I can do to help (provide logs or whatever)...

@BClark09
Copy link
Member

BClark09 commented Jan 8, 2018

haven't figured out how to tell on which build the packages are actually based on

This one I can help with! The apt package is triggered automatically after the manual build completes, so whenever a snapshot is triggered, the unstable repo is updated within a couple of minutes after the build completes.

@cdjackson triggered the build https://openhab.ci.cloudbees.com/job/openHAB-Distribution/1172/ and therefore the apt version 2.3.0~20180107193004-1 will contain the changes.

That doesn't necessarily mean that your bundles would have updated automatically though. If you use bundle:list in the karaf console, then you should see 2.3.0.201801071809 or later.

@weakfl
Copy link
Contributor Author

weakfl commented Jan 11, 2018

Growth has almost stalled, queue has now reached 4 after almost 24 hours.
Here's the log, but be warned, this one is huge :)

zigbee_queue_20180110-20180111.log.zip

@cdjackson
Copy link
Contributor

cdjackson commented Jan 11, 2018 via email

@weakfl
Copy link
Contributor Author

weakfl commented Jan 11, 2018

Well almost stalled :) Previously it would grow pretty fast, now it's growing slowly but it's still growing.

It goes up two 2, then varies, but never gets below 2. After a while it goes up to 3, varies again, but never goes below 3, and so on.

@weakfl
Copy link
Contributor Author

weakfl commented Jan 11, 2018

One more observation: first I wasn't sure if it was just bad luck, but it has happened again now:

The queue will start to grow pretty fast as soon as I set the log level from DEBUG to ERROR. I had noticed that behaviour yesterday when I turned debug logging off and now it happened again. Queue going up to 7 in like 30 minutes.

Is it possible that the supposed thread sync issue has a bigger impact when debug logging is off? I'm not sure how to log this though???

@cdjackson
Copy link
Contributor

cdjackson commented Jan 11, 2018 via email

@weakfl
Copy link
Contributor Author

weakfl commented Jan 11, 2018

Whatever it is, it makes a huge difference if logging is turned off: TX Telegesis queue: 252

@cdjackson
Copy link
Contributor

cdjackson commented Jan 11, 2018 via email

@weakfl
Copy link
Contributor Author

weakfl commented Jan 12, 2018

You're right about the log, it's probably negligible. You mentioned that the queue should never go above 3 permanently, that's why I thought it might matter.

On the other hand, if it’s actually working (i.e. not locking up and becoming unresponsive) then maybe the main issue is resolved (maybe!!)

It does become unresponsive with log level set to error within a couple of hours. But as you said, I have no idea how to catch this. I'll let it run with debug level for a few days and see what happens.

@BClark09
Copy link
Member

Thought it was just a co-incidence but I get the same sort of issue. The lights have been responsive for over a full day now and still going. If I set the logging to ERROR the lights become unresponsive within the hour.

@cdjackson
Copy link
Contributor

I would say that the issue with the growing queue count was not related, but I might be wrong. At some point the queue will get to the point where it won't allow more entries, so maybe it's related. Why it's worse when debug is off, I don't know...

However, I'm reasonably confident that I have fixed this so I'll do an update later and we can see if it is the case ;) ...

@weakfl
Copy link
Contributor Author

weakfl commented Jan 12, 2018

However, I'm reasonably confident that I have fixed this so I'll do an update later and we can see if it is the case ;)

Nice 👍

Just in case, with debug logging on this time:
zigbee_issue57_20180112.log.zip

@weakfl
Copy link
Contributor Author

weakfl commented Jan 13, 2018

Devices still becoming unresponsive with 2.3.0.201801130942, log level set to error.

@BClark09 can you confirm?

@BClark09
Copy link
Member

BClark09 commented Jan 13, 2018

Was just about to post! Almost within an hour, the devices stopped responding when logging was set to WARN.

@cdjackson
Copy link
Contributor

cdjackson commented Jan 13, 2018 via email

@cdjackson
Copy link
Contributor

I'm going to increase the log level on some log entries - just certain exceptions or "minor" errors that shouldn't really happen, and also the log about the queue length.

It will allow to run with error logging, but also log some stuff so we can see if the queue is still increasing and if there are any timeouts etc.

Let's see if that shows anything.

I'll do that soon, but in the meantime if you have a log showing debug info after it's stopped responding, please provide it.

@BClark09
Copy link
Member

BClark09 commented Jan 13, 2018

Unfortunately openHAB restarted (nothing to do with this binding) before I turned on debug logging. So will try to get this to happen again in a few hours when I return home. The following is it's current state but because I'm not at home I can't say if the toggle of Bedroom_Light_Switch was successful or not, just wanted to show that the TX queue is still growing.

2018-01-13 14:46:36.788 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FD9ED3: Polling...
2018-01-13 14:46:38.860 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: Starting mesh update
2018-01-13 14:46:38.868 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementLqiRequest [0/0 -> 0/0, cluster=0031, TID=C8, startIndex=0]
2018-01-13 14:46:38.877 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=49, addressMode=DEVICE, radius=31, sequence=200, payload=00 00]
2018-01-13 14:46:38.886 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=49, messageData=00 00, messageId=null]
2018-01-13 14:46:38.895 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 148
2018-01-13 14:46:38.902 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS sendNextFrame in
2018-01-13 14:46:38.909 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS sendNextFrame ret timer
2018-01-13 14:46:43.137 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FFEF9F: Polling...
2018-01-13 14:46:44.382 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FDA018: Polling...
2018-01-13 14:46:46.995 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FD9ED3: Polling...
2018-01-13 14:46:48.917 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: ManagementLqiRequest returned null
2018-01-13 14:46:48.924 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: IeeeAddressRequest [0/0 -> 0/0, cluster=0001, TID=C9, nwkAddrOfInterest=0, requestType=1, startIndex=0]
2018-01-13 14:46:48.933 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=1, addressMode=DEVICE, radius=31, sequence=201, payload=00 00 00 01 00]
2018-01-13 14:46:48.944 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=1, messageData=00 00 00 01 00, messageId=null]
2018-01-13 14:46:48.951 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 149
2018-01-13 14:46:48.962 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS sendNextFrame in
2018-01-13 14:46:48.968 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS sendNextFrame ret timer
2018-01-13 14:46:53.586 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FFEF9F: Polling...
2018-01-13 14:46:54.520 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FDA018: Polling...
2018-01-13 14:46:56.047 [ome.event.ItemCommandEvent] - Item 'Bedroom_Light_Switch' received command ON
2018-01-13 14:46:56.049 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FD9ED3: Command for channel zigbee:device:4554161b:0017880100fd9ed3:0017880100FD9ED3_11_color_color --> ON
2018-01-13 14:46:56.055 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: MoveToLevelWithOnOffCommand [Level Control: 0/0 -> 30411/11, cluster=0008, TID=CA, level=254, transitionTime=10]
2018-01-13 14:46:56.060 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=CLUSTER_SPECIFIC_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=202, commandId=4]
2018-01-13 14:46:56.066 [vent.ItemStateChangedEvent] - Bedroom_Light_Switch changed from OFF to ON
2018-01-13 14:46:56.070 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/1, destinationAddress=30411/11, profile=0104, cluster=8, addressMode=DEVICE, radius=31, sequence=202, payload=01 CA 04 FE 0A 00]
2018-01-13 14:46:56.077 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=30411, sourceEp=0, destEp=11, profileId=260, clusterId=8, messageData=01 CA 04 FE 0A 00, messageId=null]
2018-01-13 14:46:56.083 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 150
2018-01-13 14:46:56.087 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS sendNextFrame in
2018-01-13 14:46:56.090 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS sendNextFrame ret timer
2018-01-13 14:46:57.207 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FD9ED3: Polling...
2018-01-13 14:46:57.246 [ome.event.ItemCommandEvent] - Item 'Bedroom_Light_Switch' received command OFF
2018-01-13 14:46:57.259 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FD9ED3: Command for channel zigbee:device:4554161b:0017880100fd9ed3:0017880100FD9ED3_11_color_color --> OFF
2018-01-13 14:46:57.270 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: Ieee Address request returned null
2018-01-13 14:46:57.271 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: MoveToLevelWithOnOffCommand [Level Control: 0/0 -> 30411/11, cluster=0008, TID=CB, level=0, transitionTime=10]
2018-01-13 14:46:57.279 [vent.ItemStateChangedEvent] - Bedroom_Light_Switch changed from ON to OFF
2018-01-13 14:46:57.279 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX CMD: ManagementRoutingRequest [0/0 -> 0/0, cluster=0032, TID=CC, startIndex=0]
2018-01-13 14:46:57.289 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX ZCL: ZclHeader [frameType=CLUSTER_SPECIFIC_COMMAND, manufacturerSpecific=false, direction=CLIENT_TO_SERVER, disableDefaultResponse=false, manufacturerCode=0, sequenceNumber=203, commandId=4]
2018-01-13 14:46:57.297 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/0, destinationAddress=0/0, profile=0000, cluster=50, addressMode=DEVICE, radius=31, sequence=204, payload=00 00]
2018-01-13 14:46:57.305 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=0, sourceEp=0, destEp=0, profileId=0, clusterId=50, messageData=00 00, messageId=null]
2018-01-13 14:46:57.305 [DEBUG] [tsystems.zigbee.ZigBeeNetworkManager] - TX APS: ZigBeeApsFrame [sourceAddress=0/1, destinationAddress=30411/11, profile=0104, cluster=8, addressMode=DEVICE, radius=31, sequence=203, payload=01 CB 04 00 0A 00]
2018-01-13 14:46:57.313 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 151
2018-01-13 14:46:57.319 [DEBUG] [ngle.telegesis.ZigBeeDongleTelegesis] - Telegesis send: TelegesisSendUnicastCommand [address=30411, sourceEp=0, destEp=11, profileId=260, clusterId=8, messageData=01 CB 04 00 0A 00, messageId=null]
2018-01-13 14:46:57.324 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TX Telegesis queue: 152
2018-01-13 14:46:57.323 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS sendNextFrame in
2018-01-13 14:46:57.329 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS sendNextFrame in
2018-01-13 14:46:57.334 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS sendNextFrame ret timer
2018-01-13 14:46:57.341 [DEBUG] [gesis.internal.TelegesisFrameHandler] - TELEGESIS sendNextFrame ret timer
2018-01-13 14:47:04.034 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FFEF9F: Polling...
2018-01-13 14:47:04.658 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FDA018: Polling...
2018-01-13 14:47:07.342 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: ManagementRoutingRequest returned null
2018-01-13 14:47:07.351 [DEBUG] [tems.zigbee.ZigBeeNetworkMeshMonitor] - 0: Ending mesh update
2018-01-13 14:47:07.412 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FD9ED3: Polling...
2018-01-13 14:47:14.483 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FFEF9F: Polling...
2018-01-13 14:47:14.796 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FDA018: Polling...
2018-01-13 14:47:17.619 [DEBUG] [ng.zigbee.handler.ZigBeeThingHandler] - 0017880100FD9ED3: Polling...

@cdjackson
Copy link
Contributor

cdjackson commented Jan 13, 2018 via email

@BClark09
Copy link
Member

BClark09 commented Jan 13, 2018

Already looking good. TX Queue is constantly 1 and my lights are responding to polling updates. Will let you know how it goes when I return home!

P.S Found that I can know if my lights are changing because my zwave devices are reporting Luminance changes in each room ;)

@weakfl
Copy link
Contributor Author

weakfl commented Jan 13, 2018

Looking very good here too. I‘d like to let it run a bit longer, but seems to be fixed now.

@cdjackson
Copy link
Contributor

cdjackson commented Jan 13, 2018 via email

@cdjackson
Copy link
Contributor

@weakfl
Copy link
Contributor Author

weakfl commented Jan 14, 2018

Still responsive, queue still at 1.
I‘d say ship it ;)

@BClark09
Copy link
Member

All working great here too! Thanks a lot @cdjackson!!

@weakfl
Copy link
Contributor Author

weakfl commented Jan 15, 2018

Everything still working, I think we can close here. Thanks Chris!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants