-
Notifications
You must be signed in to change notification settings - Fork 457
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failure on Edge Hub and messages get stuck while sending messages upstream. Failure is happening in EdgeHub #715
Comments
Looking at the logs it appears the device lost connectivity to IoT Hub. Are you saying the device had connectivity but somehow Edge Hub was unable to connect still? |
Device had connectivity . I am running so many devices for testing. All are configured with MQTT protocol instead of AMQP. Attached log is for two different devices. |
When you say "messages get stuck" what exactly do you mean. From the logs I see 2 kinds of issues - one is timeouts when Edge Hub connects to IoT Hub but it does eventually recover and connect successfully -- I am thinking this is some kind of transient connectivity issue. The other problem is that Edge Hub seems to be trying to route a message to a module and did not receive an acknowledgement from the module. Subsequently an exception seems to have caused Edge Hub to close the connection to the module but the module does not seem to detect this and recover. We are trying to repro this issue in our long haul tests. In the meantime, could you check what version of the Device SDK you are using from your module? If its not the latest version (v1.19.0) then you might want to try upgrading to that version and see if that helps. Will keep this thread posted on our findings. |
I am using transparent Gw solution-
-i have dedicated Wifi network for Gw. So there is no chance. it will not reach to module because i have added offline feature capability. In that case if i switched off WiFi Network. Data from EdgeHub to module was reaching properly and storing data upto 24 hours. I am using below version of Device Client version- Microsoft.Azure.Devices.Client/1.18.1 |
There are two thread are running which points the same issue if i am correct- As per comment this is looking like device sdk issue? Facing issue like- |
Yes, we suspect that this is a Device SDK issue which might get resolved by upgrading to 1.19.0. |
I have upgraded device sdk to 1.19.0 on few Gw devices. I will update if same issue still persist. |
Didn't see this issue after updating sdk to 1.19.0. Issue seems to be resolved with device sdk update. |
I'm going to close this issue for now as it seems to be resolved with the new SDK. Please feel free to reopen. |
Hi, I am facing issue with EdgeHub, data sending to IoT hub get stuck after 7-10 days.
My downstream device is reading data from RS485 and sending to Edge Hub using transparent gateway. Edge module is sending data to IoT hub. I have added offline features of edge as well.
This is completely blocker for me at production. EdgeHub is not getting restart as well after getting stuck. Can you please help to find alternate solution.
Expected Behavior
Configured following things-
DPS- TPM on Aaeon Gateway
Configured Offline Capability for Message storage- 24 hours
Gateway Configuration- 4GB RAM and 32 GB SSD
Module should keep running and get restarted in case of any failure. But this not happening.
Fixing this issue with following steps if EdgeHub get disconnect from IoT Hub.
Solution- Manually delete all docker images- "sudo docker rm -f edgeHub edgeAgent mymodule". After executing this command. Modules started running perfectly fine.
Current Behavior
Facing issue while running modules on Edge runtime. Device is running perfectly fine for 7-10 days but after that message got stuck and seeing some exception in EdgeHub. EdgeHub connection to IoT hub is getting failed. Please find attached log for you reference. I have seen this issue on two different gateway and attaching both logs.
Device (Host) Operating System
<Ubuntu 16.04>
failed_logs_edgeAgent_10_days_08_01_2019.txt
failed_logs_edgeAgent_05012019.txt
failed_logs_edgeHub_10_days_08_01_2019.txt
failed_logs_edgeHub_05012019.txt
Architecture
Container Operating System
Runtime Versions
iotedged
<Run
iotedge version
1.0.5>Edge Agent
< Image tag (i.e. 1.0.5) >
Edge Hub
< Image tag (i.e. 1.0.5) >
Docker
< Run
docker version
3.0.2>Logs
Additional Information
The text was updated successfully, but these errors were encountered: