-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Artemis ATP firmware compiled with Arduino_Apollo3 versions 2.x.x Hangs after about 1430 seconds (clock time) #388
Comments
Thanks for the report. This sounds like a pretty big issue. Testing now. |
Im at 3300 right now on the latest release-candidate with no hang.... interesting....
|
I am not getting this failure on v2.0.6
Lets try to find the difference in what I am doing vs you. Board: Sparkfun RedBoard Artemis ATP Is it possible you are having a problem with your serial monitor? I am running this sketch unedited
Were you having problems with a sketch that was doing something else/more. I know the first few revisions of v2 had a problem where after a certain number of BLE messages, the heap would fill and it would stop sending. Anything like that? |
FYI: Just tried the same. no issues it keeps counting... Board: Sparkfun RedBoard Artemis ATP Count: 1754 1755756 regards, |
Update:
@nigelb am still very interested in this, as it was reproducible between you and a friend. Let me know if you can figure out what we are doing different. |
Hi All, In my original post I did not mention that my friend and I are running Windows 10 and are using the Arduino Serial Monitor. I just tested this issue on my Ubuntu PC:
and after compiling and flashing the ATP it did not hang:
Surprisingly the MD5 sum of the firmwares compiled on my Win10 and Linux machines matched:
Linux:
So I copied the firmware compile on my Win10 machine over to my Linux machine and flashed it onto my ATP board:
🎉 The hanging issue no longer occurs:
At this point, on the Win10 machine, I closed Arduino, and deleted the
The hanging issue no longer occurs. |
Reopened after a report from @nseidle that he was getting the same issue. A failure after ~23 minutes. His original example was a BLE project that would quit after 23 minutes, but he is able to reproduce the problem with this sketch
I am trying to reproduce the problem on my end. |
I'll do the same on 2.10. |
Thanks @paulvha, any luck? I should add that he was running with: I used the same platform and version and am not getting the problem. This is an interesting problem. |
I think its unlikely to be a serial issue. The original observation from Nate was a BLE sketch (that was not connected over serial), that simply stopped showing up and blinking a heartbeat LED after 23 minutes. I am also unable to reproduce the problem. I am trying to figure out why this problem seems to happen to only some users. I find it interesting that nigel was able to fix the problem on his end by re-installing, and that the compiled binaries appeared to be the same between working an non-working builds. perhaps there is a problem with the tools, but I cant see how it would cause this issue. Mostly just typing my thought right now, |
more than 2.5 hours and counting.. the computer is still terrible slow.. but the ATP is still running. I will let it run for the night... maybe tell Nate to get a fast computer with a real OS as I still think this is a buffer issue 🙂. The overhead on BLE (having studied that in detail) is huge. A buffer overrun is easily happening.
regards
Paul
…________________________________
Van: Wenn0101 ***@***.***>
Verzonden: maandag 24 mei 2021 21:25
Aan: sparkfun/Arduino_Apollo3 ***@***.***>
CC: paulvha ***@***.***>; Mention ***@***.***>
Onderwerp: Re: [sparkfun/Arduino_Apollo3] Artemis ATP firmware compiled with Arduino_Apollo3 versions 2.x.x Hangs after about 1430 seconds (clock time) (#388)
I think its unlikely to be a serial issue, since the original observation from nate was a BLE sketch (that was not connected over serial), that simply stopped showing up and blinking a heartbeat LED after 23 minutes.
I am also unable to reproduce the problem. I am trying to figure out why this problem seems to happen to only some users. I find it interesting that nigel was able to fix the problem on his end by re-installing, and that the compiled binaries appeared to be the same between working an non-working builds. perhaps there is a problem with the tools, but I cant see how it would cause this issue. Mostly just typing my thought right now,
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub<https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fsparkfun%2FArduino_Apollo3%2Fissues%2F388%23issuecomment-847278809&data=04%7C01%7C%7C50578ae70977487ac9f308d91ee9aeba%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637574811261909898%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=HZfXK9pWKcuIqIF33KUwCqoeL5ovA5YMbRwAfnfA4xY%3D&reserved=0>, or unsubscribe<https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAD2GBPFXCWKLVK4ASW5FPX3TPKR2JANCNFSM44GCVINA&data=04%7C01%7C%7C50578ae70977487ac9f308d91ee9aeba%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637574811261919895%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=W2tr1GF2L8XjjJsnPE7a6yV9oyw5b%2FhMoPTPY3k7iKI%3D&reserved=0>.
|
it has been running for more than 12 hours without a problem. regards, |
Thanks for the help Paul! This testing is so valuable in helping me figure narrow down who this affects. -Kyle |
Ok, here is what I have found. The problem appeared to be mostly board specific - a failing board would typically fail regardless of the computer it was being used on, and a nonfailing board would work on computers originally thought to be suspect. Interesting information that lead to the resolution: Explicitly disabling this interrupt in the us_ticker setup should resolve this issue on affected boards.
I have confirmed this on the 1 board I have that would fail, waiting on Nate to confirm that the problem is resolved on his boards. |
interesting... could it be a leftover from the SVL bootloader, where it is set and used to autodetect the baud rate. It does perform a disable before starting the loaded app, but maybe it not happening on all boards. timing ?? I load the sketches on the ATP board with the ASB uploader. |
I was thinking maybe an older version of the SVL bootloader doesn't disable it before loading the app. I did notice all of the "problem" boards are older. I haven't looked into it yet, but I like the reasoning that it could be the bootloader. Either way I think the fix is to have the app explicitly disable it, to be safe. |
look at this post that I saw today... https://forum.sparkfun.com/viewtopic.php?f=168&t=5139. They had the same issue. |
@Wenn0101 and @paulvha my friend actually could not fix the issue by re-installing. When I think back I believe I tried both the SVL and ASB bootloaders and managed to break the SVL bootloader somehow. I had to "Burn Bootloader" to fix the issue. In the same test iteration I re-installed the "Sparkfun Apollo3 Boards" platform. After this everything was working. My friend is going to try updating the bootloader and see if this solves his problem. We both purchased our boards in around 2019. |
This should be fixed as of v2.1.1 |
Board: Sparkfun RedBoard Artemis ATP
Arduino Version: 1.8.13
Sparkfun Apollo3 Boards version 2.0.6, 2.0.3, and I assume the versions in between. This issue does not occur in version 1.2.1.
With the following sketch (derived from this example):
After uploading and running it hangs at step 1430 (or, occasionally 1431):
It seems to happen after a certain amount of time running because if I change the delay to 100 we get this:
I have tested this on two different ATP boards and had a friend try on his as well.
This Issue occurred in all cases that was compiled with the V2 series Arduino_Apollo3.
The text was updated successfully, but these errors were encountered: