Branch picobob pr2 #4

andrewmarles · 2023-08-28T21:31:25Z

This is the best performance I have been able to get to with the RP2040 PRU. I really have a much better understanding of Remora now and how it interacts with LinuxCNC. The main point of all of these changes has been to reduce the amount of jitter in the base thread running on the MCU to eliminate timing jitter on the step pulses. There were 3 major sources of timing jitter:

Jitter caused by data-copy delays
Jitter caused by interrupt contention between the base thread and servo thread.
Jitter caused by inconsistent execution times in the base thread/step generator.

To address the data copy delays (having to pause the base thread while data is copied in and out with the host) I made a simple double-buffer and now the base thread only needs to be interrupted long enough to switch the pointers over. This did require some small changes to the stepgen code. I am not blocking the servo thread at the moment as I don't think it needs it, but this is maybe something to look at if you start doing more than just basic I/O/Blink in the servo thread.

To address number 2 I made the following changes:

Swapped the timers over to the main microsecond timer and used two different timer alarms, one for the base thread and one for the servo thread. This allows the two threads to have different IRQ priority levels as well as nesting and makes step 2 below a bit easier to accomplish.
I moved the execution of the servo thread outside of an interrupt context. This allows the base thread to interrupt the servo thread and pulse the step pins. Along with this, a servo thread of 2 KHz ensures that there is minimal latency on any updates to/from the LinuxCNC host for the slower servo thread I/O.

The above changes are fine and good but there are some gotchas now (issue 3) because the base thread is running in an interrupt context and doing floating-point math. Since the RP2040 doesn't have a FPU, this math requires calls to the software floating point libraries. The issue here is that those libraries are stored in the SPI flash and contention on that bus (networking code on the other core mainly) plus it is not really fast to begin with means that the base thread can still be interrupted by the rest of the system even if you use critical sections. And because of the library access it's non-trivial to try to get specific functions (stepgen) loaded into RAM. So I just load the entirety of Remora into RAM "set(PICO_COPY_TO_RAM 1)" and it all fits and this eliminates the jitter from the FP instructions.

Might need to keep an eye on this as the networking packet buffers are dynamic, but there is a decent amount of memory left as Remora is fairly compact and the config file is still stored in flash:

Running from flash:
[build] Memory region Used Size Region Size %age Used
[build] FLASH: 168008 B 2 MB 8.01%
[build] RAM: 45788 B 256 KB 17.47%
[build] SCRATCH_X: 2 KB 4 KB 50.00%
[build] SCRATCH_Y: 0 GB 4 KB 0.00%

Running from RAM:
[build] Memory region Used Size Region Size %age Used
[build] FLASH: 167452 B 2 MB 7.98%
[build] RAM: 202908 B 256 KB 77.40%
[build] SCRATCH_X: 2 KB 4 KB 50.00%
[build] SCRATCH_Y: 0 GB 4 KB 0.00%

So the Remora application is taking up about 157K.

A fixed point implementation of the stepgen would be pretty helpful here as it would avoid software FP on the M0 processors, but that's outside my scope for now. I am pretty happy with the performance of the little RP2040 PRU with the above optimizations. I think there is still work to be done on the component side to tune up the gains, but I am getting pretty good results with a 1ms servo thread on a RPI4.

…terrupt context (preserving base thread priority) and also execute Remora from RAM to avoid bus contention.

…-W5500

andrewmarles added 10 commits August 9, 2023 19:24

Changed config to enable compilation on windows.

9630b5e

Updated config to improve ping times. Removed build products from index.

a39aa46

commit current wip.

aa96883

Commit current WIP.

3910e32

WIP double buffered PRUs.

156ddc9

buffered implementation comes out of estop.

51f9f0d

Major changes to use double-buffering, run servo thread outside of in…

f833324

…terrupt context (preserving base thread priority) and also execute Remora from RAM to avoid bus contention.

Restored semaphore for base thread.

4216a7c

Merge branch 'main' of github.com:Expatria-Technologies/Remora-RP2040…

e4404bd

…-W5500

Merge branch 'main' into picobob-followerror

1031601

scottalford75 merged commit 53748dc into scottalford75:main Aug 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Branch picobob pr2 #4

Branch picobob pr2 #4

andrewmarles commented Aug 28, 2023 •

edited

Branch picobob pr2 #4

Branch picobob pr2 #4

Conversation

andrewmarles commented Aug 28, 2023 • edited

andrewmarles commented Aug 28, 2023 •

edited