Z21 pending request queue #73

gfgit · 2023-06-24T21:43:30Z

Pending request queue allows to retry sending messages after a timeout if expected reply is not received.
This also allows to detect a message is a reply to our own changes and react to it differently

See #60 for earlier discussion.

NOTE: needs #82 applied first

Both this and #82 LocoCache try to resolve the "async loop" issue (See #57).
This 2 solutions are orthogonal and can live together.

gfgit · 2023-06-24T22:13:05Z

Quick comparison with #59 approach (not fully accurate):

LocoCache approach

PROS:

Less memory footprint when number of loco is low and many messages are sent
Can extract individual changes from LanXLocoInfo like change only direction or only emergency stop
This is possible because we cache last value of direction, speed and emergency stop per each loco.
Can somewhat detect speed trend change: user reduces speed while train is accelerating or viceversa

CONS:

Weak logic based on heuristic. Many edge cases
Logic is more obscure
More memory when many loco are running
Annoying "ignore" delay when user moves throttle right after Traintastic has updated same loco
This is needed to avoid receiving our own reply but creates discrepancy between real loco state and cached one.

Pending Request Queue approach

PROS:

While logic is more complex, it's easier to understand
More accurate for own replies (which get then ignored)
Expandable to other message types
Allows re-sending a message if no reply is received so it workarounds UDP packet loss
Logic is much more robust
No delay needed for throttle messages

CONS:

Can detect external changes as our own replies if messages match.
This shouldn't be a big problem but there are at least some edge cases (see below)
Potential risk of big memory footprint is sending a lot of messages in a fast manner
Does not check received message order so 2 replies referring to same object can be
detected in wrong order.
Cannot extract individual changes because it has no track of last value

Edge case for reply ordering and external changes

Set a Train to speed 100.
It will slowly set speed from 0 to 100 at regular intervals
Say you already have enqueued from 0 to 20 and Z21 still has not answered
You should expect answers in ascending order from 0 to 20
If you get speed 15 after speed 5 it means someone else has explicitly set speed to 15
This "15" might be detected as our own reply because is less than 20 and anyway less than target 100
So Train will continue to accelerate to 100 exceeding user set speed 15

We should detect the "bump" from 5 to 15 which should not interpreted as "train is already at 15"
but instead as "accelerate until 15"

This can be fixed adding some sort of counter to tracked replies.

Future work

The solution of course will be "best of both worlds". I'll try to tune both approaches to work together
and understand which one is doing what exactly and what are the edge cases left.

It already shows big improvement when simulating enormous network delays (100 ms for every message on Z21 side)
but it's not yet perfect.
It really needs testing with real hardware (which unfortunately I don't have) because different behaviors might arise

reinder · 2023-06-26T21:28:23Z

Looking good, I really like it. Nice work! I don't own a z21 or Z21 yet so can't test it against a real one, I only have a Digikeijs DR5000 which supports the Z21 protocol.

If you get speed 15 after speed 5 it means someone else has explicitly set speed to 15

Or that 6 .. 14 are dropped, it's UDP after all. But under normal circumstances without a high network load it probably won't happen.

Maybe you can add a debug option to log some additional info about the retries, that might be useful if we find people to do some testing with it.

p.s. As the Z21 implementation is receiving many improvements, it seems fair to me that you add a Copyright line for your name. (Don't know how this normally works...this is my first Open Source project with contributions :))

gfgit · 2023-06-27T14:44:45Z

p.s. As the Z21 implementation is receiving many improvements, it seems fair to me that you add a Copyright line for your name. (Don't know how this normally works...this is my first Open Source project with contributions :))

I'm glad!

I forgot build is failing because I've based it on top of #56 so I'll need to modify it a bit

gfgit · 2023-11-02T17:12:44Z

Rebased onto #82

Handle LanXLocoInfo inside ClientKernel This allows reacting to external decoder state changes

Z21 Firmware 1.42 adds F29 to F31 to LAN_X_LOCO_INFO

Decoder changes caused by Z21 should not be sent back to Z21

React to Emenrgency Stop and direction change regardless of timeout status

server/src/hardware/protocol/z21/clientkernel.hpp

server/src/hardware/protocol/z21/messages.cpp

server/src/hardware/protocol/z21/messages.hpp

Now m_isUpdatingFromKernel P.S. I'm tired of rebasing!

It cannot be null

This makes it possible to detect replies from Z21 originated by our own requests and process them differently than externally generated messages. This also enables resending requests which did not receive the expected reply in timeout

gfgit force-pushed the work/z21_pending_queue branch from 9f5851f to 0bf0698 Compare July 10, 2023 14:59

gfgit force-pushed the work/z21_pending_queue branch from 0bf0698 to 094f4ef Compare November 2, 2023 14:18

gfgit mentioned this pull request Nov 2, 2023

New Z21 fixes #81

Merged

gfgit force-pushed the work/z21_pending_queue branch 2 times, most recently from 08a1cb3 to 2f9cc61 Compare November 2, 2023 16:57

gfgit force-pushed the work/z21_pending_queue branch 2 times, most recently from a7a7a1b to 14a13c5 Compare November 5, 2023 18:56

gfgit added 9 commits November 5, 2023 22:28

DecoderChangeFlags: add operator|=()

46fae8c

server: Z21 handle LAN_X_LOCO_INFO

3e3eaca

Handle LanXLocoInfo inside ClientKernel This allows reacting to external decoder state changes

server: Z21 ClientKernel, try/catch when updating decoder

50279a1

server: Z21 LanXLocoInfo support F29 to F31

373c277

Z21 Firmware 1.42 adds F29 to F31 to LAN_X_LOCO_INFO

WIP: add LocoCache to Z21 ClientKernel

5bdbeeb

Z21 ClientKernel: do not propagate external changes

1e6d350

Decoder changes caused by Z21 should not be sent back to Z21

Z21 ClientKernel: always react to stop and direction

4d36f51

React to Emenrgency Stop and direction change regardless of timeout status

Z21: ClientKernel store last received step in 126 scale

379c22c

server: Z21 ClientKernel prevent speed trend override

5b48969

reinder requested changes Nov 5, 2023

View reviewed changes

gfgit force-pushed the work/z21_pending_queue branch from 14a13c5 to 57f524a Compare November 6, 2023 00:13

gfgit mentioned this pull request Nov 6, 2023

Z21 handle loco info 2 #82

Merged

gfgit added 6 commits November 6, 2023 01:26

ClientKernel: use Doxygen \note in comment

45cbc58

ClientKernel: prefix members with m_

39eef0f

Now m_isUpdatingFromKernel P.S. I'm tired of rebasing!

LanXLocoInfo: rework max function index support

ff428c4

ClientKernel: make getLocoCache() return reference

7a2eea5

It cannot be null

ClientKernel: do not abbreviate variable names

3bd9a87

server: Z21 add pending request tracking

9700f0a

This makes it possible to detect replies from Z21 originated by our own requests and process them differently than externally generated messages. This also enables resending requests which did not receive the expected reply in timeout

gfgit force-pushed the work/z21_pending_queue branch from 57f524a to 9700f0a Compare November 6, 2023 00:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Z21 pending request queue #73

Z21 pending request queue #73

gfgit commented Jun 24, 2023 •

edited

gfgit commented Jun 24, 2023 •

edited

reinder commented Jun 26, 2023

gfgit commented Jun 27, 2023

gfgit commented Nov 2, 2023

Z21 pending request queue #73

Are you sure you want to change the base?

Z21 pending request queue #73

Conversation

gfgit commented Jun 24, 2023 • edited

gfgit commented Jun 24, 2023 • edited

LocoCache approach

Pending Request Queue approach

Edge case for reply ordering and external changes

Future work

reinder commented Jun 26, 2023

gfgit commented Jun 27, 2023

gfgit commented Nov 2, 2023

gfgit commented Jun 24, 2023 •

edited

gfgit commented Jun 24, 2023 •

edited