Active socket, implementation with state machine #68

bokner · 2023-07-10T18:55:49Z

Switch to use active mode for the socket. Addresses #66.

mix.exs

test/client_test.exs

…sed or :einval depending on timing of send call)

lib/mllp/client.ex

mfos239 · 2023-07-16T10:28:31Z

Confirmed the latest changes fix this issue #65 (comment)
Everything seems to be working great on this end. Thanks so much @bokner and @starbelly!

lib/mllp/client.ex

starbelly · 2023-07-16T18:51:50Z

lib/mllp/client.ex

+  end
+
+  defp reply_to_caller(reply, %{caller: caller, context: context} = data) do
+    caller && :gen_statem.reply(caller, format_reply(reply, context))


I think we can avoid the boolean logic here, in fact if we're ending up here when caller is nil, we have a bug somewhere further up and it would be best to crash.

This is to avoid situations where the receiver sends more packets after full MLLP has been received.
It would not be a bug on our side, but rather a receiver misbehaving. Do you think we should crash in this case?

Seems like we'd hit the clause at line 577, no? This raises an interesting question none the less. Crashing aside, one place a RST is appropriate is when the the remote sends data that is unexpected. I wonder if we should shutdown in this case? Feels heavy handed, but yeah 🤔

Seems like we'd hit the clause at line 577, no?

You are correct, we are probably covered for the "more packets" case. Although I'm not entirely sure, as there could be a race condition with the timer in the "receiving" state. There is actually a more serious reason for checking if we have a caller - the receiver can disconnect at any given time. If it happens while the 'send' call is in progress, we notify the caller, otherwise, we don't (no caller there).

If we want to shut down on the "more packets", the natural place is around line 577 indeed.

Although I'm not entirely sure, as there could be a race condition with the timer in the "receiving" state.

Not sure, will have to stare a bit more and we'll both have to attempt to put it through the ringer :)

I think shutting down makes sense, I think we can just use maybe_close , we might need to make a modification though (i.e., maybe sometimes we want to do a graceful close and maybe some times an abortive close).

Right now maybe_close will attempt to do a graceful close, in this case it's not what we want. We want :gen_tcp.shutdown(socket, :read_write). This is the same behavior as send_timeout_close: true, which ends up resulting in a sock_select() call that does an abortive close (RST). This should be double checked though.

Speaking of send_timeout_close: true, if we don't have it this set as the default in sock opts we should, the only reason it's false in otp is for backwards compat reasons (though it should be configurable by the user).

the send_timout_close bit was resolved by your last commit @bokner. WE still need to hang up the phone on receive_timeout as well. Basically, per the nature of MLLP and the implicit behaviour when shipping HL7 around, there's may be no recourse in a situation where the receive timed out.

Specifically, the client can either hang up the phone or make assumptions about how the server is implemented. In this regard, both the client and server may end up in a bad state and form an error loop. One example where a receive timeout can result in bad things happening is you peel off part of the response, you time out waiting for the rest. The caller of the client decides to send again, yet there still data in transit (i.e., not in the local buffers and bubbled up to our app yet), the send happens, and now the rest of the ack from a previous response arrives. Meanwhile, the remote might get what you just sent and either accept it, and send back a nack or hang up the phone.

Tricky business 😁

Thoughts?

The downside of dropping the connection on the client side for whatever reason (say, receive_timeout) is that then it would have to reconnect again. Maybe it is a better recourse after all.

I guess the sane client would try to check for connection on any error anyway.

Yeah, and to add to that, there needs to be options for all of it, that really bothers me, I think having too many options is never great, but sadly since MLLP isn't a much of a protocol, the behaviour is going to differ from system to system. All we can do is try to provide a mostly sane set of defaults that cover most cases. If I go by the "protocol" and how tcp works, it leads me here to these decisions.

I tell ya what, let's not agonize over this right now. We're not at 1.0, code is easy to change, etc. Let's get test passing, merge, and take it from there.

Co-authored-by: Bryan Paxton <39971740+starbelly@users.noreply.github.com>

starbelly

Beautiful! ❤️🧡💛💚💙💜

* hca/main: Active socket, implementation with state machine (HCA-Healthcare#68)

bokner added 12 commits June 27, 2023 11:33

Unit test for detection of disconnected receiver

5e8b98a

wip

34f9159

wip (6 failing tests, disconnection detection test passed)

8b6f9ff

wip (recv timeout)

41d0bb9

Minor

2552fcf

Changes to test cases

613d665

Store context in state

69a6530

Call maybe_close on tcp_error

29a5056

Replace some mock tests with real ones

0e76b1f

Make sure socket is invalidated on reconnection

093767b

Switching to gen_statem (wip)

d68353f

Tests passed (except ones for frag responses have to be rewritten)

14ab8a9

bokner force-pushed the active_socket branch from ecfc469 to 61711ad Compare July 10, 2023 19:06

starbelly reviewed Jul 10, 2023

View reviewed changes

mix.exs Outdated Show resolved Hide resolved

starbelly reviewed Jul 10, 2023

View reviewed changes

mix.exs Outdated Show resolved Hide resolved

Bug fixes, tests for fragmented messages

472155a

bokner force-pushed the active_socket branch from 61711ad to 472155a Compare July 10, 2023 19:09

starbelly requested changes Jul 10, 2023

View reviewed changes

test/client_test.exs Outdated Show resolved Hide resolved

bokner added 3 commits July 10, 2023 17:50

Switching to latest elixir_hl7 breaks the tests

a1174f4

Fix some tests

2daeb38

Fix flaky test (disconnect on server side could result either in :clo…

903d779

…sed or :einval depending on timing of send call)

bokner force-pushed the active_socket branch from 0e69fc9 to fd2c664 Compare July 11, 2023 13:49

Switch to gen_statem

ee9e428

bokner force-pushed the active_socket branch from fd2c664 to ee9e428 Compare July 11, 2023 13:53

Another flaky test

43489e5

bokner mentioned this pull request Jul 11, 2023

Allow MLLP connections to be proactively monitored with is_closed?/1 #65

Closed

Add test for 'one-at-a-time' send request

295a44e

bokner force-pushed the active_socket branch from a4b1fae to 295a44e Compare July 11, 2023 20:41

bokner added 2 commits July 11, 2023 16:46

Restore mix files

d85d863

Minor

2ec626c

starbelly reviewed Jul 15, 2023

View reviewed changes

lib/mllp/client.ex Show resolved Hide resolved

starbelly reviewed Jul 15, 2023

View reviewed changes

lib/mllp/client.ex Show resolved Hide resolved

starbelly reviewed Jul 15, 2023

View reviewed changes

lib/mllp/client.ex Outdated Show resolved Hide resolved

Address some review items

74b735d

starbelly reviewed Jul 15, 2023

View reviewed changes

lib/mllp/client.ex Show resolved Hide resolved

starbelly mentioned this pull request Jul 15, 2023

TCP connection termination #51

Closed

Test for the client accepting requests in 'receiving' state

d8337ba

bokner force-pushed the active_socket branch from 6f6383f to d8337ba Compare July 16, 2023 13:47

starbelly reviewed Jul 16, 2023

View reviewed changes

lib/mllp/client.ex Show resolved Hide resolved

starbelly reviewed Jul 16, 2023

View reviewed changes

lib/mllp/client.ex Outdated Show resolved Hide resolved

starbelly reviewed Jul 16, 2023

View reviewed changes

lib/mllp/client.ex Outdated Show resolved Hide resolved

starbelly reviewed Jul 16, 2023

View reviewed changes

bokner and others added 5 commits July 16, 2023 15:19

Pattern match for trailer_check

7280cba

Co-authored-by: Bryan Paxton <39971740+starbelly@users.noreply.github.com>

Reset last_byte_received

00b0976

Co-authored-by: Bryan Paxton <39971740+starbelly@users.noreply.github.com>

Simplify buffer update

1912983

Minor changes to test case

a44cf72

Add 'send_timeout_close' to socket defaults

0f3ba0a

bokner force-pushed the active_socket branch from 1ddc823 to d223f73 Compare July 17, 2023 16:14

Shutdown on unexpected packet

9c6cbf1

bokner force-pushed the active_socket branch from d223f73 to 9c6cbf1 Compare July 17, 2023 16:49

Bug fix: switch to :disconnected if conn has closed

b5db135

bokner force-pushed the active_socket branch from ff74f4d to b5db135 Compare July 17, 2023 22:09

bokner added 2 commits July 18, 2023 10:08

Shutdown socket on receive timeout, if required

411dae6

Update cache version

416f254

starbelly approved these changes Jul 18, 2023

View reviewed changes

starbelly merged commit 0727b47 into HCA-Healthcare:main Jul 18, 2023
13 checks passed

starbelly mentioned this pull request Jul 18, 2023

Consider re-working client to use acitve mode with gen_tcp #66

Closed

bokner added a commit to bokner/elixir-mllp that referenced this pull request Jul 18, 2023

Merge remote-tracking branch 'hca/main' into active_socket

0b504d8

* hca/main: Active socket, implementation with state machine (HCA-Healthcare#68)

linear bot mentioned this pull request Jul 21, 2023

BEARS-246: Use active socket mllp client, refactor monitor as observer > main aristamd/mllparty#12

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Active socket, implementation with state machine #68

Active socket, implementation with state machine #68

bokner commented Jul 10, 2023 •

edited

Loading

mfos239 commented Jul 16, 2023

starbelly Jul 16, 2023

bokner Jul 16, 2023

starbelly Jul 16, 2023

bokner Jul 16, 2023

starbelly Jul 16, 2023

starbelly Jul 17, 2023

bokner Jul 17, 2023

starbelly Jul 17, 2023

starbelly Jul 18, 2023

starbelly left a comment

Active socket, implementation with state machine #68

Active socket, implementation with state machine #68

Conversation

bokner commented Jul 10, 2023 • edited Loading

mfos239 commented Jul 16, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

starbelly left a comment

Choose a reason for hiding this comment

bokner commented Jul 10, 2023 •

edited

Loading