Update docs & comments for the behavior of the RecordSubscribe service, #148

haussli · 2024-01-30T00:40:58Z

removing the keepalive concept and mentioning the behavior the server can adopt to timeout a dead client.

Addresses issue #137

removing the keepalive concept and mentioning the behavior the server can adopt to timeout a dead client.

nmahabaleshwar · 2024-01-30T16:39:53Z

acctz/acctz.proto

-  // order to signal its liveliness to the system. Failure to send a
-  // RecordRequest for more than 120 seconds will cause the connection to be
-  // reset.
+  // The stream continues ad infinitum, until the gNSI session is severed.


Can we also update the comment at line 56 ?

If this results in no records to send, the server should return an OK error and zero records.

I don't think an OK error should be sent to the collector if no records need to be sent, the connection stays open and RecordResponses will be sent for future events.

What should it return?

If the subscription succeeds, but either the history is empty or a time is requested that both does not correspond with a record in the history and there are no records after that time in the history, then the current text seems reasonable. For example, the client might request now() or a future time (T+60s, which the server should probably be permitted to assume to be now()).

If we return an OK error in those scenarios, the stream will be closed which we don't want right ?

In all those scenarios, we want the stream to be open and the collector should get RecordResponses when accounting events occur after the given timestamp in the original RecordRequest message.

If the server wants to explicitly signal the client that I don't have anything for you now, but will send records in the future, maybe we create a new message or add a new field in the RecordResponse which indicates that.

I think that I must misunderstand/misremember how the streaming works due to my limited use of it. I thought that the return was the result of the subscription; ie: RecordSubscribe() received, parsed, and started successfully. After which, records would be sent asynchronously, allowing the client to send and receive other messages as it wishes, until the session is closed.

Ah, perhaps: "if there's an error on the RPC the connection is closed"

is the missing part here in the conversation?
Generally, Error on a streaming RPC means: "oops, bad things, please restart your process"
so we should say that no error is sent, but that also no records are sent (because there are none).

Perhaps REALLY in the text about the startup the reply is something like:

"device returns as many records as will satisfy the request, or none if there are no records which satisfy the request. The connection remains open such that future records may be streamed to the client as they are available."

Or something along those lines, at any rate.

that's a fair point, we COULD have the server send back a status: ok (this is nirajan's text mostly I think)
but really .... we connect, we send a 'hope you have mail for me!, but I'm hanging out here anyway for later.'
and that's ok. Whether we get 0, 1, tons of records on our first connect is a little unimportant.

There IS a 'did you hear me???' problem, but that might be solve either in the metrics/alerting pipeline ("that device is oddly silent on acctz.. whats up?") or with a simple: "if nothing heard in N mins, disconnect and reconnect"

that model (disco/reco) is ALSO problematic, because it doesn't really solve anything and never surfaces the 'that device is oddly silent...' to the alerting pipeline.

oops... and.. nirajan, if you think there's wording fixes / etc please don't hesitate to send along :)

Ja, thanks for the explanation, morrow. Please suggest the text nirajan, or suggestion for the text to consider is here: haussli@33b831a

Raised #151 for this

Update docs & comments for the behavior of the RecordSubscribe service,

4c104c2

removing the keepalive concept and mentioning the behavior the server can adopt to timeout a dead client.

haussli mentioned this pull request Jan 30, 2024

Clarity needed (in a comment) for acctz RecordRequest keepalives #137

Closed

morrowc approved these changes Jan 30, 2024

View reviewed changes

morrowc merged commit c71f52c into openconfig:main Jan 30, 2024
2 checks passed

nmahabaleshwar reviewed Jan 30, 2024

View reviewed changes

haussli mentioned this pull request Jan 30, 2024

Remove incorrect comments about the behavior of RecordSubscribe when there are no records to send. #150

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update docs & comments for the behavior of the RecordSubscribe service, #148

Update docs & comments for the behavior of the RecordSubscribe service, #148

haussli commented Jan 30, 2024

nmahabaleshwar Jan 30, 2024 •

edited

Loading

haussli Jan 30, 2024

nmahabaleshwar Jan 30, 2024

haussli Jan 30, 2024

morrowc Jan 30, 2024

morrowc Jan 30, 2024

morrowc Jan 30, 2024

haussli Jan 30, 2024

nmahabaleshwar Jan 31, 2024

haussli Jan 31, 2024

Update docs & comments for the behavior of the RecordSubscribe service, #148

Update docs & comments for the behavior of the RecordSubscribe service, #148

Conversation

haussli commented Jan 30, 2024

nmahabaleshwar Jan 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nmahabaleshwar Jan 30, 2024 •

edited

Loading