Fix lag time #1320

pstratem · 2015-03-15T00:54:03Z

prefs.hex_net_ping_timeout is currently more or less entirely useless.

Timeout should be based on when the last successful recv() occurred, not based on ping round trip times.

remove broken hex_net_ping_timeout logic use timestamps in lag_check instead of relying on the function being called every 30 seconds use prefs.hex_net_ping_timeout if available fix whitespace fix whitespace

Arnavion · 2015-03-15T09:38:02Z

If client sends lagcheck ping at t = 0, receives a chat line at t = 1, and receives lagcheck pong at t = 2, the lag is 2 - 0 = 2, not 1 - 0 = 1 nor 2 - 1 = 1. The current code is correct.

Also, sending a lagcheck ping every second is insane.

pstratem · 2015-03-15T22:36:33Z

You've misinterpretted the patch set.

This doesn't change the logic for calculating lagtime as indicated in the GUI.

This only changes the lagtime calculation for purposes of detecting a disconnect/timeout event.

And pings are sent at most once every ping_interval seconds.

The lag_check function no longer sends a PING everytime it's called.

Arnavion · 2015-03-15T23:19:45Z

This doesn't change the logic for calculating lagtime as indicated in the GUI.
This only changes the lagtime calculation for purposes of detecting a disconnect/timeout event.

The two are identical.

pstratem · 2015-03-15T23:23:12Z

This patch separates them.

lagtime based on rtt of PING/PONG makes sense for the gui indicator

but it's entirely useless for detecting a network disconnect

Arnavion · 2015-03-15T23:24:51Z

Incorrect. Reread my first post.

pstratem · 2015-03-15T23:26:04Z

So you're saying that you dont want to change the logic.

That you want to use the ping time lag indicator to detect a tcp socket disconnect?

(I just want to be 100% clear on this)

Arnavion · 2015-03-15T23:44:12Z

Again, you cannot detect a TCP socket timeout without detecting that a pong for a corresponding ping is taking longer than expected. You cannot use non-pong traffic to detect this, because you don't know when the traffic originated. Such non-pong traffic might have originated 5 minutes ago and you may be only receiving it now, which still means the socket has timed out and needs to be reset. However when you receive a pong, you know it could only have been sent after you sent the corresponding ping, which gives you an upper bound on the timeout.

Thus if you don't receive a pong within the timeout period, that means the socket has timed out, even if you're receiving non-pong traffic during this period.

Now, looking at the current code, lag_check only seems to run if the GUI lagometer is enabled. That definitely looks like a bug.

pstratem · 2015-03-16T01:09:17Z

You're not taking into account various queuing effects.

There's client side throttling, server side recv() throttling, and server side send() throttling.

In practice the only reasonably way to detect a network distruption is to assume that anything received means forward progress.

pstratem · 2015-03-16T01:15:33Z

You're right the lag_check function should be called regardless of the gui lag indicator setting.

Arnavion · 2015-03-16T02:01:28Z

I'm aware what throttling is, thanks. I'm afraid it doesn't matter how stubbornly you refuse to understand what I'm saying - it will not change the fact that the code in this pull request is wrong, for reasons I've explained twice now.

pstratem · 2015-03-16T02:03:14Z

That's fine, I can just maintain my own branch that has sane behaviour.

CyberShadow · 2016-03-12T22:54:05Z

Again, you cannot detect a TCP socket timeout without detecting that a pong for a corresponding ping is taking longer than expected. You cannot use non-pong traffic to detect this, because you don't know when the traffic originated. Such non-pong traffic might have originated 5 minutes ago and you may be only receiving it now, which still means the socket has timed out and needs to be reset.

I've written so many network applications and this makes no sense to me. @Arnavion Do you have any sources or concrete examples to back this up? I'm fairly sure @pstratem is in the right here.

Arnavion · 2016-03-12T23:27:44Z

If you think he's right then reread all of my posts in this thread until you've convinced yourself otherwise. It is adequately explained, including in the snippet you quoted.

CyberShadow · 2016-03-12T23:33:36Z

I did, and I think you're wrong.

To be specific, you have an unusual definition of "timeout". Most importantly, as far as I can see, this definition is simply not useful.

So far I have never encountered a situation where receiving any traffic on a socket not implying that the socket is alive, and should not be timed out. Can you provide an example of one?

until you've convinced yourself otherwise

Not very open-minded are we... why are you so convinced you're right?

CyberShadow · 2016-03-12T23:34:36Z

So far I have never encountered a situation where receiving any traffic on a socket not implying that the socket is alive, and should not be timed out.

Sorry, that's wrong. I just remembered one, Slow-Loris. But it doesn't apply to IRC.

Arnavion · 2016-03-12T23:38:42Z

My definition of timeout is the same as every IRC network's. If you don't believe me, open a TCP connection to an IRC network, play out the same IRC conversation as a real client would, but don't respond to any PINGs. Tell me if it doesn't disconnect you after some time (eg 240 seconds) because it thinks you've timed out.

CyberShadow · 2016-03-12T23:43:13Z

OK, but 1) HexChat is not an IRC server 2) Do you know what the reason for this is? Could be simply a technical limitation. 3) This test will fail on the IRC servers I wrote :) I never noticed any issues with this behavior of course. I'll investigate a little...

Arnavion · 2016-03-12T23:47:39Z

HexChat is not an IRC server

But it is a client connecting to an IRC server. There's no sense for the client to pretend the connection is alive if the server doesn't think it is. Breaking the connection by the same rules ensures that the connection breaks as soon as possible and can be reestablished.

Could be simply a technical limitation.

The behavior exists in multiple ircds, atleast Rizon's and Freenode's. So even if it was an implementation detail, it's a de-facto one.

CyberShadow · 2016-03-12T23:54:20Z

There's no sense for the client to pretend the connection is alive if the server doesn't think it is.

I'm not following, what does what the server think have to do with this?

This behavior exists in multiple ircds, atleast Rizon's and Freenode's.

Actually I just tried Freenode and it's not even sending PINGs at all as long as I keep sending traffic.

CyberShadow · 2016-03-13T00:00:51Z

If you don't believe me, open a TCP connection to an IRC network, play out the same IRC conversation as a real client would, but don't respond to any PINGs. Tell me if it doesn't disconnect you after some time (eg 240 seconds) because it thinks you've timed out.

OK, I idled until I got a PING and started sending traffic again (PRIVMSG, not PONG). It's not timing me out.

Have you tested this on Freenode before and got another result? I'm guessing something must've changed? I was connected to rajaniemi.freenode.net.

Edit: after idling again I got a second PING (with no PONG in-between) which just proves that Freenode accepts any traffic as a PONG.

Arnavion · 2016-03-13T00:13:04Z

Yes you're right. My testing with both reveals the same. I'm sure it used to be that the networks sent PINGs always but atleast they don't do that now...

CyberShadow · 2016-03-13T00:18:51Z

You might be thinking of GameSurge, which sends a PING as soon as it receives a NICK, and has a rather low timeout (at least for registration or that initial PING).

So in light of this, do you still think the current behaviour is preferable?

Arnavion · 2016-03-13T00:19:46Z

So since networks don't have this behavior, it doesn't make sense for HC to have this behavior either. I'll let TingPing assess the PR.

Arnavion · 2016-03-13T02:21:59Z

#1631

fix timeout logic

085d9c3

remove broken hex_net_ping_timeout logic use timestamps in lag_check instead of relying on the function being called every 30 seconds use prefs.hex_net_ping_timeout if available fix whitespace fix whitespace

run lagcheck even if gui is disabled

ca2b4d4

pstratem closed this Mar 16, 2015

Arnavion reopened this Mar 13, 2016

pstratem closed this Mar 13, 2016

Arnavion mentioned this pull request Mar 29, 2016

PR 1320 #1631

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix lag time #1320

Fix lag time #1320

pstratem commented Mar 15, 2015

Arnavion commented Mar 15, 2015

pstratem commented Mar 15, 2015

Arnavion commented Mar 15, 2015

pstratem commented Mar 15, 2015

Arnavion commented Mar 15, 2015

pstratem commented Mar 15, 2015

Arnavion commented Mar 15, 2015

pstratem commented Mar 16, 2015

pstratem commented Mar 16, 2015

Arnavion commented Mar 16, 2015

pstratem commented Mar 16, 2015

CyberShadow commented Mar 12, 2016

Arnavion commented Mar 12, 2016

CyberShadow commented Mar 12, 2016

CyberShadow commented Mar 12, 2016

Arnavion commented Mar 12, 2016

CyberShadow commented Mar 12, 2016

Arnavion commented Mar 12, 2016

CyberShadow commented Mar 12, 2016

CyberShadow commented Mar 13, 2016

Arnavion commented Mar 13, 2016

CyberShadow commented Mar 13, 2016

Arnavion commented Mar 13, 2016

Arnavion commented Mar 13, 2016

Fix lag time #1320

Fix lag time #1320

Conversation

pstratem commented Mar 15, 2015

Arnavion commented Mar 15, 2015

pstratem commented Mar 15, 2015

Arnavion commented Mar 15, 2015

pstratem commented Mar 15, 2015

Arnavion commented Mar 15, 2015

pstratem commented Mar 15, 2015

Arnavion commented Mar 15, 2015

pstratem commented Mar 16, 2015

pstratem commented Mar 16, 2015

Arnavion commented Mar 16, 2015

pstratem commented Mar 16, 2015

CyberShadow commented Mar 12, 2016

Arnavion commented Mar 12, 2016

CyberShadow commented Mar 12, 2016

CyberShadow commented Mar 12, 2016

Arnavion commented Mar 12, 2016

CyberShadow commented Mar 12, 2016

Arnavion commented Mar 12, 2016

CyberShadow commented Mar 12, 2016

CyberShadow commented Mar 13, 2016

Arnavion commented Mar 13, 2016

CyberShadow commented Mar 13, 2016

Arnavion commented Mar 13, 2016

Arnavion commented Mar 13, 2016