remove stream if AutoStream enabled when no more subscribers #109

frederikhors · 2021-11-08T23:52:44Z

@purehyperbole, @codewinch I think this is needed to cleanup a bit especially if there are many streams for short periods, something like /events?stream=<eventIDEachTimeDifferent>.

But the new test sometimes is green sometimes not. What do you think?

It always turns green if I add this line:

time.Sleep(1*time.Second)

after the line:

sub.close()

in http.go.

But I don't like time.Sleep() in code.

Can we better "wait" for sub.close() to finish?

codewinch · 2021-11-09T01:29:04Z

You should handle this in your own code. I don't think this is the job of the library.

codewinch · 2021-11-09T01:30:31Z

Hmm. Actually, I'm not sure; maybe removing streams if autostream is enabled should be a job of the library.

purehyperbole

Hey @frederikhors, thanks for submitting this PR.

I would request a few changes however. If you need any more guidance to make them, please let me know

http.go

purehyperbole · 2021-11-09T09:16:10Z

http_test.go

+
+	events := make(chan *Event)
+
+	var cErr error


You may be better to create a channel for this error, like:

cErr := make(chan error) go func() { cErr <- c.SubscribeChan("test", events) }() require.Nil(t, <-cErr)

This is thread safe and avoids having to sleep

You should probably use something like the wait() function though to stop the test from blocking forever however.

Done, but it doesn't work. Can you help me here?

Is this resolved, @purehyperbole?

…unt()

frederikhors · 2021-11-09T12:12:38Z

I updated the code but the count is still 1 launching test here:

frederikhors · 2021-11-09T12:13:07Z

sub.close() isn't removing the subscriber I think...

purehyperbole · 2021-11-09T12:36:57Z

That is most likely related to the fact that sub.close() is not a synchronous call, so the effects of removing that subscriber will not be seen immediately until the stream has removed that subscriber:

https://github.com/r3labs/sse/blob/master/stream.go#L54

frederikhors · 2021-11-09T12:59:48Z

Can we somehow make it synchronous with some channels?

purehyperbole · 2021-11-09T13:33:11Z

yes, I think adding removed chan struct{} to the subscriber struct would be useful.

You can add:

This to addSubscriber in stream.go

removed: make(chan struct{}, 1),

This to removeSubscriber and removeAllSubscribers in stream.go

str.subscribers[i].removed <- struct{}{}
close(str.subscribers[i].removed)

and then in close() you can add this after sending on quit:

<-s.removed

frederikhors · 2021-11-09T14:20:17Z

Added. The test is green now.

But I have a doubt.

We have three channels now:

quit       chan *Subscriber
connection chan *Event
removed    chan struct{}

Can't we reuse one of them (quit or connection) instead of a new one (removed)?

purehyperbole · 2021-11-09T14:38:12Z

I think in this case, hijacking the other two channels is going to complicate things. quit is a shared channel (dereigster on stream) and we ideally don't want to push a close message to connection, as we'd have to wait for all pending messages in that queue to clear before we received whatever signal we want to wait for.

codewinch · 2021-11-09T17:59:43Z

I haven't reviewed the changes, but I'm also managing my own channels and also using multiple channels per connection... will this conflict?

codewinch · 2021-11-09T18:00:23Z

(I'm not using Autostream)

frederikhors · 2021-11-09T18:00:58Z

I haven't reviewed the changes, but I'm also managing my own channels and also using multiple channels per connection... will this conflict?

I think not. But What do you mean? Can you post an example?

codewinch · 2021-11-09T18:02:06Z

It's very complex... that's what concerns me. If these additional channels are created for each additional connection (even if Autostream is not enabled), then I will face a combinatorial explosion.

frederikhors · 2021-11-09T18:03:52Z

It's very complex... that's what concerns me. If these additional channels are created for each additional connection (even if Autostream is not enabled), then I will face a combinatorial explosion.

Get this branch and test it. But this code is only for this package. maybe the problem is in your own code?

codewinch · 2021-11-09T18:04:57Z

I think this is wrong because If I'm not using AutoStream it means I created manually the streams and I don't want they die. IMO.

This is true.

Accessing len(stream.subscribers) == 0 from the handler will not be thread safe and will cause a data race.

Frederik, are you running with data race detector on (https://golang.org/doc/articles/race_detector)? Just run them as a matter of course in this kind of work.

Also, the race detector will only kick in if there's actually a data race; in other words, you should exercise with some load.

codewinch · 2021-11-09T18:06:05Z

Get this branch and test it. But this code is only for this package. maybe the problem is in your own code?

lol I don't have a problem in my code. It's battle tested.

I'm asking if this patch will be creating new Golang channels even if Autostream is disabled.

frederikhors · 2021-11-09T18:07:24Z

Get this branch and test it. But this code is only for this package. maybe the problem is in your own code?

lol I don't have a problem in my code. It's battle tested.

I'm asking if this patch will be creating new Golang channels even if Autostream is disabled.

I think yes. At least one channel because of the new detection. There is no alternative. Read: #109 (comment)

codewinch · 2021-11-09T18:08:38Z

That's what I thought. :( I think this overcomplicates the library then, and would be better a part of a wrapper library (or a separate server package) that manages your connections for you.

frederikhors · 2021-11-09T18:09:48Z

That's what I thought. :( I think this overcomplicates the library then, and would be better a part of a wrapper library that manages your connections for you.

I don't think so. Effective detection is part of this library.

codewinch · 2021-11-09T18:28:15Z

I respectfully disagree. I chose this library over the alternatives both because I didn't find any data races in it and because it is a lower-level library to build servers with, not a server in itself that takes responsibility for managing all the streams.

purehyperbole · 2021-11-09T19:09:51Z

@frederikhors thanks for updating the PR, i will review the changes again.

@codewinch If it alleviates some of your concerns, the overhead from this in terms of memory is around extra 100 bytes. If you have 100k subscribers, that's only an extra 10MB of memory used. Definitely not free, but far from terrible.

You can verify this as follows:

package main

import (
	"fmt"
	"runtime"
)

func usage() {
	runtime.GC()
	var m runtime.MemStats
	runtime.ReadMemStats(&m)
	fmt.Printf("Alloc = %v MiB", bToMb(m.Alloc))
}

func bToMb(b uint64) uint64 {
	return b / 1024 / 1024
}

func main() {
	usage()
	
	x := make([]chan struct{}, 100000)

	usage()
	
	for i := 0; i < 100000; i++ {
		x[i] = make(chan struct{}, 1)
	}

	usage()
}

As the channels are mostly dormant before only being used once, there is a negligible amount of scheduler overhead to handling it this way.

If you forsee of any specific concerns then please let me know so we can try to address them.

Also, it's worth pointing out that why the library is race free, the tests as implemented are not (removeSubscriber causes a race). I disabled the race detector from the CI tests many years ago for that reason. It's probably worth expending the effort to correct those tests, so i'll make that a priority to do soon.

codewinch · 2021-11-09T19:19:43Z

Hi Tom,

Thanks! This does help alleviate some concerns. I'm not concerned about RAM too much, especially since we have lots of RAM and it'll probably be reclaimed if inactive; I'm only really concerned about CPU, especially for millions of channels (we backend with another Go-based MQ system.)

I guess we could always internally re-benchmark to see if these changes will cause any significant changes, but it'd be great if these management layers were separated into a separate higher-level library or server (the way socket servers in Node wrap around lower-level socket.io libraries, to enable separation of concerns and reduction of surface area for races and other bugs.)

purehyperbole · 2021-11-09T20:13:33Z

@codewinch Thanks for providing a bit more info about your usecase.

Your concerns are definitely valid, as there will be CPU overhead (from my understanding only from GC having to scan the heap, but benchmarking is probably going to be the only way to know for sure) from those unused channels, provided you're dealing with a big enough number of subscribers. Channels that aren't being read from/written to should have no more overhead than any other value.

We should be able to gate this functionality behind a conditional, so if you're not using AutoStream, this channel never gets setup or used. If you think that approach would work for you, please let me know.

Generally, I agree that streams should be completely separate from the core SSE functionality and if I were to start from scratch, this library would handle only the core SSE protocol (I actually split this out a long time ago with broadcast).

With that said, it seems that at least a decent number of users rely in some way on the stream functionality, so it's not something we can remove from this library without disrupting some of our users (which is something I'd generally like to avoid if necessary).

frederikhors · 2021-11-09T20:15:49Z

Generally, I agree that streams should be completely separate from the core SSE functionality

@purehyperbole maybe we can work on a new breaking version for this.

codewinch · 2021-11-09T20:16:40Z

Excellent! Yes, I agree completely with this approach. If the user is not using autostream, they don't get the extra functionality and have to implement it all themselves, while if they are, then they get to take advantage of these features with the associated tradeoff.

Seems like a great way to go and I'm totally on board with it!

codewinch · 2021-11-09T20:17:46Z

(I actually split this out a long time ago with broadcast).

"A library to handle broadcasting messages to a number of user channels"

Oh, that is excellent -- don't know how I missed that. Thank you! Would love some docs on how to use it and how it interops with this lib.

purehyperbole · 2021-11-09T20:56:05Z

@frederikhors If it's not too much trouble, could you please kindly make the following changes to this PR?

We should pass through the AutoStream to newStream and store it on the stream struct:

func newStream(bufsize int, replay, isAutoStream bool) *Stream {
	return &Stream{
		AutoReplay:   replay,
		subscribers:  make([]*Subscriber, 0),
		isAutoStream: isAutoStream,
		register:     make(chan *Subscriber),
		deregister:   make(chan *Subscriber),
		event:        make(chan *Event, bufsize),
		quit:         make(chan bool),
		Eventlog:     make(EventLog, 0),
	}
}

For addSubscriber, we can then only allocate the channel if AutoStream is enabled:

// addSubscriber will create a new subscriber on a stream
func (str *Stream) addSubscriber(eventid int) *Subscriber {
	atomic.AddInt32(&str.subscriberCount, 1)
	sub := &Subscriber{
		eventid:    eventid,
		quit:       str.deregister,
		connection: make(chan *Event, 64),
	}

	if str.isAutoStream {
		sub.removed = make(chan struct{}, 1)
	}
      
	str.register <- sub
	return sub
}

removeSubscriber and removeAllSubscribers can be changed to

if str.subscribers[i].removed != nil {
	str.subscribers[i].removed <- struct{}{}
}

The subscriber closed method can be implemented as

if s.removed != nil {
	<-s.removed
}

Understandably, if you don't have time to make these changes, I can merge this and implement those changes before releasing.

@codewinch are you happy with the above changes?

codewinch · 2021-11-09T21:08:29Z

Excellent! I believe that resolves my concerns completely. Thanks to both of you!

frederikhors · 2021-11-09T21:11:55Z

@purehyperbole done!

purehyperbole · 2021-11-09T21:39:40Z

@frederikhors Thanks for your hard work on this PR, looks good to merge!

@codewinch Thank you for your feedback and guidance!

I'll merge this now and create a new release 🎉

codewinch · 2021-11-09T22:47:25Z

Thanks to both of you!

frederikhors changed the title ~~remove stream if AutoStream enabled when no more subsribers~~ remove stream if AutoStream enabled when no more subscribers Nov 8, 2021

purehyperbole requested changes Nov 9, 2021

View reviewed changes

frederikhors added 3 commits November 9, 2021 12:45

remove stream if AutoStream enabled when no more subsribers

bb11f24

change test

57c52e4

rebased, do not removeStream if AutoReplay is on, use getSubscriberCo…

1950b67

…unt()

set AutoReplay to false in test

e0d4e01

add "removed" chan

bab0433

check for isAutoStream

f5aadd8

fix

9e3a66d

purehyperbole approved these changes Nov 9, 2021

View reviewed changes

purehyperbole merged commit f3bbc01 into r3labs:master Nov 9, 2021

frederikhors deleted the remove-stream-if-AutoStream-enabled-when-no-more-subsribers branch November 9, 2021 21:54

remove stream if AutoStream enabled when no more subscribers #109

remove stream if AutoStream enabled when no more subscribers #109

Conversation

frederikhors commented Nov 8, 2021 • edited Loading

codewinch commented Nov 9, 2021

codewinch commented Nov 9, 2021

purehyperbole left a comment

Choose a reason for hiding this comment

purehyperbole Nov 9, 2021

Choose a reason for hiding this comment

frederikhors Nov 9, 2021

Choose a reason for hiding this comment

frederikhors Nov 9, 2021

Choose a reason for hiding this comment

frederikhors commented Nov 9, 2021

frederikhors commented Nov 9, 2021

purehyperbole commented Nov 9, 2021

frederikhors commented Nov 9, 2021

purehyperbole commented Nov 9, 2021 • edited Loading

frederikhors commented Nov 9, 2021

purehyperbole commented Nov 9, 2021

codewinch commented Nov 9, 2021

codewinch commented Nov 9, 2021

frederikhors commented Nov 9, 2021

codewinch commented Nov 9, 2021

frederikhors commented Nov 9, 2021

codewinch commented Nov 9, 2021

codewinch commented Nov 9, 2021

frederikhors commented Nov 9, 2021

codewinch commented Nov 9, 2021 • edited Loading

frederikhors commented Nov 9, 2021

codewinch commented Nov 9, 2021 • edited Loading

purehyperbole commented Nov 9, 2021 • edited Loading

codewinch commented Nov 9, 2021 • edited Loading

purehyperbole commented Nov 9, 2021

frederikhors commented Nov 9, 2021

codewinch commented Nov 9, 2021 • edited Loading

codewinch commented Nov 9, 2021 • edited Loading

purehyperbole commented Nov 9, 2021

codewinch commented Nov 9, 2021

frederikhors commented Nov 9, 2021

purehyperbole commented Nov 9, 2021

codewinch commented Nov 9, 2021

frederikhors commented Nov 8, 2021 •

edited

Loading

purehyperbole commented Nov 9, 2021 •

edited

Loading

codewinch commented Nov 9, 2021 •

edited

Loading

codewinch commented Nov 9, 2021 •

edited

Loading

purehyperbole commented Nov 9, 2021 •

edited

Loading

codewinch commented Nov 9, 2021 •

edited

Loading

codewinch commented Nov 9, 2021 •

edited

Loading

codewinch commented Nov 9, 2021 •

edited

Loading