stat/distmv: nil checks for sigma race against setSigma in Normal and Student's T #163

btracey · 2017-02-22T22:39:27Z

We protect setSigma with a once so it only gets set once, but this still races against checks that it is non-nil, for example in normal.ConditionNormalSingle

The text was updated successfully, but these errors were encountered:

kortschak · 2017-02-22T22:52:08Z

We don't say that it is safe for concurrent use do we? Since we don't, it's actually probably overkill to use the sync.Once; we could just rewrite setSigma as

func (n *Normal) setSigma() {
	if n.sigma != nil {
		return
	}
	n.sigma = mat64.NewSymDense(n.Dim(), nil)
	n.sigma.FromCholesky(&n.chol)
}

and

func (s *StudentsT) setSigma() {
	if s.sigma != nil {
		return
	}
	s.sigma = mat64.NewSymDense(s.Dim(), nil)
	s.sigma.FromCholesky(&s.chol)
}

btracey · 2017-02-22T22:53:56Z

We don't say that, but it should all be save for concurrent usage. One of Go's strengths is concurrency, and we should work with it as much as possible. I came across this bug while trying to use these methods concurrently, for instance.

kortschak · 2017-02-22T23:11:49Z

The general practice is to not make thing safe for concurrent use unless specifically designed for that. Here if a Normal or StudentsT is intended to be used concurrently the client can either wrap it in a struct with a mutex, or pass it through a channel.

btracey · 2017-02-22T23:19:40Z

I strongly disagree with that. Good library design for Go should be that all functions and methods are safe for concurrent use unless it’s obvious why they shouldn’t be. `Dense.Mul` isn’t safe for concurrent use, but of course it isn’t, it’s modifying the value of the receiver. On the other hand, `mat64.Trace` is safe for concurrent use, as it should be since it only reads data. In the particular case here, all of the methods on `Normal` are safe for concurrent use, as they should be, since none of them inherently write data to the struct. Random number generators, computing probabilities, these are things that one might want to do many of using the same probability distribution, and so they are rightly thread safe. In the specific case, `MarginalNormal` is only not thread safe because of an implementation detail, not because the operation is inherently impossible to make concurrent. We should change the implementation to be thread safe, not force users to work around an implementation problem.

…

On Feb 22, 2017, at 4:11 PM, Dan Kortschak ***@***.***> wrote: The general practice is to not make thing safe for concurrent use unless specifically designed for that. Here if a Normal or StudentsT is intended to be used concurrently the client can either wrap it in a struct with a mutex, or pass it through a channel. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#163 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ADgqWwwmWgUOBOsP2NkuVcjzb-NxcoL0ks5rfMC1gaJpZM4MJSaZ>.

kortschak · 2017-02-22T23:24:45Z

The standard practice in std is that things are not safe for concurrent use unless marked as such. The rationale for this is that mostly things are either not used concurrently, or are used concurrently under a system of user control of synchronisation. Adding synchronisation to types in the first case would mean that you may end up with unnecessary synchronisation and in the second case doubled layers of synchronisation, both resulting in an unnecessary performance hit. In the standard library types that are very likely to be used concurrently, or are difficult to protect in a useful way have sychronisation built in.

kortschak · 2017-02-22T23:28:45Z

Random number generators, computing probabilities, these are things that one might want to do many of using the same probability distribution, and so they are rightly thread safe.

This is not true. The rand.Func functions are thread safe, but that is done by wrapping the global rand.Rand variable in a locked wrapper, var globalRand = New(&lockedSource{src: NewSource(1).(Source64)}). This is done for the very reasons I describe above; the rand.Rand types are not protected since that would in most cases cause an unnecessary performance hit, hence the existence of rand.lockedSource.

btracey · 2017-02-22T23:37:14Z

I have never run into an issue with a race condition from calling a standard library function concurrent when I thought there shouldn't be any concurrency issues.

The rand.Rand types aren't concurrent, but if you know how random number generators work, you know that they have to have some state that gets modified when you call one of the methods. If the user supplies their own random number generator to Normal, then of course Rand is not going to be thread safe unless the random number generator is also thread safe.

The performance hit of not making Marginal thread safe is likely much higher than making it thread safe. Rather than generating one *Normal, I now would have to copy the mean and cholesky decomposition many times. That's much higher than checking a read lock that can only be locked once.

Right now, in all of Gonum, it's easy to predict when code is thread safe, and when it is not. This is a great property, and not one we should break.

btracey · 2017-02-22T23:37:55Z

I also opened golang/go#19245 which would be an efficient fix to this issue.

kortschak · 2017-02-22T23:47:29Z

The performance hit of not making Marginal thread safe is likely much higher than making it thread safe. Rather than generating one *Normal, I now would have to copy the mean and cholesky decomposition many times. That's much higher than checking a read lock that can only be locked once.

Not really,

type lockedNormal struct {
    mu sync.Mutex
    n  *distmv.Normal
}

// relevant methods for client, e.g....
func (n *lockedNormal) Rand(x []float64) []float64 {
    n.mu.Lock()
    defer n.mu.Unlock()
    return n.n.Rand(x)
}

The setup of the Normal needs only be done once and does not need protection because of that.

Right now, in all of Gonum, it's easy to predict when code is thread safe, and when it is not. This is a great property, and not one we should break.

I think this is not a safe assumption to make. There should be no need to predict, the documentation should say when things are thread safe.

Having said all this, it is clear that in Normal and StudentsT there should be an accessor for the sigma field that is used instead of direct access. With that the exact semantics of synchronisation (of whatever type) of that field are easily reasoned about.

btracey mentioned this issue Feb 23, 2017

Remove race in checking for sigma #164

Closed

btracey mentioned this issue Mar 7, 2017

Store sigma in Normal and StudentsT #168

Merged

btracey closed this as completed Dec 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stat/distmv: nil checks for sigma race against setSigma in Normal and Student's T #163

stat/distmv: nil checks for sigma race against setSigma in Normal and Student's T #163

btracey commented Feb 22, 2017

kortschak commented Feb 22, 2017

btracey commented Feb 22, 2017

kortschak commented Feb 22, 2017

btracey commented Feb 22, 2017 via email

kortschak commented Feb 22, 2017

kortschak commented Feb 22, 2017

btracey commented Feb 22, 2017

btracey commented Feb 22, 2017

kortschak commented Feb 22, 2017 •

edited

Loading

stat/distmv: nil checks for sigma race against setSigma in Normal and Student's T #163

stat/distmv: nil checks for sigma race against setSigma in Normal and Student's T #163

Comments

btracey commented Feb 22, 2017

kortschak commented Feb 22, 2017

btracey commented Feb 22, 2017

kortschak commented Feb 22, 2017

btracey commented Feb 22, 2017 via email

kortschak commented Feb 22, 2017

kortschak commented Feb 22, 2017

btracey commented Feb 22, 2017

btracey commented Feb 22, 2017

kortschak commented Feb 22, 2017 • edited Loading

kortschak commented Feb 22, 2017 •

edited

Loading