Add `LKJ` #1066

johnczito · 2020-02-11T11:10:21Z

This PR adds the LKJ distribution, which is a distribution over correlation matrices. It can be found in other statistical computing platforms (here, here, or here, for instance).

Note: One of the unit tests runs a hypothesis test, so this PR adds HypothesisTests.jl as an extra testing dependency in the .toml. Let me know if I have done this incorrectly, or if I should exclude this change for now.

johnczito · 2020-02-11T11:11:02Z

(As I mentioned here, I'm not sure if folks will want to wait on this indefinitely until #951 is sorted out. Hopefully not...)

codecov-io · 2020-02-11T12:49:56Z

Codecov Report

Merging #1066 into master will increase coverage by 1.17%.
The diff coverage is 98.94%.

@@            Coverage Diff             @@
##           master    #1066      +/-   ##
==========================================
+ Coverage   79.45%   80.62%   +1.17%     
==========================================
  Files         112      113       +1     
  Lines        5514     5611      +97     
==========================================
+ Hits         4381     4524     +143     
+ Misses       1133     1087      -46

Impacted Files	Coverage Δ
src/Distributions.jl	`100% <ø> (ø)`	⬆️
src/matrixvariates.jl	`92.15% <ø> (ø)`	⬆️
src/matrix/lkj.jl	`98.94% <98.94%> (ø)`
src/univariate/continuous/ksonesided.jl	`0% <0%> (ø)`	⬆️
src/univariate/continuous/locationscale.jl	`94.11% <0%> (+2.94%)`	⬆️
src/univariate/continuous/ksdist.jl	`67.6% <0%> (+67.6%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update fb90c3d...2c010df. Read the comment docs.

mschauer · 2020-02-11T12:52:11Z

Hi, could you post a screenshot of the docstring of LKJ?

johnczito · 2020-02-11T12:56:36Z

Hi, could you post a screenshot of the docstring of LKJ?

^Updated.

mschauer · 2020-02-11T15:04:38Z

Do you know in which sense is this "uniform" if eta=1?

mschauer · 2020-02-11T15:08:27Z

src/matrix/lkj.jl

+    #  Section 3.2 in LKJ (2009 JMA)
+    #  1. Initialization
+    R = ones(typeof(η), d, d)
+    β = η + 0.5d - 1


Maybe

Suggested change

β = η + 0.5d - 1

β = η + d/2 - 1

or should this have some oftype?

β won't be promoted to whatever η is?

η = 1f0 d = 4 julia> typeof(η), typeof(η + 0.5d - 1) (Float32, Float64)

Right, thanks. Since Beta suffers from #960 it doesn't end up mattering, unfortunately.

Lets fix it anyway.

mschauer · 2020-02-11T15:09:25Z

src/matrix/lkj.jl

+    #  2.
+    for k in 2:d - 1
+        #  (a)
+        β -= 0.5


mschauer · 2020-02-11T15:16:00Z

One could test that importance sampling is successful, e.g.

E f(R) = E f(U) c_0(eta)/c_0(1) | U | (eta-1)

for R ~ LKJ(eta) and U ~ LKJ(1) and some functional f

johnczito · 2020-02-11T15:16:42Z

Do you know in which sense is this "uniform" if eta=1?

The density is constant in R. The determinant term collapses to 1 and you just have f(R; η) = c₀.

johnczito · 2020-02-11T15:29:09Z

One could test that importance sampling is successful, e.g.

Fair enough. I can add that.

mschauer · 2020-02-11T15:40:29Z

Yes, I can see that, but I suspect there is something to say what that distribution actually is. Or in other words, what is the reference measure for the densities?

johnczito · 2020-02-11T23:14:35Z

@mschauer It's a density with respect to Lebesgue measure, so uniform here just means the density is one over the Lebesgue measure of the support. The m = d(d-1)/2 free elements of a correlation matrix live in a subset of [-1, 1]ᵐ that has finite and strictly positive Lebesgue measure. So not some measure zero submanifold that requires a funky dominating measure to define a density. More here and here.

Consequently though, the log integrating constant here is currently off by a sign. Fixing that now.

mschauer · 2020-02-12T10:37:10Z

Thank you, this is nice. What is the price we pay for the test in terms of time?

test/lkj.jl

johnczito · 2020-02-12T11:59:50Z

Thank you, this is nice. What is the price we pay for the test in terms of time?

0.21 seconds

src/matrix/lkj.jl

johnczito · 2020-02-14T23:15:23Z

Thanks for all of the help on this, as usual. Let me know what else you want to see for this to be considered for merging.

matbesancon · 2020-02-14T23:18:17Z

Looks great to me. @johnczito maybe add a reference in the testset from the last commit to indicate where the stan test set comes from (documentation link or something)

matbesancon · 2020-02-14T23:27:15Z

Awesome, the tests were green before, so I'll just merge it here, thanks!

mschauer · 2020-02-14T23:42:24Z

I now believe that it is uniform in the constraint space of correlation matrices!

using Distributions
using Makie
K = 50000; ρ = [Point3f0(rand(LKJ(3, 1.0))[[2,3,6]]) for k in 1:K]
scatter(ρ, markersize=0.009, color=(:black, 0.2))

johnczito · 2020-02-14T23:50:55Z

@mschauer Nice! And that's a code snippet I'll definitely be stealing...

johnczito added 3 commits February 11, 2020 04:19

add LKJ

db1e5cd

test LKJ

2b99fce

Document LKJ

df10ccd

mschauer reviewed Feb 11, 2020

View reviewed changes

src/matrix/lkj.jl

# 2.

for k in 2:d - 1

# (a)

β -= 0.5

Copy link

Member

mschauer Feb 11, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cf 115

johnczito added 2 commits February 12, 2020 00:26

update integrating constant helpers, fix sign, update docstring

038d248

add explicit volume tests, and an importance sampling check

7c72ab1

mschauer reviewed Feb 12, 2020

View reviewed changes

test/lkj.jl Outdated Show resolved Hide resolved

johnczito added 5 commits February 12, 2020 14:35

use nifty comprehension to simplify IS test

771f16f

lil wording change

cfe56ec

fix mode(), handle d = 1 edge case

a963335

update tests

1a726ae

constant behavior in edge case

b490f4d

matbesancon reviewed Feb 13, 2020

View reviewed changes

src/matrix/lkj.jl Show resolved Hide resolved

matbesancon reviewed Feb 13, 2020

View reviewed changes

src/matrix/lkj.jl Show resolved Hide resolved

johnczito added 2 commits February 14, 2020 08:05

short circuit arg check and add little space

8166be8

test short circuit

bb6f0d7

matbesancon approved these changes Feb 14, 2020

View reviewed changes

Test LKJ logpdf against archived output from Stan

90c0ba7

add links to Stan test set

2c010df

matbesancon merged commit d521695 into JuliaStats:master Feb 14, 2020

johnczito deleted the add_lkj branch February 15, 2020 00:22

This was referenced May 26, 2020

Add LKJ Correlation Distribution TuringLang/Turing.jl#924

Closed

Varying Slopes StatisticalRethinkingJulia/TuringModels.jl#1

Closed

Add LKJ Matrix Distribution TuringLang/Bijectors.jl#108

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `LKJ` #1066

Add `LKJ` #1066

johnczito commented Feb 11, 2020 •

edited

Loading

johnczito commented Feb 11, 2020

codecov-io commented Feb 11, 2020 •

edited

Loading

mschauer commented Feb 11, 2020

johnczito commented Feb 11, 2020 •

edited

Loading

mschauer commented Feb 11, 2020

mschauer Feb 11, 2020

johnczito Feb 11, 2020

mschauer Feb 12, 2020

johnczito Feb 12, 2020

mschauer Feb 14, 2020

mschauer Feb 11, 2020

mschauer commented Feb 11, 2020

johnczito commented Feb 11, 2020

johnczito commented Feb 11, 2020

mschauer commented Feb 11, 2020

johnczito commented Feb 11, 2020 •

edited

Loading

mschauer commented Feb 12, 2020

johnczito commented Feb 12, 2020

johnczito commented Feb 14, 2020

matbesancon commented Feb 14, 2020

matbesancon commented Feb 14, 2020

mschauer commented Feb 14, 2020

johnczito commented Feb 14, 2020

Add LKJ #1066

Add LKJ #1066

Conversation

johnczito commented Feb 11, 2020 • edited Loading

johnczito commented Feb 11, 2020

codecov-io commented Feb 11, 2020 • edited Loading

Codecov Report

mschauer commented Feb 11, 2020

johnczito commented Feb 11, 2020 • edited Loading

mschauer commented Feb 11, 2020

mschauer Feb 11, 2020

Choose a reason for hiding this comment

johnczito Feb 11, 2020

Choose a reason for hiding this comment

mschauer Feb 12, 2020

Choose a reason for hiding this comment

johnczito Feb 12, 2020

Choose a reason for hiding this comment

mschauer Feb 14, 2020

Choose a reason for hiding this comment

mschauer Feb 11, 2020

Choose a reason for hiding this comment

mschauer commented Feb 11, 2020

johnczito commented Feb 11, 2020

johnczito commented Feb 11, 2020

mschauer commented Feb 11, 2020

johnczito commented Feb 11, 2020 • edited Loading

mschauer commented Feb 12, 2020

johnczito commented Feb 12, 2020

johnczito commented Feb 14, 2020

matbesancon commented Feb 14, 2020

matbesancon commented Feb 14, 2020

mschauer commented Feb 14, 2020

johnczito commented Feb 14, 2020

Add `LKJ` #1066

Add `LKJ` #1066

johnczito commented Feb 11, 2020 •

edited

Loading

codecov-io commented Feb 11, 2020 •

edited

Loading

johnczito commented Feb 11, 2020 •

edited

Loading

johnczito commented Feb 11, 2020 •

edited

Loading