diff --git a/docs/articles/mr_mash_intro.html b/docs/articles/mr_mash_intro.html
index dd33009..360c3d1 100644
--- a/docs/articles/mr_mash_intro.html
+++ b/docs/articles/mr_mash_intro.html
@@ -33,7 +33,7 @@
       </button>
       <span class="navbar-brand">
         <a class="navbar-link" href="../index.html">mr.mash.alpha</a>
-        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.2-26</span>
+        <span class="version label label-default" data-toggle="tooltip" data-placement="bottom" title="">0.2-27</span>
       </span>
     </div>
 
@@ -42,6 +42,9 @@
 <li>
   <a href="../index.html">Home</a>
 </li>
+<li>
+  <a href="../articles/index.html">Vignettes</a>
+</li>
 <li>
   <a href="../reference/index.html">Functions</a>
 </li>
@@ -67,7 +70,7 @@ <h1 data-toc-skip>Introduction to mr.mash</h1>
                         <h4 data-toc-skip class="author">Peter
 Carbonetto &amp; Fabio Morgante</h4>
             
-            <h4 data-toc-skip class="date">2023-05-16</h4>
+            <h4 data-toc-skip class="date">2023-05-19</h4>
       
       <small class="dont-index">Source: <a href="https://github.com/stephenslab/mr.mash.alpha/blob/HEAD/vignettes/mr_mash_intro.Rmd" class="external-link"><code>vignettes/mr_mash_intro.Rmd</code></a></small>
       <div class="hidden name"><code>mr_mash_intro.Rmd</code></div>
@@ -76,105 +79,173 @@ <h4 data-toc-skip class="date">2023-05-16</h4>
 
     
     
-<p>The aim of this vignette is to introduce the basic steps of a
-<em>mr.mash</em> analysis, fitting the <em>mr.mash</em> model then using
-it to make predictions.</p>
-<p>First, we set the seed and load the <code>mr.mash.alpha</code> R
-package.</p>
+<p>The aim of this vignette is to introduce the basic steps of a mr.mash
+analysis through a toy example. To learn more about mr.mash, please see
+the <a href="https://doi.org/10.1101/2022.11.22.517471" class="external-link">paper</a>.</p>
+<p>First, we set the seed to make the results more easily reproducible,
+and we load the “mr.mash.alpha” package.</p>
 <div class="sourceCode" id="cb1"><pre class="downlit sourceCode r">
-<code class="sourceCode R"><span><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="va"><a href="https://github.com/stephenslab/mr.mash.alpha" class="external-link">mr.mash.alpha</a></span><span class="op">)</span></span>
-<span><span class="fu"><a href="https://rdrr.io/r/base/Random.html" class="external-link">set.seed</a></span><span class="op">(</span><span class="fl">123</span><span class="op">)</span></span></code></pre></div>
-<div class="section level2">
-<h2 id="step-1-simulate-example-data">Step 1 – Simulate example data<a class="anchor" aria-label="anchor" href="#step-1-simulate-example-data"></a>
-</h2>
-<p>We start by simulating a data set with 800 individuals, 1000
-predictors and 5 responses. We then 5 causal variables (randomly sampled
-from the total 1000) are assigned equal effects across responses and
-explain 20% of the total per-response variance. This would be roughly
-equivalent to one gene in the “equal effects” scenario in the
-<em>mr.mash</em> (with the difference being that genotypes are simulated
-here).</p>
+<code class="sourceCode R"><span class="kw"><a href="https://rdrr.io/r/base/library.html" class="external-link">library</a></span><span class="op">(</span><span class="va"><a href="https://github.com/stephenslab/mr.mash.alpha" class="external-link">mr.mash.alpha</a></span><span class="op">)</span>
+<span class="fu"><a href="https://rdrr.io/r/base/Random.html" class="external-link">set.seed</a></span><span class="op">(</span><span class="fl">123</span><span class="op">)</span></code></pre></div>
+<p>We illustrate the application of mr.mash to a data set simulated from
+a multivariate, multiple linear regression with 5 responses in which the
+coefficients are the same for all responses. In the target application
+considered in the paper—prediction of multi-tissue gene expression from
+genotypes—this would correspond to the situation in which we would like
+to predict expression of a single gene in 5 different tissues from
+genotype data at multiple SNPs, and the SNPs have the same effects on
+gene expression in all 5 tissues. (In multi-tissue gene expression we
+would normally like to predict expression of many genes, but to simplify
+this vignette here we illustrate the key ideas with a single gene.)</p>
+<p>Although this simulation is not a particularly realistic, this is
+meant to illustrate the benefits of mr.mash: by modeling the sharing of
+effects across tissues, mr.mash is able to more accurately estimate the
+effects in multiple tissues, and therefore is able to obtain better
+predictions.</p>
+<p>We start by simulating 150 samples from a multivariate, multiple
+linear regression model in which 5 out of the 800 variables (SNPs)
+affect the 5 responses (expression levels).</p>
 <div class="sourceCode" id="cb2"><pre class="downlit sourceCode r">
-<code class="sourceCode R"><span><span class="va">n</span> <span class="op">&lt;-</span> <span class="fl">800</span></span>
-<span><span class="va">p</span> <span class="op">&lt;-</span> <span class="fl">1000</span></span>
-<span><span class="va">p_causal</span> <span class="op">&lt;-</span> <span class="fl">5</span></span>
-<span><span class="va">r</span> <span class="op">&lt;-</span> <span class="fl">5</span></span>
-<span><span class="va">pve</span> <span class="op">&lt;-</span> <span class="fl">0.2</span></span>
-<span><span class="va">B_cor</span> <span class="op">&lt;-</span> <span class="fl">1</span></span>
-<span><span class="va">out</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/simulate_mr_mash_data.html">simulate_mr_mash_data</a></span><span class="op">(</span><span class="va">n</span>, <span class="va">p</span>, <span class="va">p_causal</span>, <span class="va">r</span>, pve<span class="op">=</span><span class="va">pve</span>, B_cor<span class="op">=</span><span class="va">B_cor</span>,</span>
-<span>                             B_scale<span class="op">=</span><span class="fl">1</span>, X_cor<span class="op">=</span><span class="fl">0</span>, X_scale<span class="op">=</span><span class="fl">1</span>, V_cor<span class="op">=</span><span class="fl">0</span><span class="op">)</span></span></code></pre></div>
-</div>
-<div class="section level2">
-<h2 id="step-2-split-the-data-into-training-and-test-sets">Step 2 – Split the data into training and test sets<a class="anchor" aria-label="anchor" href="#step-2-split-the-data-into-training-and-test-sets"></a>
-</h2>
-<p>We then split the data into a training set and a test set.</p>
+<code class="sourceCode R"><span class="va">dat</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/simulate_mr_mash_data.html">simulate_mr_mash_data</a></span><span class="op">(</span>n <span class="op">=</span> <span class="fl">150</span>,p <span class="op">=</span> <span class="fl">800</span>,p_causal <span class="op">=</span> <span class="fl">5</span>,r <span class="op">=</span> <span class="fl">5</span>,pve <span class="op">=</span> <span class="fl">0.5</span>,
+                             V_cor <span class="op">=</span> <span class="fl">0.25</span><span class="op">)</span></code></pre></div>
+<p>Next we split the samples into a training set (with 100 samples) and
+test set (with 50 samples).</p>
 <div class="sourceCode" id="cb3"><pre class="downlit sourceCode r">
-<code class="sourceCode R"><span><span class="va">Ytrain</span> <span class="op">&lt;-</span> <span class="va">out</span><span class="op">$</span><span class="va">Y</span><span class="op">[</span><span class="op">-</span><span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="fl">1</span><span class="op">:</span><span class="fl">200</span><span class="op">)</span>,<span class="op">]</span></span>
-<span><span class="va">Xtrain</span> <span class="op">&lt;-</span> <span class="va">out</span><span class="op">$</span><span class="va">X</span><span class="op">[</span><span class="op">-</span><span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="fl">1</span><span class="op">:</span><span class="fl">200</span><span class="op">)</span>,<span class="op">]</span></span>
-<span><span class="va">Ytest</span> <span class="op">&lt;-</span> <span class="va">out</span><span class="op">$</span><span class="va">Y</span><span class="op">[</span><span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="fl">1</span><span class="op">:</span><span class="fl">200</span><span class="op">)</span>,<span class="op">]</span></span>
-<span><span class="va">Xtest</span> <span class="op">&lt;-</span> <span class="va">out</span><span class="op">$</span><span class="va">X</span><span class="op">[</span><span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="fl">1</span><span class="op">:</span><span class="fl">200</span><span class="op">)</span>,<span class="op">]</span></span></code></pre></div>
-</div>
+<code class="sourceCode R"><span class="va">ntest</span>  <span class="op">&lt;-</span> <span class="fl">50</span> 
+<span class="va">Ytrain</span> <span class="op">&lt;-</span> <span class="va">dat</span><span class="op">$</span><span class="va">Y</span><span class="op">[</span><span class="op">-</span><span class="op">(</span><span class="fl">1</span><span class="op">:</span><span class="va">ntest</span><span class="op">)</span>,<span class="op">]</span>
+<span class="va">Xtrain</span> <span class="op">&lt;-</span> <span class="va">dat</span><span class="op">$</span><span class="va">X</span><span class="op">[</span><span class="op">-</span><span class="op">(</span><span class="fl">1</span><span class="op">:</span><span class="va">ntest</span><span class="op">)</span>,<span class="op">]</span>
+<span class="va">Ytest</span>  <span class="op">&lt;-</span> <span class="va">dat</span><span class="op">$</span><span class="va">Y</span><span class="op">[</span><span class="fl">1</span><span class="op">:</span><span class="va">ntest</span>,<span class="op">]</span>
+<span class="va">Xtest</span>  <span class="op">&lt;-</span> <span class="va">dat</span><span class="op">$</span><span class="va">X</span><span class="op">[</span><span class="fl">1</span><span class="op">:</span><span class="va">ntest</span>,<span class="op">]</span></code></pre></div>
 <div class="section level2">
-<h2 id="step-3-define-the-mixture-prior">Step 3 – Define the mixture prior<a class="anchor" aria-label="anchor" href="#step-3-define-the-mixture-prior"></a>
+<h2 id="define-the-mr-mash-prior">Define the mr.mash prior<a class="anchor" aria-label="anchor" href="#define-the-mr-mash-prior"></a>
 </h2>
-<p>To run <em>mr.mash</em>, we need to first specify the covariances in
-the mixture-of-normals prior, which are supposed to capture the effect
-sharing patterns across responses. In this example, we use a mixture of
-“canonical” covariances computed using
-<code><a href="../reference/compute_canonical_covs.html">compute_canonical_covs()</a></code>. However, “data-driven”
-covariances can also be used – here’s an <a href="https://stephenslab.github.io/mashr/articles/intro_mash_dd.html" class="external-link">example</a>
-of how to compute these matrices. Regardless of the type of covariance
-matrices, these are each multiplied by a grid of scaling factors
-computed using <code><a href="../reference/autoselect.mixsd.html">autoselect.mixsd()</a></code>, which are supposed to
-capture the magnitude of the effects. The expansion is done using
-<code><a href="../reference/expand_covs.html">expand_covs()</a></code>, which also adds a matrix of all zeros (our
-spike) when requested. The grid is derived from the regression
-coefficients and their standard errors from univariate simple linear
-regression which can be computed using
-<code><a href="../reference/compute_univariate_sumstats.html">compute_univariate_sumstats()</a></code>.</p>
+<p>To run mr.mash, we need to first specify the covariances in the
+mixture of normals prior. The idea is that the chosen collection of
+covariance matrices should include a variety of potential effect sharing
+patterns, and in the model fitting stage the prior should then assign
+most weight to the sharing patterns that are present in the data, and
+little or no weight on patterns that are inconsistent with the data. In
+general, we recommend learning <a href="https://stephenslab.github.io/mashr/articles/intro_mash_dd.html" class="external-link">“data-driven”
+covariance matrices</a>. But here, for simplicity, we instead use
+“canonical” covariances which are not adaptive, but nonetheless well
+suited for this toy example since the true effects are the same across
+responses/tissues.</p>
 <div class="sourceCode" id="cb4"><pre class="downlit sourceCode r">
-<code class="sourceCode R"><span><span class="va">univ_sumstats</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/compute_univariate_sumstats.html">compute_univariate_sumstats</a></span><span class="op">(</span><span class="va">Xtrain</span>, <span class="va">Ytrain</span>,</span>
-<span>                   standardize<span class="op">=</span><span class="cn">TRUE</span>, standardize.response<span class="op">=</span><span class="cn">FALSE</span><span class="op">)</span></span>
-<span><span class="va">grid</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/autoselect.mixsd.html">autoselect.mixsd</a></span><span class="op">(</span><span class="va">univ_sumstats</span>, mult<span class="op">=</span><span class="fu"><a href="https://rdrr.io/r/base/MathFun.html" class="external-link">sqrt</a></span><span class="op">(</span><span class="fl">2</span><span class="op">)</span><span class="op">)</span><span class="op">^</span><span class="fl">2</span></span>
-<span><span class="va">S0</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/compute_canonical_covs.html">compute_canonical_covs</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/nrow.html" class="external-link">ncol</a></span><span class="op">(</span><span class="va">Ytrain</span><span class="op">)</span>, singletons<span class="op">=</span><span class="cn">TRUE</span>,</span>
-<span>                             hetgrid<span class="op">=</span><span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="fl">0</span>, <span class="fl">0.25</span>, <span class="fl">0.5</span>, <span class="fl">0.75</span>, <span class="fl">1</span><span class="op">)</span><span class="op">)</span></span>
-<span><span class="va">S0</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/expand_covs.html">expand_covs</a></span><span class="op">(</span><span class="va">S0</span>, <span class="va">grid</span>, zeromat<span class="op">=</span><span class="cn">TRUE</span><span class="op">)</span></span></code></pre></div>
+<code class="sourceCode R"><span class="va">S0</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/compute_canonical_covs.html">compute_canonical_covs</a></span><span class="op">(</span>r <span class="op">=</span> <span class="fl">5</span>,singletons <span class="op">=</span> <span class="cn">TRUE</span>,
+                             hetgrid <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/seq.html" class="external-link">seq</a></span><span class="op">(</span><span class="fl">0</span>,<span class="fl">1</span>,<span class="fl">0.25</span><span class="op">)</span><span class="op">)</span></code></pre></div>
+<p>This gives a mixture of 10 covariance matrices capturing a variety of
+“canonical” effect-sharing patterns:</p>
+<div class="sourceCode" id="cb5"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span class="fu"><a href="https://rdrr.io/r/base/names.html" class="external-link">names</a></span><span class="op">(</span><span class="va">S0</span><span class="op">)</span>
+<span class="co">#  [1] "singleton1"  "singleton2"  "singleton3"  "singleton4"  "singleton5" </span>
+<span class="co">#  [6] "independent" "shared0.25"  "shared0.5"   "shared0.75"  "shared1"</span></code></pre></div>
+<p>To illustrate the benefits of modeling a variety of effect-sharing
+patterns, we also try out mr.mash with a simpler mixture of covariance
+matrices in which the effects are effectively independent across
+tissues. Although this may seem to be a very poor choice of prior,
+particularly for this example, it turns out that several multivariate
+regression methods assume, implicitly or explicitly, this prior.</p>
+<div class="sourceCode" id="cb6"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span class="va">S0_ind</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/compute_canonical_covs.html">compute_canonical_covs</a></span><span class="op">(</span>r <span class="op">=</span> <span class="fl">5</span>,singletons <span class="op">=</span> <span class="cn">FALSE</span>,
+                                 hetgrid <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="fl">0</span>,<span class="fl">0.001</span>,<span class="fl">0.01</span><span class="op">)</span><span class="op">)</span>
+<span class="fu"><a href="https://rdrr.io/r/base/names.html" class="external-link">names</a></span><span class="op">(</span><span class="va">S0_ind</span><span class="op">)</span>
+<span class="co"># [1] "independent" "shared0.001" "shared0.01"</span></code></pre></div>
+<p>Regardless of the covariance matrices are chosen, it is recommended
+to also consider a variety of effect scales in the prior. This is
+normally achieved in mr.mash by expanding the mixture across a specifed
+grid of scaling factors. Here we choose this grid in an adaptive fashion
+based on the data:</p>
+<div class="sourceCode" id="cb7"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span class="va">univ_sumstats</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/compute_univariate_sumstats.html">compute_univariate_sumstats</a></span><span class="op">(</span><span class="va">Xtrain</span>,<span class="va">Ytrain</span>,standardize <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span>
+<span class="va">scaling_grid</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/autoselect.mixsd.html">autoselect.mixsd</a></span><span class="op">(</span><span class="va">univ_sumstats</span>,mult <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/MathFun.html" class="external-link">sqrt</a></span><span class="op">(</span><span class="fl">2</span><span class="op">)</span><span class="op">)</span><span class="op">^</span><span class="fl">2</span>
+<span class="va">S0</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/expand_covs.html">expand_covs</a></span><span class="op">(</span><span class="va">S0</span>,<span class="va">scaling_grid</span><span class="op">)</span>
+<span class="va">S0_ind</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/expand_covs.html">expand_covs</a></span><span class="op">(</span><span class="va">S0_ind</span>,<span class="va">scaling_grid</span><span class="op">)</span></code></pre></div>
 </div>
 <div class="section level2">
-<h2 id="step-4-fit-mr-mash-to-the-training-data">Step 4 – Fit <em>mr.mash</em> to the training data<a class="anchor" aria-label="anchor" href="#step-4-fit-mr-mash-to-the-training-data"></a>
+<h2 id="fit-a-mr-mash-model-to-the-data">Fit a mr.mash model to the data<a class="anchor" aria-label="anchor" href="#fit-a-mr-mash-model-to-the-data"></a>
 </h2>
-<p>Now we are ready to fit a mr.mash model to the training data using
-<code><a href="../reference/mr.mash.html">mr.mash()</a></code>, to estimate the posterior mean of the regression
-coefficients.</p>
-<div class="sourceCode" id="cb5"><pre class="downlit sourceCode r">
-<code class="sourceCode R"><span><span class="va">fit</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/mr.mash.html">mr.mash</a></span><span class="op">(</span><span class="va">Xtrain</span>, <span class="va">Ytrain</span>, <span class="va">S0</span>, update_V<span class="op">=</span><span class="cn">TRUE</span>, verbose<span class="op">=</span><span class="cn">FALSE</span><span class="op">)</span></span></code></pre></div>
+<p>Having specified the mr.mash prior, we are now ready to fit a mr.mash
+model to the training data (this may take a few minutes to run):</p>
+<div class="sourceCode" id="cb8"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span class="va">fit</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/mr.mash.html">mr.mash</a></span><span class="op">(</span><span class="va">Xtrain</span>,<span class="va">Ytrain</span>,<span class="va">S0</span>,update_V <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span></code></pre></div>
+<p>And for comparison we fit a second mr.mash model using the simpler
+and less flexible prior:</p>
+<div class="sourceCode" id="cb9"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span class="va">fit_ind</span> <span class="op">&lt;-</span> <span class="fu"><a href="../reference/mr.mash.html">mr.mash</a></span><span class="op">(</span><span class="va">Xtrain</span>,<span class="va">Ytrain</span>,<span class="va">S0_ind</span>,update_V <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span></code></pre></div>
+<p>(Notice that the less complex model also takes less time to fit.)</p>
+<p>For prediction, the key output is the posterior mean estimtes of the
+regression coefficients, stored in the “mu1” output. Let’s compare the
+estimates to the ground truth:</p>
+<div class="sourceCode" id="cb10"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span class="fu"><a href="https://rdrr.io/r/graphics/par.html" class="external-link">par</a></span><span class="op">(</span>mfrow <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="fl">1</span>,<span class="fl">2</span><span class="op">)</span><span class="op">)</span>
+<span class="fu"><a href="https://rdrr.io/r/graphics/plot.html" class="external-link">plot</a></span><span class="op">(</span><span class="va">dat</span><span class="op">$</span><span class="va">B</span>,<span class="va">fit_ind</span><span class="op">$</span><span class="va">mu1</span>,pch <span class="op">=</span> <span class="fl">20</span>,xlab <span class="op">=</span> <span class="st">"true"</span>,ylab <span class="op">=</span> <span class="st">"estimated"</span>,
+     main <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/sprintf.html" class="external-link">sprintf</a></span><span class="op">(</span><span class="st">"cor = %0.3f"</span>,
+                    <span class="fu"><a href="https://rdrr.io/r/stats/cor.html" class="external-link">cor</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/vector.html" class="external-link">as.vector</a></span><span class="op">(</span><span class="va">dat</span><span class="op">$</span><span class="va">B</span><span class="op">)</span>,<span class="fu"><a href="https://rdrr.io/r/base/vector.html" class="external-link">as.vector</a></span><span class="op">(</span><span class="va">fit_ind</span><span class="op">$</span><span class="va">mu1</span><span class="op">)</span><span class="op">)</span><span class="op">)</span><span class="op">)</span>
+<span class="fu"><a href="https://rdrr.io/r/graphics/abline.html" class="external-link">abline</a></span><span class="op">(</span>a <span class="op">=</span> <span class="fl">0</span>,b <span class="op">=</span> <span class="fl">1</span>,col <span class="op">=</span> <span class="st">"royalblue"</span>,lty <span class="op">=</span> <span class="st">"dotted"</span><span class="op">)</span>
+<span class="fu"><a href="https://rdrr.io/r/graphics/plot.html" class="external-link">plot</a></span><span class="op">(</span><span class="va">dat</span><span class="op">$</span><span class="va">B</span>,<span class="va">fit</span><span class="op">$</span><span class="va">mu1</span>,pch <span class="op">=</span> <span class="fl">20</span>,xlab <span class="op">=</span> <span class="st">"true"</span>,ylab <span class="op">=</span> <span class="st">"estimated"</span>,
+     main <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/sprintf.html" class="external-link">sprintf</a></span><span class="op">(</span><span class="st">"cor = %0.3f"</span>,
+                    <span class="fu"><a href="https://rdrr.io/r/stats/cor.html" class="external-link">cor</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/vector.html" class="external-link">as.vector</a></span><span class="op">(</span><span class="va">dat</span><span class="op">$</span><span class="va">B</span><span class="op">)</span>,<span class="fu"><a href="https://rdrr.io/r/base/vector.html" class="external-link">as.vector</a></span><span class="op">(</span><span class="va">fit</span><span class="op">$</span><span class="va">mu1</span><span class="op">)</span><span class="op">)</span><span class="op">)</span><span class="op">)</span>
+<span class="fu"><a href="https://rdrr.io/r/graphics/abline.html" class="external-link">abline</a></span><span class="op">(</span>a <span class="op">=</span> <span class="fl">0</span>,b <span class="op">=</span> <span class="fl">1</span>,col <span class="op">=</span> <span class="st">"royalblue"</span>,lty <span class="op">=</span> <span class="st">"dotted"</span><span class="op">)</span></code></pre></div>
+<p><img src="mr_mash_intro_files/figure-html/plot-coefs-1.png" width="720" style="display: block; margin: auto;"></p>
+<p>As expected, the coefficients on the left-hand side obtained using an
+“independent effects” prior are not as accurate as the the coefficients
+estimated using the more flexible prior (right-hand side).</p>
+<p>While perhaps not of primary interest, for diagnostic purposes it is
+often helpfl to examine the estimated mixture weights in the prior as
+well as the estimated residual covariance matrix.</p>
+<p>Inspecting the top prior mixture weights from the better model, it is
+helpful to see that the “null” and “shared1” components are among the
+top components by weight. (The top component is the null component
+because most of the SNPs have no effect on gene expression.)</p>
+<div class="sourceCode" id="cb11"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span class="fu"><a href="https://rdrr.io/r/utils/head.html" class="external-link">head</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/sort.html" class="external-link">sort</a></span><span class="op">(</span><span class="va">fit</span><span class="op">$</span><span class="va">w0</span>,decreasing <span class="op">=</span> <span class="cn">TRUE</span><span class="op">)</span>,n <span class="op">=</span> <span class="fl">10</span><span class="op">)</span>
+<span class="co">#             null singleton2_grid1 singleton1_grid1 singleton5_grid1 </span>
+<span class="co">#       0.04741471       0.04243749       0.04202594       0.04141742 </span>
+<span class="co"># singleton4_grid1 singleton3_grid1 singleton2_grid2    shared1_grid1 </span>
+<span class="co">#       0.04038929       0.04022118       0.03798986       0.03788223 </span>
+<span class="co"># singleton1_grid2 singleton5_grid2 </span>
+<span class="co">#       0.03724834       0.03618774</span></code></pre></div>
+<p>Also, reassuringly, the estimated residual variance-covariance matrix
+is close to the matrix used to simulate the data:</p>
+<div class="sourceCode" id="cb12"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span class="va">dat</span><span class="op">$</span><span class="va">V</span>
+<span class="co">#           [,1]      [,2]      [,3]      [,4]      [,5]</span>
+<span class="co"># [1,] 3.5145375 0.8786344 0.8786344 0.8786344 0.8786344</span>
+<span class="co"># [2,] 0.8786344 3.5145373 0.8786343 0.8786343 0.8786343</span>
+<span class="co"># [3,] 0.8786344 0.8786343 3.5145373 0.8786343 0.8786343</span>
+<span class="co"># [4,] 0.8786344 0.8786343 0.8786343 3.5145373 0.8786343</span>
+<span class="co"># [5,] 0.8786344 0.8786343 0.8786343 0.8786343 3.5145373</span>
+<span class="va">fit</span><span class="op">$</span><span class="va">V</span>
+<span class="co">#           [,1]      [,2]      [,3]      [,4]      [,5]</span>
+<span class="co"># [1,] 3.2407233 0.8502217 0.8841572 1.3563118 1.0189386</span>
+<span class="co"># [2,] 0.8502217 3.5174997 1.2133736 0.6212035 0.1021557</span>
+<span class="co"># [3,] 0.8841572 1.2133736 2.5068761 0.6631672 1.0174394</span>
+<span class="co"># [4,] 1.3563118 0.6212035 0.6631672 2.7869990 0.7854292</span>
+<span class="co"># [5,] 1.0189386 0.1021557 1.0174394 0.7854292 2.9687632</span></code></pre></div>
 </div>
 <div class="section level2">
-<h2 id="step-5-predict-responses-in-the-test">Step 5 – Predict responses in the test<a class="anchor" aria-label="anchor" href="#step-5-predict-responses-in-the-test"></a>
+<h2 id="use-the-fitted-mr-mash-model-to-make-predictions">Use the fitted mr.mash model to make predictions<a class="anchor" aria-label="anchor" href="#use-the-fitted-mr-mash-model-to-make-predictions"></a>
 </h2>
-<p>We then use the fitted model from step 4 to predict the response
-values in the test set. In this plot, we compare the true and predicted
-values</p>
-<div class="sourceCode" id="cb6"><pre class="downlit sourceCode r">
-<code class="sourceCode R"><span><span class="va">Ytest_est</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/stats/predict.html" class="external-link">predict</a></span><span class="op">(</span><span class="va">fit</span>,<span class="va">Xtest</span><span class="op">)</span></span>
-<span><span class="fu"><a href="https://rdrr.io/r/graphics/plot.default.html" class="external-link">plot</a></span><span class="op">(</span><span class="va">Ytest_est</span>,<span class="va">Ytest</span>,pch <span class="op">=</span> <span class="fl">20</span>,col <span class="op">=</span> <span class="st">"darkblue"</span>,xlab <span class="op">=</span> <span class="st">"true"</span>,</span>
-<span>     ylab <span class="op">=</span> <span class="st">"predicted"</span><span class="op">)</span></span>
-<span><span class="fu"><a href="https://rdrr.io/r/graphics/abline.html" class="external-link">abline</a></span><span class="op">(</span>a <span class="op">=</span> <span class="fl">0</span>,b <span class="op">=</span> <span class="fl">1</span>,col <span class="op">=</span> <span class="st">"magenta"</span>,lty <span class="op">=</span> <span class="st">"dotted"</span><span class="op">)</span></span></code></pre></div>
-<p><img src="mr_mash_intro_files/figure-html/plot-pred-test-1.png" width="600" style="display: block; margin: auto;"></p>
-<p>However, we also want to assess prediction accuracy more formally.
-Here, we do that in terms of <span class="math inline">\(R^2\)</span>
-which is easy to interpret – its maximum value would be the proportion
-of variance explained, 0.2 in this case).</p>
-<div class="sourceCode" id="cb7"><pre class="downlit sourceCode r">
-<code class="sourceCode R"><span><span class="va">r2</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/base/vector.html" class="external-link">vector</a></span><span class="op">(</span><span class="st">"numeric"</span>, <span class="va">r</span><span class="op">)</span></span>
-<span><span class="kw">for</span><span class="op">(</span><span class="va">i</span> <span class="kw">in</span> <span class="fl">1</span><span class="op">:</span><span class="va">r</span><span class="op">)</span><span class="op">{</span></span>
-<span>  <span class="va">fit_acc</span>  <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/stats/lm.html" class="external-link">lm</a></span><span class="op">(</span><span class="va">Ytest</span><span class="op">[</span>, <span class="va">i</span><span class="op">]</span> <span class="op">~</span> <span class="va">Ytest_est</span><span class="op">[</span>, <span class="va">i</span><span class="op">]</span><span class="op">)</span></span>
-<span>  <span class="va">r2</span><span class="op">[</span><span class="va">i</span><span class="op">]</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/base/summary.html" class="external-link">summary</a></span><span class="op">(</span><span class="va">fit_acc</span><span class="op">)</span><span class="op">$</span><span class="va">r.squared</span></span>
-<span><span class="op">}</span></span>
-<span></span>
-<span><span class="va">r2</span></span>
-<span><span class="co"># [1] 0.1426443 0.1937896 0.1248826 0.1756781 0.2181555</span></span></code></pre></div>
-<p>We can see that the predictions are pretty accurate.</p>
+<p>We can use the fitted mr.mash model to predict gene expression from a
+genotype sample, including a sample not included in the training set.
+This is implemented by the “predict” method. Let’s compare the
+predictions from the two mr.mash models:</p>
+<div class="sourceCode" id="cb13"><pre class="downlit sourceCode r">
+<code class="sourceCode R"><span class="fu"><a href="https://rdrr.io/r/graphics/par.html" class="external-link">par</a></span><span class="op">(</span>mfrow <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/c.html" class="external-link">c</a></span><span class="op">(</span><span class="fl">1</span>,<span class="fl">2</span><span class="op">)</span><span class="op">)</span>
+<span class="va">Ypred</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/stats/predict.html" class="external-link">predict</a></span><span class="op">(</span><span class="va">fit</span>,<span class="va">Xtest</span><span class="op">)</span>
+<span class="va">Ypred_ind</span> <span class="op">&lt;-</span> <span class="fu"><a href="https://rdrr.io/r/stats/predict.html" class="external-link">predict</a></span><span class="op">(</span><span class="va">fit_ind</span>,<span class="va">Xtest</span><span class="op">)</span>
+<span class="fu"><a href="https://rdrr.io/r/graphics/plot.html" class="external-link">plot</a></span><span class="op">(</span><span class="va">Ytest</span>,<span class="va">Ypred_ind</span>,pch <span class="op">=</span> <span class="fl">20</span>,col <span class="op">=</span> <span class="st">"darkblue"</span>,xlab <span class="op">=</span> <span class="st">"true"</span>,
+     ylab <span class="op">=</span> <span class="st">"predicted"</span>,
+     main <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/sprintf.html" class="external-link">sprintf</a></span><span class="op">(</span><span class="st">"cor = %0.3f"</span>,<span class="fu"><a href="https://rdrr.io/r/stats/cor.html" class="external-link">cor</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/vector.html" class="external-link">as.vector</a></span><span class="op">(</span><span class="va">Ytest</span><span class="op">)</span>,<span class="fu"><a href="https://rdrr.io/r/base/vector.html" class="external-link">as.vector</a></span><span class="op">(</span><span class="va">Ypred_ind</span><span class="op">)</span><span class="op">)</span><span class="op">)</span><span class="op">)</span>
+<span class="fu"><a href="https://rdrr.io/r/graphics/abline.html" class="external-link">abline</a></span><span class="op">(</span>a <span class="op">=</span> <span class="fl">0</span>,b <span class="op">=</span> <span class="fl">1</span>,col <span class="op">=</span> <span class="st">"magenta"</span>,lty <span class="op">=</span> <span class="st">"dotted"</span><span class="op">)</span>
+<span class="fu"><a href="https://rdrr.io/r/graphics/plot.html" class="external-link">plot</a></span><span class="op">(</span><span class="va">Ytest</span>,<span class="va">Ypred</span>,pch <span class="op">=</span> <span class="fl">20</span>,col <span class="op">=</span> <span class="st">"darkblue"</span>,xlab <span class="op">=</span> <span class="st">"true"</span>,
+     ylab <span class="op">=</span> <span class="st">"predicted"</span>,
+     main <span class="op">=</span> <span class="fu"><a href="https://rdrr.io/r/base/sprintf.html" class="external-link">sprintf</a></span><span class="op">(</span><span class="st">"cor = %0.3f"</span>,<span class="fu"><a href="https://rdrr.io/r/stats/cor.html" class="external-link">cor</a></span><span class="op">(</span><span class="fu"><a href="https://rdrr.io/r/base/vector.html" class="external-link">as.vector</a></span><span class="op">(</span><span class="va">Ytest</span><span class="op">)</span>,<span class="fu"><a href="https://rdrr.io/r/base/vector.html" class="external-link">as.vector</a></span><span class="op">(</span><span class="va">Ypred</span><span class="op">)</span><span class="op">)</span><span class="op">)</span><span class="op">)</span>
+<span class="fu"><a href="https://rdrr.io/r/graphics/abline.html" class="external-link">abline</a></span><span class="op">(</span>a <span class="op">=</span> <span class="fl">0</span>,b <span class="op">=</span> <span class="fl">1</span>,col <span class="op">=</span> <span class="st">"magenta"</span>,lty <span class="op">=</span> <span class="st">"dotted"</span><span class="op">)</span></code></pre></div>
+<p><img src="mr_mash_intro_files/figure-html/plot-pred-test-1.png" width="720" style="display: block; margin: auto;"></p>
+<p>Indeed, mr.mash with the more flexible prior (right-hand plot)
+produces more accurate predictions than mr.mash with the “independent
+effects” prior.</p>
 </div>
   </div>
 
diff --git a/docs/articles/mr_mash_intro_files/figure-html/plot-coefs-1.png b/docs/articles/mr_mash_intro_files/figure-html/plot-coefs-1.png
new file mode 100644
index 0000000..a8ad78c
Binary files /dev/null and b/docs/articles/mr_mash_intro_files/figure-html/plot-coefs-1.png differ
diff --git a/docs/articles/mr_mash_intro_files/figure-html/plot-pred-test-1.png b/docs/articles/mr_mash_intro_files/figure-html/plot-pred-test-1.png
index 766aaff..d513fc6 100644
Binary files a/docs/articles/mr_mash_intro_files/figure-html/plot-pred-test-1.png and b/docs/articles/mr_mash_intro_files/figure-html/plot-pred-test-1.png differ
diff --git a/vignettes/mr_mash_intro.Rmd b/vignettes/mr_mash_intro.Rmd
index 63fd66c..4e6da3a 100644
--- a/vignettes/mr_mash_intro.Rmd
+++ b/vignettes/mr_mash_intro.Rmd
@@ -19,7 +19,7 @@ mr.mash analysis through a toy example. To learn more about
 mr.mash, please see the [paper][mr-mash-biorxiv].
 
 ```{r knitr-opts, include=FALSE}
-knitr::opts_chunk$set(comment = "#",collapse = TRUE,results = "hold",
+knitr::opts_chunk$set(comment = "#",collapse = TRUE,
                       fig.align = "center",dpi = 120)
 ```
 
@@ -128,14 +128,14 @@ Fit a mr.mash model to the data
 Having specified the mr.mash prior, we are now ready to fit a mr.mash
 model to the training data (this may take a few minutes to run):
 
-```{r fit-mr-mash-1}
+```{r fit-mr-mash-1, results="hide"}
 fit <- mr.mash(Xtrain,Ytrain,S0,update_V = TRUE)
 ```
 
 And for comparison we fit a second mr.mash model using the simpler and
 less flexible prior:
 
-```{r fit-mr-mash-2}
+```{r fit-mr-mash-2, results="hide"}
 fit_ind <- mr.mash(Xtrain,Ytrain,S0_ind,update_V = TRUE)
 ```
 
@@ -165,25 +165,48 @@ While perhaps not of primary interest, for diagnostic purposes it is
 often helpfl to examine the estimated mixture weights in the prior as
 well as the estimated residual covariance matrix.
 
+Inspecting the top prior mixture weights from the better model, it is
+helpful to see that the "null" and "shared1" components are among the
+top components by weight. (The top component is the null component
+because most of the SNPs have no effect on gene expression.)
+
 ```{r prior-mixture-weights}
+head(sort(fit$w0,decreasing = TRUE),n = 10)
+```
+
+Also, reassuringly, the estimated residual variance-covariance matrix
+is close to the matrix used to simulate the data:
 
+```{r resid-var}
+dat$V
+fit$V
 ```
 
-## Step 5 -- Predict responses in the test
-We then use the fitted model from step 4 to predict the response values in the 
-test set. In this plot, we compare the true and predicted values
+Use the fitted mr.mash model to make predictions
+------------------------------------------------
 
-```{r plot-pred-test, fig.height=5, fig.width=5}
+We can use the fitted mr.mash model to predict gene expression from
+a genotype sample, including a sample not included in the training
+set. This is implemented by the "predict" method. Let's compare the
+predictions from the two mr.mash models:
+
+```{r plot-pred-test, fig.height=3.5, fig.width=6}
 par(mfrow = c(1,2))
-Ytest_est <- predict(fit_ind,Xtest)
-plot(Ytest_est,Ytest,pch = 20,col = "darkblue",xlab = "true",
-     ylab = "predicted",main = cor(as.vector(Ytest_est),as.vector(Ytest)))
+Ypred <- predict(fit,Xtest)
+Ypred_ind <- predict(fit_ind,Xtest)
+plot(Ytest,Ypred_ind,pch = 20,col = "darkblue",xlab = "true",
+     ylab = "predicted",
+     main = sprintf("cor = %0.3f",cor(as.vector(Ytest),as.vector(Ypred_ind))))
 abline(a = 0,b = 1,col = "magenta",lty = "dotted")
-Ytest_est <- predict(fit,Xtest)
-plot(Ytest_est,Ytest,pch = 20,col = "darkblue",xlab = "true",
-     ylab = "predicted",main = cor(as.vector(Ytest_est),as.vector(Ytest)))
+plot(Ytest,Ypred,pch = 20,col = "darkblue",xlab = "true",
+     ylab = "predicted",
+     main = sprintf("cor = %0.3f",cor(as.vector(Ytest),as.vector(Ypred))))
 abline(a = 0,b = 1,col = "magenta",lty = "dotted")
 ```
 
+Indeed, mr.mash with the more flexible prior (right-hand plot)
+produces more accurate predictions than mr.mash with the "independent
+effects" prior.
+
 [mr-mash-biorxiv]: https://doi.org/10.1101/2022.11.22.517471 
 [mashr-dd-vignette]: https://stephenslab.github.io/mashr/articles/intro_mash_dd.html