«

»

Aug 18

Elliptical Slice Sampling for priors with non-zero mean

I’ve been playing around with elliptical slice sampling (ESS) lately.  This is a new MCMC technique developed by Iain Murray, Ryan Adams, and David MacKay. Here is the original AISTATS paper. The punchline is that this method allows exact joint sampling for vectors whose posterior distribution can be expressed using any Gaussian prior and any (non-conjugate) likelihood. ESS does not have any free parameters, and always accepts its proposal.  This makes it favorable compared to, say, Metropolis-Hastings methods, since tuning the lengthscales that control a random walk to achieve good mixing rates requires expensive human time.

One tricky aspect of this method is that it requires a change of variables when applied to sample a variable with non-zero prior mean.  This is mentioned in the original paper, but not all that coherently. I thought this deserved a blog post, so that practitioners make sure they get it right.  The necessary steps are quite simple for experienced statisticians, so there’s nothing really novel or exciting happening. Read on for more.  Hopefully in the near future I’ll report on experiments comparing ESS to other methods on some more interesting models.

Change of Variables

Suppose we wish to draw variable of interest x from its posterior: p(x ) \propto \mathcal{N}( \mu_0, S_0 ) L(x), where the prior is Gaussian with mean \mu_0 and covariance S_0, and the likelihood L(x) is some generic function of x.

If x has zero-mean prior, we can easily use ESS as a “black box” to get posterior draws, by iterating the following for many samples x_1, x_2, \ldots x_T

  • For t = 1\ldots T
    • \nu \sim \mathcal{N}( 0, S_0 )
    • x_{t+1} \sim \mbox{ESS}(  x_{t}, \nu, L( ) )

where the variable \nu defines the ellipse that makes ESS possible (see original paper for details).

However, if x has a non-zero mean Gaussian prior Prior(x) = \mathcal{N}(\mu_0, S_0), one might be tempted to modify the above procedure as follows:

  • For t = 1\ldots T
    • \nu \sim \mathcal{N}( \mu_0, S_0 )  (WRONG!)
    • x_{t+1} \sim \mbox{ESS}(  x_{t}, \nu, L( ) )

This is just flat out wrong, because ESS is *only* valid when we are sampling a variable that has a zero mean prior.

To overcome this obstacle, we simply change our variable of interest x into \bar{x}, which has zero mean, by simply setting \bar{x} = x - \mu_0.   However, we need redefine our likelihood function to compensate for this change, so that it “adds back in” this prior mean.  Thus, we set \bar{L}( \bar{x} ) = L( \bar{x} + \mu_0 ) = L(x).

Then, it is quite simple to get posterior draws x_1, x_2, \ldots x_T from p(x) by repeating

  • For t = 1\ldots T
    • \nu \sim \mathcal{N}( 0, S_0 )
    • \bar{x}_{t+1} \sim \mbox{ESS}(  \bar{x}_{t}, \nu, \bar{L}( ) )

After all samples are collected, set \mathbf{x} = \mathbf{\bar{x}} + \mu_0, so that the resulting samples are correctly centered.

Toy Experiment

Here, I’ve completed a simple toy experiment to verify this procedure works as expected.  I’ve set up an easy problem, where both prior and likelihood for x have Gaussian distributions. This means we have a closed-form posterior for our variable x, and thus we know the “right answer”.

I’ve then sampled many draws from this posterior, using the “CORRECT” change of variables method, and the “WRONG” method that naively adds the mean directly into the \nu auxiliary variable within the ESS procedure.

Clearly, the change of variables method correctly finds the mean of the posterior, while the WRONG “include the prior mean” method is way off.  So don’t be fooled, and make sure you do the change of variables correctly.

Code to Try this Yourself

You can get a good matlab implementation of the basic ESS procedure from Iain Murray’s website: elliptical_slice.m

I then just used this matlab script to run this experiment and create the above plot:  ExploreESSWithMean.m

Along with this helper script for plotting the contours of Gaussian covariances: plotGauss.m

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>