Skip to main content

PyMC3 Extras extracted from the

Project description

PyMC3 Extras

This library include various experimental or otherwise special purpose extras for use with PyMC3 that have been extracted from the exoplanet project. The most widely useful component is probably the custom tuning functions for the PyMC3 NUTS sampler that is described below, but it also includes some helper functions for non-linear optimization and some custom distributions.

You'll find the usage instructions below and automatically generated tutorial notebooks on the notebooks branch on GitHub.

Installation

You'll need a Python installation (tested on versions 3.7 and 3.8) and it can often be best to install PyMC3 using conda so that it can handle all the details of compiler setup. This step is optional, but I would normally create a clean conda environment for projects that use PyMC3:

# Optional
conda create -n name-of-my-project python=3.8 pymc3
conda activate name-of-my-project

The easiest way to install this package is using pip:

python -m pip install -U pymc3-ext

This will also update the dependencies like PyMC3, which is probably what you want because this is only tested on recent versions of both of those packages.

NUTS tuning

The main function that the pymc3-ext library provides is a drop-in replacement for the pm.sample function in PyMC3:

import pymc3_ext as pmx

with model:
    trace = pmx.sample()

This adjusts the defaults (dense mass matrix adaptation and higher target acceptance fraction, for example) to be more in line with the needs of users in astrophysics, but it also provides some other features that might improve sampling performance.

The main features are described below, but more details can be found in the Sampling tutorial notebook.

Tuning schedule

One main difference between pmx.sample and the standard one implemented in PyMC3 is the tuning schedule. In PyMC3, the default tuning schedule is very simple: (a) update the step size parameter every step, and (b) update the mass matrix every N steps. In reality the procedure is slightly more convoluted, but this is the gist. pmx.sample instead uses a "fast" and "slow" update schedule with doubling adaptation windows that is nearly identical to the scheme used by the Stan project. We have found that this can significantly outperform the default algorithm and rarely leads to worse performance.

Parameter groups

Since this pmx.sample function defaults to estimating a dense mass matrix, the memory and computational requirements (as well as the numerical error) for the sampler can grow significantly for problems with a large number of parameters. This library includes support for grouping parameters that are known to have covariances so that the mass matrix is block diagonal rather than dense. For example, in the following snippet, the parameter x is known to be correlated, while the parameter y only requires a diagonal mass matrix:

import pymc3 as pm
import pymc3_ext as pmx

with pm.Model():
    # `L` is the Cholesky factor of a covariance matrix with off-diagonal elements
    x = pm.MvNormal("x", mu=np.zeros(L.shape[0]), chol=L, shape=L.shape[0])
    y = pm.Normal("y", shape=5)

    trace = pmx.sample(
        parameter_groups=[
            [x],
            pmx.sampling.ParameterGroup([y], "diag"),
        ],
    )

Variable target acceptance fraction (experimental)

The pmx.sample function also includes support for an experimental feature where the target acceptance fraction is adjusted throughout the tuning phase. This can be useful when the early warm up windows are slow to run because a high target acceptance fraction will require tiny step sizes that aren't well suited for searching for the typical set. This feature can be used by providing the initial_accept parameter (usually set to something like 0.5) to pmx.sample. This will be the target acceptance fraction at the beginning and it will be increased to target_accept (0.9 by default) throught the tuning phase.

Optimization

When PyMC3 added a warning to the pm.find_MAP function, we implemented a custom non-linear optimization framework in exoplanet because it is often useful to be able to optimize (at least) some parameters when initializing the sampler for many problems in astrophysics (and probably elsewhere). While pm.find_MAP no longer complains, the pymc3_ext.optimize function is included here for backwards compatibility even though it should have similar behavior to pm.find_MAP. To use this function, you'll do something like the following:

import pymc3_ext as pmx

with model:
    soln = pmx.optimize(vars=[var1, var2])
    soln = pmx.optimize(start=soln, vars=[var3])

Distributions

Most of the custom distributions in this library are there to make working with periodic parameters (like angles) easier. All of these reparameterizations could be implemented manually without too much trouble, but it can be useful to have them in a more compact form. Here is a list of the included distributions and a short description:

  • pmx.UnitVector: A vector where the sum of squares is fixed to unity. For a multidimensional shape, the normalization is performed along the last dimension.
  • pmx.UnitDisk: Two dimensional parameters constrianed to live within the unit disk. This will be useful when you have an angle and a magnitude that must be in the range zero to one (for example, an eccentricity vector for a bound orbit). This distribution is constrained such that the sum of squares along the zeroth axis will always be less than one. Note that the shape of this distribution must be two in the zeroth axis.
  • pmx.Angle: An angle constrained to be in the range -pi to pi. The actual sampling is performed in the two dimensional vector space (sin(theta), cos(theta)) so that the sampler doesn't see a discontinuity at pi. As a technical detail, the performance of this distribution can be affected using the regularization parameter which helps deal with pathelogical geometries introduced when this parameter is well/poorly constrained. The default value (10.0) was selected as a reasonable default choice, but you might get better performance by adjusting this.
  • pmx.Periodic: An extension to pmx.Angle that supports arbitrary upper and lower bounds for the allowed range.
  • pmx.UnitUniform: This distribution is equivalent to pm.Uniform(lower=0, upper=1), but it can be more numerically stable in some cases.

License

Copyright 2020 Dan Foreman-Mackey and contributors.

pymc3-ext is free software made available under the MIT License. For details see the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pymc3_ext-0.1.1.tar.gz (35.0 kB view hashes)

Uploaded Source

Built Distribution

pymc3_ext-0.1.1-py2.py3-none-any.whl (22.6 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page