class

LKJCholesky

extendsDistribution

LKJCholesky(dim: int, concentration: Tensor | float = 1.0, validate_args: bool | None = None)

source edit

LKJ distribution over Cholesky factors of correlation matrices.

Generative prior for the lower-triangular Cholesky factor $L$ of a correlation matrix $R = L L^\top$ of size $D \times D$ . Density is proportional to $\det(R)^{\eta - 1}$ so the $\eta$ parameter ("concentration") controls how concentrated the prior is around the identity (uncorrelated). Widely used as a weakly-informative prior on correlation structure in Bayesian hierarchical models — replacing an inverse-Wishart prior (which couples scale and correlation) with a clean separation.

Parameters

dimint

Size

D

of the correlation matrix (must be

\geq 2

concentrationTensor or float= 1.0

Shape parameter

\eta > 0

. Default 1.0 (uniform over the space of correlation matrices).

\eta > 1

concentrates mass near the identity (low correlation),

\eta < 1

favours high-correlation factors.

validate_argsbool= None

If True, validate parameter constraints at construction.

Notes

Probability density on the manifold of valid Cholesky factors $L$ :

p(L; \eta) \propto |\det R(L)|^{\eta - 1} = \prod_{i=1}^{D-1} L_{ii}^{D - i + 2(\eta - 1)}

where the second equality follows from the Jacobian of the transformation between $R$ and its Cholesky factor.

Sampling uses the vectorised Onion method (Lewandowski, Kurowicka & Joe, 2009, §3) — row $i$ is built incrementally so that its squared norm equals one, with off-diagonal magnitude drawn from a Beta and angular components from a uniform on the sphere. The full sampler runs in $\mathcal{O}(D^2)$ per draw with no rejections.

For $\eta = 1$ the distribution is uniform over valid correlation matrices (specifically, over the set of valid Cholesky factors).

Examples

>>> import lucid
>>> from lucid.distributions import LKJCholesky
>>> dist = LKJCholesky(dim=4, concentration=2.0)
>>> L = dist.sample()           # (4, 4) lower-triangular Cholesky factor
>>> R = L @ L.T                 # implied correlation matrix
>>> R.diagonal()                # diagonal of R is all 1's
Tensor([1., 1., 1., 1.])
Use as a Bayesian prior over correlation structure:
>>> prior = LKJCholesky(dim=8, concentration=1.5)
>>> # ... pair with marginal scales / standard deviations to form a
>>> #     covariance prior: Σ = diag(σ) · R · diag(σ)

Used by 1

lucid.distributions

Constructors

dunder

init

→None

__init__(dim: int, concentration: Tensor | float = 1.0, validate_args: bool | None = None)

source edit

Initialise an LKJ distribution over Cholesky factors.

Pre-computes the Beta distribution parameters used by the vectorised Onion sampler (Lewandowski et al. 2009, §3).

Parameters

dimint

Dimension

D \geq 2

of the correlation matrix.

concentrationTensor | float= 1.0

Shape parameter

\eta > 0

. Default is 1.0 (uniform distribution over correlation matrices). Larger values concentrate mass near the identity (near-zero correlations).

validate_argsbool | None= None

If True, validate parameter constraints at construction time.

Properties

prop

support

→Constraint

support: Constraint

source edit

Constraint: positive-definite matrices (proxy for correlation-Cholesky support).

Instance methods

log_prob

→Tensor

log_prob(value: Tensor)

source edit

Log-density of the LKJ distribution on a Cholesky factor.

The density is proportional to ∏_{i=2}^{d} L_{ii}^{order_i} where order_i = 2(η − 1) + (d − i).

The normaliser follows Eq. (15) of Lewandowski et al. (2009).

sample

→Tensor

sample(sample_shape: tuple[int, ...] = ())

source edit

Draw a sample via the vectorised Onion method (Lewandowski 2009 §3).

Returns the lower Cholesky factor L of a random correlation matrix.

>>> import lucid >>> from lucid.distributions import LKJCholesky >>> dist = LKJCholesky(dim=4, concentration=2.0) >>> L = dist.sample() # (4, 4) lower-triangular Cholesky factor >>> R = L @ L.T # implied correlation matrix >>> R.diagonal() # diagonal of R is all 1's Tensor([1., 1., 1., 1.]) Use as a Bayesian prior over correlation structure: >>> prior = LKJCholesky(dim=8, concentration=1.5) >>> # ... pair with marginal scales / standard deviations to form a >>> # covariance prior: Σ = diag(σ) · R · diag(σ)