class

LowerCholeskyTransform

extendsTransform

LowerCholeskyTransform()

source edit

Bijection mapping an unconstrained matrix to a positive-diagonal Cholesky factor.

The standard reparameterisation used to learn covariance / scale matrices: a free $D \times D$ matrix is mapped to a lower triangular matrix with strictly positive diagonal by zeroing the strict upper triangle and applying softplus to the diagonal. Composing with a base on $\mathbb{R}^{D \times D}$ produces a pushforward over the cone of valid Cholesky factors. event_dim = 2.

Notes

Forward (element-wise on a $D \times D$ input $X$ ):

L_{ij} = \begin{cases} \operatorname{softplus}(X_{ii}) & i = j \\ X_{ij} & i > j \\ 0 & i < j \end{cases}

Inverse:

X_{ii} = \operatorname{softplus}^{-1}(L_{ii}) = \log(e^{L_{ii}} - 1), \qquad X_{ij} = L_{ij}\;\;(i > j)

Log Jacobian determinant (summed over the matrix event dims):

\log|\det J| = \sum_{i=1}^{D} \log \sigma(X_{ii}) = -\sum_{i=1}^{D} \operatorname{softplus}(-X_{ii})

Off-diagonal entries contribute unit Jacobian (identity map); only the softplus applied to the diagonal carries a non-trivial factor.

For correlation-matrix factors (unit diagonal) use CorrCholeskyTransform instead.

Examples

>>> import lucid
>>> from lucid.distributions.transforms import LowerCholeskyTransform
>>> T = LowerCholeskyTransform()
>>> X = lucid.tensor([[0.0, 0.0], [0.5, 0.0]])
>>> L = T(X)  # softplus on diagonal, raw on strict lower

Used by 1

lucid.distributions

Instance methods

log_abs_det_jacobian

→Tensor

log_abs_det_jacobian(x: Tensor, y: Tensor)

source edit

Sum of $\log \sigma(x_{ii})$ over the diagonal.

Off-diagonal entries contribute unit Jacobian; only the softplus applied to the diagonal contributes the non-trivial factor $\sigma'(x) = \sigma(x)$ , computed via the stable identity $\log \sigma(x) = -\operatorname{softplus}(-x)$ .

Parameters

xTensor

Pre-transform input matrix (last two dims square).

yTensor

Post-transform output; ignored — the log-Jacobian uses x's diagonal only.

Returns

Tensor

Per-sample log-Jacobian (reduces the last two matrix axes).

>>> import lucid >>> from lucid.distributions.transforms import LowerCholeskyTransform >>> T = LowerCholeskyTransform() >>> X = lucid.tensor([[0.0, 0.0], [0.5, 0.0]]) >>> L = T(X) # softplus on diagonal, raw on strict lower