Cholesky Decomposition¶

Formula¶

\[ A = L L^T \]

Parameters¶

\(A\): symmetric positive definite matrix
\(L\): lower triangular with positive diagonal

What it means¶

Efficient factorization for SPD matrices.

What it's used for¶

Solving SPD linear systems efficiently.
Sampling from multivariate Gaussians.

Key properties¶

Unique for SPD \(A\)
About half the cost of LU for SPD matrices

Common gotchas¶

Fails if \(A\) is not positive definite.
Numerical issues can arise if \(A\) is ill-conditioned.

Example¶

For \(A=egin{bmatrix}4&2\2&3 \end{bmatrix}\), \(L=egin{bmatrix}2&0\1&\sqrt{2} \end{bmatrix}\) since \(LL^T=A\).

How to Compute (Pseudocode)¶

Input: SPD matrix A (n x n)
Output: lower-triangular L such that A = L L^T

initialize L as zero matrix
for i from 1 to n:
  for j from 1 to i:
    s <- sum_{k=1}^{j-1} L[i,k] * L[j,k]
    if i == j:
      L[i,j] <- sqrt(A[i,i] - s)
    else:
      L[i,j] <- (A[i,j] - s) / L[j,j]

return L

Complexity¶

Time: \(O(n^3)\) for dense \(n \times n\) matrices (about half the constant factor of LU)
Space: \(O(n^2)\) to store \(A\) and/or \(L\)
Assumptions: Dense SPD matrix; in-place implementations may reduce extra memory

Cholesky Decomposition¶

Formula¶

Parameters¶

What it means¶

What it's used for¶

Key properties¶

Common gotchas¶

Example¶

How to Compute (Pseudocode)¶

Complexity¶

See also¶