Skip to content

CDF (Cumulative Distribution Function)

Formula

\[ F_X(x)=P(X\le x) \]

Plot

fn: 1/(1+exp(-x))
xmin: -6
xmax: 6
ymin: -0.05
ymax: 1.05
height: 280
title: Example CDF shape (logistic CDF)

Parameters

  • \(X\): random variable
  • \(F_X(x)\): cumulative probability up to value \(x\)

What it means

The CDF gives the probability that a random variable is less than or equal to a threshold.

What it's used for

  • Computing interval probabilities from differences of CDF values.
  • Describing distributions in one unified form (discrete or continuous).

Key properties

  • Nondecreasing and right-continuous.
  • \(\lim_{x\to -\infty}F_X(x)=0\), \(\lim_{x\to \infty}F_X(x)=1\).
  • For continuous \(X\), \(f_X(x)=F_X'(x)\) where differentiable.

Common gotchas

  • For discrete variables, the CDF has jumps.
  • The distinction between \(<\) and \(\le\) matters for discrete distributions.

Example

For a fair die, \(F_X(3)=P(X\le 3)=3/6=0.5\).

How to Compute (Pseudocode)

Input: distribution specification and query value(s)
Output: CDF values (or probabilities from it)

for each query value/interval:
  evaluate the distribution rule for the card's representation
  (PMF: point probability, PDF: density, CDF: cumulative probability)
return the computed value(s)

Complexity

  • Time: Depends on the distribution family and number of queries (often \(O(q)\) for \(q\) query points once parameters are known)
  • Space: \(O(q)\) for returned values (or \(O(1)\) for a single query)
  • Assumptions: Parameter-estimation cost is excluded; exact formulas and numerical methods vary by distribution family

See also