Standard Deviation¶
Formula¶
\[
\sigma = \sqrt{\operatorname{Var}(X)}
\]
\[
s = \sqrt{\frac{1}{n-1}\sum_{i=1}^n (x_i-\bar x)^2}
\]
Parameters¶
- \(X\): random variable
- \(x_i\): samples
- \(\bar x\): sample mean
- \(n\): number of samples
What it means¶
Typical distance of values from the mean.
What it's used for¶
- Measuring spread in data.
- Standardizing values (z-scores).
Key properties¶
- Same units as the data.
- \(\sigma \ge 0\); zero means all values are equal.
Common gotchas¶
- Sample standard deviation uses \(n-1\) (Bessel's correction).
- Sensitive to outliers.
Example¶
For samples \([1, 2, 4]\), \(s=1.528\).
How to Compute (Pseudocode)¶
Input: sample data (and any reference values needed by the statistic)
Output: statistic value
compute the summary quantities required by the formula (for example, mean, deviations, counts)
apply the statistic formula from the card
return the result
Complexity¶
- Time: Typically \(O(n)\) for \(n\) samples for common one-pass or two-pass summary-statistic computations (sorting-based medians are \(O(n\log n)\) unless selection is used)
- Space: \(O(1)\) to \(O(n)\) depending on whether values must be stored/sorted
- Assumptions: Sample-statistic workflow shown; parameter-estimation and streaming/online algorithms can change constants and memory usage