Learning Rate Schedule¶

Formula¶

\[ \eta_t = s(t) \]

fn: exp(-0.6*x)
xmin: 0
xmax: 8
ymin: 0
ymax: 1.05
height: 280
title: Example decay schedule (normalized)

A learning-rate schedule changes the step size over training instead of keeping it constant.

Cosine decay starts high and gradually reduces the learning rate toward a small final value.

Input: current step/epoch t and a schedule definition s(t)
Output: learning rate eta_t

eta_t <- s(t)
return eta_t

Time: \(O(1)\) per step for common closed-form schedules (step, cosine, exponential, linear)
Space: \(O(1)\)
Assumptions: Schedule value computation only; total training cost is dominated by optimizer/model updates over many steps