Skip to content
The loss curve

Gradient descent

Optimization procedure: subtract a small multiple of the gradient from the parameters at each step.