Description:

  • The gradient of a function at a point where is differentiable, denoted with , is a column vector of first derivatives of with respect to :
  • The gradient of an Affine Function is

Geometric Interpretation:

  • The gradient of a Vector Function can be interpreted in the context of the level and sublevel sets.
  • The -level set of a function is the set
    • ie, the contours
    • and the -sublevel set is
  • Geometrically, the gradient of at a point is a vector perpendicular to the contour line of at level , pointing from outwards the -sublevel set
  • That is, the gradient, represents the direction along which the function has the max rate of increase
  • Let be a unit vector and
  • Consider the point . We have for
    • equivalently,
    • Think of it as the gradient projecting on the vector
  • Whenever and is such that then is increasing along the direction , for small ??
  • The inner product measures the rate of variation of at , along direction , and it is usually referred to as the directional derivative of along ???

Minimization via Gradient Descent Method:

  • To solve an optimization problem of the form
    • where is differentiable
  • Start from , iterate the rule
    • where is the step size