If for some reason I take another linear algebra class, I’ll move this file there.
Definition (Hessian Matrix)
Let . Recall that
and that gives the Gradient of . The Hessian Matrix denoted as is defined as or from a linear algebra perspective, . In particular,
describes the Hessian. In particular, we can describe as a matrix in . The Hessian describes the second order mixed partials of a scalar field.
Example 1
Suppose . Then
and
Furthermore, is equal to
and likewise with the other values. Note the order.
Hessian Intuitively
We can think of the Hessian operator as a double-derivative in the same way the Gradient or Jacobian is treated as a single derivative. Recall that the first derivative is no longer a scalar but rather a linear map (i.e. derivative of is , a linear map). The second derivative is instead a bilinear map.
Let be a smooth function. The first-derivative
is a smooth function from to the set of linear maps . From the example above, is a map that is given from the derivative of . We can do the same with Hessian
where the codomain is (trivially) isomorphic to the set of bilinear maps
But this is precisely what represents. It is a matrix operator in . Similar to how double-derivatives model the concavity of a point on some curve/surface, models the concavity of a hypersurface at a point.1
Footnotes
-
See this MSE post. ↩