If for some reason I take another linear algebra class, I’ll move this file there.

Definition (Hessian Matrix)

Let . Recall that

and that gives the Gradient of . The Hessian Matrix denoted as is defined as or from a linear algebra perspective, . In particular,

describes the Hessian. In particular, we can describe as a matrix in . The Hessian describes the second order mixed partials of a scalar field.

Example 1

Suppose . Then

and

Furthermore, is equal to

and likewise with the other values. Note the order.

Hessian Intuitively

We can think of the Hessian operator as a double-derivative in the same way the Gradient or Jacobian is treated as a single derivative. Recall that the first derivative is no longer a scalar but rather a linear map (i.e. derivative of is , a linear map). The second derivative is instead a bilinear map.

Let be a smooth function. The first-derivative

is a smooth function from to the set of linear maps . From the example above, is a map that is given from the derivative of . We can do the same with Hessian

where the codomain is (trivially) isomorphic to the set of bilinear maps

But this is precisely what represents. It is a matrix operator in . Similar to how double-derivatives model the concavity of a point on some curve/surface, models the concavity of a hypersurface at a point.1

Footnotes

  1. See this MSE post.