We wish to show that
p ≤
p +
p for p ≥ 1.
For p = ∞, this is maxi
≤ maxi
≤ maxi
+ maxi
. □
For 1 ≤ p < ∞, first assume without loss that x,y≠0 (the inequality is trivial if either vector is zero). Let α =
p, and
β =
p, from which we define x0 =
x, and y0 =
y. Then:

Since f
=
p is convex for 1 ≤ p < ∞, and noting that
,
≥ 0 and
+
= 1:

Recalling that
pp =
pp = 1:

WLOG But α =
p and β =
p, so:

□
A norm, by definition, satisfies the triangle inequality. Suppose that
is a norm. Then, if λ 
:

Suppose that f :
d →
is twice differentiable. The gradient of f is the vector of partial derivatives:

The Hessian is the matrix of partial second derivatives:

Note that since
=
, this matrix is symmetric. It’s important to realize that, like a derivative
(or gradient), the Hessian is a function of the vector x–it takes on different values on different parts of the
domain.
A word on notation: ▽2 is frequently used for the Laplacian operator ▽2f = ∑
i=1d
, but not in this
class.
An eigenvalue is a scalar λ which satisfies the following:

For some x≠0 (which is called an eigenvector). Note that if the above holds for x, it holds for αx where α≠0 is an arbitrary scalar. That is, the scale of eigenvectors is irrelevant. A n × n symmetric matrix will always have exactly n eigenvalue/eigenvector pairs, though the eigenvalues are not necessarily distinct. Additionally, the eigenvectors may always be taken to be orthogonal.
For small matrices, we may solve for the eigenvalues & eigenvectors explicitly:
![[ 17 - 6 ] [ x1] [ x1 ]
- 6 8 x2 = λ x2
[ ] [ ]
17x1 - 6x2 = λ x1
[ - 6x1 + 8x2] [ x2 ]
(17- λ) x1 = 6x2
(8- λ)x2 6x1](rec134x.png)
Soving for x1 gives that x1 =
x2 and x1 =
x2, so that:

A matrix is positive definite if all of its eigenvalues are strictly positive, and positive semi-definite if they are all nonnegative. Is the above matrix positive definite? (yes)
One simple test for positive (semi-)definiteness is that a matrix A is positive (semi-)definite if xTAx is positive (nonnegative) for all x≠0.
A twice-differentiable function is convex if its Hessian is positive semidefinite everywhere, and is strictly convex if its Hessian is positive definite.
The definitions of negative (semi-) definite and (strictly) concave are the same, with the signs reversed.
Consider f
=
for y > 0. Then:
![[ ∂f ]
▽f = ∂∂xf
[ ∂y ]
2xy-
= - x22
y](rec140x.png)
And:
![[ 2 2 ]
2 ∂∂2fx- ∂∂xf∂y
▽ f = -∂2f- ∂22f
[ ∂y∂x ∂ y ]
2y - 2yx2
= - 2xy2- 2yx32](rec141x.png)
Which we may write as:
![[ 2 ]
▽2f = 23 y - xy2
y [ - xy] x
= 2- y [ y - x ]
y3 - x](rec142x.png)
By inspection, one eigenvector of
is
, with eigenvalue y2 + x2 ≥ 0:
![[ 2 ] [ ] [ ][ ][ ]
y - xy2 y = y y - x y
- xy x - x [- x ] - x
= y (y2 + x2)
- x](rec145x.png)
While the other eigenvalue is zero, since
has rank 1 (eigenvector
). Since
≥ 0 for y > 0, this gives
that:

Showing that f is convex. Is it strictly convex? (no)
Consider f
= 
for x > 0. We may write log f
=
∑
i=1n log xi. By the concavity of the
logarithm:

Exponentiating both sides gives:

Which is the arithmetic-geometric means inequality.
Again consider f
= 
for x > 0. Its gradient is:

And its Hessian is:

Which we may write as:

Consider vT ▽2fv:

Note that we may write ∑
i=1n
as:

And by the Cauchy-Schwarz inequality (
2 ≤
22
22) with a = 1 and bi =
:

So that vT
v ≤ 0, showing that f is concave.