Convexity - cseweb.ucsd.educseweb.ucsd.edu/classes/fa19/cse151-a/convexity-annotated.pdf ·...

Convexity

Instructor: Taylor Berg-KirkpatrickSlides: Sanjoy Dasgupta

Course website:http://cseweb.ucsd.edu/classes/wi19/cse151-b/

Convexity

A function f : Rd → R is convex if for all a, b ∈ Rd and0 < θ < 1,

f (θa + (1− θ)b) ≤ θf (a) + (1− θ)f (b).

It is strictly convex if strict inequality holds for all a 6= b.

f is concave ⇔ −f is convex

Convexity

f (θa + (1− θ)b) ≤ θf (a) + (1− θ)f (b).

Convexity

f (θa + (1− θ)b) ≤ θf (a) + (1− θ)f (b).

Checking convexity

A function on one variable, f : R→ R, is convex if its secondderivative is ≥ 0 everywhere.

Example: f (z) = z2

Checking convexity

A function on one variable, f : R→ R, is convex if its secondderivative is ≥ 0 everywhere.

Example: f (z) = z2

First and second derivatives of multivariate functions

For a function f : Rd → R,

• the first derivative is a vector with d entries:

∇f (z) =

∂f∂z1...∂f∂zd

• the second derivative is a d × d matrix, the Hessian H(z):

Hjk =∂2f

∂zj∂zk

Example

Find the second derivative matrix of f (z) = ‖z‖2.

f (z) =d∑

∇f (z) =

2z1...

∇2f (z) = 2I

Example

f (z) =d∑

∇f (z) =

2z1...

∇2f (z) = 2I

Example

f (z) =d∑

∇f (z) =

2z1...

∇2f (z) = 2I

Example

f (z) =d∑

∇f (z) =

2z1...

∇2f (z) = 2I

Second-derivative test for convexity

A function f : Rd → R is convex if its matrix of second derivativesis positive semidefinite everywhere.

Recall: every square matrix M encodes a quadratic function:

x 7→ xTMx =d∑

i ,j=1

Mijxixj

(M is a d × d matrix and x is a vector in Rd)

Sometimes xTMx is always ≥ 0, no matter what x you plug in.

A function f : Rd → R is convex if its matrix of second derivativesis positive semidefinite everywhere.

Recall: every square matrix M encodes a quadratic function:

x 7→ xTMx =d∑

i ,j=1

Mijxixj

(M is a d × d matrix and x is a vector in Rd)

Sometimes xTMx is always ≥ 0, no matter what x you plug in.

A hierarchy of square matrices

Square

Positivesemidefinite

Positive definite

Symmetric

M 2 Rd⇥d

M = MT

zT Mz � 0 for all z 2 Rd

zT Mz > 0 for all z 6= 0

A symmetric matrix M is positive semidefinite (psd) if:

zTMz ≥ 0 for all vectors z

PSD or not?

1 11 1

(x1 x2

)(1 11 1

)(x1x2

)=(x1 x2

)(x1 + x2x1 + x2

= x21 + 2x1x2 + x22

= (x1 + x2)2

1 22 1

(x1 x2

)(1 22 1

)(x1x2

)= x21 + 4x1x2 + x22

= (x1 + x2)2 + 2x1x2

PSD or not?

1 11 1

(x1 x2

)(1 11 1

)(x1x2

)=(x1 x2

)(x1 + x2x1 + x2

= x21 + 2x1x2 + x22

= (x1 + x2)2

1 22 1

(x1 x2

)(1 22 1

)(x1x2

)= x21 + 4x1x2 + x22

= (x1 + x2)2 + 2x1x2

PSD or not?

1 11 1

(x1 x2

)(1 11 1

)(x1x2

)=(x1 x2

)(x1 + x2x1 + x2

= x21 + 2x1x2 + x22

= (x1 + x2)2

1 22 1

(x1 x2

)(1 22 1

)(x1x2

)= x21 + 4x1x2 + x22

= (x1 + x2)2 + 2x1x2

PSD or not?

1 11 1

(x1 x2

)(1 11 1

)(x1x2

)=(x1 x2

)(x1 + x2x1 + x2

= x21 + 2x1x2 + x22

= (x1 + x2)2

1 22 1

(x1 x2

)(1 22 1

)(x1x2

)= x21 + 4x1x2 + x22

= (x1 + x2)2 + 2x1x2

When is a diagonal matrix PSD?

xTAx = a1x21 + a2x

22 + · · ·+ anx

When is a diagonal matrix PSD?

xTAx = a1x21 + a2x

22 + · · ·+ anx

If M,N are of the same size and PSD, must M + N be PSD?

xT (M + N)x = xTMx + xTNx

If M,N are of the same size and PSD, must M + N be PSD?

xT (M + N)x = xTMx + xTNx

Checking if a matrix is PSD

A matrix M is PSD if and only if it can be written as M = UUT

for some matrix U.

Quick check: say U ∈ Rr×d and M = UUT .

1 M is square.

2 M is symmetric.

3 Pick any z ∈ Rr . Then

zTMz = zTUUT z = (zTU)(UT z)

= (UT z)T (UT z)

= ‖UT z‖2 ≥ 0.

Another useful fact: any covariance matrix is PSD.

for some matrix U.

1 M is square.

2 M is symmetric.

= (UT z)T (UT z)

= ‖UT z‖2 ≥ 0.

for some matrix U.

1 M is square.

2 M is symmetric.

= (UT z)T (UT z)

= ‖UT z‖2 ≥ 0.

N(X −M)T (X −M)

(1√N

(X −M)

)T ( 1√N

(X −M)

where M ∈ RN×d and

µ...µ

A function (of several variables) is convex if its second-derivativematrix is positive semidefinite everywhere.

More formally:Suppose that for f : Rd → R, the second partial derivatives existeverywhere and are continuous functions of z . Then:

1 H(z) is a symmetric matrix

2 f is convex ⇔ H(z) is positive semidefinite for all z ∈ Rd

Example

Is f (x) = ‖x‖2 convex?

• Recall:

∇2f (x) = 2I

• This is a diagonal matrix with all positive entries along thediagonal.

Example

• Recall:

∇2f (x) = 2I

Example

• Recall:

∇2f (x) = 2I

Fix any vector u ∈ Rd . Is this function f : Rd → R convex?

f (z) = (u · z)2

∇f (z) = 2(u · z)u

∇2f (z) = 2

u21 u1u2 · · · u1udu1u2 u22 · · · u2ud

......

u1ud u2ud · · · u2d

= 2uuT

Fix any vector u ∈ Rd . Is this function f : Rd → R convex?

f (z) = (u · z)2

∇f (z) = 2(u · z)u

∇2f (z) = 2

u21 u1u2 · · · u1udu1u2 u22 · · · u2ud

......

u1ud u2ud · · · u2d

= 2uuT

Least-squares regression

Recall loss function: for data points (x (i), y (i)) ∈ Rd × R,

L(w) =n∑

(y (i) − (w · x (i)))2

Logistic regression

Recall loss function: for data (x (i), y (i)) ∈ Rd × {−1,+1},

L(w) =n∑

ln(1 + e−y (i)(w ·x(i)))

We earlier found the first derivative:

∂wj= −

y (i)x(i)j

1 + ey(i)(w ·x(i))

Logistic regression

Recall loss function: for data (x (i), y (i)) ∈ Rd × {−1,+1},

L(w) =n∑

ln(1 + e−y (i)(w ·x(i)))

We earlier found the first derivative:

∂wj= −

y (i)x(i)j

1 + ey(i)(w ·x(i))

Logistic regression, cont’d

Second derivative: the (j , k) entry of the Hessian H(w) is

∂wk∂wj=

(x(i)j x

1 + ew ·x(i)1

1 + e−w ·x(i)

This is uj · uk , where vectors u1, . . . , ud ∈ Rn are defined as follows:

uj has ith coordinate x(i)j

(1 + ew ·x(i))(1 + e−w ·x(i))

Therefore H(w) = UUT , where U is the matrix with rows uj .Convex!

∂wk∂wj=

(x(i)j x

1 + ew ·x(i)1

1 + e−w ·x(i)

(1 + ew ·x(i))(1 + e−w ·x(i))

∂wk∂wj=

(x(i)j x

1 + ew ·x(i)1

1 + e−w ·x(i)

(1 + ew ·x(i))(1 + e−w ·x(i))

Convexity - cseweb.ucsd.educseweb.ucsd.edu/classes/fa19/cse151-a/convexity-annotated.pdf ·...

Documents

Transcript of Convexity - cseweb.ucsd.educseweb.ucsd.edu/classes/fa19/cse151-a/convexity-annotated.pdf ·...

Second Presentation

Second document

Convexity - New York Universitypeople.stern.nyu.edu/jcarpen0/courses/b403333/06convexity.pdf · Convexity • Think of bond prices, ... values, as functions of interest rates. •

Ch 7 Appendix Convexity Structures

RÉFÉRENTIEL DE MÉTIER Mars 2016 · BLAL Hassan . Second mécanicien . LKARCH Ibrahim . Second mécanicien . BOULAID Noureddine . Second mécanicien . QBIB Abdelkbir . Second mécanicien

Second Grading

Second Level

Second Life

Introduction to geometric invariant theory II: Convexity ... · Introduction to geometric invariant theory II: Convexity, marginals & moment polytopes MichaelWalter(UniversityofAmsterdam)

second version

Second Sight

Introduction - ERNETmath.iisc.ernet.in/~subhojoy/public_html/teichmuller-survey.pdf · concerning the Euclidean convexity of Teichmuller domains. 1. Introduction The complex analytical

Second lifegrupo3

03.07 bonos duration & convexity

A note on the convexity and compactness of the integral of ...nber/econ/papers/papers2/poster-podczek.pdfA note on the convexity and compactness of the integral of a Banach space valued

Second Exam

Solving Continuous MDPs with Discretizationpabbeel/cs287-fa19/... · 2019-09-26 · Markov chain approximation to continuous state space dynamics model (“discretization”) n Original

The second

PROCEDIMIENTO NORMALIZADO DE OPERACIÓN PARA EL ...cufcd.edu.mx/calidad/v20/documentacion/CM/CEMA-PR-FA19.pdf · demasiada iluminación, polvo, fuentes de calor, áreas insalubres,

19. Frühjahrsakademie m athematikweberu/fa/2012/fa19.pdf · plom), Wirtschaftsmathematik, Network Computing oder angewandter informa-tik (Bachelor) in Freiberg informieren. mathematisches