How to determine whether a function of many variables is convex or non-convex?

Question

A function $f$ is convex if $$f(\theta x + (1 − \theta)y) \leq \theta f(x) + (1 − \theta)f(y)$$ for all $x, y \in \mathcal{D}(f)$, the domain of $f$, and $\theta \in [0, 1]$.

How do I determine whether a function of many variables is convex or non-convex?

If I find the Hessian of my function and they are not all positive, is it a non-convex function?

First i think that the domain must be convex. Second it is depends on your function. If it is only continuous, you have only continuous techniques to prove it. If it is $C^1$ you have more techniques and if it is $C^2$, you can compute the Hessian and verify if it is positive. — Tomás, Feb 02 '13 at 16:28
Convexity makes sense for functions $f:A\rightarrow \mathbb R$ where $A$ is a convex subset of a vectorial space $V$. In particular, $f$ must be real-valued. Otherwise the codomain should be equipped with a partial order relation such that the definition of convexity still makes sense. — AndreasT, Feb 02 '13 at 16:40

score 31 · Accepted Answer · answered Feb 02 '13 at 17:55

31

If the function is twice differentiable and the Hessian is positive semidefinite in the entire domain, then the function is convex. Note that the domain must be assumed to be convex too. If the Hessian has a negative eigenvalue at a point in the interior of the domain, then the function is not convex.

answered Feb 02 '13 at 17:55

Peder

2,182

7

This is brilliant. Would you care to show me a reference for the proof? – Vim Mar 20 '16 at 15:31
2

I added a proof of that as an answer. – Apr 07 '16 at 21:07

score 16 · Answer 2 · 2017-12-21T21:49:54.373

This is a proof that if $f:U \rightarrow \mathbb{R}$, $U$ is an open and convex set of $\mathbb{R}^n$ and f is twice differentiable, then $f''(x)$ positive definite imply $f$ convex.

Let $f''(x)(h²) \geq 0 $ $\forall x \in U, \forall h \in \mathbb{R}^n. $

If $f$ isn't convex, $\exists x_{1},x_{2} \in U , \exists \lambda $ st. $ f((1- \lambda)x_{1} + \lambda x_{2}) > (1-\lambda)f(x_{1}) + \lambda f(x_{2})$

Let $\phi:[0,1] \rightarrow \mathbb{R}, \phi(t) = f((1-t)x_{1} + tx_{2})$

Then $ \phi(\lambda) > (1- \lambda) \phi(0) + \lambda \phi(1)$

Using the Mean Value Theorem, $\exists t_{1}, t_{2}$ with $0<t_{1}<\lambda<t_{2}<1$ st.

$\phi(\lambda) - \phi(0) = \lambda \frac{d}{dt} \phi(t_{1}) $ and $\phi(1)-\phi(\lambda) = (1-\lambda) \frac{d}{dt} \phi(t_{2}) $

Observe that

$\lambda ( \phi(1) - \phi(0)) < \phi(\lambda) - \phi(0) = \lambda \frac{d}{dt} \phi(t_{1})$

and that

$ (1-\lambda) \frac{d}{dt} \phi(t_{2}) = \phi(1) - \phi(\lambda) < (1-\lambda) [ \phi(1) - \phi(0) ]$

Then $ \frac{d}{dt} \phi (t_{2}) < \phi(1) - \phi(0) < \frac{d}{dt} \phi(t_{1}).$

Again, using MVT,

$\exists t_{3} \in (t_{1}, t_{2})$ st. $\frac{d²}{dt²} \phi(t_3) < 0$

This is an absurd, since by hypothesis $\frac{d²}{dt²} \phi(t) \geq 0, \frac{d²}{dt²} \phi(t) = f''((1-t)x_{1} + tx_{2})(x_{2}-x_{1})²$

$\square$

Isn't $\frac{d^2}{dt^2}\phi{(t)}$ equal to $f''((1-t)x_1+tx_2)(x_1-x_2)^2$ instead of $f''(\phi{(t)})(x_1-x_2)^2$? — Rasputin, Jun 21 '17 at 17:58

How to determine whether a function of many variables is convex or non-convex?

2 Answers2