Proving convexity of a function is a local property

Question

Let $I$ be an open interval of $\mathbb R$. Let $\varphi:I\to\mathbb R$ be a continuous function such that $\forall x\in I$, there is an open interval $V(x)\subset I$ containing $x$ such that $\varphi$ is convex on $V(x)$. Is it true that $\varphi$ is convex on $I$? What about if we remove the continuity hypothesis?

It's tempting to say that for any $x\in I$, $\exists y\in I$, $V(x)\cap V(y)\neq\emptyset$ and so $\varphi$ is convex on $V(x)\cup V(y)$ since in this case $V(x)\cup V(y)$ is convex. In that case, we get that $\varphi$ is convex on $\bigcup_{x\in I}V(x)=I$. The problem is I dont know if we can necessarily find such a $y$. Furthermore, we don't use the fact that $\varphi$ is continuous at all, which feels weird. But then on the other hand, I don't really know how to get from the local convexity to convexity without using a similar reasoning. Any help on this question would be greatly appreciated.

Neat question, I think it follows from this:https://math.stackexchange.com/questions/3960180/locally-convex-implies-convex?noredirect=1&lq=1. — Vivaan Daga, Feb 20 '23 at 17:21
We certainly could find some $y$ that works (by taking $y \in V(x)$ where $y \neq x$); the problem is that in order for your line of reasoning to work as written out, it would have to be the case that for all $x, y \in I$, $V(x) \cap V(y) \neq \emptyset$, which is basically what you're trying to prove. — Jonathan Pal, Feb 21 '23 at 23:56
One thought which may or may not be useful: if $I$ were a closed interval, then we could use compactness to conclude that $I$ is the union of a finite number of the $V(x)$'s, and then your idea would work great since each of these $V(x)$'s would have nonempty intersection with at least one other of the $V(x)$'s. But here we don't have that luxury since $I$ is open. — Jonathan Pal, Feb 22 '23 at 00:06
Wouldn't the following argument work: By @JonathanPal s comment $\phi$ is convex on every interval of the form $[a+\epsilon, b-\epsilon]$ where $I = [a,b]$. But every pair $x,y \in I$ is contained in $[a+\epsilon, b-\epsilon]$ for some small $\epsilon$. Hence $\phi(\lambda x +(1-\lambda)y) \leq \lambda \phi(x) + (1-\lambda)\phi(y)$. — Bruno Krams, Feb 22 '23 at 19:18
If you remove the continuity absumption it should be still true because if $\varphi$ is convex on an open set, then it is continuous on this set, so your assumption on $\varphi$ implies continuouty of $\varphi$ if I'm not wrong.
Concerning the main question, maybe is easier to prove the converse: if a continuous function $\varphi : I \to \mathbb{R}$ is not convex, then there is a point $z \in I$ such that for every $\varepsilon>0$ the function $\varphi$ restricted to $(z-\varepsilon,z+\varepsilon)$ is not convex. Anyway, I don't know how to choose $z$ in a smart way. — Motticoz, Feb 22 '23 at 19:24

Ewan Delanoy · Accepted Answer · 2023-02-25T11:13:32.110

I think this is simple enough, and that the continuity of $\phi$ is not needed. Let $c\in(a,b)$. By the hypothesis, there are $c^{-},c^{+}$ with $a \lt c^{-} \lt c \lt c^{+} \lt b$ such that $\phi$ is convex on $[c^{-},c^{+}]$. Now, let

$$ \Omega=\bigg\lbrace d\in [c^{-},b) \ \bigg| \ \phi \ \textrm{is convex on } [c^{-},d] \bigg\rbrace $$

We have $[c^{-},c^{+}] \subseteq \Omega \subseteq [c^{-},b)$, so $\omega=\sup(\Omega)$ is $\leq b$, and further $\Omega=[c^{-},\omega)$.

In fact, we can show that $\omega=b$, by contradiction as follows. Suppose that $\omega \lt b$. By the hypothesis, there are $\omega^{-},\omega^{+}$ with $a \lt \omega^{-} \lt \omega \lt \omega^{+} \lt b$ such that $\phi$ is convex on $J=[\omega^{-},\omega^{+}]$. Then $\frac{\omega^{-}+\omega}{2}\in [c^{-},\omega)=\Omega$, so $\phi$ is convex on $K=[c^{-},\frac{\omega^{-}+\omega}{2}]$. Now the interior of $J\cap K$ is nonempty (it contains a neighborhood of $\omega^{-}$), so by gluing, $\phi$ must be convex on $J\cup K=[c^{-},\omega^{+}]$. But then $\omega^{+}\in \Omega$, contradicting $\omega=\sup(\Omega)$. So $\omega=b$, and $\phi$ is therefore convex on $[c^{-},b)$.

A symmetrical argument shows that phi is convex on $(a,c^{+}]$. By gluing, $\phi$ is convex on $(a,b)$ as wished.

Appendix: Since there is a userin the comments who is skeptical about the gluing property, I add a proof for it :

Theorem(gluing property). Suppose a function $\phi$ is convex on two intervals $J,K$ of $\mathbb R$, and that $J\cap K$ has nonempty interior. Then $\phi$ is convex on $J\cup K$.

Proof. For $x_1\lt x_2\lt x_3$ define

$$ \Delta_{f}(x_1,x_2,x_3)= (x_3-x_2)f(x_1)+(x_2-x_1)f(x_3)-(x_3-x_1)f(x_2) $$

Notice that $f$ is convex an interval $I$ iff $\Delta_{f}(x_1,x_2,x_3)\geq 0$ for any $x_1\lt x_2\lt x_3\in I$.

Now, we have the purely algebraic identity

$$ a\Delta_{f}(x_1,x_2,x_5)= b_1\Delta_{f}(x_1,x_2,x_3)+ b_2\Delta_{f}(x_2,x_3,x_4)+ b_3\Delta_{f}(x_3,x_4,x_5) \tag{1} $$

where $a,b_1,b_2,b_3$ are the nonnegative numbers defined by

$$ \begin{array}{lcl} a &=&(x_4-x_3)(x_3-x_2) \\ b_1&=&(x_4-x_3)(x_5-x_2) \\ b_2&=&(x_2-x_1)(x_5-x_3) \\ b_3&=&(x_2-x_1)(x_3-x_2) \end{array}\tag{2} $$

More generally, for an arbitrary number of points $x_1\lt x_2\lt \ldots \lt x_n$, it can be shown that any $\Delta_f(x_i,x_j,x_k)$ with $i\leq j\leq k$ can be written as a nonnegative linear combination of the $\Delta_f(x_t,x_{t+1},x_{t+2})$ for $i\leq t \leq j-2$ ; this can be shown by induction on $k-i$, starting with the trivial cases when some variables are equal. The formulas in (2) reflect this.

In particular, if the three deltas in the RHS of (1) are nonnegative, so is the $\Delta$ in the LHS.

Let us now show the theorem. Take $x_1 \lt x_2 \lt x_5$ in $J\cup K$ ; our goal is to show that $\Delta_{f}(x_1,x_2,x_5)\geq 0$.

If $x_1,x_2,x_5$ are all in $J$ or all in $K$, we are done by the hypothesis. So we can assume without loss that (say) $x_1,x_2\in J,x_5\in K$. Then we can find $x_3 \lt x_4\in J\cap K$ such that $x_2\lt x_3 \lt x_4 \lt x_5$. Applying (1) finishes the proof.

imho the last term in (1) should be $(x_3-x_2)(x_3-x_1)\Delta_{f}(x_3,x_4,x_5)$ — daw, Feb 24 '23 at 11:20
very neat proof of the gluing property! is there some intuition behind? Also this 'algebraic identity' is not so easy to spot. — daw, Feb 24 '23 at 11:21
@daw : Fixed the last term in (1), thx. While the exact terms of the algebraic identity are perhaps not easy to remember, I find it rather intuitive. I find it easy to remember than $\Delta(x_1,x_3,x_5)$ is a linear combination of the $\Delta(x_{k},x_{k+1},x_{k+2}), 1 \leq k \leq 3$ ; think of two different frogs, with one frog's leap being the double of the other's. — Ewan Delanoy, Feb 24 '23 at 13:06
How do you know that $\Delta_f(x_3,x_4,x_5)\ge0$? $x_3\in J$, $x_5\in K$, only. I think the proof needs to be modified to: Take $x_1<x_2<x_5$ with $x_1,x_2\in J$, $x_5\in K$. Then choose $x_3,x_4\in J\cap K$. This needs an identity to express $\Delta_f(x_1,x_2,x_5)$ in terms of non-negative expressions. As now $\Delta_f(x_1\dots x_3),\Delta_f(x_2\dots x_4),\Delta_f(x_3\dots x_5),$ are non-negative. — daw, Feb 25 '23 at 10:08
@daw There is indeed an expression of $\Delta(x_1,x_2,x_5)$ as a non-negative linear combination of the $\Delta(x_k,x_{k+1},x_{k+2})$, see the revised version. Thx again for that new correction ! By the way, another confirmation that the new proof is correct is that we really use the fact that $J\cap K$ has non-empty interior. — Ewan Delanoy, Feb 25 '23 at 10:58

score 0 · Answer 2 · answered Feb 23 '23 at 10:48

Picking up this material from

Li,Yuan-Chuan;Yeh,Cheh-Chih, Some characterizations of convex functions,Computers & Mathematics with Applications : Volume $59$, Issue $1$, January $2010$, Pages $327-337$

Definition : Let $\Omega \subset X$ be a subset of a normed linear space $X$. A function $f : X \to \mathbb R$ is called locally convex on $\Omega$ if for all $x \in \Omega$ there exists $x \in U_x \subset \Omega$ open (relative to $\Omega$) such that $f$ is convex on $U_x$. That is, for all $y_1,y_2 \in U_x, \lambda \in [0,1]$,$$ f(\lambda y_1 + (1-\lambda)y_2) \leq \lambda f(y_1)+(1-\lambda)f(y_2) $$

Note that we did not insist that $\Omega$ is convex, but the domain of $f$ is $X$ so the definition still makes sense. For a given $y_1,y_2 \in \Omega$, we call the statement above as the convexity condition for $y_1,y_2$. That is, the convexity condition for $y_1,y_2$ is true if the convexity definition holds for all $\lambda \in [0,1]$.

Of course, a function $f : X \to \mathbb R$ is called convex if the convexity condition holds for all $y_1,y_2 \in \Omega$, not merely those restricted to a neighborhood of some point.

Theorem : Every locally convex function on a convex subset $\Omega \subset X$ is also convex.

Proof : We will first prove the statement when $X = \mathbb R$ and $\Omega=I$ is an interval (and hence a convex set).

Start with the following idea : if $I$ can be broken into two convex parts which intersect, then it should be convex as well, so let's prove this.

Claim : If $a,b \in I, a <b$ are such that $f$ is convex is $I_R = I \cap [a,\infty)$ and on $I_L = I \cap (-\infty,b]$ then $f$ is convex on $I$.

Proof of claim : If $a$ or $b$ is an endpoint of $I$ then we are clearly done since either $I_L$ or $I_R$ will equal $I$.

Note that convex functions are automatically continuous on the interior of an interval (this was mentioned in a comment, and can be proved "graphically" by sandwiching the graph of the convex function on a neighborhood of a point between chords of reducing slope, or using the result I proved here as a consequence of left and right derivatives existing at every interior point).

Let $x<y \in I$. If either of $x,y \geq a$ or $x,y \leq b$ are true then either $x,y \in I_R$ or $x,y \in I_L$. Hence, the convexity condition is satisfied for $x,y$. We will then assume that $x<a$ and $y>b$ now.

Now, suppose that $x<z<y$. Either $x<z<a,a<z<b$ or $b<z<y$ must be true : note that $z=a,z=b$ are possible, but these can be handled by continuity of $f$.

We handle $a<z<b$ first.

Note that $x<a<z<b<y$. Thus we can find $r,s \in (0,1)$ such that $a = rx+(1-r)z$ and $z = sa+(1-s)y$. This implies that $$ z = sa+(1-s)y = s(rx+(1-r)z) +(1-s)y = tx+(1-t)y $$ where $t = \frac{sr}{1-s(1-r)}$. As $a,z,y \in I_R$ and $x,a,z \in I_L$, we get by convexity of $f$ in each of these regions, $$ f(z) \leq sf(a) + (1-s)f(y) \leq s(rf(x)+(1-r)f(z)) - (1-s)f(y) $$ From here, taking the $f(z)$ to the other side and making $f(z)$ the subject of the inequation leads to $f(z) \leq tf(x)+(1-t)f(y)$, as desired.

We now handle $b<z<y$. The case $x<z<a$ is similarly handled. Now, $x<a<b<z<y$ so there exist $r,s,t \in (0,1)$ such that $$ z = rb+(1-r)y,b = sa+(1-s)y,a=tx+(1-t)b $$ Now, as an exercise similar to the one in the previous case, one can easily see that $z = tx+(1-t)y$ where $$ t = \frac{rst}{1-s(1-t)} $$ Finally, using convexity for $b<z<y \in I_R, a<b<y \in I_R,x<a<b \in I_L$ and putting those conditions together, it is easy to see that $f(z) \leq tf(x) + (1-t)f(y)$ as desired.

As I mentioned earlier, the third case is clear. The result is now complete. $\blacksquare$

Use this claim to show that if $f : \mathbb I \to \mathbb R$ is locally convex then $f$ is convex, where $I$ is an interval.

Fix $z \in I$ not an endpoint. Let $d := \sup\{y \in [z,b] : f \text{ is convex on } I \cap [z,y]\}$. We claim that $d$ is an endpoint of $I$. Clearly $d \neq z$ by local convexity at the point $z$.

If not, then by local convexity at $d$ we have an $r>0$ such that $f$ is convex on $(d-r,d+r) \subset I$, and because $d$ is the supremum, $f$ is convex on $[z,d)$. Now, let $I' = [z,d+r)$. Then, $f$ is convex on $I'\cap [z,\infty)$ and on $I' \cap (-\infty,d]$ where $d>z$. By the claim, $f$ is convex on $I'$, a contradiction to $d$ being the supremum.

It follows that $d$ is an endpoint of $I$. In other words, we have shown that $f$ is convex on $I \cap [z,+\infty)$. A very similar argument but moving in the other direction shows that for any $z'>z \in I$ which is not an endpoint, $f$ is convex on $I \cap (-\infty,z']$. As $z'>z$, the claim finishes off the proof and $f$ is convex on $I$.

For general normed linear spaces

This follows from the convexity of $\Omega$ and the fact that convexity is essentially a phenomena restricted to functional behavior on one-dimensional line segments.

Let $y_1,y_2 \in \Omega$ be given, and let $f$ be locally convex on $\Omega$. As $\Omega$ is convex, $f$ is also locally convex on every point joining the line segment between $y_1$ and $y_2$, which lies entirely in $\Omega$. Call this line segment $L$.

Let $g(t) = f(ty_1+(1-t)y_2)$ for $t \in [0,1]$. We claim that $g$ is locally convex on $[0,1]$. Indeed, let $t \in [0,1]$ be given. By definition, $z = tx+(1-t)y$ has a neighborhood $U_z \subset \Omega$ in which $f$ is convex. By definition of a neighborhood in a normed linear space, $U_z$ also contains part of the line segment $L$, and corresponding to the values of $t'$ such that $t'y_1+(1-t')y_2$ fall in that part, there is a neighborhood of $t$ in $[0,1]$ over which $g$ is convex. Thus, $g$ is locally convex.

By our previous results, $g$ is convex. This obviously implies that $g(t) \leq tg(0)+(1-t)g(1)$ for any $t \in [0,1]$. If $z = tx+(1-t)y$ then $g(t)=f(z)$ and $tg(0)+(1-t)g(1) = tf(x)+(1-t)f(y)$. Hence, we are done.

Pete Caradonna · Answer 3 · 2023-02-23T20:32:30.397

To prove the desired result directly, I find it easier to glue monotone functions, rather than convex ones. (N.B. removing the continuity hypothesis doesn't do anything, as any convex function is continuous on the interior of its domain, thus $\phi$ is continuous on a neighborhood of every point and hence local convexity implies continuity everywhere.)

Since $\varphi$ is convex on some neighborhood of every point, it is right-differentiable everywhere. Moreover, at any point $\bar{x} \in I$, the function $\partial_+ \varphi$ (i.e. the right derivative of $\varphi$) is monotone increasing on the open neighborhood $V(\bar{x})$. To establish the claim, it then suffices to show that $\partial_+ \varphi$ is monotone on all of $I$.

Proof: Consider $\mathcal{I} = \{\textrm{relatively open intervals $\tilde{I}$ of } I : \partial_+ \varphi \vert_{\tilde{I}} \textrm{ is monotone}\}$, partially ordered by set-inclusion, $\subseteq$. Given any chain $\{I_\lambda\}_{\lambda \in \Lambda}$ of elements of $\mathcal{I}$, clearly $\cup_\lambda I_\lambda \in \mathcal{I}$ is an upper bound for $\{I_\lambda\}_{\lambda \in \Lambda}$, thus by Zorn's lemma, there exist $\subseteq$-maximal elements of $\mathcal{I}$. Suppose, toward a contradiction, that there exists a maximal element $I^* \in \mathcal{I}$, with $I^* \subsetneq I$. Let $x^* \in \textrm{cl}(I^*) \setminus I^*$; without loss let $x^* = \sup I^*$. By hypothesis, $\partial_+ \varphi \vert_{V(x^*)}$ is monotone, thus for all $x \in V(x^*) \cap I^*$, $\partial_+ \varphi(x) \le \partial_+ \varphi(x^*)$, and thus $\partial_+ \varphi(x) \le \partial_+ \varphi(x^*)$ for all $x \in I^*$ by monotonicity on $I^*$. Similarly, for any $y > x^*$, $\partial_+ \varphi(x^*) \le \partial_+ \varphi(y)$, and hence $\partial_+ \varphi$ is monotone on $I^* \cup V(x^*)$. Since without loss we may assume $V(x^*)$ is an open interval, this contradicts the maximality of $I^*$. Thus any maximal element of $\mathcal{I}$ equals $I$ itself, and $\varphi$ is convex, as desired.

To address the specific question about finding, for any $x$, a $y$ such that (i) $V(y) \not \subseteq V(x)$ and (ii) $V(x) \cap V(y)$ is non-empty, pick an arbitrary $x$ and let $\bar{x} \in I$, $\bar{x}> x$ be arbitrary. Then the sets $\{V(y)\}_{y \in [x , \bar{x}]}$ are an open cover for $[x, \bar{x}]$. Since $[x, \bar{x}]$ is compact, there exists a finite set of points $y_1,\ldots, y_N \in [x, \bar{x}]$ such that the $\{V(y_i)\}_{i=1}^N$ cover $[x, \bar{x}]$. Without loss, we can take this collection to be minimal, in the sense that we can throw out any $V(y_i)$ that is contained in any other $V(y_j)$. We can also, without loss, let $y_1 = x$. Then there is some $V(y_i)$ such that $V(y_i) \cap V(x) \neq \varnothing$ and $V(y_i) \not \subseteq V(x)$ or vice versa.

Why is the set $I$ non-empty for you to apply Zorn’s lemma? — Vivaan Daga, Feb 23 '23 at 18:07
@Shinrin-Yoku Do you mean $\mathcal{I}$? Because for any $x \in I$, each open set $V(x)$ contains some open ball about $x$ on which the restriction of $\varphi$ is convex, or equivalently that the restriction of $\partial_+ \varphi$ is non-decreasing. Thus this open ball is an element in $\mathcal{I}$. — Pete Caradonna, Feb 23 '23 at 18:08

score 0 · Answer 4 · answered Dec 10 '23 at 05:03

Let $a<b<c$ points in the domain of the function. We have to show for the two slopes of secants of the graph are in this order

$$\frac{f(b)-f(a)}{b-a}\le \frac{f(c)-f(b)}{c-b}$$

Now since $f$ is locally convex, and because of the compactness of $[a,c]$, there exists a division

$$a = a_0 < a_1 < \ldots < a_k = b< a_{k+1} < \ldots < a_n = c$$ and such that on every $[a_i, a_{i+2}]$ the function $f$ is convex. We get the following inequalities for slopes

$$\frac{f(a_{i+1})-f(a_i)}{a_{i+1}-a_i} \le \frac{f(a_{i+2})-f(a_{i+1})}{a_{i+2}-a_{i+1}}$$

Now the slope $\frac{f(b)-f(a)}{b-a}$ is a convex combination of the slopes $\frac{f(a_{i+1})-f(a_i)}{a_{i+1}-a_i}$ for $0\le i \le k-1$, while $\frac{f(c)-f(b)}{c-b}$ is a convex combination of slopes $\frac{f(a_{i+1})-f(a_i)}{a_{i+1}-a_i}$, with $k\le i \le n-1$. Since the latter slopes are all larger then the other ones, we get the inequality.

Note: one could say $f$ convex means $f^{''}\ge 0$, but then $f$ may not have a second derivative everywhere. However, this is correct if we consider $f''\le 0$ in the distribution sense. Now being $\ge 0$ for a distribution is a local property, hence the conclusion ( so it seems $f''$ is indeed a distribution of order $0$, but may not be of function type, for instance consider $f = \max(x,0)$ ).

Proving convexity of a function is a local property

4 Answers4

Linked