5

I have already learned about what a supremum means from wikipedia and from another answer here. However I am not quite sure what 'supremum over a set of functions' means exactly.

As an example, my book gives an example called "Support function of a set", where it states that the support function "associated with a set C" is:

$$ S_c(x) = \sup \{ x^Ty \ |\ y \in C \} $$

It goes on to say:

"Since $x^Ty$ is an affine function of $x$, so $S_c$ is the pointwise supremum of a family of affine functions, hence convex"

What I am not too clear about, is, what exactly is a 'supremum of a set of functions' mean? Does this mean that in this case, I am simply taking the supremum of all the dot product results fpr any $x$, across all the vectors $y$ in $C$?

Spacey
  • 1,057
  • 1
    Yes. Note that for fixed $x$, the set is just a set of real numbers, so the supremum is just the ordinary one. – Lutz Lehmann Feb 23 '14 at 02:22
  • @LutzL Thanks, so what's the relevance of the word 'support' in this example then? Why is he using it? ('Support function'). What is being 'supported' and by what? – Spacey Feb 23 '14 at 02:24
  • This comes from convex geometry. A supporting line or supporting hyperplane of a convex set $C$ is an affine codimension 1 subspace outside of $C$ that touches $C$ in some point or larger facette. $S_C(x)$ is how much you have to shift the plane ${y:x^Ty=0}$ to be a supporting hyperplane. – Lutz Lehmann Feb 23 '14 at 02:31
  • @LutzL OOHHHH! Ok that makes sense. Then if we shift the plane by $S_c(x)$, we guarantee that the set $C$ is in the half space made by $x = S_c(x)$ right? – Spacey Feb 23 '14 at 02:35
  • Be careful with the dual of the dual, $C\subset {y:x^Ty\le S_C(x)}$, actually the intersection over all $x\ne 0$ is the closure of the convex hull of any $C$. – Lutz Lehmann Feb 23 '14 at 02:38
  • @LutzL Hmm, would you mind consolidating your comments into an answer with an example? Thanks in advance. – Spacey Feb 23 '14 at 02:40

1 Answers1

6

In convex geometry, a supporting line or supporting hyperplane of a convex set $C$ is an affine co-dimension-$1$ subspace $\{y:a^Ty=b\}$ outside of $C$ that touches $C$ in some point or larger facet, i.e., so that $a^TC\le b$ with equality assumed at least once.

From there it is just a twist of thought to compute the constant as $b=\sup a^TC$ and formalize it by giving it a name as a function

$$S_C(x)=\sup\{x^Ty:y\in C\},$$

i.e., $S_C(x)$ is how much you have to shift the plane $\{y:x^Ty=0\}$ to be a supporting hyperplane, thus support function.

Then $C⊂\{y:x^Ty≤S_C(x)\}$. Starting from any even non-convex $C$, the intersection of these supporting half-spaces is the closure of the convex hull of $C$.

As to $S_C$ itself, convex functions are closed under suprema, and since the family of linear functions $x\mapsto x^Ty$, $y\in C$ is a family of convex functions, their pointwise supremum is again convex.

Lutz Lehmann
  • 126,666
  • Thanks LutzL, so just to be clear though, $S_c(x)$ is simply a scalar right? – Spacey Feb 23 '14 at 03:04
  • Yes, $S_C:\mathbb R^n\to\mathbb R\cup{\infty}$. – Lutz Lehmann Feb 23 '14 at 03:06
  • Oh! We are cycling across all $x$'s, (that is, all possible vectors "$a$", and for EACH one, we get a $sup(x^Ty)$, (a scalar), and that defines how much to move the hyperplane by, such that it touches $C$, FOR that particular "$a$". Is that right? – Spacey Feb 23 '14 at 03:09
  • Yes. In the function $S_C$, it is $x$. Note that $S_C$ is positive homogenous, for $t>0$ one has $S_C(tx)=tS_C(x)$, so that one only needs one $x$ per ray from the origin. – Lutz Lehmann Feb 23 '14 at 03:13
  • Right exactly. Thanks it is a great answer! :-) – Spacey Feb 23 '14 at 03:14
  • @LutzL: Could you help me to understand where we can infer your sentence "how much you have to shift the plane ${y:x^Ty=0}$ to be a supporting hyperplane" from ${y:x^Ty=0}$. I am struggling why you build the hyperplane with constant zero. We could have had any hyperplane with any scalar let's say $d$ so ${y:x^Ty=d}$. Indeed, my question is why support function describes the (signed) distances of supporting hyperplanes of A from the origin. –  Nov 08 '18 at 06:41
  • You need some point of reference, the most natural one is $0$. All these hyperplanes with different $d$ are parallel, you need only one as model that can be translated/shifted into all the others. – Lutz Lehmann Nov 08 '18 at 06:52