Problem following proof of Šmulian theorem for separable space

Question

I tried to solve Problem 10 on p. 464 of Brezis to get a proof of part of the Eberlein-Šmulian theorem, precisely the equivalence between compactness and sequential compactness in the weak topology of a Banach space. I think I have managed everything except the first point, which is to prove that:

Let $X$ be a separable Banach space and $A\subseteq X$ be (relatively) weakly compact. Then $A$ is (relatively) weakly sequentially compact.

I basically didn't know where to start, so I googled and found this. I have two problems here though. The proof starts by taking a sequence $a_n$ in a weakly compact set $A$ and a countable subset $H\subseteq X^\ast$, say $H=\{f_i\}_{i\in\mathbb{N}}$, and lining up $f_i(a_j)$ in a table. Then since the sequences are all bounded ($A$ is bounded after all) we conclude that every row $\{f_i(x_j)\}_j$ has a convergent subsequence $\{f_i(a_{n(i,j)}\}_j$. Then it takes the diagonal sequence $f_i(a_{n(i,i)})$ and the corresponding subsequence $a_{n(i,i)}$ and says $f_i(a_{n(j,j)})$ converges for $j\to\infty$ for all $i$. I really cannot see how this is proved.

How do I show that converges?

The second problem is in the following paragraph.

This can be done for every countable $H$. Let us now make $H$ "big" to get closer to weak convergence of $(y_n)$. Let $(x_n)$ be dense in $S_X$ and pick $x_n^*\subset S_X$ such that $x_n^*x_n=1$. Let $H=(x_n^*)$. Then $H$ is total in $X^*$. To see this, let $x\in S_X$ be such that $x_n^*x=0$ for every $n$ and let $x^*\in S_X$. Let $\epsilon>0$ and find some $x_n$ such that $\lVert x-x_n\rVert<\epsilon$. Then, since $\lVert x^*\rVert=1$, $|x^*x|<\epsilon$. Since $\epsilon$ was arbitrary, it follows from the Hahn-Banach separation theorem that $H$ is total.

I understand practically everything (though I had to look up "total" to find it means that if all functionals in it evalue a point to 0 then the point is 0) except how it deduces $|x^\ast(x)|<\epsilon$.

How do I go about showing that?

I thought of adding and subtracting some terms. $x^\ast(x)=x^\ast(x-x_n)+x^\ast(x_n)$, put absolute values, triangular inequality, $|x^\ast(x-x_n)|\leq\|x^\ast\|\|x-x_n\|=1\cdot\epsilon$, but then I'd need the other bit to be 0, how do I know that? $x^\ast(x)=(x^\ast-x^\ast_n)(x)+x^\ast_n(x)$, but I know nothing of the first term, and the second one is zero. If I combine the two, i.e. $x^\ast(x)=x^\ast(x-x_n)+(x^\ast-x^\ast_n)(x_n)+x^\ast_n(x_n)$, the first is less than $\epsilon$, the third is 1, and the other one? I don't know. So how do I prove this can be estimated by $\epsilon$?

Update

Over here is a slightly different but totally understandable way of constructing the desired total set, so the second part of the question is no longer a problem. I am, however, still left with the first part. @Qiyu Wen noted that we have metrizability of bounded sets in $X^\ast$ under the weak-* topology by theorem 3.16 of Rudin's Functional Analysis. But I need to prove compactness in the weak topology of $X$ implies sequential compactness in that same topology, nothing to do with $X^\ast$. Maybe I can try to see things up in $X^{\ast\ast}$, where the weak-* topology is metrizable, and if a weak closure in $X$ embeds to be weak-* closed in $X^{\ast\ast}$ then I'll be done. But can I prove that $\overline{A}^{\sigma(X^{\ast\ast},X^\ast)}\subseteq\overline{A}^{\sigma(X,X^\ast)}$, where $A$ and its weak closure are being identified to their images via the canonical embedding $J$? And in any case, I'm still curious about why that thing converges, i.e. why the argument given in that pdf works.

I think you might want to check Chapter 3 of Rudin's Functional Analysis. He argued that separability leads to metrizability, hence the equivalence of compactness and sequential compactness. — Ningxin, Apr 24 '16 at 18:17
Wait @Qiyu. I know separability of the dual leads to metrizability of the unit ball of the base space in the weak topology and vice versa. So separability of $X$ in general does not imply $B_X$ is weakly metrizable. Otherwise this would be so easy :). — MickG, Apr 24 '16 at 18:43
It doesn't hold for $X^$, but it holds for all compact subsets of $X^$. It's either his 3.16 or 3.17, following the Banach-Alaoglu theorem. — Ningxin, Apr 24 '16 at 18:50
The paragraph that caused your second problem is nonsense. To see that $H$ is total, pick $x\in S_X$. Choose an $m$ such that $\lVert x-x_m\rVert < \frac{1}{2}$. Then $$\lvert x_m^{\ast}(x)\rvert = \lvert x_m^{\ast}(x_m) + x_m^{\ast}(x-x_m)\rvert \geqslant \lvert x_m^{\ast}(x_m)\rvert - \lvert x_m^{\ast}(x-x_m)\rvert > 1-\frac{1}{2},$$ whence $x_m^{\ast}(x)\neq 0$. So $(\forall n)(x_n^{\ast}(y) = 0) \implies y = 0$. — Daniel Fischer, Apr 24 '16 at 21:41
I've added a proof [or almost two] to my answer. One uses a quite different argument (and is much shorter), the other is a more detailed version of the argument in Nygaard's paper. You might find either of them interesting or illuminating. — Daniel Fischer, Apr 26 '16 at 14:54

Daniel Fischer · Accepted Answer · 2016-04-26T14:49:38.333

Presumably the author assumed that all readers are familiar with diagonal sequence constructions and therefore just gave a broad outline, skipping some important details of the procedure.

You don't choose the subsequences independently. One uses the diagonal sequence construction to make sure that the selected subsequence (the diagonal sequence) is a subsequence - except for finitely many terms at the start - of all of the selected subsequences. To ensure that, every subsequence $(a_{n(i+1,j)})_{j\in \mathbb{N}}$ must be a subsequence of $(a_{n(i,j)})_{j\in \mathbb{N}}$.

So you start by choosing a subsequence $(a_{n(0,j)})_{j\in \mathbb{N}}$ of $(a_j)_{j\in \mathbb{N}}$ such that the sequence $\bigl(f_0(a_{n(0,j)})\bigr)_{j\in \mathbb{N}}$ converges. Then you choose a subsequence $(a_{n(1,j)})_{j\in \mathbb{N}}$ of $(a_{n(0,j)})_{j\in \mathbb{N}}$ such that the sequence $\bigl(f_1(a_{n(1,j)})\bigr)_{j\in \mathbb{N}}$ converges. Since it is a subsequence of $(a_{n(0,j)})$, the sequence $\bigl(f_0(a_{n(1,j)}\bigr)$ is also convergent. One continues in that way, if the subsequences $(a_{n(i,j)})_{j\in\mathbb{N}}$ for $i \leqslant k$ have been selected, in the next step we choose a subsequence $(a_{n(k+1,j)})_{j\in \mathbb{N}}$ of $(a_{n(k,j)})_{j\in \mathbb{N}}$ such that $\bigl(f_{k+1}(a_{n(k+1,j)})\bigr)_{j\in \mathbb{N}}$ converges. By construction, since $(a_{n(k+1,j)})$ is a subsequence of $(a_{n(i,j)})$ for $i \leqslant k$, it follows that $\bigl(f_i(a_{n(k+1,j)}\bigr)$ converges for all $i \leqslant k+1$.

Then the diagonal sequence - given by $b_j = a_{n(j,j)}$ - is a subsequence of $(a_j)$, and, importantly, for each $i\in \mathbb{N}$ the tail $(b_j)_{j \geqslant i}$ is a subsequence of $(a_{n(i,j)})_{j\in \mathbb{N}}$. From that fact, we obtain that the sequence $\bigl(f_i(b_j)\bigr)_{j\in\mathbb{N}}$ converges for every $i$.

Let's have a complete proof (not completely complete, we're not proving all tools we use) of Šmulian's theorem here:

Definition: Let $E$ be a topological vector space and $E'$ its (topological) dual. A set $S\subset E'$ is called separating if $\bigcap\limits_{\lambda\in S} \ker \lambda = \{0\}$.

Your reference calls such sets total, but since there is another notion of totality in topological vector spaces, namely that the set spans a dense subspace, I prefer to use a different term. These two notions aren't unrelated, however. If $E$ is such that $E'$ is separating (e.g. if $E$ is a Hausdorff locally convex space), then a subset of $E'$ is separating if and only if its span is $\sigma(E',E)$-dense, i.e. if and only if it is $\sigma(E',E)$-total.

Definition: We say that a topological vector space $E$ has a countable separation, if there is a countable separating subset of $E'$.

(I don't like the term "countable separation", but it's the best I could come up with. Mathematics is easier than computer science in so far as we don't have to deal with cache invalidation, but naming things and off-by-one-errors are still hard.) We note in passing that a TVS having a countable separation is necessarily Hausdorff.

Lemma: Let $(X,\tau)$ be a quasicompact space. If $\tau' \subset \tau$ is a Hausdorff topology, then $\tau' = \tau$.

Proof: The identity $\operatorname{id} \colon (X,\tau) \to (X,\tau')$ is a continuous (since $\tau'\subset\tau$) and closed - if $F$ is $\tau$-closed, it is $\tau$-quasicompact, hence $F$ is $\tau'$-quasicompact, and since $\tau'$ is Hausdorff, $\tau'$-closed - bijection, i.e. a homeomorphism.

The main part of the proof of Šmulian's theorem is

Theorem: Let $E$ be a TVS having a countable separation. Then every (relatively) weakly compact subset of $E$ is (relatively) weakly sequentially compact.

We give two proofs of this theorem, one more topological, and a slightly modified version of the proof in your reference.

First proof: Let $S\subset E'$ be a countable separating set. For $\lambda \in E'$, we let $p_\lambda \colon x \mapsto \lvert \lambda(x)\rvert$. This is a $\sigma(E,E')$-continuous seminorm on $E$, and the countable family $\{p_{\lambda} : \lambda \in S\}$ defines a first-countable locally convex topology $\tau_S$ on $E$. Since $S$ is separating, $\tau_S$ is a Hausdorff topology, hence metrisable. Since all of the seminorms generating $\tau_S$ are $\sigma(E,E')$-continuous, it follows that $\tau_S \subset \sigma(E,E')$. In fact, $\tau_S = \sigma(E, \operatorname{span} S)$, so the inclusion is strict, unless $S$ spans $E'$ (which never happens if $E$ is an infinite-dimensional normed space).

If $A\subset E$ is weakly compact, by the lemma we have $\tau_S\lvert_A = \sigma(E,E')\lvert_A$, hence $\sigma(E,E')\lvert_A$ is metrisable. Metrisable compact spaces are sequentially compact.

Second proof: This is, if I understood correctly, essentially the proof in Dunford-Schwartz. I only include it because I think the version in Nygaard's paper that you linked to is not detailed enough (and hence somewhat unclear) for the beginner. In principle, as a more topologically-minded person, I find the first proof much more instructive.

Let $\Lambda = \{ \lambda_n : n \in \mathbb{N}\}$ be a countable separating subset of $E'$, and $A\subset E$ weakly compact. Let $(a_n)_{n\in \mathbb{N}}$ be a sequence in $A$ (if $A = \varnothing$, there is nothing to prove). For every $k\in\mathbb{N}$, $\lambda_k(A)$ is a compact subset of $\mathbb{C}$ (or $\mathbb{R}$), so from every sequence in $\lambda_k(A)$ we can extract a convergent subsequence. We begin with the sequence $\bigl(\lambda_0(a_n)\bigr)_{n\in\mathbb{N}}$ and find a strictly increasing map $\sigma_0 \colon \mathbb{N}\to \mathbb{N}$ such that the sequence $\bigl(\lambda_0(a_{\sigma_0(n)})\bigr)_{n\in\mathbb{N}}$ is convergent. We let $k(0,n) := \sigma_0(n)$ for $n\in \mathbb{N}$. Having found $k(j,\,\cdot\,)$ for $0 \leqslant j \leqslant i$, we consider the sequence $\bigl(\lambda_{i+1}(a_{k(i,n)})\bigr)_{n\in\mathbb{N}}$ and by the (sequential) compactness of $\lambda_{i+1}(A)$ we find a strictly increasing $\sigma_{i+1} \colon \mathbb{N} \to \mathbb{N}$ such that $\bigl(\lambda_{i+1}(a_{k(i,\sigma_{i+1}(n))})\bigr)_{n\in \mathbb{N}}$ converges. We set $k(i+1,n) := k(i,\sigma_{i+1}(n))$. Continuing in this way, we obtain a map $k \colon \mathbb{N}\times \mathbb{N} \to \mathbb{N}$ such that for every $i\in \mathbb{N}$ the sequence $s_i \colon n \mapsto k(i,n)$ is strictly increasing, and $s_{i+1}$ is a subsequence of $s_i$ such that $\bigl(\lambda_{i+1}(a_{s_{i+1}(n)})\bigr)_{n \in \mathbb{N}}$ converges. Taking the diagonal sequence, we find a subsequence $(b_n)_{n \in \mathbb{N}}$ of $(a_n)_{n \in \mathbb{N}}$, namely $b_n = a_{k(n,n)}$, such that $\bigl(\lambda_k(b_n)\bigr)_{n\in \mathbb{N}}$ converges for every $k \in \mathbb{N}$.

We assert that the sequence $(b_n)$ is weakly convergent. For $m \in \mathbb{N}$, let $F_m = \overline{\{ b_n : n \geqslant m\}}$, where the closure is taken with respect to $\sigma(E,E')$. Then $(F_m)$ is a nested sequence of nonempty (weakly) compact sets, so $F := \bigcap\limits_{m\in \mathbb{N}} F_m \neq \varnothing$. Let $y\in F$. We claim that $\lambda_k(y) = \lim\limits_{n\to \infty} \lambda_k(b_n)$ for every $k\in \mathbb{N}$. For that, fix arbitrary $k \in \mathbb{N}$ and $\varepsilon > 0$. Since $\bigl(\lambda_k(b_n)\bigr)$ is convergent, we can find an $m\in\mathbb{N}$ such that $\lvert \lambda_k(b_n) - \lambda_k(b_r)\rvert \leqslant \varepsilon$ forall $n,r \geqslant m$. Since $y \in F_m$, we can find an $n \geqslant m$ with $\lvert \lambda_k(b_n) - \lambda_k(y)\rvert \leqslant \varepsilon$. It follows that $\bigl\lvert \lambda_k(y) - \lim\limits_{n\to\infty} \lambda_k(b_n)\bigr\rvert \leqslant 2\varepsilon$. The claim follows because $k$ and $\varepsilon$ were arbitrary. Since $\Lambda$ is separating, we further conclude that $F = \{y\}$. Now let $\lambda \in E'$. Assume for the sake of contradiction that $\bigl(\lambda(b_n)\bigr)$ does not converge to $\lambda(y)$. Then we can extract a subsequence $(c_n)$ such that $\lvert \lambda(c_n) - \lambda(y)\rvert \geqslant \delta$ for all $n$ and some $\delta > 0$. By extracting a further subsequence, we can assume that $\bigl(\lambda(c_n)\bigr)$ converges. But since $(c_n)$ is a subsequence of $(b_n)$, we have $\{c_n : n \geqslant m\} \subset \{b_n : n \geqslant m\}$ for all $m\in \mathbb{N}$, and hence

$$\varnothing \neq \bigcap_{m \in \mathbb{N}} \overline{\{ c_n : n \geqslant m\}} \subset \bigcap_{m \in \mathbb{N}} F_m = \{y\},$$

and the argument above shows that $\lim\limits_{n\to\infty} \lambda(c_n) = \lambda(y)$, contradicting the assumption. Thus indeed $(b_n)$ converges weakly to $y$.

To complete the proof of Šmulian's theorem, we now need the

Proposition: Let $X$ be a separable Banach (or normed, completeness is irrelevant) space. Then $X$ has a countable separation.

Proof: Let $\{x_n : n \in \mathbb{N}\}$ be a countable dense subset of the unit sphere of $X$. For each $n$, choose $\lambda_n$ in the unit sphere of $X'$ with $\lambda_n(x_n) = 1$. Such a $\lambda_n$ exists by the Hahn-Banach theorem. Then $\{\lambda_n : n \in \mathbb{N}\}$ is separating. For if $\lVert x\rVert = 1$, choose $n$ so that $\lVert x_n - x\rVert < \frac{1}{2}$. Then

$$\lvert \lambda_n(x)\rvert = \lvert \lambda_n(x_n) - \lambda_n(x_n - x)\rvert \geqslant \lvert \lambda_n(x_n)\rvert - \lvert \lambda_n(x_n - x)\rvert \geqslant 1 - \lVert\lambda_n\rVert\cdot \lVert x_n - x\rVert > \frac{1}{2},$$

so $\lambda_n(x) \neq 0$.

Finally, we come to the

Theorem (Šmulian): Let $X$ be a Banach (or normed, completeness is unimportant here) space. Then every (relatively) weakly compact subset of $X$ is (relatively) weakly sequentially compact.

Proof: Let $A \subset X$ be weakly compact and $(a_n)_{n\in \mathbb{N}}$ a sequence in $A$ (again, for $A = \varnothing$ there is nothing to prove). Let $Y := \overline{\operatorname{span} \{ a_n : n \in \mathbb{N}\}}$. Then $Y$ is a separable closed subspace of $X$. Since subspaces are convex, $Y$ is weakly closed, and hence $B = A \cap Y$ is $\sigma(X,X')$-compact. By Hahn-Banach, $\sigma(X,X')\lvert_Y = \sigma(Y,Y')$, so by the proposition and the previous theorem, $B$ is $\sigma(Y,Y')$-sequentially compact. We can therefore extract a subsequence $(b_n)$ of $(a_n)$ such that $$b_n \xrightarrow{\sigma(Y,Y')} b \in B.$$ Since $\sigma(X,X')\lvert_Y = \sigma(Y,Y')$, we have $b_n \xrightarrow{\sigma(X,X')} b$, and the proof is complete.

No wonder. I had to turn off preview, since it took a minute to type every word. (And now I need to proofread. Before I turned off mathjax, it seriously screwed around with what I was typing when I corrected something. Entire phrases mysteriously went awol.) — Daniel Fischer, Apr 26 '16 at 14:39
When you say "quasicompact", do you mean every open cover has a finite subcover but the space is not necessarily Hausdorff? — MickG, Apr 26 '16 at 16:56
Why is $\tau_S|_A$ or $\sigma(E,E')|_A$ metrizable? Series of seminorms divided by $\frac{1}{2^n}$ as metric (this)? — MickG, Apr 26 '16 at 17:02
Right, @MickG, I've been brought up on Bourbaki's terminology, so "compact = quasicompact + Hausdorff", and quasicompact is defined as "every open cover has a finite subcover". It's not only $\tau_S\lvert_A$ that is metrisable, $\tau_S$ itself is metrisable, for example by the construction in your link, or with $d(x,y) = \sup \bigl{ 2^{-n}\cdot \min { p_n(x-y), 1} : n \in \mathbb{N}\bigr}$. Every first countable Hausdorff vector space topology is metrisable (cf. e.g. theorem 1.24 in Rudin's FA), in particular $\tau_S$ for a countable separating $S$. — Daniel Fischer, Apr 26 '16 at 17:19
Topological subspaces of metrisable spaces are again metrisable, so $\tau_S\lvert_A$ is metrisable for every $A\subset E$. If $A$ is weakly compact, then $\tau_S\lvert_A = \sigma(E,E')\lvert_A$, so every weakly compact subset is metrisable in the weak topology (under the assumption that a countable separating subset of $E'$ exists). — Daniel Fischer, Apr 26 '16 at 17:21

Problem following proof of Šmulian theorem for separable space

1 Answers1

Linked