We give an approach using the extreme value theorem. The following definition is a key ingredient in the proof: $\newcommand{\diam}{\operatorname{diam}}$
For each $x \in X$, define $h(x) \in \mathbb R^{\geqslant 0}$ to be the infimum of $\diam S$ over all $S \subseteq X$ satisfying the following conditions:
- $x \in S$, and
- $S \nsubseteq U$ for any $U \in \mathcal A$.
We now study the map $h: X \to \mathbb R^{\geqslant 0}$ as defined above. Note first that $h(x) > 0$ for every $x \in X$. [Proof is left as exercise.]
Lipschitzness. The main technical idea is to show that $h$ is $1$-Lipschitz. Fix any $x, y \in X$; we want to show that $h(y) \geqslant h(x) - d(x,y)$. Further fix an arbitrary $\varepsilon > 0$. By the definition of $h(y)$, there exists $T \subseteq X$ such that
- $y \in T$;
- $T$ is not contained in any $U \in \mathcal A$;
- $\diam T \leqslant h(y) + \varepsilon$.
Now, consider $S = T \cup \{ x \}$. Clearly,
- $x \in S$;
- $S$ is not contained in any $U \in \mathcal A$ (why?);
- $\diam S \stackrel{\color{Red}{(!!)}}{\leqslant} \diam T + d(x,y) \leqslant h(y) + \varepsilon + d(x,y)$. [Exercise: Justify the inequality marked $\color{Red}{(!!)}$.]
Therefore, by definition of $h(x)$, we can see that $h(x) \leqslant h(y) + d(x,y) + \varepsilon$. Since this is true for all $\varepsilon > 0$, it follows that $h(x) \leqslant h(y) + d(x,y)$.
Wrap-up of the proof. Since $h$ is Lipschitz, it is also continuous on $X$. Furthermore, being a continuous function over a compact set, $h$ is guaranteed (by the extreme value theorem) to attains its minimum over $X$, and this minimum is strictly positive. Let $\delta > 0$ be any number that is strictly smaller than $h(x)$ for all $x \in X$. It only remains to check that such a $\delta$ satisfies the requirements of the problem. I leave that as a simple exercise.