The problem comes from the definition of $\zeta$.
The authors define $\zeta$ to be exactly the "missing term" so that (2) equals (3), that is,
$$
\zeta = \mathbb{E}\left[(1-\beta_2)\sum_{i=1}^t \beta_2^{t-i}g_i^2\right] - \mathbb E\left[g_t^2\right](1-\beta_2)\sum_{i=1}^t \beta_2^{t-i}.
$$
Once that is said, the equality is straightforward to obtain. The authors then explain that one may expect $\zeta$ to be small in general. In particular, if for all $i\in\{1,\ldots,t\}$, $\mathbb E[g_i^2] = \mathbb E[g_t^2]$ (i.e., the second moment does not depend on $i$), then one can see from the definition of $\zeta$ that $\zeta=0$.