First, in the general case, there may be several distinct criteria for an object to have an interesting property. The different criteria may each correctly identify some of the objects having that property, but be mute on the other objects having that property. This leads to summary theorems of the form "If any of the following conditions hold: (1) ... (2) ... ... (13) ... Then (object) has (property)." So it is entirely reasonable to have several if-then criteria for an object to have a property.
This is not what is happening here.
Here, you have if-and-only-if criteria. (All definitions are if-and-only-if, even though the mathematical style is to write them as if-thens. For instance, from https://en.wikipedia.org/wiki/Continuous_function , "a function is continuous if arbitrarily small changes in its output can be assured by restricting to sufficiently small changes in its input". Strictly interpreting this sentence, this is "if arbitrarily small changes in a function's, $f$'s, output can be assured by restricting to sufficiently small changes in its input, then $f$ is continuous" and only gives an inference "... $\implies$ $f$ is continuous". In contraposition, it also gives an inference "$f$ not continuous $\implies$ ...". Notice that, strictly, there is no inference given of the form "$f$ is continuous $\implies$ ...". This obvious absurdity is removed when we establish that definitions written as implications are actually biimplications.)
In general, a text takes one definition as "the definition", then proves equivalence between that definition and other characterizations of the property under study.
This could be what is happening here, but it isn't. So what is happening here?
Here, you have the same criterion written in apparently different ways. Can we show this equivalence?
Let $\varepsilon > 0$ and let $\delta$ be chosen so that any partition, $P$, of $[a,b]$ satisfies
$$ \mu(P) < \delta \implies |S_P - I| < \varepsilon/2 \text{.} $$
One detail that has been omitted is the choice of $c_k$ in each interval of $P$. In the definition we have been reciting through so far, the choice of the $c_k$ is universally quantified. For all choices of the $c_k$, $S_P$ and $I$ are close in value. This means the inequality holds if we make specific choices.
Suppose $c_k$ is chosen to give the minimum (or, if $f$ is so wild it doesn't have a minimum, infimal) value of $f$ in each interval of $P$. Then $S_P = L(f,P)$. Choosing instead the maximal (or supremal) points, $S_P = U(f,P)$. Our inequality promises
$$ |L(f,P) - I| < \varepsilon/2 $$
and
$$ |U(f,P) - I| < \varepsilon/2 $$
giving
$$ |U(f,P) - L(f,P)| = U(f,P) - L(f,P) < \varepsilon \text{.} $$
So the first definition is saying pretty much the same thing as the second: "any choice of the $c_k$" gives small difference between the Riemann sum and $I$" implies "the most extreme difference between choices of $c_k$, the upper sum choice of the $c_k$ and the lower sum choice of the $c_k$, yields small differences in the Riemann sum (because both sums are close to $I$)".
Going in the reverse direction: we have that $L(f,P_\varepsilon) \leq S_{P_\varepsilon} \leq U(f, P_\varepsilon)$ for any choice of the $c_k$, and we apply the squeeze theorem (as $\varepsilon \rightarrow 0$) to show that all three of these (or actually $L(...)$, $U(...)$, and all infinitely many $S_P$ as we range over the possible choices of the $c_k$) converge to the same value, $I$.
One might ask why any particular criterion is "natural" (i.e., is one we might choose as the definition), since once we have all these equivalences, any of them could be the definition. In the cases you cite, the definitions are modeled on two different ways to approach convergence. The first definition is modeled on the definition of convergence of a sequence in the limit, so can be taught very early during rigorous analysis. The second definition is modeled on an alternative definition of convergence of a sequence: that the limit infimum and limit supremum agree. (One nice property of these two limits is that they always exist, so we don't have to say "... both exist and agree".) Why do we have both ideas?
Convergence of a sequence requires a sequence, a collection indexed by a countable set. So, in some sense, the first definition is only useful if you are interested in convergence of a small (ordered) set of values. Limits infimum and supremum can be defined for uncountably large sets, so are useful in more settings that the first definition.