Originally, people realized that the set $S_X$ of all the bijections on a given set $X$, endowed with map composition as "combining rule", fulfils the properties of closure, associativity, existence of the identity map, existence of the inverse map (one for each bijection). Later on, someone thought: what if we replace the set $S_X$ by any set $G$, endowed with a "combining rule" among its elements (so that the closure property is natively embedded in this abstractization process), such that associativity and indentity/inverses existence are ensured by axioms? We get so the definition of "abstract group", which is then the result of a typical mimicking process. Of course, being the prototypical one, $S_X$ is also an abstract group. The next step was realizing that two abstract groups are algebraically the same when an isomorphism exists between them. Well, Cayley theorem proves that $S_G$ contains an isomorphic copy of any abstract group you can build up with the elements of the set $G$.
Therefore, in a sense, Cayley theorem closes the hystorical loop: "group of permutations" $\rightarrow$ "abstract group" $\rightarrow$ "group of permutations".