Very often the first proof of a result which appears in the literature is extremely messy because the mathematician who proved it is working at the very edge of what is possible with the tools of the day; then it gets simplified over time as other mathematicians better understand what is going on and develop better machinery for streamlining the proofs. These first proofs are typically not presented to students because they are terrible, but the disadvantage of not knowing them is that you don't see how valuable the machinery that streamlines the modern proofs is.
There are many examples of this sort of thing, some of which you can find at this MO question; here's one that I came across while writing a blog post about the Sylow theorems. It is about
Cauchy's theorem: if a finite group $G$ has the property that its order $|G|$ is divisible by a prime $p$, then $G$ has an element of order $p$.
There is an extremely slick proof of this theorem which comes from consider the set of solutions to the equation
$$\{ (g_1, \dots g_p) \in G^p : g_1 g_2 \dots g_p = e \}$$
and then considering the action of the cyclic group by rotation $(g_1, g_2 \dots g_{p-1}, g_p) \mapsto (g_2, g_3, \dots g_p, g_1)$, which you can see in the link. It takes maybe three sentences to give.
By contrast, Cauchy's original proof took 9 pages. He does it by explicitly constructing the Sylow $p$-subgroups of the symmetric group, then (I believe Cauchy was working at a time when "finite group" always meant "finite group of permutations" so for him all finite groups were already embedded into symmetric groups) using a clever counting argument to show that if a finite group $G$ has the property that $p \mid |G|$ and also embeds into another finite group which has Sylow $p$-subgroups, then $G$ has an element of order $p$; you can see the details in the link. I give a very abbreviated sketch of the proof; the full construction of the Sylow $p$-subgroups of the symmetric group is very tedious (I have never seen anyone give it in full, and tried doing it in a follow-up blog post but gave up because it was too tedious).
This is a good example of what I mean; Cauchy was working at a very early time in group theory before anyone had even defined an abstract group, and people just didn't understand group theory that well yet. There was not even the notion of a quotient group at the time. Once group theory was better understood better proofs were possible. Actually I have no idea who the above slick proof of Cauchy's theorem is due to nor how many decades it took after Cauchy's original proof for someone to find it.
Cauchy's original proof does have the advantage that it is much closer to being a proof of the first Sylow theorem. It has a generalization due to Frobenius which shows that if a finite group $G$ embeds into a finite group $H$ which has a Sylow $p$-subgroup, then $G$ must have a Sylow $p$-subgroup. And then you can prove Sylow I by exhibiting the Sylow $p$-subgroups of the symmetric groups, or somewhat more easily, the general linear groups $GL_n(\mathbb{F}_p)$, then invoking Cayley's theorem.