I think the most naturally visual interpretation of Fourier series is in a complex setting. The most basic example would be the vanishing of the integral of integer-scaled complex exponentials:
$$\int_{-\pi}^\pi e^{inx}dx=\oint_{\gamma}zdz=\delta_n.$$
The reason this is zero is simple: it traces out the unit circle in the complex plane ($n$ times), having symmetry in both reflection and rotation and thus must evaluate to the origin, geometrically speaking. When we create a linear combination of these functions, we can pinpoint the coefficient of $e^{inx}$ by multipling by its complex conjugate and then integrating (and normalizing): if we regard the function as a system of "clockwork," this process is calibrated to merely "delay" all the "pieces" of the system by the equivalent of what the $e^{inx}$ "gear" contributes to the clock; all of the other pieces will still be turning around and thus will evaluate to zero by the exact same considerations of symmetry, while the actual $e^{inx}$ gear will come to a standstill and thus we measure its coefficient.
This works because the "clockwork system" is a formal linear combination of "gears," and the act of integration is also linear and thus can distinguish between "running" versus "stopped" gears.
To bring this understanding down to the real setting, we project down by intermittently making the coefficients of the $e^{inx}$ gear and its conjugate $e^{-inx}$ the same. Conjugation reverses the direction of rotation, and when we add the effect of two gears rotating in opposite directions, their imaginary parts cancel out and we are left with a "real" gear: one that is a sinusoidal wave on $[-1,1]$ (after we average, anyway). This type of gear always decomposes into complex gears that are mirror images of each other, so there is the same visual picture at work as before, but it has "collapsed" down, like a fold-up tent into existing purely on the real line.
That works for $\cos (nx)$, but $\sin(nx)$ is created a little differently: one complex exponential minus another. Negation has the effect of reflection on a gear, so geometrically $e^{inx}-e^{-inx}$ will be a gear plus its "mirror image" again, but this time the "mirror" is the imaginary axis instead of the real axis. When we divide out by $2i$ we turn it onto the real line. Going from a complex Fourier series to a trig one is effectively decomposing its gears into real and imaginary sinusoids, then situating these sinusoids onto the same subspace of $\mathbb{C}$ by rotating the imaginary one onto it (and compensating by rotating the coefficients of these waves in the opposite direction, so that we represent the same function throughout). Going from the cosine-sine expansion to the complex one is therefore the exact reverse of this process: sieving out waves and putting up their corresponding tents.
Finally, how is it that we glean the coefficients of the sinusoids the same way we do it with complex exponentials? The explanation is simple: linearity. Obtaining the coefficients of the sinusoids is equivalent to obtaining the coefficients of the mirrored complex exponentials they decompose into and then putting them together. When we write the latter, we can put the two integrals together and get a single trigonometric one, i.e. the former.
Note that this graphical perspective interprets $x$ as "time." I'm also uncertain of how to "see" that complex exponentials or sinusoidal waves form a complete basis for $L^2\big([-\pi,\pi]\big)$...