$\mathrm dx $ and $\mathrm dy$ can't be taken as meaningless symbols. If they were, how could you talk about what it means to divide them (or even add them)? How could you talk about $\mathrm df$ as a linear combination of them, unless that also makes it a meaningless symbol?
No, these are well-defined quantities called differential forms.
A differential $k$-form is a function of the coordinate position as well as a function of $k$ vectors. $\mathrm dx$ and $\mathrm dy$ are differential 1-forms, and the equation you were given for the total differential can be seen as defining $\mathrm df$ as a differential 1-form in terms of $\mathrm dx$ and $\mathrm dy$:
$$\begin{align*}\mathrm df&: \mathbb R^2 \times \mathbb R^2 \to \mathbb R\\
\mathrm df(r,a) &= \partial_1 f(r) \mathrm dx(r, a) + \partial_2 f(r) \mathrm dy(r,a)\end{align*}$$
This is a very verbose and explicit way of writing the total differential. I'm being extremely, extremely pedantic here: I'm not even calling $\partial_1 f$ by $\partial f/\partial x$ because, from a strict mathematics perspective, $f$ is a function of a vector argument, and the components of that argument (the coordinates) need not be called $x,y$.
When we consider a level curve of $f$, that means we have some curve $C: \mathbb R \to \mathbb R^2$ such that $(f \circ C)(t) = K$ for some constant $K$. Taking a derivative of this function yields, using the chain rule,
$$\begin{align*}0 &= (f \circ C)'(t) \\ &= \mathrm df(C(t), C'(t)) \\ &= [(\partial_1 f) \circ C](t) \mathrm dx(C(t), C'(t)) + [(\partial_2 f) \circ C](t) \mathrm dy(C(t), C'(t))\end{align*}$$
Again, being painfully explicit here. Most people would not even bother writing the function $C$ in here, and the derivative $C'$ would be considered implied.
Now at this point, you can write
$$\frac{\mathrm dy(C(t), C'(t))}{\mathrm dx(C(t), C'(t))} = - \frac{(\partial_1 f)\circ C(t)}{(\partial_2 f) \circ C(t)}$$
That, however, is far from considering $y$ a function of $x$ and taking a derivative.
Now, what some people might do is define the level curve such that $C$ takes one of the coordinates (like $x$) and spits out the coordinate pair $(x,y)$ that corresponds to a point on the curve. When this is done, $C(x) = (x, Y(x))$ and $C'(x) = (1, Y'(x))$ for some function $Y$. Moreover, $\mathrm dx$ and $\mathrm dy$ don't actually depend on position at all: they only look at the second argument.
When this is done, the resulting equation looks like
$$Y'(x) = -\frac{\partial _2 f(x,Y(x))}{\partial _1 f(x,Y(x))}$$
So while the argument put forth is very powerful and very suggestive, and it gets the gist of things right, the bookkeeping required "under the hood" of this argument may require some additional thought.
$\frac{dy}{dt} \cdot \frac{dt}{dx} = \frac{dy}{dx}.$
I understand that. But I don't quite see where the $t$ came from. And why is the surface one-dimensional? It seems two-dimensional to me.
– ra1nmaster Dec 02 '15 at 01:26$$\nabla f \cdot \frac{\mathrm{d}\mathbf{r}}{\mathrm{d}t} = 0$$ where $\mathbf{r}$ is the parametrization of the curve you described. This works because the gradient is perpendicular to $\frac{\mathrm{d}\mathbf{r}}{\mathrm{d}t}.$
And then I would just use the dot product to get that equation?
– ra1nmaster Dec 02 '15 at 01:33$\mathrm{d}f = \frac{\partial f}{\partial x} \mathrm{d}x + \frac{\partial f}{\partial y}\mathrm{d}y.$
Clearly, if these kinds of equations are going to be taught, they ought be given a meaning. One way I see it is that $dy$ gives the linear change in $f(x)$ along a tangent line given a change in $x$ ($dx$), but this definition doesn't seem to hold in say, integrals.
– ra1nmaster Dec 02 '15 at 02:28