My understanding in univariate case is:
Suppose $f:\mathbb{R}\rightarrow\mathbb{R}$.
Then, $\frac{dy}{dx}|_{x=x_o}=f'(x)|_{x=x_o}$=$\lim_{\Delta x \to 0}\frac{\Delta f}{\Delta x}$, where $\Delta x=(x-x_o)$ and $\Delta f=f(x)-f(x_o)$.
Part of confusion is when I see something like: $dy=f'(x)dx$.
What is happening here exactly? Why some books move around $dy$ and $dx$ as if they are discrete changes? Can somebody explain the difference between discrete changes and differential changes with clarity?
My understanding of $f'(x)$ at $x_o$ is the best affine approximation of $f$ at that point. Then, I use the term "sensitivity", because it is my understanding the derivative (or slope here) provides the information how sensitive function value at that point is to the very little change in $x$.
Then, what does $dy$ mean? Doesn't $dy$ also represent infinitesimal change? So I am confused difference between $dy$ and $\frac{dy}{dx}$ as well.
The terminologies that are used "instantaneous rate of change" or "slope" or "best affine approximation" or "how sensitive f is with respect to the argument at that point" amplifying the confusion.
Please help.