I looked up a proof of the chain rule here https://web.williams.edu/Mathematics/lg5/A37W12/Chain.pdf
which made sense from a computational standpoint. However, I do not understand what intuitive details about the process of the chain rule or why it works, from this proof.