I was looking over Langtangen's book on finite volume methods, and he mentions in section 5.5.1 that the second order or Laplacian term in a PDE can be written either as:
$$ \nabla^2 u \quad \text{or} \quad \nabla \cdot \nabla u = \frac{\partial^2 u}{\partial x^2} + \frac{\partial^2 u}{\partial y^2} $$
Both of these notations are analytically equivalent, but the $\nabla \cdot \nabla u$ approach seems to focus on this idea of the divergence of the gradient.
I was just trying to understand the physical intuition behind defining the Laplacian as the divergence of the gradient? I understand that the Laplacian operator represents diffusion and that it looks to reduce the deviation between a point and its neighboring points. But I was not clear on how this idea relates to the ideas of divergence--which relates to flux, and gradient--which describes how a scalar valued function is changing at a point given its variables.