Wikipedia lists the identity for the gradient of a composition as $$\nabla(f\circ \mathbf A) = (\nabla f\circ \mathbf A)\nabla \mathbf A$$
First, is this formula correct?
Assuming it is:
Second, what does $\nabla\mathbf A$ mean? Is it the Jacobian of $\mathbf A$?
And finally, what does it mean to multiply the vector $\nabla f\circ \mathbf A$ and the matrix $\nabla \mathbf A$? Is it some type of tensor product? That doesn't seem like it would result in the correct type of object. Or is $\nabla f\circ \mathbf A$ supposed to be a row matrix and then you just use matrix multiplication?