Some of the properties of the matrix you are talking about are
- In my area of research (signal processing), this is referred to as the gram matrix. I believe, this is the standard name as well.
- It is a positive (semi) definite (PD) matrix. If you know about PD matrices, proving this is straight forward. The trick is this relation $x^Tx=||x||_2^2\geq 0$. Thus all results applicable to PD matrices are also applicable to the above matrix. (For eg: all eigenvalues are non-negative, cholesky decomposition and so on).
- Then, if $det(A^TA)\neq 0$, then columns of $A$ are linearly independent. Try to think about the converse.