I understand that the idea of principle component analysis is to find the projection onto a vector with the largest variance.
The book says this can be achieved with eigen decomposition of the covariance matrix and Im wondering why that is. It writes that the eigenvector associated with the eigenvalue with the largest eigenvalue will have the highest variance when the matrix is projected on it. Then it doesn't go into why this is so. What is the intuition?