Suppose I have a data set of $m$ vectors in $\mathbb{R}^d$, $D = \{x_1,\ldots,x_m\}$.
Let $S = \sum_{i=1}^{m}x_ix_i^T$ be the scatter matrix.
My question is: thus the eigenvectors of $S$ form a base to $\mathbb{R}^d$?
I know that if I have $d$ eigenvectors of $S$ with distinct eigenvalues, than those vectors are orthogonal to each other because $S$ is symmetric. Then, I know orthogonal vectors are independent of each other. Thus, if $S$ has $d$ eigenvectors with distinct eigenvalues, then those vectors are $d$ (= dimension of the space) independent vectors and thus form a base.
But, I don't sure that the eigenvectors of $S$ necessarily form a base because maybe there are eigenvectors with same eigenvalues and thus not necessarily orthogonal to each other. Also, I don't sure if $S$ necessarily has $d$ eigenvectors, maybe can be less.
Can $S$ has non distinct eigenvalues?
Can $S$ has less than $d$ eigenvectors?
Elaboration on when the eigenvectors of $S$ form a base (maybe always or maybe under some conditions) will be helpful.