Coefficients of characteristic polynomial of a matrix

Question

For a given $n \times n$-matrix $A$, and $J\subseteq\{1,...,n\}$ let us denote by $A[J]$ its principal minor formed by the columns and rows with indices from $J$.

If the characteristic polynomial of $A$ is $x^n+a_{n-1}x^{n-1}+\cdots+a_1x+a_0$, then why $$a_k=(-1)^{n-k}\sum_{|J|=n-k}A[J],$$ that is, why is each coefficient the sum of the appropriately sized principal minors of $A$?

Found something useful .. www.mcs.csueastbay.edu/~malek/Class/Characteristic.pdf — Dilawar, May 12 '12 at 03:39
Also http://books.google.co.in/books?id=ULMmheb26ZcC&pg=PA195&dq=coefficients+of+characteristic+polynomial&hl=en&sa=X&ei=q9utT7PpFI3JrAeWpcWUBA&redir_esc=y#v=onepage&q=coefficients%20of%20characteristic%20polynomial&f=false — Dilawar, May 12 '12 at 03:46
This follows from Corollary 5.161 in my Notes on the combinatorial fundamentals of algebra, version of 25 May 2017. Just mentioning this for the sake of completeness; I'm sure you don't want to read my proof (which is an unenlightening orgy of notation, with nothing interesting going on other than repeated applications of multilinearity), but it might be comforting to know it exists. — darij grinberg, Jul 18 '17 at 12:50
See also https://math.stackexchange.com/a/336078/ for an outline of the proof. — darij grinberg, Jul 18 '17 at 12:51

score 16 · Answer 1 · edited May 16 '16 at 07:43

16

Use the fact that $\begin{vmatrix} a & b+e \\ c & d+f \end{vmatrix} = \begin{vmatrix} a & b \\ c & d \end{vmatrix} + \begin{vmatrix} a & e \\ c & f \end{vmatrix} $

We can use this fact to separate out powers of $\lambda$. Following is an example for $2 \times 2$ matrix. $$ \begin{vmatrix} a-\lambda & b \\ c & d-\lambda \end{vmatrix} = \begin{vmatrix} a & b \\ c & d-\lambda \end{vmatrix} + \begin{vmatrix} -\lambda & b \\ 0 & d-\lambda \end{vmatrix} = \begin{vmatrix} a & b \\ c & d \end{vmatrix} + %% \begin{vmatrix} a & 0 \\ c & -\lambda \end{vmatrix} + %% \begin{vmatrix} -\lambda & b \\ 0 & d \end{vmatrix} + \begin{vmatrix} -\lambda & 0 \\ 0 & -\lambda \end{vmatrix} $$

This decompose $det$ expression into sum of various powers of $\lambda$.

Now try it with a $3 \times 3$ matrix and then generalize it.

edited May 16 '16 at 07:43

user26857

52,094

answered May 12 '12 at 05:37

Dilawar

6,125

I couldn't understand how you went from LHS to RHS in the first equal sing in the$2\times 2 $ matrix example. I mean I can see the equality by just calculating the determinants, but I couldn't get the method you used while separating the determinants. – Our Jul 18 '17 at 15:17
@Leth This is a well know fact which you can prove by yourself by using the definition of determinant. See here https://math.stackexchange.com/questions/1148302/effect-of-row-operations-on-determinant-for-matrices-in-row-form?noredirect=1&lq=1 for pointers. – Dilawar Jul 19 '17 at 07:39

score 11 · Answer 2 · edited May 16 '16 at 07:45

11

One way to see it: $A:V\to V$ induces the (again linear) maps $\wedge^k A:\wedge^k V\to \wedge^k V$. Your formula (restated in an invariant way, i.e. independently of basis) says that $$\det(xI-A)=x^n-x^{n-1}\operatorname{Tr}(A)+ x^{n-2}\operatorname{Tr}(\wedge^2 A)-\cdots(*)$$ We can conjugate $A$ so that it becomes upper-triangular with diagonal elements $\lambda_i$ ($\lambda_i$'s are the roots of the char. polynomial). Now for upper triangular matrices the formula $(*)$ says that $$(x-\lambda_1)\cdots(x-\lambda_n)=x^n-x^{n-1}(\sum\lambda_i)+x^{n-2}(\sum\lambda_i\lambda_j)-\cdots$$ which is certainly true, hence $(*)$ is true.

edited May 16 '16 at 07:45

user26857

52,094

answered Mar 23 '11 at 12:51

user8268

21,348

This is highbrow but doesn't explain the combinatorial equality of the OP. – Duchamp Gérard H. E. Feb 05 '19 at 08:36
As for why $\operatorname{tr}(\Lambda^k A)$ is the sum of principal $k\times k$ minors of $A$, see https://math.stackexchange.com/q/1604461/. The references at https://mathoverflow.net/a/372497/ are also useful, and https://math.stackexchange.com/q/23899/ discusses an interesting extension of this result. – ho boon suan Jan 16 '21 at 02:29

score 7 · Answer 3 · answered Jan 18 '21 at 03:56

$\newcommand\sgn{\operatorname{sgn}}$ I learned of the following proof from @J_P's answer to what effectively is the same question. It arises from expanding the usual definition $\det A=\sum_{\sigma\in S_n}\sgn\sigma\prod_{1\le k\le n}A_{k,\sigma(k)}$, and deserves to be more well-known than it currently is.

Let $[n]:=\{1,\dots,n\}$, and write $\delta_{i,j}$ for the Kronecker delta, which is equal to $1$ if $i=j$, and is $0$ otherwise. Note that $\prod_{1\le k\le n}(a_k+b_k)=\sum_{C\subseteq[n]}\prod_{i\in C}a_i\prod_{j\in[n]-C}b_j$, since every term in the expansion on the left hand side will choose from each expression $(a_k+b_k)$ either $a_k$ or $b_k$, and so we may sum over all possible ways $C$ of choosing the $a_k$ terms. We compute \begin{align*} \det(tI-A) &=\sum_{\sigma\in S_n}\sgn\sigma\prod_{1\le k\le n} (t\delta_{k,\sigma(k)}-A_{k,\sigma(k)})\\ &=\sum_{\sigma\in S_n}\sgn\sigma\sum_{C\subseteq[n]} \prod_{i\in C}(-A_{i,\sigma(i)})\prod_{j\in[n]-C}t\delta_{j,\sigma(j)}\\ &=\sum_{C\subseteq[n]}(-1)^{|C|}\sum_{\sigma\in S_n}\sgn\sigma \prod_{i\in C}A_{i,\sigma(i)}\prod_{j\in[n]-C}t\delta_{j,\sigma(j)}. \end{align*} For fixed $C\subseteq[n]$ and $\sigma\in S_n$, the last product $\prod_{j\in[n]-C}t\delta_{j,\sigma(j)}$ vanishes unless $\sigma$ fixes the elements of $[n]-C$, in which case the product is just $t^{n-|C|}$. So we need only consider the contributions of the permutations of $C$ in our sum, by thinking of a permutation $\sigma\in S_n$ that fixes $[n]-C$ as a permutation in $S_C$. The sign of this permutation considered as an element of $S_C$ remains the same, as can be seen if we consider the sign as $(-1)^{T(\sigma)}$, where $T(\sigma)$ is the number of transpositions of $\sigma$. We thus have \begin{align*} \sum_{C\subseteq[n]}(-1)^{|C|}\sum_{\sigma\in S_n}\sgn\sigma \prod_{i\in C}A_{i,\sigma(i)}\prod_{j\in[n]-C}t\delta_{j,\sigma(j)} &=\sum_{C\subseteq[n]}(-1)^{|C|}\sum_{\sigma\in S_C}\sgn\sigma \prod_{i\in C}A_{i,\sigma(i)}t^{n-|C|}\\ &=\sum_{C\subseteq[n]}(-1)^{|C|}t^{n-|C|}\sum_{\sigma\in S_C}\sgn\sigma \prod_{i\in C}A_{i,\sigma(i)}. \end{align*} The term $\sum_{\sigma\in S_C}\sgn\sigma\prod_{i\in C}A_{i,\sigma(i)}$ is precisely the determinant of the principal submatrix $A_{C\times C}$, which is the $|C|\times|C|$ matrix with rows and columns indexed by $C$, and so \begin{align*} \sum_{C\subseteq[n]}(-1)^{|C|}t^{n-|C|}\sum_{\sigma\in S_C}\sgn\sigma \prod_{i\in C}A_{i,\sigma(i)} &=\sum_{C\subseteq[n]}(-1)^{|C|}t^{n-|C|}\det(A_{C\times C})\\ &=\sum_{0\le k\le n}\sum_{\substack{C\subseteq[n]\\|C|=k}}(-1)^kt^{n-k} \det(A_{C\times C})\\ &=\sum_{0\le k\le n}t^{n-k}\left((-1)^k \sum_{\substack{C\subseteq[n]\\|C|=k}} \det(A_{C\times C})\right)\\ &=\sum_{0\le k\le n}t^k\left((-1)^{n-k} \sum_{\substack{C\subseteq[n]\\|C|=n-k}} \det(A_{C\times C})\right). \end{align*}

I've been googling this question and this is the first answer that made perfect sense to me, thanks. — Hilbert Jr., Nov 24 '21 at 20:07

score 4 · Answer 4 · 2017-08-11T19:52:24.190

Here's another way by using Taylor's theorem.

Consider $\det (xI+A)$ as a polynomial $p(x)$, from Taylor's theorem we have that: $$ p(x)=\sum_{i=0}^n\frac{p^{(i)}(0)}{i!}x^i. $$ Computing $p^{(i)}(0)$ will leads quikly to the conclusion.

How to compute $p^{(i)}(x)$ at $x=0$ ? Well, here's a trick:

For instance we compute $p'(0)$, go back to the determinant and replace the $x$ in the $k$th row by $x_k$, and using the total derivative. Then you'll find: $$p'(0)=\sum_{|J|=n-1}A[J].$$

And using induction we can show in general that: $$p^{(i)}(0)=i!\sum_{|J|=n-i}A[J]$$

Venkata Karthik Bandaru · Answer 5 · 2023-11-27T05:16:27.573

Let ${ A \in \mathbb{R} ^{m \times n} }.$ Recall notation ${ [m] = \lbrace 1, \ldots, m \rbrace, }$ and for ${ I \subseteq [m], J \subseteq [n] }$ we have submatrix $${ A _{I,J} = (a _{ij}) _{i \in I, j \in J}. }$$

Let ${ A \in \mathbb{R} ^{n \times n} }.$ Let $${ f(A,t) = \det(tI _n + A) }.$$
Using the result on differentiating determinants, $${ \frac{d}{dt} f(A,t) = \sum _{\substack{I \subseteq [n] \\ \vert I \vert = n-1}} f(A _{I,I}, t) }.$$So second derivative $${ \frac{d ^2}{dt ^2} f(A,t) = \sum _{\substack{I _1 \subseteq [n] \\ \vert I _1 \vert = n-1}} \left( \sum _{\substack{I _2 \subseteq I _1 \\ \vert I _2 \vert = \vert I _1 \vert - 1}} f(A _{I _2, I _2}, t) \right) }$$ that is $${ \frac{d ^2}{dt ^2} f(A,t) = 2 \sum _{\substack{I \subseteq [n] \\ \vert I \vert = (n-2)}} f(A _{I,I}, t). }$$ In general, we see $${ \frac{d ^k}{dt ^k} f(A,t) = \sum _{\substack{I _1 \subseteq [n] \\ \vert I _1 \vert = n -1}} \left( \ldots \left( \sum _{\substack{I _k \subseteq I _{k-1} \\ \vert I _k \vert = \vert I _{k-1} \vert - 1}} f(A _{I _k, I _k} , t) \right) \right) }$$ that is $${ \frac{d^k}{dt^k} f(A,t) = k! \sum _{\substack{I \subseteq [n] \\ \vert I \vert = n-k}} f(A _{I,I}, t). }$$ Hence $${ \begin{align*} f(A,t) &= \sum _{k=0} ^{n} \frac{f ^{(k)} (A, 0)}{k!} t ^k \\ &= \sum _{k=0} ^{n} \left( \sum _{\substack{I \subseteq [n] \\ \vert I \vert = n-k}} f(A _{I,I}, 0)\right) t ^k \end{align*} }$$ that is $${ \det(tI _n + A) = \sum _{k=0} ^{n} \left( \sum _{\substack{I \subseteq [n] \\ \vert I \vert = n-k}} \det(A _{I,I})\right) t ^k .}$$

Coefficients of characteristic polynomial of a matrix

5 Answers5

Linked

Related