Section PEE Properties of Eigenvalues and Eigenvectors
Subsection BPE Basic Properties of Eigenvalues
Theorem EDELI. Eigenvectors with Distinct Eigenvalues are Linearly Independent.
Suppose that A is an n×n square matrix and S={x1,x2,x3,…,xp} is a set of eigenvectors with eigenvalues λ1,λ2,λ3,…,λp such that λi≠λj whenever i≠j. Then S is a linearly independent set.
Proof.
If \(p=1\text{,}\) then the set \(S=\set{\vect{x}_1}\) is linearly independent since eigenvectors are nonzero (Definition EEM), so assume for the remainder that \(p\geq 2\text{.}\)
We will prove this result by contradiction (Proof Technique CD). Suppose to the contrary that \(S\) is a linearly dependent set. Define \(S_i=\set{\vectorlist{x}{i}}\) and let \(k\) be an integer such that \(S_{k-1}=\set{\vectorlist{x}{k-1}}\) is linearly independent and \(S_k=\set{\vectorlist{x}{k}}\) is linearly dependent. We have to ask if there is even such an integer \(k\text{?}\) First, since eigenvectors are nonzero, the set \(\set{\vect{x}_1}\) is linearly independent. Since we are assuming that \(S=S_p\) is linearly dependent, there must be an integer \(k\text{,}\) \(2\leq k\leq p\text{,}\) where the sets \(S_i\) transition from linear independence to linear dependence (and stay that way). In other words, \(\vect{x}_k\) is the vector with the smallest index that is a linear combination of just vectors with smaller indices.
Since \(\set{\vectorlist{x}{k}}\) is a linearly dependent set there must be scalars, \(\scalarlist{a}{k}\text{,}\) not all zero (Definition LI), so that
Then,
This equation is a relation of linear dependence on the linearly independent set \(\set{\vectorlist{x}{k-1}}\text{,}\) so the scalars must all be zero. That is, \(a_i\left(\lambda_i-\lambda_k\right)=0\) for \(1\leq i\leq k-1\text{.}\) However, we have the hypothesis that the eigenvalues are distinct, so \(\lambda_i\neq\lambda_k\) for \(1\leq i\leq k-1\) and so \(\lambda_i-\lambda_k\neq 0\text{.}\) So Theorem ZPZT implies \(a_i=0\) for \(1\leq i\leq k-1\text{.}\)
This reduces the original relation of linear dependence on \(\set{\vectorlist{x}{k}}\) to the simpler equation \(a_k\vect{x}_k=\zerovector\text{.}\) By Theorem SMEZV we conclude that \(a_k=0\) or \(\vect{x}_k=\zerovector\text{.}\) Eigenvectors are never the zero vector (Definition EEM), so \(a_k=0\text{.}\) So now all of the scalars \(a_i\text{,}\) \(1\leq i\leq k\) are zero, contradicting their introduction as the scalars creating a nontrivial relation of linear dependence on the set \(\set{\vectorlist{x}{k}}\text{.}\) With a contradiction in hand, we conclude that \(S\) must be linearly independent.
Theorem MNEM. Maximum Number of Eigenvalues of a Matrix.
Suppose that A is a square matrix of size n. Then A cannot have more than n distinct eigenvalues.
Proof.
We argue by contradiction (Proof Technique CD). Assume that \(A\) has \(n+1\) or more distinct eigenvalues. Then there is a set of \(n+1\) or more eigenvectors of \(A\text{,}\) with distinct eigenvalues. This is a set of \(n+1\) or more vectors from \(\complex{n}\) that will be linearly independent by Theorem EDELI. But this contradicts Theorem MVSLD, so our assumption is false, and there are no more than \(n\) distinct eigenvalues.
Theorem SMZE. Singular Matrices have Zero Eigenvalues.
Suppose A is a square matrix. Then A is singular if and only if 0 is an eigenvalue of A.
Proof.
We have the following equivalences:
Theorem NME8. Nonsingular Matrix Equivalences, Round 8.
Suppose that A is a square matrix of size n. The following are equivalent.
- A is nonsingular.
- A row-reduces to the identity matrix.
- The null space of A contains only the zero vector, N(A)={0}.
- The linear system LS(A,b) has a unique solution for every possible choice of b.
- The columns of A are a linearly independent set.
- A is invertible.
- The column space of A is Cn, C(A)=Cn.
- The columns of A are a basis for Cn.
- The rank of A is n, r(A)=n.
- The nullity of A is zero, n(A)=0.
- The determinant of A is nonzero, det(A)≠0.
- λ=0 is not an eigenvalue of A.
Proof.
The equivalence of the first and last statements is Theorem SMZE, reformulated by negating each statement in the equivalence. So we are able to improve on Theorem NME7 with this addition.
Sage NME8. Nonsingular Matrix Equivalences, Round 8.
Zero eigenvalues are another marker of singular matrices. We illustrate with two matrices, the first nonsingular, the second not.
Theorem ESMM. Eigenvalues of a Scalar Multiple of a Matrix.
Suppose A is a square matrix and λ is an eigenvalue of A. Then αλ is an eigenvalue of αA.
Proof.
Let \(\vect{x}\neq\zerovector\) be one eigenvector of \(A\) for \(\lambda\text{.}\) Then
So \(\vect{x}\neq\zerovector\) is an eigenvector of \(\alpha A\) for the eigenvalue \(\alpha\lambda\text{.}\)
Theorem EOMP. Eigenvalues Of Matrix Powers.
Suppose A is a square matrix, λ is an eigenvalue of A, and s≥0 is an integer. Then λs is an eigenvalue of As.
Proof.
Let \(\vect{x}\neq\zerovector\) be one eigenvector of \(A\) for \(\lambda\text{.}\) Then we proceed by induction on \(s\) (Proof Technique I). First, for \(s=0\text{,}\) employing Theorem MMIM and Property OC to establish the base case,
So \(\lambda^s\) is an eigenvalue of \(A^s\) in this special case. For the induction step, we assume the theorem is true for \(s\text{,}\) and find
So \(\vect{x}\neq\zerovector\) is an eigenvector of \(A^{s+1}\) for \(\lambda^{s+1}\text{,}\) and induction tells us the theorem is true for all \(s\geq 0\text{.}\)
Theorem EPM. Eigenvalues of the Polynomial of a Matrix.
Suppose A is a square matrix and λ is an eigenvalue of A. Let q(x) be a polynomial in the variable x. Then q(λ) is an eigenvalue of the matrix q(A).
Proof.
Let \(\vect{x}\neq\zerovector\) be one eigenvector of \(A\) for \(\lambda\text{,}\) and write \(q(x)=a_0+a_1x+a_2x^2+\cdots+a_mx^m\text{.}\) Then
So \(\vect{x}\neq 0\) is an eigenvector of \(q(A)\) for the eigenvalue \(q(\lambda)\text{.}\)
Example BDE. Building desired eigenvalues.
In Example ESMS4 the \(4\times 4\) symmetric matrix
is shown to have the three eigenvalues \(\lambda=3,\,1,\,-1\text{.}\) Suppose we wanted a \(4\times 4\) matrix that has the three eigenvalues \(\lambda=4,\,0,\,-2\text{.}\) We can employ Theorem EPM by finding a polynomial that converts \(3\) to \(4\text{,}\) \(1\) to \(0\text{,}\) and \(-1\) to \(-2\text{.}\) Such a polynomial is called an interpolating polynomial, and in this example we can use
We will not discuss how to concoct this polynomial, but a text on numerical analysis should provide the details. For now, simply verify that \(r(3)=4\text{,}\) \(r(1)=0\) and \(r(-1)=-2\text{.}\)
Now compute
Theorem EPM tells us that if \(r(x)\) transforms the eigenvalues in the desired manner, then \(r(C)\) will have the desired eigenvalues. You can check this by computing the eigenvalues of \(r(C)\) directly. Furthermore, notice that the multiplicities are the same, and the eigenspaces of \(C\) and \(r(C)\) are identical.
Theorem EIM. Eigenvalues of the Inverse of a Matrix.
Suppose A is a square nonsingular matrix and λ is an eigenvalue of A. Then λ−1 is an eigenvalue of the matrix A−1.
Proof.
Notice that since \(A\) is assumed nonsingular, \(\inverse{A}\) exists by Theorem NI, but more importantly, \(\lambda^{-1}=1/\lambda\) does not involve division by zero since Theorem SMZE prohibits this possibility.
Let \(\vect{x}\neq\zerovector\) be one eigenvector of \(A\) for \(\lambda\text{.}\) Suppose \(A\) has size \(n\text{.}\) Then
So \(\vect{x}\neq\zerovector\) is an eigenvector of \(\inverse{A}\) for the eigenvalue \(\frac{1}{\lambda}\text{.}\)
Theorem ETM. Eigenvalues of the Transpose of a Matrix.
Suppose A is a square matrix and λ is an eigenvalue of A. Then λ is an eigenvalue of the matrix At.
Proof.
Suppose \(A\) has size \(n\text{.}\) Then
So \(A\) and \(\transpose{A}\) have the same characteristic polynomial, and by Theorem EMRCP, their eigenvalues are identical and have equal algebraic multiplicities. Notice that what we have proved here is a bit stronger than the stated conclusion in the theorem.
Theorem ERMCP. Eigenvalues of Real Matrices come in Conjugate Pairs.
Suppose A is a square matrix with real entries and x is an eigenvector of A for the eigenvalue λ. Then ¯x is an eigenvector of A for the eigenvalue ¯λ.
Proof.
We have
So \(\conjugate{\vect{x}}\) is an eigenvector of \(A\) for the eigenvalue \(\conjugate{\lambda}\text{.}\)
Subsection ME Multiplicities of Eigenvalues
A polynomial of degree n will have exactly n roots. From this fact about polynomial equations we can say more about the algebraic multiplicities of eigenvalues.Theorem DCP. Degree of the Characteristic Polynomial.
Suppose that A is a square matrix of size n. Then the characteristic polynomial of A, pA(x), has degree n.
Proof.
We will prove a more general result by induction (Proof Technique I). Then the theorem will be true as a special case. We will carefully state this result as a proposition indexed by \(m\text{,}\) \(m\geq 1\text{.}\)
\(P(m)\text{:}\) Suppose that \(A\) is an \(m\times m\) matrix whose entries are complex numbers or linear polynomials in the variable \(x\) of the form \(c-x\text{,}\) where \(c\) is a complex number. Suppose further that there are exactly \(k\) entries that contain \(x\) and that no row or column contains more than one such entry. Then, when \(k=m\text{,}\) \(\detname{A}\) is a polynomial in \(x\) of degree \(m\text{,}\) with leading coefficient \(\pm 1\text{,}\) and when \(k\lt m\text{,}\) \(\detname{A}\) is a polynomial in \(x\) of degree \(k\) or less.
Base Case: Suppose \(A\) is a \(1\times 1\) matrix. Then its determinant is equal to the lone entry (Definition DM). When \(k=m=1\text{,}\) the entry is of the form \(c-x\text{,}\) a polynomial in \(x\) of degree \(m=1\) with leading coefficient \(-1\text{.}\) When \(k\lt m\text{,}\) then \(k=0\) and the entry is simply a complex number, a polynomial of degree \(0\leq k\text{.}\) So \(P(1)\) is true.
Induction Step: Assume \(P(m)\) is true, and that \(A\) is an \((m+1)\times(m+1)\) matrix with \(k\) entries of the form \(c-x\text{.}\) There are two cases to consider.
Suppose \(k=m+1\text{.}\) Then every row and every column will contain an entry of the form \(c-x\text{.}\) Suppose that for the first row, this entry is in column \(t\text{.}\) Compute the determinant of \(A\) by an expansion about this first row (Definition DM). The term associated with entry \(t\) of this row will be of the form
The submatrix \(\submatrix{A}{1}{t}\) is an \(m\times m\) matrix with \(k=m\) terms of the form \(c-x\text{,}\) no more than one per row or column. By the induction hypothesis, \(\detname{\submatrix{A}{1}{t}}\) will be a polynomial in \(x\) of degree \(m\) with coefficient \(\pm 1\text{.}\) So this entire term is then a polynomial of degree \(m+1\) with leading coefficient \(\pm 1\text{.}\)
The remaining terms (which constitute the sum that is the determinant of \(A\)) are products of complex numbers from the first row with cofactors built from submatrices that lack the first row of \(A\) and lack some column of \(A\text{,}\) other than column \(t\text{.}\) As such, these submatrices are \(m\times m\) matrices with \(k=m-1\lt m\) entries of the form \(c-x\text{,}\) no more than one per row or column. Applying the induction hypothesis, we see that these terms are polynomials in \(x\) of degree \(m-1\) or less. Adding the single term from the entry in column \(t\) with all these others, we see that \(\detname{A}\) is a polynomial in \(x\) of degree \(m+1\) and leading coefficient \(\pm 1\text{.}\)
The second case occurs when \(k\lt m+1\text{.}\) Now there is a row of \(A\) that does not contain an entry of the form \(c-x\text{.}\) We consider the determinant of \(A\) by expanding about this row (Theorem DER), whose entries are all complex numbers. The cofactors employed are built from submatrices that are \(m\times m\) matrices with either \(k\) or \(k-1\) entries of the form \(c-x\text{,}\) no more than one per row or column. In either case, \(k\leq m\text{,}\) and we can apply the induction hypothesis to see that the determinants computed for the cofactors are all polynomials of degree \(k\) or less. Summing these contributions to the determinant of \(A\) yields a polynomial in \(x\) of degree \(k\) or less, as desired.
Definition CP tells us that the characteristic polynomial of an \(n\times n\) matrix is the determinant of a matrix having exactly \(n\) entries of the form \(c-x\text{,}\) no more than one per row or column. As such we can apply \(P(n)\) to see that the characteristic polynomial has degree \(n\text{.}\)
Theorem NEM. Number of Eigenvalues of a Matrix.
Suppose that λ1,λ2,λ3,…,λk are the distinct eigenvalues of a square matrix A of size n. Then
Proof.
By the definition of the algebraic multiplicity (Definition AME), we can factor the characteristic polynomial as
where \(c\) is a nonzero constant. (We could prove that \(c=(-1)^{n}\text{,}\) but we do not need that specificity right now. See Exercise PEE.T30) The left-hand side is a polynomial of degree \(n\) by Theorem DCP and the right-hand side is a polynomial of degree \(\sum_{i=1}^{k}\algmult{A}{\lambda_i}\text{.}\) So the equality of the polynomials' degrees gives the equality \(\sum_{i=1}^{k}\algmult{A}{\lambda_i}=n\text{.}\)
Theorem ME. Multiplicities of an Eigenvalue.
Suppose that A is a square matrix of size n and λ is an eigenvalue. Then
Proof.
Since \(\lambda\) is an eigenvalue of \(A\text{,}\) there is an eigenvector of \(A\) for \(\lambda\text{,}\) \(\vect{x}\text{.}\) Then \(\vect{x}\in\eigenspace{A}{\lambda}\text{,}\) so \(\geomult{A}{\lambda}\geq 1\text{,}\) since we can extend \(\set{\vect{x}}\) into a basis of \(\eigenspace{A}{\lambda}\) (Theorem ELIS).
To show \(\geomult{A}{\lambda}\leq\algmult{A}{\lambda}\) is the most involved portion of this proof. To this end, let \(g=\geomult{A}{\lambda}\) and let \(\vectorlist{x}{g}\) be a basis for the eigenspace of \(\lambda\text{,}\) \(\eigenspace{A}{\lambda}\text{.}\) Construct another \(n-g\) vectors, \(\vectorlist{y}{n-g}\text{,}\) so that
is a basis of \(\complex{n}\text{.}\) This can be done by repeated applications of Theorem ELIS.
Finally, define a matrix \(S\) by
where \(R\) is an \(n\times(n-g)\) matrix whose columns are \(\vectorlist{y}{n-g}\text{.}\) The columns of \(S\) are linearly independent by design, so \(S\) is nonsingular (Theorem NMLIC) and therefore invertible (Theorem NI).
Then
So
Preparations in place, we compute the characteristic polynomial of \(A\text{,}\)
What can we learn then about the matrix \(\inverse{S}AS\text{?}\)
Now imagine computing the characteristic polynomial of \(A\) by computing the characteristic polynomial of \(\inverse{S}AS\) using the form just obtained. The first \(g\) columns of \(\inverse{S}AS\) are all zero, save for a \(\lambda\) on the diagonal. So if we compute the determinant by expanding about the first column, successively, we will get successive factors of \((\lambda-x)\text{.}\) More precisely, let \(T\) be the square matrix of size \(n-g\) that is formed from the last \(n-g\) rows and last \(n-g\) columns of \(\inverse{S}AR\text{.}\) Then
This says that \((x-\lambda)\) is a factor of the characteristic polynomial at least \(g\) times, so the algebraic multiplicity of \(\lambda\) as an eigenvalue of \(A\) is greater than or equal to \(g\) (Definition AME). In other words,
as desired.
Theorem NEM says that the sum of the algebraic multiplicities for all the eigenvalues of \(A\) is equal to \(n\text{.}\) Since the algebraic multiplicity is a positive quantity, no single algebraic multiplicity can exceed \(n\) without the sum of all of the algebraic multiplicities doing the same.
Subsection EHM Eigenvalues of Hermitian Matrices
Recall that a matrix is Hermitian (or self-adjoint) if A=A∗ (Definition HM). In the case where A is a matrix whose entries are all real numbers, being Hermitian is identical to being symmetric (Definition SYM). Keep this in mind as you read the next two theorems. Their hypotheses could be changed to “suppose A is a real symmetric matrix.”Theorem HMRE. Hermitian Matrices have Real Eigenvalues.
Suppose that A is a Hermitian matrix and λ is an eigenvalue of A. Then λ∈R.
Proof.
Let \(\vect{x}\neq\zerovector\) be one eigenvector of \(A\) for the eigenvalue \(\lambda\text{.}\) Then
Since \(\vect{x}\neq\zerovector\text{,}\) by Theorem PIP we know \(\innerproduct{\vect{x}}{\vect{x}}\neq 0\text{.}\) Then by Theorem ZPZT, \(\lambda - \conjugate{\lambda}=0\text{,}\) and so \(\lambda = \conjugate{\lambda}\text{.}\) If a complex number is equal to its conjugate, then it has a complex part equal to zero, and therefore is a real number.
Theorem HMOE. Hermitian Matrices have Orthogonal Eigenvectors.
Suppose that A is a Hermitian matrix and x and y are two eigenvectors of A for different eigenvalues. Then x and y are orthogonal vectors.
Proof.
Let \(\vect{x}\) be an eigenvector of \(A\) for \(\lambda\) and let \(\vect{y}\) be an eigenvector of \(A\) for a different eigenvalue \(\rho\text{.}\) So we have \(\lambda-\rho\neq 0\text{.}\) Then
Because \(\lambda\) and \(\rho\) are presumed to be different, \(\lambda-\rho\neq 0\text{,}\) and Theorem ZPZT implies that \(\innerproduct{\vect{x}}{\vect{y}}=0\text{.}\) In other words, \(\vect{x}\) and \(\vect{y}\) are orthogonal vectors according to Definition OV.
Reading Questions PEE Reading Questions
1.
How can you identify a nonsingular matrix just by looking at its eigenvalues?
2.
How many different eigenvalues may a square matrix of size n have?
3.
What is amazing about the eigenvalues of a Hermitian matrix and why is it amazing?
Exercises PEE Exercises
T10.
Suppose that A is a square matrix. Prove that the constant term of the characteristic polynomial of A is equal to the determinant of A.
Suppose that the characteristic polynomial of \(A\) is
Then
T20.
Suppose that A is a square matrix. Prove that a single vector may not be an eigenvector of A for two different eigenvalues.
Suppose that the vector \(\vect{x}\neq\zerovector\) is an eigenvector of \(A\) for the two eigenvalues \(\lambda\) and \(\rho\text{,}\) where \(\lambda\neq\rho\text{.}\) Then \(\lambda-\rho\neq 0\text{,}\) and we also have
By Theorem SMEZV, either \(\lambda-\rho=0\) or \(\vect{x}=\zerovector\text{,}\) which are both contradictions.
T21.
Suppose that A is a square matrix of size n with an eigenvalue λ. For α∈C, prove that λ+α is an eigenvalue of the matrix A+αIn.
- Construct a proof that begins by employing a vector x that is an eigenvector of A for λ.
- Construct a proof that establishes that λ+α is a root of the characteristic polynomial of A+αIn.
T22.
Suppose that U is a unitary matrix with eigenvalue λ. Prove that λ has modulus 1, i.e. |λ|=1. This says that all of the eigenvalues of a unitary matrix lie on the unit circle of the complex plane.
T30.
Theorem DCP tells us that the characteristic polynomial of a square matrix of size n has degree n. By suitably augmenting the proof of Theorem DCP prove that the coefficient of xn in the characteristic polynomial is (−1)n.
T50.
Theorem EIM says that if λ is an eigenvalue of the nonsingular matrix A, then 1λ is an eigenvalue of A−1. Write an alternate proof of this theorem using the characteristic polynomial and without making reference to an eigenvector of A for λ.
Since \(\lambda\) is an eigenvalue of a nonsingular matrix, \(\lambda\neq 0\) (Theorem SMZE). \(A\) is invertible (Theorem NI), and so \(-\lambda A\) is invertible (Theorem MISM). Thus \(-\lambda A\) is nonsingular (Theorem NI) and \(\detname{-\lambda A}\neq 0\) (Theorem SMZD). We have
So \(\frac{1}{\lambda}\) is a root of the characteristic polynomial of \(\inverse{A}\) and so is an eigenvalue of \(\inverse{A}\text{.}\) This proof is due to Sara Bucht.