Section PDM  Properties of Determinants of Matrices

From A First Course in Linear Algebra
Version 2.23
© 2004.
Licensed under the GNU Free Documentation License.
http://linear.ups.edu/

We have seen how to compute the determinant of a matrix, and the incredible fact that we can perform expansion about any row or column to make this computation. In this largely theoretical section, we will state and prove several more intriguing properties about determinants. Our main goal will be the two results in Theorem SMZD and Theorem DRMM, but more specifically, we will see how the value of a determinant will allow us to gain insight into the various properties of a square matrix.

Subsection DRO: Determinants and Row Operations

We start easy with a straightforward theorem whose proof presages the style of subsequent proofs in this subsection.

Theorem DZRC
Determinant with Zero Row or Column
Suppose that A is a square matrix with a row where every entry is zero, or a column where every entry is zero. Then \mathop{ det} \left (A\right ) = 0.

Proof   Suppose that A is a square matrix of size n and row i has every entry equal to zero. We compute \mathop{ det} \left (A\right ) via expansion about row i.

\eqalignno{ \mathop{ det} \left (A\right ) & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{i+j}{\left [A\right ]}_{ ij}\mathop{ det} \left (A\left (i|j\right )\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#theorem.DER")Theorem DER@(/a)} & & & & \cr & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{i+j}\kern 1.95872pt 0\kern 1.95872pt \mathop{ det} \left (A\left (i|j\right )\right ) & &\text{Row $i$ is zeros} & & & & \cr & ={ \mathop{∑ }}_{j=1}^{n}0 = 0 & & & & }

The proof for the case of a zero column is entirely similar, or could be derived from an application of Theorem DT employing the transpose of the matrix.

Theorem DRCS
Determinant for Row or Column Swap
Suppose that A is a square matrix. Let B be the square matrix obtained from A by interchanging the location of two rows, or interchanging the location of two columns. Then \mathop{ det} \left (B\right ) = −\mathop{ det} \left (A\right ).

Proof   Begin with the special case where A is a square matrix of size n and we form B by swapping adjacent rows i and i + 1 for some 1 ≤ i ≤ n − 1. Notice that the assumption about swapping adjacent rows means that B\left (i + 1|j\right ) = A\left (i|j\right ) for all 1 ≤ j ≤ n, and {\left [B\right ]}_{i+1,j} ={ \left [A\right ]}_{ij} for all 1 ≤ j ≤ n. We compute \mathop{ det} \left (B\right ) via expansion about row i + 1.

\eqalignno{ \mathop{ det} \left (B\right ) & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{(i+1)+j}{\left [B\right ]}_{ i+1,j}\mathop{ det} \left (B\left (i + 1|j\right )\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#theorem.DER")Theorem DER@(/a)} & & & & \cr & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{(i+1)+j}{\left [A\right ]}_{ ij}\mathop{ det} \left (A\left (i|j\right )\right ) & &\text{Hypothesis} & & & & \cr & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{1}{(−1)}^{i+j}{\left [A\right ]}_{ ij}\mathop{ det} \left (A\left (i|j\right )\right ) & & & & \cr & = (−1){ \mathop{∑ }}_{j=1}^{n}{(−1)}^{i+j}{\left [A\right ]}_{ ij}\mathop{ det} \left (A\left (i|j\right )\right ) & & & & \cr & = −\mathop{ det} \left (A\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#theorem.DER")Theorem DER@(/a)} & & & & }

So the result holds for the special case where we swap adjacent rows of the matrix. As any computer scientist knows, we can accomplish any rearrangement of an ordered list by swapping adjacent elements. This principle can be demonstrated by naïve sorting algorithms such as “bubble sort.” In any event, we don’t need to discuss every possible reordering, we just need to consider a swap of two rows, say rows s and t with 1 ≤ s < t ≤ n.

Begin with row s, and repeatedly swap it with each row just below it, including row t and stopping there. This will total t − s swaps. Now swap the former row t, which currently lives in row t − 1, with each row above it, stopping when it becomes row s. This will total another t − s − 1 swaps. In this way, we create B through a sequence of 2(t − s) − 1 swaps of adjacent rows, each of which adjusts \mathop{ det} \left (A\right ) by a multiplicative factor of − 1. So

\mathop{det} \left (B\right ) = {(−1)}^{2(t−s)−1}\mathop{ det} \left (A\right ) ={ \left ({(−1)}^{2}\right )}^{t−s}{(−1)}^{−1}\mathop{ det} \left (A\right ) = −\mathop{ det} \left (A\right )

as desired.

The proof for the case of swapping two columns is entirely similar, or could be derived from an application of Theorem DT employing the transpose of the matrix.

So Theorem DRCS tells us the effect of the first row operation (Definition RO) on the determinant of a matrix. Here’s the effect of the second row operation.

Theorem DRCM
Determinant for Row or Column Multiples
Suppose that A is a square matrix. Let B be the square matrix obtained from A by multiplying a single row by the scalar α, or by multiplying a single column by the scalar α. Then \mathop{ det} \left (B\right ) = α\mathop{ det} \left (A\right ).

Proof   Suppose that A is a square matrix of size n and we form the square matrix B by multiplying each entry of row i of A by α. Notice that the other rows of A and B are equal, so A\left (i|j\right ) = B\left (i|j\right ), for all 1 ≤ j ≤ n. We compute \mathop{ det} \left (B\right ) via expansion about row i.

\eqalignno{ \mathop{ det} \left (B\right ) & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{i+j}{\left [B\right ]}_{ ij}\mathop{ det} \left (B\left (i|j\right )\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#theorem.DER")Theorem DER@(/a)} & & & & \cr & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{i+j}{\left [B\right ]}_{ ij}\mathop{ det} \left (A\left (i|j\right )\right ) & &\text{Hypothesis} & & & & \cr & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{i+j}α{\left [A\right ]}_{ ij}\mathop{ det} \left (A\left (i|j\right )\right ) & &\text{Hypothesis} & & & & \cr & = α{\mathop{∑ }}_{j=1}^{n}{(−1)}^{i+j}{\left [A\right ]}_{ ij}\mathop{ det} \left (A\left (i|j\right )\right ) & & & & \cr & = α\mathop{ det} \left (A\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#theorem.DER")Theorem DER@(/a)} & & & & \cr & & & & }

The proof for the case of a multiple of a column is entirely similar, or could be derived from an application of Theorem DT employing the transpose of the matrix.

Let’s go for understanding the effect of all three row operations. But first we need an intermediate result, but it is an easy one.

Theorem DERC
Determinant with Equal Rows or Columns
Suppose that A is a square matrix with two equal rows, or two equal columns. Then \mathop{ det} \left (A\right ) = 0.

Proof   Suppose that A is a square matrix of size n where the two rows s and t are equal. Form the matrix B by swapping rows s and t. Notice that as a consequence of our hypothesis, A = B. Then

\eqalignno{ \mathop{ det} \left (A\right ) & = {1\over 2}\left (\mathop{det} \left (A\right ) +\mathop{ det} \left (A\right )\right ) & & & & \cr & = {1\over 2}\left (\mathop{det} \left (A\right ) −\mathop{ det} \left (B\right )\right ) & &\text{@(a href="#theorem.DRCS")Theorem DRCS@(/a)} & & & & \cr & = {1\over 2}\left (\mathop{det} \left (A\right ) −\mathop{ det} \left (A\right )\right ) & &\text{Hypothesis, $A = B$} & & & & \cr & = {1\over 2}\kern 1.95872pt (0) = 0 & & & & }

The proof for the case of two equal columns is entirely similar, or could be derived from an application of Theorem DT employing the transpose of the matrix.

Now explain the third row operation. Here we go.

Theorem DRCMA
Determinant for Row or Column Multiples and Addition
Suppose that A is a square matrix. Let B be the square matrix obtained from A by multiplying a row by the scalar α and then adding it to another row, or by multiplying a column by the scalar α and then adding it to another column. Then \mathop{ det} \left (B\right ) =\mathop{ det} \left (A\right ).

Proof   Suppose that A is a square matrix of size n. Form the matrix B by multiplying row s by α and adding it to row t. Let C be the auxiliary matrix where we replace row t of A by row s of A. Notice that A\left (t|j\right ) = B\left (t|j\right ) = C\left (t|j\right ) for all 1 ≤ j ≤ n. We compute the determinant of B by expansion about row t.

\eqalignno{ \mathop{ det} \left (B\right ) & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{t+j}{\left [B\right ]}_{ tj}\mathop{ det} \left (B\left (t|j\right )\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#theorem.DER")Theorem DER@(/a)} & & & & \cr & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{t+j}\left (α{\left [A\right ]}_{ sj} +{ \left [A\right ]}_{tj}\right )\mathop{ det} \left (B\left (t|j\right )\right ) & &\text{Hypothesis} & & & & \cr & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{t+j}α{\left [A\right ]}_{ sj}\mathop{ det} \left (B\left (t|j\right )\right ) & & & & \cr &\quad \quad +{ \mathop{∑ }}_{j=1}^{n}{(−1)}^{t+j}{\left [A\right ]}_{ tj}\mathop{ det} \left (B\left (t|j\right )\right ) & & & & \cr & = α{\mathop{∑ }}_{j=1}^{n}{(−1)}^{t+j}{\left [A\right ]}_{ sj}\mathop{ det} \left (B\left (t|j\right )\right ) & & & & \cr &\quad \quad +{ \mathop{∑ }}_{j=1}^{n}{(−1)}^{t+j}{\left [A\right ]}_{ tj}\mathop{ det} \left (B\left (t|j\right )\right ) & & & & \cr & = α{\mathop{∑ }}_{j=1}^{n}{(−1)}^{t+j}{\left [C\right ]}_{ tj}\mathop{ det} \left (C\left (t|j\right )\right ) & & & & \cr &\quad \quad +{ \mathop{∑ }}_{j=1}^{n}{(−1)}^{t+j}{\left [A\right ]}_{ tj}\mathop{ det} \left (A\left (t|j\right )\right ) & & & & \cr & = α\mathop{ det} \left (C\right ) +\mathop{ det} \left (A\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#theorem.DER")Theorem DER@(/a)} & & & & \cr & = α\kern 1.95872pt 0 +\mathop{ det} \left (A\right ) =\mathop{ det} \left (A\right ) & &\text{@(a href="#theorem.DERC")Theorem DERC@(/a)} & & & & }

The proof for the case of adding a multiple of a column is entirely similar, or could be derived from an application of Theorem DT employing the transpose of the matrix.

Is this what you expected? We could argue that the third row operation is the most popular, and yet it has no effect whatsoever on the determinant of a matrix! We can exploit this, along with our understanding of the other two row operations, to provide another approach to computing a determinant. We’ll explain this in the context of an example.

Example DRO
Determinant by row operations
Suppose we desire the determinant of the 4 × 4 matrix

A = \left [\array{ 2 &0& 2 &3\cr 1 &3 &−1 &1 \cr −1&1&−1&2\cr 3 &5 & 4 &0 } \right ]

We will perform a sequence of row operations on this matrix, shooting for an upper triangular matrix, whose determinant will be simply the product of its diagonal entries. For each row operation, we will track the effect on the determinant via Theorem DRCS, Theorem DRCM, Theorem DRCMA.

\eqalignno{ \mathop{\longrightarrow}\limits_{}^{{R}_{1} ↔ {R}_{2}}{A}_{1} & = \left [\array{ 1 &3&−1&1\cr 2 &0 & 2 &3 \cr −1&1&−1&2\cr 3 &5 & 4 &0 } \right ]&\mathop{ det} \left (A\right )& = −\mathop{ det} \left ({A}_{1}\right ) &&\text{@(a href="#theorem.DRCS")Theorem DRCS@(/a)} &&&&&& \cr \mathop{\longrightarrow}\limits_{}^{ − 2{R}_{1} + {R}_{2}}{A}_{2} & = \left [\array{ 1 & 3 &−1&1\cr 0 &−6 & 4 &1 \cr −1& 1 &−1&2\cr 3 & 5 & 4 &0 } \right ]& & = −\mathop{ det} \left ({A}_{2}\right ) &&\text{@(a href="#theorem.DRCMA")Theorem DRCMA@(/a)}&&&&&& \cr \mathop{\longrightarrow}\limits_{}^{1{R}_{1} + {R}_{3}}{A}_{3} & = \left [\array{ 1& 3 &−1&1\cr 0&−6 & 4 &1 \cr 0& 4 &−2&3\cr 3& 5 & 4 &0 } \right ]& & = −\mathop{ det} \left ({A}_{3}\right ) &&\text{@(a href="#theorem.DRCMA")Theorem DRCMA@(/a)}&&&&&& \cr \mathop{\longrightarrow}\limits_{}^{ − 3{R}_{1} + {R}_{4}}{A}_{4} & = \left [\array{ 1& 3 &−1& 1\cr 0&−6 & 4 & 1 \cr 0& 4 &−2& 3\cr 0&−4 & 7 &−3 } \right ]& & = −\mathop{ det} \left ({A}_{4}\right ) &&\text{@(a href="#theorem.DRCMA")Theorem DRCMA@(/a)}&&&&&& \cr \mathop{\longrightarrow}\limits_{}^{1{R}_{3} + {R}_{2}}{A}_{5} & = \left [\array{ 1& 3 &−1& 1\cr 0&−2 & 2 & 4 \cr 0& 4 &−2& 3\cr 0&−4 & 7 &−3 } \right ]& & = −\mathop{ det} \left ({A}_{5}\right ) &&\text{@(a href="#theorem.DRCMA")Theorem DRCMA@(/a)}&&&&&& \cr \mathop{\longrightarrow}\limits_{}^{ −{1\over 2}{R}_{2}}{A}_{6} & = \left [\array{ 1& 3 &−1& 1\cr 0& 1 &−1 &−2 \cr 0& 4 &−2& 3\cr 0&−4 & 7 &−3 } \right ]& & = 2\mathop{ det} \left ({A}_{6}\right ) &&\text{@(a href="#theorem.DRCM")Theorem DRCM@(/a)} &&&&&& \cr \mathop{\longrightarrow}\limits_{}^{ − 4{R}_{2} + {R}_{3}}{A}_{7} & = \left [\array{ 1& 3 &−1& 1\cr 0& 1 &−1 &−2 \cr 0& 0 & 2 &11\cr 0&−4 & 7 &−3 } \right ]& & = 2\mathop{ det} \left ({A}_{7}\right ) &&\text{@(a href="#theorem.DRCMA")Theorem DRCMA@(/a)}&&&&&& \cr \mathop{\longrightarrow}\limits_{}^{4{R}_{2} + {R}_{4}}{A}_{8} & = \left [\array{ 1&3&−1& 1\cr 0&1 &−1 & −2 \cr 0&0& 2 & 11\cr 0&0 & 3 &−11 } \right ]& & = 2\mathop{ det} \left ({A}_{8}\right ) &&\text{@(a href="#theorem.DRCMA")Theorem DRCMA@(/a)}&&&&&& \cr \mathop{\longrightarrow}\limits_{}^{ − 1{R}_{3} + {R}_{4}}{A}_{9} & = \left [\array{ 1&3&−1& 1\cr 0&1 &−1 & −2 \cr 0&0& 2 & 11\cr 0&0 & 1 &−22 } \right ]& & = 2\mathop{ det} \left ({A}_{9}\right ) &&\text{@(a href="#theorem.DRCMA")Theorem DRCMA@(/a)}&&&&&& \cr \mathop{\longrightarrow}\limits_{}^{ − 2{R}_{4} + {R}_{3}}{A}_{10}& = \left [\array{ 1&3&−1& 1\cr 0&1 &−1 & −2 \cr 0&0& 0 & 55\cr 0&0 & 1 &−22 } \right ]& & = 2\mathop{ det} \left ({A}_{10}\right ) &&\text{@(a href="#theorem.DRCMA")Theorem DRCMA@(/a)}&&&&&& \cr \mathop{\longrightarrow}\limits_{}^{{R}_{3} ↔ {R}_{4}}{A}_{11} & = \left [\array{ 1&3&−1& 1\cr 0&1 &−1 & −2 \cr 0&0& 1 &−22\cr 0&0 & 0 & 55 } \right ]& & = −2\mathop{ det} \left ({A}_{11}\right ) &&\text{@(a href="#theorem.DRCS")Theorem DRCS@(/a)} &&&&&& \cr \mathop{\longrightarrow}\limits_{}^{ {1\over 55}{R}_{4}}{A}_{12} & = \left [\array{ 1&3&−1& 1\cr 0&1 &−1 & −2 \cr 0&0& 1 &−22\cr 0&0 & 0 & 1 } \right ]& & = −110\mathop{ det} \left ({A}_{12}\right )&&\text{@(a href="#theorem.DRCM")Theorem DRCM@(/a)} &&&&&& \cr & & & && & }

The matrix {A}_{12} is upper triangular, so expansion about the first column (repeatedly) will result in \mathop{ det} \left ({A}_{12}\right ) = (1)(1)(1)(1) = 1 (see Example DUTM) and thus, \mathop{ det} \left (A\right ) = −110(1) = −110.

Notice that our sequence of row operations was somewhat ad hoc, such as the transformation to {A}_{5}. We could have been even more methodical, and strictly followed the process that converts a matrix to reduced row-echelon form (Theorem REMEF), eventually achieving the same numerical result with a final matrix that equaled the 4 × 4 identity matrix. Notice too that we could have stopped with {A}_{8}, since at this point we could compute \mathop{ det} \left ({A}_{8}\right ) by two expansions about first columns, followed by a simple determinant of a 2 × 2 matrix (Theorem DMST).

The beauty of this approach is that computationally we should already have written a procedure to convert matrices to reduced-row echelon form, so all we need to do is track the multiplicative changes to the determinant as the algorithm proceeds. Further, for a square matrix of size n this approach requires on the order of {n}^{3} multiplications, while a recursive application of expansion about a row or column (Theorem DER, Theorem DEC) will require in the vicinity of (n − 1)(n!) multiplications. So even for very small matrices, a computational approach utilizing row operations will have superior run-time. Tracking, and controlling, the effects of round-off errors is another story, best saved for a numerical linear algebra course.

Subsection DROEM: Determinants, Row Operations, Elementary Matrices

As a final preparation for our two most important theorems about determinants, we prove a handful of facts about the interplay of row operations and matrix multiplication with elementary matrices with regard to the determinant. But first, a simple, but crucial, fact about the identity matrix.

Theorem DIM
Determinant of the Identity Matrix
For every n ≥ 1, \mathop{ det} \left ({I}_{n}\right ) = 1.

Proof   It may be overkill, but this is a good situation to run through a proof by induction on n (Technique I). Is the result true when n = 1? Yes,

\eqalignno{ \mathop{ det} \left ({I}_{1}\right ) & ={ \left [{I}_{1}\right ]}_{11} & &\text{@(a href="fcla-jsmath-2.23li44.html#definition.DM")Definition DM@(/a)} & & & & \cr & = 1 & &\text{@(a href="fcla-jsmath-2.23li21.html#definition.IM")Definition IM@(/a)} & & & & \cr & & & & }

Now assume the theorem is true for the identity matrix of size n − 1 and investigate the determinant of the identity matrix of size n with expansion about row 1,

\eqalignno{ \mathop{ det} \left ({I}_{n}\right ) & ={ \mathop{∑ }}_{j=1}^{n}{(−1)}^{1+j}{\left [{I}_{ n}\right ]}_{1j}\mathop{ det} \left ({I}_{n}\left (1|j\right )\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#definition.DM")Definition DM@(/a)} & & & & \cr & = {(−1)}^{1+1}{\left [{I}_{ n}\right ]}_{11}\mathop{ det} \left ({I}_{n}\left (1|1\right )\right ) & & & & \cr &\quad \quad +{ \mathop{∑ }}_{j=2}^{n}{(−1)}^{1+j}{\left [{I}_{ n}\right ]}_{1j}\mathop{ det} \left ({I}_{n}\left (1|j\right )\right ) & & & & \cr & = 1\mathop{ det} \left ({I}_{n−1}\right ) +{ \mathop{∑ }}_{j=2}^{n}{(−1)}^{1+j}\kern 1.95872pt 0\kern 1.95872pt \mathop{ det} \left ({I}_{ n}\left (1|j\right )\right ) & &\text{@(a href="fcla-jsmath-2.23li21.html#definition.IM")Definition IM@(/a)} & & & & \cr & = 1(1) +{ \mathop{∑ }}_{j=2}^{n}\kern 1.95872pt 0 = 1 & &\text{Induction Hypothesis} & & & & \cr & & & & }

Theorem DEM
Determinants of Elementary Matrices
For the three possible versions of an elementary matrix (Definition ELEM) we have the determinants,

  1. \mathop{ det} \left ({E}_{i,j}\right ) = −1
  2. \mathop{ det} \left ({E}_{i}\left (α\right )\right ) = α
  3. \mathop{ det} \left ({E}_{i,j}\left (α\right )\right ) = 1

Proof   Swapping rows i and j of the identity matrix will create {E}_{i,j} (Definition ELEM), so

\eqalignno{ \mathop{ det} \left ({E}_{i,j}\right ) & = −\mathop{ det} \left ({I}_{n}\right ) & &\text{@(a href="#theorem.DRCS")Theorem DRCS@(/a)} & & & & \cr & = −1 & &\text{@(a href="#theorem.DIM")Theorem DIM@(/a)} & & & & \cr & & & & }

Multiplying row i of the identity matrix by α will create {E}_{i}\left (α\right ) (Definition ELEM), so

\eqalignno{ \mathop{ det} \left ({E}_{i}\left (α\right )\right ) & = α\mathop{ det} \left ({I}_{n}\right ) & &\text{@(a href="#theorem.DRCM")Theorem DRCM@(/a)} & & & & \cr & = α(1) = α & &\text{@(a href="#theorem.DIM")Theorem DIM@(/a)} & & & & \cr & & & & }

Multiplying row i of the identity matrix by α and adding to row j will create {E}_{i,j}\left (α\right ) (Definition ELEM), so

\eqalignno{ \mathop{ det} \left ({E}_{i,j}\left (α\right )\right ) & =\mathop{ det} \left ({I}_{n}\right ) & &\text{@(a href="#theorem.DRCMA")Theorem DRCMA@(/a)} & & & & \cr & = 1 & &\text{@(a href="#theorem.DIM")Theorem DIM@(/a)} & & & & \cr & & & & }

Theorem DEMMM
Determinants, Elementary Matrices, Matrix Multiplication
Suppose that A is a square matrix of size n and E is any elementary matrix of size n. Then

\mathop{det} \left (EA\right ) =\mathop{ det} \left (E\right )\mathop{ det} \left (A\right )

Proof   The proof procedes in three parts, one for each type of elementary matrix, with each part very similar to the other two. First, let B be the matrix obtained from A by swapping rows i and j,

\eqalignno{ \mathop{ det} \left ({E}_{i,j}A\right ) & =\mathop{ det} \left (B\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#theorem.EMDRO")Theorem EMDRO@(/a)} & & & & \cr & = −\mathop{ det} \left (A\right ) & &\text{@(a href="#theorem.DRCS")Theorem DRCS@(/a)} & & & & \cr & =\mathop{ det} \left ({E}_{i,j}\right )\mathop{ det} \left (A\right ) & &\text{@(a href="#theorem.DEM")Theorem DEM@(/a)} & & & & }

Second, let B be the matrix obtained from A by multiplying row i by α,

\eqalignno{ \mathop{ det} \left ({E}_{i}\left (α\right )A\right ) & =\mathop{ det} \left (B\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#theorem.EMDRO")Theorem EMDRO@(/a)} & & & & \cr & = α\mathop{ det} \left (A\right ) & &\text{@(a href="#theorem.DRCM")Theorem DRCM@(/a)} & & & & \cr & =\mathop{ det} \left ({E}_{i}\left (α\right )\right )\mathop{ det} \left (A\right ) & &\text{@(a href="#theorem.DEM")Theorem DEM@(/a)} & & & & }

Third, let B be the matrix obtained from A by multiplying row i by α and adding to row j,

\eqalignno{ \mathop{ det} \left ({E}_{i,j}\left (α\right )A\right ) & =\mathop{ det} \left (B\right ) & &\text{@(a href="fcla-jsmath-2.23li44.html#theorem.EMDRO")Theorem EMDRO@(/a)} & & & & \cr & =\mathop{ det} \left (A\right ) & &\text{@(a href="#theorem.DRCMA")Theorem DRCMA@(/a)} & & & & \cr & =\mathop{ det} \left ({E}_{i,j}\left (α\right )\right )\mathop{ det} \left (A\right ) & &\text{@(a href="#theorem.DEM")Theorem DEM@(/a)} & & & & }

Since the desired result holds for each variety of elementary matrix individually, we are done.

Subsection DNMMM: Determinants, Nonsingular Matrices, Matrix Multiplication

If you asked someone with substantial experience working with matrices about the value of the determinant, they’d be likely to quote the following theorem as the first thing to come to mind.

Theorem SMZD
Singular Matrices have Zero Determinants
Let A be a square matrix. Then A is singular if and only if \mathop{ det} \left (A\right ) = 0.

Proof   Rather than jumping into the two halves of the equivalence, we first establish a few items. Let B be the unique square matrix that is row-equivalent to A and in reduced row-echelon form (Theorem REMEF, Theorem RREFU). For each of the row operations that converts B into A, there is an elementary matrix {E}_{i} which effects the row operation by matrix multiplication (Theorem EMDRO). Repeated applications of Theorem EMDRO allow us to write

A = {E}_{s}{E}_{s−1}\mathop{\mathop{…}}{E}_{2}{E}_{1}B

Then

\eqalignno{ \mathop{ det} \left (A\right )& =\mathop{ det} \left ({E}_{s}{E}_{s−1}\mathop{\mathop{…}}{E}_{2}{E}_{1}B\right ) && && \cr & =\mathop{ det} \left ({E}_{s}\right )\mathop{ det} \left ({E}_{s−1}\right )\mathop{\mathop{…}}\mathop{ det} \left ({E}_{2}\right )\mathop{ det} \left ({E}_{1}\right )\mathop{ det} \left (B\right )&&\text{@(a href="#theorem.DEMMM")Theorem DEMMM@(/a)}&&&& }

From Theorem DEM we can infer that the determinant of an elementary matrix is never zero (note the ban on α = 0 for {E}_{i}\left (α\right ) in Definition ELEM). So the product on the right is composed of nonzero scalars, with the possible exception of \mathop{ det} \left (B\right ). More precisely, we can argue that \mathop{ det} \left (A\right ) = 0 if and only if \mathop{ det} \left (B\right ) = 0. With this established, we can take up the two halves of the equivalence.

(⇒) If A is singular, then by Theorem NMRRI, B cannot be the identity matrix. Because (1) the number of pivot columns is equal to the number of nonzero rows, (2) not every column is a pivot column, and (3) B is square, we see that B must have a zero row. By Theorem DZRC the determinant of B is zero, and by the above, we conclude that the determinant of A is zero.

(⇐) We will prove the contrapositive (Technique CP). So assume A is nonsingular, then by Theorem NMRRI, B is the identity matrix and Theorem DIM tells us that \mathop{ det} \left (B\right ) = 1\mathrel{≠}0. With the argument above, we conclude that the determinant of A is nonzero as well.

For the case of 2 × 2 matrices you might compare the application of Theorem SMZD with the combination of the results stated in Theorem DMST and Theorem TTMI.

Example ZNDAB
Zero and nonzero determinant, Archetypes A and B
The coefficient matrix in Archetype A has a zero determinant (check this!) while the coefficient matrix Archetype B has a nonzero determinant (check this, too). These matrices are singular and nonsingular, respectively. This is exactly what Theorem SMZD says, and continues our list of contrasts between these two archetypes.

Since Theorem SMZD is an equivalence (Technique E) we can expand on our growing list of equivalences about nonsingular matrices. The addition of the condition \mathop{ det} \left (A\right )\mathrel{≠}0 is one of the best motivations for learning about determinants.

Theorem NME7
Nonsingular Matrix Equivalences, Round 7
Suppose that A is a square matrix of size n. The following are equivalent.

  1. A is nonsingular.
  2. A row-reduces to the identity matrix.
  3. The null space of A contains only the zero vector, N\kern -1.95872pt \left (A\right ) = \left \{0\right \}.
  4. The linear system ℒS\kern -1.95872pt \left (A,\kern 1.95872pt b\right ) has a unique solution for every possible choice of b.
  5. The columns of A are a linearly independent set.
  6. A is invertible.
  7. The column space of A is {ℂ}^{n}, C\kern -1.95872pt \left (A\right ) = {ℂ}^{n}.
  8. The columns of A are a basis for {ℂ}^{n}.
  9. The rank of A is n, r\left (A\right ) = n.
  10. The nullity of A is zero, n\left (A\right ) = 0.
  11. The determinant of A is nonzero, \mathop{ det} \left (A\right )\mathrel{≠}0.

Proof   Theorem SMZD says A is singular if and only if \mathop{ det} \left (A\right ) = 0. If we negate each of these statements, we arrive at two contrapositives that we can combine as the equivalence, A is nonsingular if and only if \mathop{ det} \left (A\right )\mathrel{≠}0. This allows us to add a new statement to the list found in Theorem NME6.

Computationally, row-reducing a matrix is the most efficient way to determine if a matrix is nonsingular, though the effect of using division in a computer can lead to round-off errors that confuse small quantities with critical zero quantities. Conceptually, the determinant may seem the most efficient way to determine if a matrix is nonsingular. The definition of a determinant uses just addition, subtraction and multiplication, so division is never a problem. And the final test is easy: is the determinant zero or not? However, the number of operations involved in computing a determinant by the definition very quickly becomes so excessive as to be impractical.

Now for the coup de grâce. We will generalize Theorem DEMMM to the case of any two square matrices. You may recall thinking that matrix multiplication was defined in a needlessly complicated manner. For sure, the definition of a determinant seems even stranger. (Though Theorem SMZD might be forcing you to reconsider.) Read the statement of the next theorem and contemplate how nicely matrix multiplication and determinants play with each other.

Theorem DRMM
Determinant Respects Matrix Multiplication
Suppose that A and B are square matrices of the same size. Then \mathop{ det} \left (AB\right ) =\mathop{ det} \left (A\right )\mathop{ det} \left (B\right ).

Proof   This proof is constructed in two cases. First, suppose that A is singular. Then \mathop{ det} \left (A\right ) = 0 by Theorem SMZD. By the contrapositive of Theorem NPNT, AB is singular as well. So by a second application ofTheorem SMZD, \mathop{ det} \left (AB\right ) = 0. Putting it all together

\eqalignno{ \mathop{ det} \left (AB\right ) = 0 = 0\kern 1.95872pt \mathop{ det} \left (B\right ) =\mathop{ det} \left (A\right )\mathop{ det} \left (B\right ) & & }

as desired.

For the second case, suppose that A is nonsingular. By Theorem NMPEM there are elementary matrices {E}_{1},\kern 1.95872pt {E}_{2},\kern 1.95872pt {E}_{3},\kern 1.95872pt \mathop{\mathop{…}},\kern 1.95872pt {E}_{s} such that A = {E}_{1}{E}_{2}{E}_{3}\mathop{\mathop{…}}{E}_{s}. Then

\eqalignno{ \mathop{ det} \left (AB\right )& =\mathop{ det} \left ({E}_{1}{E}_{2}{E}_{3}\mathop{\mathop{…}}{E}_{s}B\right ) && && \cr & =\mathop{ det} \left ({E}_{1}\right )\mathop{ det} \left ({E}_{2}\right )\mathop{ det} \left ({E}_{3}\right )\mathop{\mathop{…}}\mathop{ det} \left ({E}_{s}\right )\mathop{ det} \left (B\right )&&\text{@(a href="#theorem.DEMMM")Theorem DEMMM@(/a)}&&&& \cr & =\mathop{ det} \left ({E}_{1}{E}_{2}{E}_{3}\mathop{\mathop{…}}{E}_{s}\right )\mathop{ det} \left (B\right ) &&\text{@(a href="#theorem.DEMMM")Theorem DEMMM@(/a)}&&&& \cr & =\mathop{ det} \left (A\right )\mathop{ det} \left (B\right ) && && }

It is amazing that matrix multiplication and the determinant interact this way. Might it also be true that \mathop{ det} \left (A + B\right ) =\mathop{ det} \left (A\right ) +\mathop{ det} \left (B\right )? (See Exercise PDM.M30.)

Subsection READ: Reading Questions

  1. Consider the two matrices below, and suppose you already have computed \mathop{ det} \left (A\right ) = −120. What is \mathop{ det} \left (B\right )? Why?
    \eqalignno{ A & = \left [\array{ 0 & 8 & 3 &−4\cr −1 & 2 &−2 & 5 \cr −2& 8 & 4 & 3\cr 0 &−4 & 2 &−3 } \right ] &B & = \left [\array{ 0 & 8 & 3 &−4\cr 0 &−4 & 2 &−3 \cr −2& 8 & 4 & 3\cr −1 & 2 &−2 & 5 } \right ] & & & & }
  2. State the theorem that allows us to make yet another extension to our NMEx series of theorems.
  3. What is amazing about the interaction between matrix multiplication and the determinant?

Subsection EXC: Exercises

C30 Each of the archetypes below is a system of equations with a square coefficient matrix, or is a square matrix itself. Compute the determinant of each matrix, noting how Theorem SMZD indicates when the matrix is singular or nonsingular.
Archetype A
Archetype B
Archetype F
Archetype K
Archetype L
 
Contributed by Robert Beezer

M20 Construct a 3 × 3 nonsingular matrix and call it A. Then, for each entry of the matrix, compute the corresponding cofactor, and create a new 3 × 3 matrix full of these cofactors by placing the cofactor of an entry in the same location as the entry it was based on. Once complete, call this matrix C. Compute A{C}^{t}. Any observations? Repeat with a new matrix, or perhaps with a 4 × 4 matrix.  
Contributed by Robert Beezer Solution [1218]

M30 Construct an example to show that the following statement is not true for all square matrices A and B of the same size: \mathop{ det} \left (A + B\right ) =\mathop{ det} \left (A\right ) +\mathop{ det} \left (B\right ).  
Contributed by Robert Beezer

T10 Theorem NPNT says that if the product of square matrices AB is nonsingular, then the individual matrices A and B are nonsingular also. Construct a new proof of this result making use of theorems about determinants of matrices.  
Contributed by Robert Beezer

T15 Use Theorem DRCM to prove Theorem DZRC as a corollary. (See Technique LC.)  
Contributed by Robert Beezer

T20 Suppose that A is a square matrix of size n and α ∈ {ℂ}^{} is a scalar. Prove that \mathop{ det} \left (αA\right ) = {α}^{n}\mathop{ det} \left (A\right ).  
Contributed by Robert Beezer

T25 Employ Theorem DT to construct the second half of the proof of Theorem DRCM (the portion about a multiple of a column).  
Contributed by Robert Beezer

Subsection SOL: Solutions

M20 Contributed by Robert Beezer Statement [1216]
The result of these computations should be a matrix with the value of \mathop{ det} \left (A\right ) in the diagonal entries and zeros elsewhere. The suggestion of using a nonsingular matrix was partially so that it was obvious that the value of the determinant appears on the diagonal.

This result (which is true in general) provides a method for computing the inverse of a nonsingular matrix. Since A{C}^{t} =\mathop{ det} \left (A\right ){I}_{ n}, we can multiply by the reciprocal of the determinant (which is nonzero!) and the inverse of A (it exists!) to arrive at an expression for the matrix inverse:

{A}^{−1} = {1\over \mathop{det} \left (A\right )}{C}^{t}

Annotated Acronyms D: Determinants

Theorem EMDRO
The main purpose of elementary matrices is to provide a more formal foundation for row operations. With this theorem we can convert the notion of “doing a row operation” into the slightly more precise, and tractable, operation of matrix multiplication by an elementary matrix. The other big results in this chapter are made possible by this connection and our previous understanding of the behavior of matrix multiplication (such as results in Section MM).

Theorem DER
We define the determinant by expansion about the first row and then prove you can expand about any row (and with Theorem DEC, about any column). Amazing. If the determinant seems contrived, these results might begin to convince you that maybe something interesting is going on.

Theorem DRMM
Theorem EMDRO connects elementary matrices with matrix multiplication. Now we connect determinants with matrix multiplication. If you thought the definition of matrix multiplication (as exemplified by Theorem EMP) was as outlandish as the definition of the determinant, then no more. They seem to play together quite nicely.

Theorem SMZD
This theorem provides a simple test for nonsingularity, even though it is stated and titled as a theorem about singularity. It’ll be helpful, especially in concert with Theorem DRMM, in establishing upcoming results about nonsingular matrices or creating alternative proofs of earlier results. You might even use this theorem as an indicator of how often a matrix is singular. Create a square matrix at random — what are the odds it is singular? This theorem says the determinant has to be zero, which we might suspect is a rare occurrence. Of course, we have to be a lot more careful about words like “random,” “odds,” and “rare” if we want precise answers to this question.