Section RREF Reduced Row-Echelon Form

From A First Course in Linear Algebra
Version 2.99
© 2004.
Licensed under the GNU Free Documentation License.
http://linear.ups.edu/

After solving a few systems of equations, you will recognize that it doesn’t matter so much what we call our variables, as opposed to what numbers act as their coefficients. A system in the variables {x}_{1},\kern 1.95872pt {x}_{2},\kern 1.95872pt {x}_{3} would behave the same if we changed the names of the variables to a,\kern 1.95872pt b,\kern 1.95872pt c and kept all the constants the same and in the same places. In this section, we will isolate the key bits of information about a system of equations into something called a matrix, and then use this matrix to systematically solve the equations. Along the way we will obtain one of our most important and useful computational tools.

Subsection MVNSE: Matrix and Vector Notation for Systems of Equations

Definition M
Matrix
An m × n matrix is a rectangular layout of numbers from {ℂ}^{} having m rows and n columns. We will use upper-case Latin letters from the start of the alphabet (A,\kern 1.95872pt B,\kern 1.95872pt C,\mathop{\mathop{…}}) to denote matrices and squared-off brackets to delimit the layout. Many use large parentheses instead of brackets — the distinction is not important. Rows of a matrix will be referenced starting at the top and working down (i.e. row 1 is at the top) and columns will be referenced starting from the left (i.e. column 1 is at the left). For a matrix A, the notation {\left [A\right ]}_{ij} will refer to the complex number in row i and column j of A.

Be careful with this notation for individual entries, since it is easy to think that {\left [A\right ]}_{ij} refers to the whole matrix. It does not. It is just a number, but is a convenient way to talk about the individual entries simultaneously. This notation will get a heavy workout once we get to Chapter M.

is a matrix with m = 3 rows and n = 4 columns. We can say that {\left [B\right ]}_{2,3} = −6 while {\left [B\right ]}_{3,4} = −2. ⊠

Some mathematical software is very particular about which types of numbers (integers, rationals, reals, complexes) you wish to work with. See: Computation R.SAGE A calculator or computer language can be a convenient way to perform calculations with matrices. But first you have to enter the matrix. See: Computation ME.MMA Computation ME.TI86 Computation ME.TI83 Computation ME.SAGE When we do equation operations on system of equations, the names of the variables really aren’t very important. {x}_{1}, {x}_{2}, {x}_{3}, or a, b, c, or x, y, z, it really doesn’t matter. In this subsection we will describe some notation that will make it easier to describe linear systems, solve the systems and describe the solution sets. Here is a list of definitions, laden with notation.

Definition CV
Column Vector
A column vector of size m is an ordered list of m numbers, which is written in order vertically, starting at the top and proceeding to the bottom. At times, we will refer to a column vector as simply a vector. Column vectors will be written in bold, usually with lower case Latin letter from the end of the alphabet such as u, v, w, x, y, z. Some books like to write vectors with arrows, such as \vec{u}. Writing by hand, some like to put arrows on top of the symbol, or a tilde underneath the symbol, as in \mathop{u}\limits_{ ∼}. To refer to the entry or component that is number i in the list that is the vector v we write {\left [v\right ]}_{i}.

Be careful with this notation. While the symbols {\left [v\right ]}_{i} might look somewhat substantial, as an object this represents just one component of a vector, which is just a single complex number.

Definition ZCV
Zero Column Vector
The zero vector of size m is the column vector of size m where each entry is the number zero,

\eqalignno{ 0 = \left [\array{ 0\cr 0 \cr 0\cr \mathop{\mathop{⋮}} \cr 0 } \right ] & & }

\eqalignno{ {a}_{11}{x}_{1} + {a}_{12}{x}_{2} + {a}_{13}{x}_{3} + \mathrel{⋯} + {a}_{1n}{x}_{n} & = {b}_{1} & & \cr {a}_{21}{x}_{1} + {a}_{22}{x}_{2} + {a}_{23}{x}_{3} + \mathrel{⋯} + {a}_{2n}{x}_{n} & = {b}_{2} & & \cr {a}_{31}{x}_{1} + {a}_{32}{x}_{2} + {a}_{33}{x}_{3} + \mathrel{⋯} + {a}_{3n}{x}_{n} & = {b}_{3} & & \cr \mathop{\mathop{⋮}} & & & \cr {a}_{m1}{x}_{1} + {a}_{m2}{x}_{2} + {a}_{m3}{x}_{3} + \mathrel{⋯} + {a}_{mn}{x}_{n} & = {b}_{m} & & }

The solution vector may do double-duty on occasion. It might refer to a list of variable quantities at one point, and subsequently refer to values of those variables that actually form a particular solution to that system.

Definition MRLS
Matrix Representation of a Linear System
If A is the coefficient matrix of a system of linear equations and b is the vector of constants, then we will write ℒS\kern -1.95872pt \left (A,\kern 1.95872pt b\right ) as a shorthand expression for the system of linear equations, which we will refer to as the matrix representation of the linear system.

\eqalignno{ 2{x}_{1} + 4{x}_{2} − 3{x}_{3} + 5{x}_{4} + {x}_{5} & = 9 & & \cr 3{x}_{1} + {x}_{2} + \quad \quad {x}_{4} − 3{x}_{5} & = 0 & & \cr − 2{x}_{1} + 7{x}_{2} − 5{x}_{3} + 2{x}_{4} + 2{x}_{5} & = −3 & & }

and so will be referenced as ℒS\kern -1.95872pt \left (A,\kern 1.95872pt b\right ). ⊠

Definition AM
Augmented Matrix
Suppose we have a system of m equations in n variables, with coefficient matrix A and vector of constants b. Then the augmented matrix of the system of equations is the m × (n + 1) matrix whose first n columns are the columns of A and whose last column (number n + 1) is the column vector b. This matrix will be written as \left [\left .A\kern 1.95872pt \right \vert \kern 1.95872pt b\right ].

The augmented matrix represents all the important information in the system of equations, since the names of the variables have been ignored, and the only connection with the variables is the location of their coefficients in the matrix. It is important to realize that the augmented matrix is just that, a matrix, and not a system of equations. In particular, the augmented matrix does not have any “solutions,” though it will be useful for finding solutions to the system of equations that it is associated with. (Think about your objects, and review Technique L.) However, notice that an augmented matrix always belongs to some system of equations, and vice versa, so it is tempting to try and blur the distinction between the two. Here’s a quick example.

\eqalignno{ {x}_{1} − {x}_{2} + 2{x}_{3} & = 1 & & \cr 2{x}_{1} + {x}_{2} + {x}_{3} & = 8 & & \cr {x}_{1} + {x}_{2} & = 5 & & }

\eqalignno{ \left [\array{ 1&−1&2&1\cr 2& 1 &1 &8 \cr 1& 1 &0&5 } \right ] & & }

Subsection RO: Row Operations

An augmented matrix for a system of equations will save us the tedium of continually writing down the names of the variables as we solve the system. It will also release us from any dependence on the actual names of the variables. We have seen how certain operations we can perform on equations (Definition EO) will preserve their solutions (Theorem EOPSS). The next two definitions and the following theorem carry over these ideas to augmented matrices.

Definition RO
Row Operations
The following three operations will transform an m × n matrix into a different matrix of the same size, and each is known as a row operation.

Definition REM
Row-Equivalent Matrices
Two matrices, A and B, are row-equivalent if one can be obtained from the other by a sequence of row operations. △

\eqalignno{ A = \left [\array{ 2&−1& 3 &4\cr 5& 2 &−2 &3 \cr 1& 1 & 0 &6 } \right ] & &B = \left [\array{ 1& 1 & 0 & 6\cr 3& 0 &−2 &−9 \cr 2&−1& 3 & 4 } \right ] & & & & }

\eqalignno{ \left [\array{ 2&−1& 3 &4\cr 5& 2 &−2 &3 \cr 1& 1 & 0 &6 } \right ]\mathop{\longrightarrow}\limits_{}^{{R}_{1} ↔ {R}_{3}}\left [\array{ 1& 1 & 0 &6\cr 5& 2 &−2 &3 \cr 2&−1& 3 &4 } \right ] &\mathop{\longrightarrow}\limits_{}^{ − 2{R}_{1} + {R}_{2}}\left [\array{ 1& 1 & 0 & 6\cr 3& 0 &−2 &−9 \cr 2&−1& 3 & 4 } \right ] & & }

Notice that each of the three row operations is reversible (Exercise RREF.T10), so we do not have to be careful about the distinction between “A is row-equivalent to B” and “B is row-equivalent to A.” (Exercise RREF.T11) The preceding definitions are designed to make the following theorem possible. It says that row-equivalent matrices represent systems of linear equations that have identical solution sets.

Theorem REMES
Row-Equivalent Matrices represent Equivalent Systems
Suppose that A and B are row-equivalent augmented matrices. Then the systems of linear equations that they represent are equivalent systems. □

Proof If we perform a single row operation on an augmented matrix, it will have the same effect as if we did the analogous equation operation on the corresponding system of equations. By exactly the same methods as we used in the proof of Theorem EOPSS we can see that each of these row operations will preserve the set of solutions for the corresponding system of equations. ■

So at this point, our strategy is to begin with a system of equations, represent it by an augmented matrix, perform row operations (which will preserve solutions for the corresponding systems) to get a “simpler” augmented matrix, convert back to a “simpler” system of equations and then solve that system, knowing that its solutions are those of the original system. Here’s a rehash of Example US as an exercise in using our new tools.

Example USR
Three equations, one solution, reprised
We solve the following system using augmented matrices and row operations. This is the same system of equations solved in Example US using equation operations.

\eqalignno{ {x}_{1} + 2{x}_{2} + 2{x}_{3} & = 4 & & \cr {x}_{1} + 3{x}_{2} + 3{x}_{3} & = 5 & & \cr 2{x}_{1} + 6{x}_{2} + 5{x}_{3} & = 6 & & }

\eqalignno{ A = \left [\array{ 1&2&2&4\cr 1&3 &3 &5 \cr 2&6&5&6 } \right ] & & }

\eqalignno{ \mathop{\longrightarrow}\limits_{}^{ − 1{R}_{1} + {R}_{2}} &\left [\array{ 1&2&2&4\cr 0&1 &1 &1 \cr 2&6&5&6 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 2{R}_{1} + {R}_{3}}\left [\array{ 1&2&2& 4\cr 0&1 &1 & 1 \cr 0&2&1&−2 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 2{R}_{2} + {R}_{3}} &\left [\array{ 1&2& 2 & 4\cr 0&1 & 1 & 1 \cr 0&0&−1&−4 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 1{R}_{3}}\left [\array{ 1&2&2&4\cr 0&1 &1 &1 \cr 0&0&1&4 } \right ] & & }

is row equivalent to A and by Theorem REMES the system of equations below has the same solution set as the original system of equations.

\eqalignno{ {x}_{1} + 2{x}_{2} + 2{x}_{3} & = 4 & & \cr {x}_{2} + {x}_{3} & = 1 & & \cr {x}_{3} & = 4 & & }

Solving this “simpler” system is straightforward and is identical to the process in Example US. ⊠

Subsection RREF: Reduced Row-Echelon Form

The preceding example amply illustrates the definitions and theorems we have seen so far. But it still leaves two questions unanswered. Exactly what is this “simpler” form for a matrix, and just how do we get it? Here’s the answer to the first question, a definition of reduced row-echelon form.

Definition RREF
Reduced Row-Echelon Form
A matrix is in reduced row-echelon form if it meets all of the following conditions:

A row of only zero entries will be called a zero row and the leftmost nonzero entry of a nonzero row will be called a leading 1. The number of nonzero rows will be denoted by r.

A column containing a leading 1 will be called a pivot column. The set of column indices for all of the pivot columns will be denoted by D = \left \{{d}_{1},\kern 1.95872pt {d}_{2},\kern 1.95872pt {d}_{3},\kern 1.95872pt \mathop{\mathop{…}},\kern 1.95872pt {d}_{r}\right \} where {d}_{1} < {d}_{2} < {d}_{3} < \mathrel{⋯} < {d}_{r}, while the columns that are not pivot columns will be denoted as F = \left \{{f}_{1},\kern 1.95872pt {f}_{2},\kern 1.95872pt {f}_{3},\kern 1.95872pt \mathop{\mathop{…}},\kern 1.95872pt {f}_{n−r}\right \} where {f}_{1} < {f}_{2} < {f}_{3} < \mathrel{⋯} < {f}_{n−r}.

The principal feature of reduced row-echelon form is the pattern of leading 1’s guaranteed by conditions (2) and (4), reminiscent of a flight of geese, or steps in a staircase, or water cascading down a mountain stream.

There are a number of new terms and notation introduced in this definition, which should make you suspect that this is an important definition. Given all there is to digest here, we will mostly save the use of D and F until Section TSS. However, one important point to make here is that all of these terms and notation apply to a matrix. Sometimes we will employ these terms and sets for an augmented matrix, and other times it might be a coefficient matrix. So always give some thought to exactly which type of matrix you are analyzing.

\eqalignno{ C & = \left [\array{ 1&−3&0&6&0&0&−5& 9\cr 0& 0 &0 &0 &1 &0 & 3 &−7 \cr 0& 0 &0&0&0&1& 7 & 3\cr 0& 0 &0 &0 &0 &0 & 0 & 0 \cr 0& 0 &0&0&0&0& 0 & 0 } \right ] & & }

This matrix has two zero rows and three leading 1’s. So r = 3. Columns 1, 5, and 6 are pivot columns, so D = \left \{1,\kern 1.95872pt 5,\kern 1.95872pt 6\right \} and then F = \left \{2,\kern 1.95872pt 3,\kern 1.95872pt 4,\kern 1.95872pt 7,\kern 1.95872pt 8\right \}. ⊠

\eqalignno{ E & = \left [\array{ 1&0&−3&0&6&0&7&−5& 9\cr 0&0 & 0 &5 &0 &1 &0 & 3 &−7 \cr 0&0& 0 &0&0&0&0& 0 & 0\cr 0&1 & 0 &0 &0 &0 &0 &−4 & 2 \cr 0&0& 0 &0&0&0&1& 7 & 3\cr 0&0 & 0 &0 &0 &0 &0 & 0 & 0 } \right ] & & }

Our next theorem has a “constructive” proof. Learn about the meaning of this term in Technique C.

Proof Suppose that A has m rows and n columns. We will describe a process for converting A into B via row operations. This procedure is known as Gauss–Jordan elimination. Tracing through this procedure will be easier if you recognize that i refers to a row that is being converted, j refers to a column that is being converted, and r keeps track of the number of nonzero rows. Here we go.

The result of this procedure is that the matrix A is converted to a matrix in reduced row-echelon form, which we will refer to as B. We need to now prove this claim by showing that the converted matrix has the requisite properties of Definition RREF. First, the matrix is only converted through row operations (Step 6, Step 7, Step 8), so A and B are row-equivalent (Definition REM).

It is a bit more work to be certain that B is in reduced row-echelon form. We claim that as we begin Step 2, the first j columns of the matrix are in reduced row-echelon form with r nonzero rows. Certainly this is true at the start when j = 0, since the matrix has no columns and so vacuously meets the conditions of Definition RREF with r = 0 nonzero rows.

In Step 2 we increase j by 1 and begin to work with the next column. There are two possible outcomes for Step 3. Suppose that every entry of column j in rows r + 1 through m is zero. Then with no changes we recognize that the first j columns of the matrix has its first r rows still in reduced-row echelon form, with the final m − r rows still all zero.

Suppose instead that the entry in row i of column j is nonzero. Notice that since r + 1 ≤ i ≤ m, we know the first j − 1 entries of this row are all zero. Now, in Step 5 we increase r by 1, and then embark on building a new nonzero row. In Step 6 we swap row r and row i. In the first j columns, the first r − 1 rows remain in reduced row-echelon form after the swap. In Step 7 we multiply row r by a nonzero scalar, creating a 1 in the entry in column j of row i, and not changing any other rows. This new leading 1 is the first nonzero entry in its row, and is located to the right of all the leading 1’s in the preceding r − 1 rows. With Step 8 we insure that every entry in the column with this new leading 1 is now zero, as required for reduced row-echelon form. Also, rows r + 1 through m are now all zeros in the first j columns, so we now only have one new nonzero row, consistent with our increase of r by one. Furthermore, since the first j − 1 entries of row r are zero, the employment of the third row operation does not destroy any of the necessary features of rows 1 through r − 1 and rows r + 1 through m, in columns 1 through j − 1.

So at this stage, the first j columns of the matrix are in reduced row-echelon form. When Step 2 finally increases j to n + 1, then the procedure is completed and the full n columns of the matrix are in reduced row-echelon form, with the value of r correctly recording the number of nonzero rows. ■

The procedure given in the proof of Theorem REMEF can be more precisely described using a pseudo-code version of a computer program, as follows:

input m, n and A
r ← 0
for j ← 1 to n
    i ← r + 1
    while i ≤ m and {\left [A\right ]}_{ij} = 0
     i ← i + 1
    if i\mathrel{≠}m + 1
     r ← r + 1
     swap rows i and r of A (row op 1)
     scale entry in row r, column j of A to a leading 1 (row op 2)
     for k ← 1\text{ to }m, k\mathrel{≠}r
      zero out entry in row k, column j of A (row op 3 using row r)
output r and A

Notice that as a practical matter the “and” used in the conditional statement of the while statement should be of the “short-circuit” variety so that the array access that follows is not out-of-bounds.

So now we can put it all together. Begin with a system of linear equations (Definition SLE), and represent the system by its augmented matrix (Definition AM). Use row operations (Definition RO) to convert this matrix into reduced row-echelon form (Definition RREF), using the procedure outlined in the proof of Theorem REMEF. Theorem REMEF also tells us we can always accomplish this, and that the result is row-equivalent (Definition REM) to the original augmented matrix. Since the matrix in reduced-row echelon form has the same solution set, we can analyze the row-reduced version instead of the original matrix, viewing it as the augmented matrix of a different system of equations. The beauty of augmented matrices in reduced row-echelon form is that the solution sets to their corresponding systems can be easily determined, as we will see in the next few examples and in the next section.

We will see through the course that almost every interesting property of a matrix can be discerned by looking at a row-equivalent matrix in reduced row-echelon form. For this reason it is important to know that the matrix B guaranteed to exist by Theorem REMEF is also unique.

Two proof techniques are applicable to the proof. First, head out and read two proof techniques: Technique CD and Technique U.

Theorem RREFU
Reduced Row-Echelon Form is Unique
Suppose that A is an m × n matrix and that B and C are m × n matrices that are row-equivalent to A and in reduced row-echelon form. Then B = C. □

Proof We need to begin with no assumptions about any relationships between B and C, other than they are both in reduced row-echelon form, and they are both row-equivalent to A.

If B and C are both row-equivalent to A, then they are row-equivalent to each other. Repeated row operations on a matrix combine the rows with each other using operations that are linear, and are identical in each column. A key observation for this proof is that each individual row of B is linearly related to the rows of C. This relationship is different for each row of B, but once we fix a row, the relationship is the same across columns. More precisely, there are scalars {δ}_{ik}, 1 ≤ i,k ≤ m such that for any 1 ≤ i ≤ m, 1 ≤ j ≤ n,

\eqalignno{ {\left [B\right ]}_{ij} & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ ik}{\left [C\right ]}_{kj} & & }

You should read this as saying that an entry of row i of B (in column j) is a linear function of the entries of all the rows of C that are also in column j, and the scalars ({δ}_{ik}) depend on which row of B we are considering (the i subscript on {δ}_{ik}), but are the same for every column (no dependence on j in {δ}_{ik}). This idea may be complicated now, but will feel more familiar once we discuss “linear combinations” (Definition LCCV) and moreso when we discuss “row spaces” (Definition RSM). For now, spend some time carefully working Exercise RREF.M40, which is designed to illustrate the origins of this expression. This completes our exploitation of the row-equivalence of B and C.

We now repeatedly exploit the fact that B and C are in reduced row-echelon form. Recall that a pivot column is all zeros, except a single one. More carefully, if R is a matrix in reduced row-echelon form, and {d}_{ℓ} is the index of a pivot column, then {\left [R\right ]}_{k{d}_{ℓ}} = 1 precisely when k = ℓ and is otherwise zero. Notice also that any entry of R that is both below the entry in row ℓ and to the left of column {d}_{ℓ} is also zero (with below and left understood to include equality). In other words, look at examples of matrices in reduced row-echelon form and choose a leading 1 (with a box around it). The rest of the column is also zeros, and the lower left “quadrant” of the matrix that begins here is totally zeros.

Assuming no relationship about the form of B and C, let B have r nonzero rows and denote the pivot columns as D = \left \{{d}_{1},\kern 1.95872pt {d}_{2},\kern 1.95872pt {d}_{3},\kern 1.95872pt \mathop{\mathop{…}},\kern 1.95872pt {d}_{r}\right \}. For C let {r}^{′} denote the number of nonzero rows and denote the pivot columns as {D}^{′} = \left \{\kern 1.95872pt {{d}^{′}}_{ 1},\kern 1.95872pt {{d}^{′}}_{ 2},\kern 1.95872pt {{d}^{′}}_{ 3},\kern 1.95872pt \mathop{\mathop{…}},\kern 1.95872pt {{d}^{′}}_{{ r}^{′}}\right \} (Notation RREFA). There are four steps in the proof, and the first three are about showing that B and C have the same number of pivot columns, in the same places. In other words, the “primed” symbols are a necessary fiction.

\eqalignno{ 1 & ={ \left [B\right ]}_{1{d}_{1}} & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ 1k}{\left [C\right ]}_{k{d}_{1}} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ 1k}(0) & &{d}_{1} < {d}_{1}^{′} & & & & \cr & = 0 & & & & }

The entries of C are all zero since they are left and below of the leading 1 in row 1 and column {d}_{1}^{′} of C. This is a contradiction, so we know that {d}_{1} ≥ {d}_{1}^{′}. By an entirely similar argument, reversing the roles of B and C, we could conclude that {d}_{1} ≤ {d}_{1}^{′}. Together this means that {d}_{1} = {d}_{1}^{′}.

Second Step. Suppose that we have determined that {d}_{1} = {d}_{1}^{′}, {d}_{2} = {d}_{2}^{′}, {d}_{3} = {d}_{3}^{′}, …, {d}_{p} = {d}_{p}^{′}. Let’s now show that {d}_{p+1} = {d}_{p+1}^{′}. Working towards a contradiction, suppose that {d}_{p+1} < {d}_{p+1}^{′}. For 1 ≤ ℓ ≤ p,

\eqalignno{ 0 & ={ \left [B\right ]}_{p+1,{d}_{ℓ}} & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ p+1,k}{\left [C\right ]}_{k{d}_{ℓ}} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ p+1,k}{\left [C\right ]}_{k{d}_{ℓ}^{′}} & & & & \cr & = {δ}_{p+1,ℓ}{\left [C\right ]}_{ℓ{d}_{ℓ}^{′}} +{ \mathop{∑ }}_{\begin{array}{c}k=1 \\ k\mathrel{≠}ℓ \end{array}}^{m}{δ}_{ p+1,k}{\left [C\right ]}_{k{d}_{ℓ}^{′}} & &\text{@(a href="fcla-jsmath-2.99li69.html#property.CACN")Property CACN@(/a)} & & & & \cr & = {δ}_{p+1,ℓ}(1) +{ \mathop{∑ }}_{\begin{array}{c}k=1 \\ k\mathrel{≠}ℓ \end{array}}^{m}{δ}_{ p+1,k}(0) & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & = {δ}_{p+1,ℓ} & & & & }

\eqalignno{ 1 & ={ \left [B\right ]}_{p+1,{d}_{p+1}} & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ p+1,k}{\left [C\right ]}_{k{d}_{p+1}} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{p}{δ}_{ p+1,k}{\left [C\right ]}_{k{d}_{p+1}} +{ \mathop{∑ }}_{k=p+1}^{m}{δ}_{ p+1,k}{\left [C\right ]}_{k{d}_{p+1}} & &\text{@(a href="fcla-jsmath-2.99li69.html#property.AACN")Property AACN@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{p}(0){\left [C\right ]}_{ k{d}_{p+1}} +{ \mathop{∑ }}_{k=p+1}^{m}{δ}_{ p+1,k}{\left [C\right ]}_{k{d}_{p+1}} & & & & \cr & ={ \mathop{∑ }}_{k=p+1}^{m}{δ}_{ p+1,k}{\left [C\right ]}_{k{d}_{p+1}} & & & & \cr & ={ \mathop{∑ }}_{k=p+1}^{m}{δ}_{ p+1,k}(0) & &{d}_{p+1} < {d}_{p+1}^{′} & & & & \cr & = 0 & & & & }

This contradiction shows that {d}_{p+1} ≥ {d}_{p+1}^{′}. By an entirely similar argument, we could conclude that {d}_{p+1} ≤ {d}_{p+1}^{′}, and therefore {d}_{p+1} = {d}_{p+1}^{′}.

Third Step. Now we establish that r = {r}^{′}. Suppose that {r}^{′} < r. By the arguments above, we know that {d}_{1} = {d}_{1}^{′}, {d}_{2} = {d}_{2}^{′}, {d}_{3} = {d}_{3}^{′}, …, {d}_{{r}^{′}} = {d}_{{r}^{′}}^{′}. For 1 ≤ ℓ ≤ {r}^{′} < r,

\eqalignno{ 0 & ={ \left [B\right ]}_{r{d}_{ℓ}} & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ rk}{\left [C\right ]}_{k{d}_{ℓ}} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{{r}^{′} }{δ}_{rk}{\left [C\right ]}_{k{d}_{ℓ}} +{ \mathop{∑ }}_{k={r}^{′}+1}^{m}{δ}_{ rk}{\left [C\right ]}_{k{d}_{ℓ}} & &\text{@(a href="fcla-jsmath-2.99li69.html#property.AACN")Property AACN@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{{r}^{′} }{δ}_{rk}{\left [C\right ]}_{k{d}_{ℓ}} +{ \mathop{∑ }}_{k={r}^{′}+1}^{m}{δ}_{ rk}(0) & &\text{@(a href="fcla-jsmath-2.99li69.html#property.AACN")Property AACN@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{{r}^{′} }{δ}_{rk}{\left [C\right ]}_{k{d}_{ℓ}} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{{r}^{′} }{δ}_{rk}{\left [C\right ]}_{k{d}_{ℓ}^{′}} & & & & \cr & = {δ}_{rℓ}{\left [C\right ]}_{ℓ{d}_{ℓ}^{′}} +{ \mathop{∑ }}_{\begin{array}{c}k=1 \\ k\mathrel{≠}ℓ \end{array}}^{{r}^{′} }{δ}_{rk}{\left [C\right ]}_{k{d}_{ℓ}^{′}} & &\text{@(a href="fcla-jsmath-2.99li69.html#property.CACN")Property CACN@(/a)} & & & & \cr & = {δ}_{rℓ}(1) +{ \mathop{∑ }}_{\begin{array}{c}k=1 \\ k\mathrel{≠}ℓ \end{array}}^{{r}^{′} }{δ}_{rk}(0) & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & = {δ}_{rℓ} & & & & }

\eqalignno{ {\left [B\right ]}_{rj} & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ rk}{\left [C\right ]}_{kj} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{{r}^{′} }{δ}_{rk}{\left [C\right ]}_{kj} +{ \mathop{∑ }}_{k={r}^{′}+1}^{m}{δ}_{ rk}{\left [C\right ]}_{kj} & &\text{@(a href="fcla-jsmath-2.99li69.html#property.CACN")Property CACN@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{{r}^{′} }{δ}_{rk}{\left [C\right ]}_{kj} +{ \mathop{∑ }}_{k={r}^{′}+1}^{m}{δ}_{ rk}(0) & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{{r}^{′} }{δ}_{rk}{\left [C\right ]}_{kj} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{{r}^{′} }(0){\left [C\right ]}_{kj} & & & & \cr & = 0 & & & & }

So row r is a totally zero row, contradicting that this should be the bottommost nonzero row of B. So {r}^{′} ≥ r. By an entirely similar argument, reversing the roles of B and C, we would conclude that {r}^{′} ≤ r and therefore r = {r}^{′}. Thus, combining the first three steps we can say that D = {D}^{′}. In other words, B and C have the same pivot columns, in the same locations.

Fourth Step. In this final step, we will not argue by contradiction. Our intent is to determine the values of the {δ}_{ij}. Notice that we can use the values of the {d}_{i} interchangeably for B and C. Here we go,

\eqalignno{ 1 & ={ \left [B\right ]}_{i{d}_{i}} & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ ik}{\left [C\right ]}_{k{d}_{i}} & & & & \cr & = {δ}_{ii}{\left [C\right ]}_{i{d}_{i}} +{ \mathop{∑ }}_{\begin{array}{c}k=1 \\ k\mathrel{≠}i \end{array}}^{m}{δ}_{ ik}{\left [C\right ]}_{k{d}_{i}} & &\text{@(a href="fcla-jsmath-2.99li69.html#property.CACN")Property CACN@(/a)} & & & & \cr & = {δ}_{ii}(1) +{ \mathop{∑ }}_{\begin{array}{c}k=1 \\ k\mathrel{≠}i \end{array}}^{m}{δ}_{ ik}(0) & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & = {δ}_{ii} & & & & }

\eqalignno{ 0 & ={ \left [B\right ]}_{i{d}_{ℓ}} & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ ik}{\left [C\right ]}_{k{d}_{ℓ}} & & & & \cr & = {δ}_{iℓ}{\left [C\right ]}_{ℓ{d}_{ℓ}} +{ \mathop{∑ }}_{\begin{array}{c}k=1 \\ k\mathrel{≠}ℓ \end{array}}^{m}{δ}_{ ik}{\left [C\right ]}_{k{d}_{ℓ}} & &\text{@(a href="fcla-jsmath-2.99li69.html#property.CACN")Property CACN@(/a)} & & & & \cr & = {δ}_{iℓ}(1) +{ \mathop{∑ }}_{\begin{array}{c}k=1 \\ k\mathrel{≠}ℓ \end{array}}^{m}{δ}_{ ik}(0) & &\text{@(a href="#definition.RREF")Definition RREF@(/a)} & & & & \cr & = {δ}_{iℓ} & & & & }

Finally, having determined the values of the {δ}_{ij}, we can show that B = C. For 1 ≤ i ≤ m, 1 ≤ j ≤ n,

\eqalignno{ {\left [B\right ]}_{ij} & ={ \mathop{∑ }}_{k=1}^{m}{δ}_{ ik}{\left [C\right ]}_{kj} & & & & \cr & = {δ}_{ii}{\left [C\right ]}_{ij} +{ \mathop{∑ }}_{\begin{array}{c}k=1 \\ k\mathrel{≠}i \end{array}}^{m}{δ}_{ ik}{\left [C\right ]}_{kj} & &\text{@(a href="fcla-jsmath-2.99li69.html#property.CACN")Property CACN@(/a)} & & & & \cr & = (1){\left [C\right ]}_{ij} +{ \mathop{∑ }}_{\begin{array}{c}k=1 \\ k\mathrel{≠}i \end{array}}^{m}(0){\left [C\right ]}_{ kj} & & & & \cr & ={ \left [C\right ]}_{ij} & & & & }

We will now run through some examples of using these definitions and theorems to solve some systems of equations. From now on, when we have a matrix in reduced row-echelon form, we will mark the leading 1’s with a small box. In your work, you can box ’em, circle ’em or write ’em in a different color — just identify ’em somehow. This device will prove very useful later and is a very good habit to start developing right now.

Example SAB
Solutions for Archetype B
Let’s find the solutions to the following system of equations,

\eqalignno{ − 7{x}_{1} − 6{x}_{2} − 12{x}_{3} & = −33 & & \cr 5{x}_{1} + 5{x}_{2} + 7{x}_{3} & = 24 & & \cr {x}_{1} + 4{x}_{3} & = 5 & & }

\eqalignno{ \left [\array{ −7&−6&−12&−33\cr 5 & 5 & 7 & 24 \cr 1 & 0 & 4 & 5 } \right ] & & }

\eqalignno{ \mathop{\longrightarrow}\limits_{}^{{R}_{1} ↔ {R}_{3}} &\left [\array{ 1 & 0 & 4 & 5\cr 5 & 5 & 7 & 24 \cr −7&−6&−12&−33 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 5{R}_{1} + {R}_{2}}\left [\array{ 1 & 0 & 4 & 5\cr 0 & 5 &−13 & −1 \cr −7&−6&−12&−33 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{7{R}_{1} + {R}_{3}} &\left [\array{ \text{1}& 0 & 4 & 5\cr 0& 5 &−13 &−1 \cr 0&−6& 16 & 2 } \right ] & & \text{Now, with $j = 2$,} \cr \mathop{\longrightarrow}\limits_{}^{{1\over 5}{R}_{2}} &\left [\array{ \text{1}& 0 & 4 & 5 \cr 0& 1 &{−13\over 5} &{−1\over 5} \cr 0&−6& 16 & 2 } \right ]\mathop{\longrightarrow}\limits_{}^{6{R}_{2} + {R}_{3}}\left [\array{ \text{1}&0& 4 & 5 \cr 0&\text{1}&{−13\over 5} &{−1\over 5} \cr 0&0& {2\over 5} & {4\over 5} } \right ] & & \text{And finally, with $j = 3$,} \cr \mathop{\longrightarrow}\limits_{}^{{5\over 2}{R}_{3}} &\left [\array{ \text{1}&0& 4 & 5 \cr 0&\text{1}&{−13\over 5} &{−1\over 5} \cr 0&0& 1 & 2 } \right ]\mathop{\longrightarrow}\limits_{}^{{13\over 5} {R}_{3} + {R}_{2}}\left [\array{ \text{1}&0&4&5\cr 0&\text{1 } &0 &5 \cr 0&0&1&2 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 4{R}_{3} + {R}_{1}} &\left [\array{ \text{1}&0&0&−3\cr 0&\text{1 } &0 & 5 \cr 0&0&\text{1}& 2 } \right ] & & }

This is now the augmented matrix of a very simple system of equations, namely {x}_{1} = −3, {x}_{2} = 5, {x}_{3} = 2, which has an obvious solution. Furthermore, we can see that this is the only solution to this system, so we have determined the entire solution set,

\eqalignno{ S & = \left \{\left [\array{ −3\cr 5 \cr 2 } \right ]\right \} & & }

Archetypes A and B are meant to contrast each other in many respects. So let’s solve Archetype A now.

Example SAA
Solutions for Archetype A
Let’s find the solutions to the following system of equations,

\eqalignno{ {x}_{1} − {x}_{2} + 2{x}_{3} & = 1 & & \cr 2{x}_{1} + {x}_{2} + {x}_{3} & = 8 & & \cr {x}_{1} + {x}_{2} & = 5 & & }

\eqalignno{ \left [\array{ 1&−1&2&1\cr 2& 1 &1 &8 \cr 1& 1 &0&5 } \right ] & & }

\eqalignno{ \mathop{\longrightarrow}\limits_{}^{ − 2{R}_{1} + {R}_{2}} &\left [\array{ 1&−1& 2 &1\cr 0& 3 &−3 &6 \cr 1& 1 & 0 &5 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 1{R}_{1} + {R}_{3}}\left [\array{ \text{1}&−1& 2 &1\cr 0& 3 &−3 &6 \cr 0& 2 &−2&4 } \right ] & & \text{Now, with $j = 2$,} \cr \mathop{\longrightarrow}\limits_{}^{{1\over 3}{R}_{2}} &\left [\array{ \text{1}&−1& 2 &1\cr 0& 1 &−1 &2 \cr 0& 2 &−2&4 } \right ]\mathop{\longrightarrow}\limits_{}^{1{R}_{2} + {R}_{1}}\left [\array{ \text{1}&0& 1 &3\cr 0&1 &−1 &2 \cr 0&2&−2&4 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 2{R}_{2} + {R}_{3}} &\left [\array{ \text{1}&0& 1 &3\cr 0&\text{1 } &−1 &2 \cr 0&0& 0 &0 } \right ] & & }

The system of equations represented by this augmented matrix needs to be considered a bit differently than that for Archetype B. First, the last row of the matrix is the equation 0 = 0, which is always true, so it imposes no restrictions on our possible solutions and therefore we can safely ignore it as we analyze the other two equations. These equations are,

\eqalignno{ {x}_{1} + {x}_{3} & = 3 & & \cr {x}_{2} − {x}_{3} & = 2. & & }

While this system is fairly easy to solve, it also appears to have a multitude of solutions. For example, choose {x}_{3} = 1 and see that then {x}_{1} = 2 and {x}_{2} = 3 will together form a solution. Or choose {x}_{3} = 0, and then discover that {x}_{1} = 3 and {x}_{2} = 2 lead to a solution. Try it yourself: pick any value of {x}_{3} you please, and figure out what {x}_{1} and {x}_{2} should be to make the first and second equations (respectively) true. We’ll wait while you do that. Because of this behavior, we say that {x}_{3} is a “free” or “independent” variable. But why do we vary {x}_{3} and not some other variable? For now, notice that the third column of the augmented matrix does not have any leading 1’s in its column. With this idea, we can rearrange the two equations, solving each for the variable that corresponds to the leading 1 in that row.

\eqalignno{ {x}_{1} & = 3 − {x}_{3} & & \cr {x}_{2} & = 2 + {x}_{3} & & }

\eqalignno{ S & = \left \{\left .\left [\array{ 3 − {x}_{3} \cr 2 + {x}_{3} \cr {x}_{3} } \right ]\right \vert {x}_{3} ∈ {ℂ}^{}\right \} & & }

We’ll learn more in the next section about systems with infinitely many solutions and how to express their solution sets. Right now, you might look back at Example IS. ⊠

Example SAE
Solutions for Archetype E
Let’s find the solutions to the following system of equations,

\eqalignno{ 2{x}_{1} + {x}_{2} + 7{x}_{3} − 7{x}_{4} & = 2 & & \cr − 3{x}_{1} + 4{x}_{2} − 5{x}_{3} − 6{x}_{4} & = 3 & & \cr {x}_{1} + {x}_{2} + 4{x}_{3} − 5{x}_{4} & = 2 & & }

\eqalignno{ \left [\array{ 2 &1& 7 &−7&2\cr −3 &4 &−5 &−6 &3 \cr 1 &1& 4 &−5&2 } \right ] & & }

\eqalignno{ \mathop{\longrightarrow}\limits_{}^{{R}_{1} ↔ {R}_{3}} &\left [\array{ 1 &1& 4 &−5&2\cr −3 &4 &−5 &−6 &3 \cr 2 &1& 7 &−7&2 } \right ]\mathop{\longrightarrow}\limits_{}^{3{R}_{1} + {R}_{2}}\left [\array{ 1&1&4& −5 &2\cr 0&7 &7 &−21 &9 \cr 2&1&7& −7 &2 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 2{R}_{1} + {R}_{3}} &\left [\array{ \text{1}& 1 & 4 & −5 & 2\cr 0& 7 & 7 &−21 & 9 \cr 0&−1&−1& 3 &−2 } \right ] & & \text{Now, with $j = 2$,} \cr \mathop{\longrightarrow}\limits_{}^{{R}_{2} ↔ {R}_{3}} &\left [\array{ \text{1}& 1 & 4 & −5 & 2\cr 0&−1 &−1 & 3 &−2 \cr 0& 7 & 7 &−21& 9 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 1{R}_{2}}\left [\array{ \text{1}&1&4& −5 &2\cr 0&1 &1 & −3 &2 \cr 0&7&7&−21&9 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 1{R}_{2} + {R}_{1}} &\left [\array{ \text{1}&0&3& −2 &0\cr 0&1 &1 & −3 &2 \cr 0&7&7&−21&9 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 7{R}_{2} + {R}_{3}}\left [\array{ \text{1}&0&3&−2& 0\cr 0&\text{1 } &1 &−3 & 2 \cr 0&0&0& 0 &−5 } \right ] & & \text{And finally, with $j = 4$,} \cr \mathop{\longrightarrow}\limits_{}^{ −{1\over 5}{R}_{3}} &\left [\array{ \text{1}&0&3&−2&0\cr 0&\text{1 } &1 &−3 &2 \cr 0&0&0& 0 &1 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 2{R}_{3} + {R}_{2}}\left [\array{ \text{1}&0&3&−2&0\cr 0&\text{1 } &1 &−3 &0 \cr 0&0&0& 0 &\text{1} } \right ] & & }

Let’s analyze the equations in the system represented by this augmented matrix. The third equation will read 0 = 1. This is patently false, all the time. No choice of values for our variables will ever make it true. We’re done. Since we cannot even make the last equation true, we have no hope of making all of the equations simultaneously true. So this system has no solutions, and its solution set is the empty set, ∅ = \left \{\ \right \} (Definition ES).

Notice that we could have reached this conclusion sooner. After performing the row operation − 7{R}_{2} + {R}_{3}, we can see that the third equation reads 0 = −5, a false statement. Since the system represented by this matrix has no solutions, none of the systems represented has any solutions. However, for this example, we have chosen to bring the matrix fully to reduced row-echelon form for the practice. ⊠

These three examples (Example SAB, Example SAA, Example SAE) illustrate the full range of possibilities for a system of linear equations — no solutions, one solution, or infinitely many solutions. In the next section we’ll examine these three scenarios more closely.

Definition RR
Row-Reducing
To row-reduce the matrix A means to apply row operations to A and arrive at a row-equivalent matrix B in reduced row-echelon form. △

So the term row-reduce is used as a verb. Theorem REMEF tells us that this process will always be successful and Theorem RREFU tells us that the result will be unambiguous. Typically, the analysis of A will proceed by analyzing B and applying theorems whose hypotheses include the row-equivalence of A and B.

Subsection READ: Reading Questions

Subsection EXC: Exercises

For problems C10–C19, find all solutions to the system of linear equations. Use your favorite computing device to row-reduce the augmented matrices for the systems, and write the solutions as a set, using correct set notation.
C10

\eqalignno{ 2{x}_{1} − 3{x}_{2} + {x}_{3} + 7{x}_{4} & = 14 & & \cr 2{x}_{1} + 8{x}_{2} − 4{x}_{3} + 5{x}_{4} & = −1 & & \cr {x}_{1} + 3{x}_{2} − 3{x}_{3} & = 4 & & \cr − 5{x}_{1} + 2{x}_{2} + 3{x}_{3} + 4{x}_{4} & = −19 & & }

\eqalignno{ 3{x}_{1} + 4{x}_{2} − {x}_{3} + 2{x}_{4} & = 6 & & \cr {x}_{1} − 2{x}_{2} + 3{x}_{3} + {x}_{4} & = 2 & & \cr 10{x}_{2} − 10{x}_{3} − {x}_{4} & = 1 & & }

\eqalignno{ 2{x}_{1} + 4{x}_{2} + 5{x}_{3} + 7{x}_{4} & = −26 & & \cr {x}_{1} + 2{x}_{2} + {x}_{3} − {x}_{4} & = −4 & & \cr − 2{x}_{1} − 4{x}_{2} + {x}_{3} + 11{x}_{4} & = −10 & & }

\eqalignno{ {x}_{1} + 2{x}_{2} + 8{x}_{3} − 7{x}_{4} & = −2 & & \cr 3{x}_{1} + 2{x}_{2} + 12{x}_{3} − 5{x}_{4} & = 6 & & \cr − {x}_{1} + {x}_{2} + {x}_{3} − 5{x}_{4} & = −10 & & }

\eqalignno{ 2{x}_{1} + {x}_{2} + 7{x}_{3} − 2{x}_{4} & = 4 & & \cr 3{x}_{1} − 2{x}_{2} + 11{x}_{4} & = 13 & & \cr {x}_{1} + {x}_{2} + 5{x}_{3} − 3{x}_{4} & = 1 & & }

\eqalignno{ 2{x}_{1} + 3{x}_{2} − {x}_{3} − 9{x}_{4} & = −16 & & \cr {x}_{1} + 2{x}_{2} + {x}_{3} & = 0 & & \cr − {x}_{1} + 2{x}_{2} + 3{x}_{3} + 4{x}_{4} & = 8 & & }

\eqalignno{ 2{x}_{1} + 3{x}_{2} + 19{x}_{3} − 4{x}_{4} & = 2 & & \cr {x}_{1} + 2{x}_{2} + 12{x}_{3} − 3{x}_{4} & = 1 & & \cr − {x}_{1} + 2{x}_{2} + 8{x}_{3} − 5{x}_{4} & = 1 & & }

\eqalignno{ − {x}_{1} + 5{x}_{2} & = −8 & & \cr − 2{x}_{1} + 5{x}_{2} + 5{x}_{3} + 2{x}_{4} & = 9 & & \cr − 3{x}_{1} − {x}_{2} + 3{x}_{3} + {x}_{4} & = 3 & & \cr 7{x}_{1} + 6{x}_{2} + 5{x}_{3} + {x}_{4} & = 30 & & }

\eqalignno{ {x}_{1} + 2{x}_{2} − 4{x}_{3} − {x}_{4} & = 32 & & \cr {x}_{1} + 3{x}_{2} − 7{x}_{3} − {x}_{5} & = 33 & & \cr {x}_{1} + 2{x}_{3} − 2{x}_{4} + 3{x}_{5} & = 22 & & \cr & & }

\eqalignno{ 2{x}_{1} + {x}_{2} & = 6 & & \cr − {x}_{1} − {x}_{2} & = −2 & & \cr 3{x}_{1} + 4{x}_{2} & = 4 & & \cr 3{x}_{1} + 5{x}_{2} & = 2 & & }

For problems C30–C33, row-reduce the matrix without the aid of a calculator, indicating the row operations you are using at each step using the notation of Definition RO.
C30

\eqalignno{ \left [\array{ 2& 1 & 5 &10\cr 1&−3 &−1 &−2 \cr 4&−2& 6 &12 } \right ] & & }

\eqalignno{ \left [\array{ 1 & 2 &−4\cr −3 &−1 &−3 \cr −2& 1 &−7 } \right ] & & }

\eqalignno{ \left [\array{ 1 & 1 & 1\cr −4 &−3 &−2 \cr 3 & 2 & 1 } \right ] & & }

\eqalignno{ \left [\array{ 1 & 2 &−1&−1\cr 2 & 4 &−1 & 4 \cr −1&−2& 3 & 5 } \right ] & & }

\eqalignno{ B & = \left [\array{ 1 & 3 &−2& 2\cr −1 &−2 &−1 &−1 \cr −1&−5& 8 &−3 } \right ] &C & = \left [\array{ 1 & 2 & 1 &2\cr 1 & 1 & 4 &0 \cr −1&−1&−4&1 } \right ] & & & & }

(a) Row-reduce each matrix and determine that the reduced row-echelon forms of B and C are identical. From this argue that B and C are row-equivalent.

(b) In the proof of Theorem RREFU, we begin by arguing that entries of row-equivalent matrices are related by way of certain scalars and sums. In this example, we would write that entries of B from row i that are in column j are linearly related to the entries of C in column j from all three rows

\eqalignno{ {\left [B\right ]}_{ij} & = {δ}_{i1}{\left [C\right ]}_{1j} + {δ}_{i2}{\left [C\right ]}_{2j} + {δ}_{i3}{\left [C\right ]}_{3j} &1 & ≤ j ≤ 4 & & & & }

For each 1 ≤ i ≤ 3 find the corresponding three scalars in this relationship. So your answer will be nine scalars, determined three at a time.
Contributed by Robert Beezer Solution [138]

M45 You keep a number of lizards, mice and peacocks as pets. There are a total of 108 legs and 30 tails in your menagerie. You have twice as many mice as lizards. How many of each creature do you have?
Contributed by Chris Black Solution [141]

M50 A parking lot has 66 vehicles (cars, trucks, motorcycles and bicycles) in it. There are four times as many cars as trucks. The total number of tires (4 per car or truck, 2 per motorcycle or bicycle) is 252. How many cars are there? How many bicycles?
Contributed by Robert Beezer Solution [142]

T10 Prove that each of the three row operations (Definition RO) is reversible. More precisely, if the matrix B is obtained from A by application of a single row operation, show that there is a single row operation that will transform B back into A.
Contributed by Robert Beezer Solution [143]

T11 Suppose that A, B and C are m × n matrices. Use the definition of row-equivalence (Definition REM) to prove the following three facts.

A relationship that satisfies these three properties is known as an equivalence relation, an important idea in the study of various algebras. This is a formal way of saying that a relationship behaves like equality, without requiring the relationship to be as strict as equality itself. We’ll see it again in Theorem SER.
Contributed by Robert Beezer

T12 Suppose that B is an m × n matrix in reduced row-echelon form. Build a new, likely smaller, k × ℓ matrix C as follows. Keep any collection of k adjacent rows, k ≤ m. From these rows, keep columns 1 through ℓ, ℓ ≤ n. Prove that C is in reduced row-echelon form.
Contributed by Robert Beezer

T13 Generalize Exercise RREF.T12 by just keeping any k rows, and not requiring the rows to be adjacent. Prove that any such matrix C is in reduced row-echelon form.
Contributed by Robert Beezer

Subsection SOL: Solutions

This augmented matrix represents the linear system {x}_{1} = 1, {x}_{2} = −3, {x}_{3} = −4, {x}_{4} = 1, which clearly has only one possible solution. We can write this solution set then as

\eqalignno{ S & = \left \{\left [\array{ 1\cr −3 \cr −4\cr 1 } \right ]\right \} & & }

Row 3 represents the equation 0 = 1, which is patently false, so the original system has no solutions. We can express the solution set as the empty set, ∅ = \left \{\right \}.

In the spirit of Example SAA, we can express the infinitely many solutions of this system compactly with set notation. The key is to express certain variables in terms of others. More specifically, each pivot column number is the index of a variable that can be written in terms of the variables whose indices are non-pivot columns. Or saying the same thing: for each i in D, we can find an expression for {x}_{i} in terms of the variables without their index in D. Here D = \left \{1,\kern 1.95872pt 3\right \}, so

\eqalignno{ {x}_{1} & = 2 − 2{x}_{2} + 4{x}_{4} & & \cr {x}_{3} & = −6\quad \quad − 3{x}_{4} & & }

Row 3 represents the equation 0 = 1, which is patently false, so the original system has no solutions. We can express the solution set as the empty set, ∅ = \left \{\right \}.

\eqalignno{ \left [\array{ −1& 5 &0&0&−8\cr −2 & 5 &5 &2 & 9 \cr −3&−1&3&1& 3\cr 7 & 6 &5 &1 & 30 } \right ] &\mathop{\longrightarrow}\limits_{}^{\text{RREF}}\left [\array{ \text{1}&0&0&0& 3\cr 0&\text{1 } &0 &0 &−1 \cr 0&0&\text{1}&0& 2\cr 0&0 &0 &\text{1 } & 5 } \right ] & & }

This augmented matrix represents the linear system {x}_{1} = 3, {x}_{2} = −1, {x}_{3} = 2, {x}_{4} = 5, which clearly has only one possible solution. We can write this solution set then as

\eqalignno{ S & = \left \{\left [\array{ 3\cr −1 \cr 2\cr 5 } \right ]\right \} & & }

\eqalignno{ \left [\array{ 1&2&−4&−1& 0 &32\cr 1&3 &−7 & 0 &−1 &33 \cr 1&0& 2 &−2& 3 &22 } \right ] &\mathop{\longrightarrow}\limits_{}^{\text{RREF}}\left [\array{ \text{1}&0& 2 &0& 5 & 6\cr 0&\text{1 } &−3 &0 &−2 & 9 \cr 0&0& 0 &\text{1}& 1 &−8 } \right ] & & }

\eqalignno{ {x}_{1} + 2{x}_{3} + 5{x}_{5} = 6\quad & →\quad {x}_{1} = 6 − 2{x}_{3} − 5{x}_{5} & & \cr {x}_{2} − 3{x}_{3} − 2{x}_{5} = 9\quad & →\quad {x}_{2} = 9 + 3{x}_{3} + 2{x}_{5} & & \cr {x}_{4} + {x}_{5} = −8\quad & →\quad {x}_{4} = −8 − {x}_{5} & & }

\eqalignno{ S & = \left \{\left .\left [\array{ 6 − 2{x}_{3} − 5{x}_{5} \cr 9 + 3{x}_{3} + 2{x}_{5} \cr {x}_{3} \cr −8 − {x}_{5} \cr {x}_{5} } \right ]\right \vert {x}_{3},\kern 1.95872pt {x}_{5} ∈ ℂ\right \} & & }

\eqalignno{ &\left [\array{ 2 & 1 & 6\cr −1 &−1 &−2 \cr 3 & 4 & 4\cr 3 & 5 & 2 } \right ] & & }

\eqalignno{ &\left [\array{ \text{1}&0& 4\cr 0&\text{1 } &−2 \cr 0&0& 0\cr 0&0 & 0 } \right ] & & }

This augmented matrix represents the linear system {x}_{1} = 4, {x}_{2} = −2, 0 = 0, 0 = 0, which clearly has only one possible solution. We can write this solution set then as

\eqalignno{ S & = \left \{\left [\array{ 4\cr −2 } \right ]\right \} & & }

\eqalignno{ &\left [\array{ 2& 1 & 5 &10\cr 1&−3 &−1 &−2 \cr 4&−2& 6 &12 } \right ]\mathop{\longrightarrow}\limits_{}^{{R}_{1} ↔ {R}_{2}}\left [\array{ 1&−3&−1&−2\cr 2& 1 & 5 & 10 \cr 4&−2& 6 &12 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 2{R}_{1} + {R}_{2}} &\left [\array{ 1&−3&−1&−2\cr 0& 7 & 7 & 14 \cr 4&−2& 6 &12 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 4{R}_{1} + {R}_{3}}\left [\array{ 1&−3&−1&−2\cr 0& 7 & 7 & 14 \cr 0&10&10&20 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{{1\over 7}{R}_{2}} &\left [\array{ 1&−3&−1&−2\cr 0& 1 & 1 & 2 \cr 0&10&10&20 } \right ]\mathop{\longrightarrow}\limits_{}^{3{R}_{2} + {R}_{1}}\left [\array{ 1& 0 & 2 & 4\cr 0& 1 & 1 & 2 \cr 0&10&10&20 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 10{R}_{2} + {R}_{3}} &\left [\array{ \text{1}&0&2&4\cr 0&\text{1 } &1 &2 \cr 0&0&0&0 } \right ] & & }

\eqalignno{ &\left [\array{ 1 & 2 &−4\cr −3 &−1 &−3 \cr −2& 1 &−7 } \right ]\mathop{\longrightarrow}\limits_{}^{3{R}_{1} + {R}_{2}}\left [\array{ 1 &2& −4\cr 0 &5 &−15 \cr −2&1& −7 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{2{R}_{1} + {R}_{3}} &\left [\array{ 1&2& −4\cr 0&5 &−15 \cr 0&5&−15 } \right ]\mathop{\longrightarrow}\limits_{}^{{1\over 5}{R}_{2}}\left [\array{ 1&2& −4\cr 0&1 & −3 \cr 0&5&−15 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 2{R}_{2} + {R}_{1}} &\left [\array{ 1&0& 2\cr 0&1 & −3 \cr 0&5&−15 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 5{R}_{2} + {R}_{3}}\left [\array{ \text{1}&0& 2\cr 0&\text{1 } &−3 \cr 0&0& 0 } \right ] & & }

\eqalignno{ \left [\array{ 1 & 1 & 1\cr −4 &−3 &−2 \cr 3 & 2 & 1 } \right ]\mathop{\longrightarrow}\limits_{}^{4{R}_{1} + {R}_{2}} &\left [\array{ 1&1&1\cr 0&1 &2 \cr 3&2&1} \right ]\mathop{\longrightarrow}\limits_{}^{ − 3{R}_{1} + {R}_{3}}\left [\array{ \text{1}& 1 & 1\cr 0& 1 & 2 \cr 0&−1&−2 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 1{R}_{2} + {R}_{1}} &\left [\array{ \text{1}& 0 &−1\cr 0& 1 & 2 \cr 0&−1&−2 } \right ]\mathop{\longrightarrow}\limits_{}^{1{R}_{2} + {R}_{3}}\left [\array{ \text{1}&0&−1\cr 0&\text{1 } & 2 \cr 0&0& 0 } \right ] & & }

\eqalignno{ &\left [\array{ 1 & 2 &−1&−1\cr 2 & 4 &−1 & 4 \cr −1&−2& 3 & 5 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 2{R}_{1} + {R}_{2}}\left [\array{ 1 & 2 &−1&−1\cr 0 & 0 & 1 & 6 \cr −1&−2& 3 & 5 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{1{R}_{1} + {R}_{3}} &\left [\array{ \text{1}&2&−1&−1\cr 0&0 & 1 & 6 \cr 0&0& 2 & 4 } \right ]\mathop{\longrightarrow}\limits_{}^{1{R}_{2} + {R}_{1}}\left [\array{ \text{1}&2&0&5\cr 0&0 &1 &6 \cr 0&0&2&4 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 2{R}_{2} + {R}_{3}} &\left [\array{ \text{1}&2&0& 5\cr 0&0 &\text{1 } & 6 \cr 0&0&0&−8 } \right ]\mathop{\longrightarrow}\limits_{}^{ −{1\over 8}{R}_{3}}\left [\array{ \text{1}&2&0&5\cr 0&0 &\text{1 } &6 \cr 0&0&0&1 } \right ] & & \cr \mathop{\longrightarrow}\limits_{}^{ − 6{R}_{3} + {R}_{2}} &\left [\array{ \text{1}&2&0&5\cr 0&0 &\text{1 } &0 \cr 0&0&0&1 } \right ]\mathop{\longrightarrow}\limits_{}^{ − 5{R}_{3} + {R}_{1}}\left [\array{ \text{1}&2&0&0\cr 0&0 &\text{1 } &0 \cr 0&0&0&\text{1} } \right ] & & }

M40 Contributed by Robert Beezer Statement [120]
(a) Let R be the common reduced row-echelon form of B and C. A sequence of row operations converts B to R and a second sequence of row operations converts C to R. If we “reverse” the second sequence’s order, and reverse each individual row operation (see Exercise RREF.T10) then we can begin with B, convert to R with the first sequence, and then convert to C with the reversed sequence. Satisfying Definition REM we can say B and C are row-equivalent matrices.

(b) We will work this carefully for the first row of B and just give the solution for the next two rows. For row 1 of B take i = 1 and we have

\eqalignno{ {\left [B\right ]}_{1j} & = {δ}_{11}{\left [C\right ]}_{1j} + {δ}_{12}{\left [C\right ]}_{2j} + {δ}_{13}{\left [C\right ]}_{3j} &1 & ≤ j ≤ 4 & & & & }

If we substitute the four values for j we arrive at four linear equations in the three unknowns {δ}_{11},{δ}_{12},{δ}_{13},

\eqalignno{ (j = 1)&&{\left [B\right ]}_{11}& = {δ}_{11}{\left [C\right ]}_{11} + {δ}_{12}{\left [C\right ]}_{21} + {δ}_{13}{\left [C\right ]}_{31}&& ⇒&1 & = {δ}_{11}(1) + {δ}_{12}(1) + {δ}_{13}(−1)&&&&&&&& \cr (j = 2)&&{\left [B\right ]}_{12}& = {δ}_{11}{\left [C\right ]}_{12} + {δ}_{12}{\left [C\right ]}_{22} + {δ}_{13}{\left [C\right ]}_{32}&& ⇒&3 & = {δ}_{11}(2) + {δ}_{12}(1) + {δ}_{13}(−1)&&&&&&&& \cr (j = 3)&&{\left [B\right ]}_{13}& = {δ}_{11}{\left [C\right ]}_{13} + {δ}_{12}{\left [C\right ]}_{23} + {δ}_{13}{\left [C\right ]}_{33}&& ⇒& − 2& = {δ}_{11}(1) + {δ}_{12}(4) + {δ}_{13}(−4)&&&&&&&& \cr (j = 4)&&{\left [B\right ]}_{14}& = {δ}_{11}{\left [C\right ]}_{14} + {δ}_{12}{\left [C\right ]}_{24} + {δ}_{13}{\left [C\right ]}_{34}&& ⇒&2 & = {δ}_{11}(2) + {δ}_{12}(0) + {δ}_{13}(1) &&&&&&&& }

We form the augmented matrix of this system and row-reduce to find the solutions,

\eqalignno{ \left [\array{ 1&1&−1& 1\cr 2&1 &−1 & 3 \cr 1&4&−4&−2\cr 2&0 & 1 & 2 } \right ] &\mathop{\longrightarrow}\limits_{}^{\text{RREF}}\left [\array{ \text{1}&0&0& 2\cr 0&\text{1 } &0 &−3 \cr 0&0&\text{1}&−2\cr 0&0 &0 & 0 } \right ] & & }

So the unique solution is {δ}_{11} = 2, {δ}_{12} = −3, {δ}_{13} = −2. Entirely similar work will lead you to

\eqalignno{ {δ}_{21} & = −1 &{δ}_{22} & = 1 &{δ}_{23} & = 1 & & & & & & \text{and} \cr {δ}_{31} & = −4 &{δ}_{32} & = 8 &{δ}_{33} & = 5 & & & & & &

}

M45 Contributed by Chris Black Statement [122]
Let l,m,p denote the number of lizards, mice and peacocks. Then the statements from the problem yield the equations:

\eqalignno{ 4l + 4m + 2p & = 108 & & \cr l + m + p & = 30 & & \cr 2l − m & = 0 & & }

\eqalignno{ \left [\array{ 4& 4 &2&108\cr 1& 1 &1 & 30 \cr 2&−1&0& 0 } \right ] &\mathop{\longrightarrow}\limits_{}^{\text{RREF}}\left [\array{ \text{1}&0&0& 8\cr 0&\text{1 } &0 &16 \cr 0&0&\text{1}& 6 } \right ] & & }

From the row-reduced matrix, we see that we have an equivalent system l = 8, m = 16, and p = 6, which means that you have 8 lizards, 16 mice and 6 peacocks.

M50 Contributed by Robert Beezer Statement [122]
Let c,\kern 1.95872pt t,\kern 1.95872pt m,\kern 1.95872pt b denote the number of cars, trucks, motorcycles, and bicycles. Then the statements from the problem yield the equations:

\eqalignno{ c + t + m + b & = 66 & & \cr c − 4t & = 0 & & \cr 4c + 4t + 2m + 2b & = 252 & & }

\eqalignno{ \left [\array{ 1& 1 &1&1& 66\cr 1&−4 &0 &0 & 0 \cr 4& 4 &2&2&252 } \right ] &\mathop{\longrightarrow}\limits_{}^{\text{RREF}}\left [\array{ \text{1}&0&0&0&48\cr 0&\text{1 } &0 &0 &12 \cr 0&0&\text{1}&1& 6 } \right ] & & }

The first row of the matrix represents the equation c = 48, so there are 48 cars. The second row of the matrix represents the equation t = 12, so there are 12 trucks. The third row of the matrix represents the equation m + b = 6 so there are anywhere from 0 to 6 bicycles. We can also say that b is a free variable, but the context of the problem limits it to 7 integer values since you cannot have a negative number of motorcycles.

T10 Contributed by Robert Beezer Statement [122]
If we can reverse each row operation individually, then we can reverse a sequence of row operations. The operations that reverse each operation are listed below, using our shorthand notation. Notice how requiring the scalar α to be non-zero makes the second operation reversible.