From A First Course in Linear Algebra
Version 2.11
© 2004.
Licensed under the GNU Free Documentation License.
http://linear.ups.edu/
After solving a few systems of equations, you will recognize that
it doesn’t matter so much what we call our variables, as opposed to
what numbers act as their coefficients. A system in the variables
{x}_{1},\kern 1.95872pt {x}_{2},\kern 1.95872pt {x}_{3}
would behave the same if we changed the names of the variables to
a,\kern 1.95872pt b,\kern 1.95872pt c and
kept all the constants the same and in the same places. In this section, we will
isolate the key bits of information about a system of equations into something
called a matrix, and then use this matrix to systematically solve the equations.
Along the way we will obtain one of our most important and useful computational
tools.
Definition M
Matrix
An m × n
matrix is a rectangular layout of numbers from
{ℂ}^{} having
m rows
and n
columns. We will use upper-case Latin letters from the start of the alphabet
(A,\kern 1.95872pt B,\kern 1.95872pt C,\mathop{\mathop{…}})
to denote matrices and squared-off brackets to delimit the layout.
Many use large parentheses instead of brackets — the distinction is
not important. Rows of a matrix will be referenced starting at the
top and working down (i.e. row 1 is at the top) and columns will be
referenced starting from the left (i.e. column 1 is at the left). For a matrix
A, the notation
{\left [A\right ]}_{ij} will refer to the
complex number in row i
and column j
of A.
(This definition contains Notation M.)
(This definition contains Notation MC.) △
Be careful with this notation for individual entries, since it is easy to think that {\left [A\right ]}_{ij} refers to the whole matrix. It does not. It is just a number, but is a convenient way to talk about the individual entries simultaneously. This notation will get a heavy workout once we get to Chapter M.
Example AM
A matrix
B = \left [\array{
−1&2& 5 & 3\cr
1 &0 &−6 & 1
\cr
−4&2& 2 &−2 } \right ]
|
is a matrix with m = 3 rows and n = 4 columns. We can say that {\left [B\right ]}_{2,3} = −6 while {\left [B\right ]}_{3,4} = −2. ⊠
Some mathematical software is very particular about which types of numbers (integers, rationals, reals, complexes) you wish to work with. See: Computation R.SAGE A calculator or computer language can be a convenient way to perform calculations with matrices. But first you have to enter the matrix. See: Computation ME.MMA Computation ME.TI86 Computation ME.TI83 Computation ME.SAGE When we do equation operations on system of equations, the names of the variables really aren’t very important. {x}_{1}, {x}_{2}, {x}_{3}, or a, b, c, or x, y, z, it really doesn’t matter. In this subsection we will describe some notation that will make it easier to describe linear systems, solve the systems and describe the solution sets. Here is a list of definitions, laden with notation.
Definition CV
Column Vector
A column vector of size m
is an ordered list of m
numbers, which is written in order vertically, starting at the top and
proceeding to the bottom. At times, we will refer to a column vector
as simply a vector. Column vectors will be written in bold, usually
with lower case Latin letter from the end of the alphabet such as
u,
v,
w,
x,
y,
z.
Some books like to write vectors with arrows, such as
\vec{u}. Writing
by hand, some like to put arrows on top of the symbol, or a tilde underneath the symbol,
as in \mathop{u}\limits_{ ∼}.
To refer to the entry or component that is number
i in the list that
is the vector v
we write {\left [v\right ]}_{i}.
(This definition contains Notation CV.)
(This definition contains Notation CVC.) △
Be careful with this notation. While the symbols {\left [v\right ]}_{i} might look somewhat substantial, as an object this represents just one component of a vector, which is just a single complex number.
Definition ZCV
Zero Column Vector
The zero vector of size m is
the column vector of size m
where each entry is the number zero,
or defined much more compactly, {\left [0\right ]}_{i} = 0 for 1 ≤ i ≤ m.
(This definition contains Notation ZCV.) △
Definition CM
Coefficient Matrix
For a system of linear equations,
the coefficient matrix is the m × n matrix
A = \left [\array{
{a}_{11} & {a}_{12} & {a}_{13} &\mathop{\mathop{…}}\kern 1.95872pt &{a}_{1n}
\cr
{a}_{21} & {a}_{22} & {a}_{23} &\mathop{\mathop{…}}\kern 1.95872pt &{a}_{2n}
\cr
{a}_{31} & {a}_{32} & {a}_{33} &\mathop{\mathop{…}}\kern 1.95872pt &{a}_{3n}
\cr
\mathop{\mathop{⋮}} &\cr
{a}_{
m1}&{a}_{m2}&{a}_{m3}&\mathop{\mathop{…}}\kern 1.95872pt &{a}_{mn}\cr
} \right ]
|
Definition VOC
Vector of Constants
For a system of linear equations,
the vector of constants is the column vector of size m
b = \left [\array{
{b}_{1}
\cr
{b}_{2}
\cr
{b}_{3}
\cr
\mathop{\mathop{⋮}}\cr
{b}_{
m}\cr
} \right ]
|
Definition SOLV
Solution Vector
For a system of linear equations,
the solution vector is the column vector of size n
x = \left [\array{
{x}_{1}
\cr
{x}_{2}
\cr
{x}_{3}
\cr
\mathop{\mathop{⋮}}\cr
{x}_{
n}\cr
} \right ]
|
The solution vector may do double-duty on occasion. It might refer to a list of variable quantities at one point, and subsequently refer to values of those variables that actually form a particular solution to that system.
Definition MRLS
Matrix Representation of a Linear System
If A
is the coefficient matrix of a system of linear equations and
b is the vector of constants,
then we will write ℒS\kern -1.95872pt \left (A,\kern 1.95872pt b\right )
as a shorthand expression for the system of linear equations, which we will refer to
as the matrix representation of the linear system.
(This definition contains Notation MRLS.) △
Example NSLE
Notation for systems of linear equations
The system of linear equations
has coefficient matrix
A = \left [\array{
2 &4&−3&5& 1\cr
3 &1 & 0 &1 &−3
\cr
−2&7&−5&2& 2 } \right ]
|
and vector of constants
b = \left [\array{
9\cr
0
\cr
−3 } \right ]
|
and so will be referenced as ℒS\kern -1.95872pt \left (A,\kern 1.95872pt b\right ). ⊠
Definition AM
Augmented Matrix
Suppose we have a system of m
equations in n variables,
with coefficient matrix A
and vector of constants b.
Then the augmented matrix of the system of equations is the
m × (n + 1) matrix whose first
n columns are the columns
of A and whose last column
(number n + 1) is the column
vector b. This matrix
will be written as \left [\left .A\kern 1.95872pt \right \vert \kern 1.95872pt b\right ].
(This definition contains Notation AM.) △
The augmented matrix represents all the important information in the system of equations, since the names of the variables have been ignored, and the only connection with the variables is the location of their coefficients in the matrix. It is important to realize that the augmented matrix is just that, a matrix, and not a system of equations. In particular, the augmented matrix does not have any “solutions,” though it will be useful for finding solutions to the system of equations that it is associated with. (Think about your objects, and review Technique L.) However, notice that an augmented matrix always belongs to some system of equations, and vice versa, so it is tempting to try and blur the distinction between the two. Here’s a quick example.
Example AMAA
Augmented matrix for Archetype A
Archetype A is the following system of 3 equations in 3 variables.
Here is its augmented matrix.
An augmented matrix for a system of equations will save us the tedium of continually writing down the names of the variables as we solve the system. It will also release us from any dependence on the actual names of the variables. We have seen how certain operations we can perform on equations (Definition EO) will preserve their solutions (Theorem EOPSS). The next two definitions and the following theorem carry over these ideas to augmented matrices.
Definition RO
Row Operations
The following three operations will transform an
m × n
matrix into a different matrix of the same size, and each is known as a row
operation.
We will use a symbolic shorthand to describe these row operations:
(This definition contains Notation RO.) △
Definition REM
Row-Equivalent Matrices
Two matrices, A
and B,
are row-equivalent if one can be obtained from the other by a sequence of row
operations. △
Example TREM
Two row-equivalent matrices
The matrices
are row-equivalent as can be seen from
We can also say that any pair of these three matrices are row-equivalent. ⊠
Notice that each of the three row operations is reversible (Exercise RREF.T10), so we do not have to be careful about the distinction between “A is row-equivalent to B” and “B is row-equivalent to A.” (Exercise RREF.T11) The preceding definitions are designed to make the following theorem possible. It says that row-equivalent matrices represent systems of linear equations that have identical solution sets.
Theorem REMES
Row-Equivalent Matrices represent Equivalent Systems
Suppose that A
and B
are row-equivalent augmented matrices. Then the systems of
linear equations that they represent are equivalent systems.
□
Proof If we perform a single row operation on an augmented matrix, it will have the same effect as if we did the analogous equation operation on the corresponding system of equations. By exactly the same methods as we used in the proof of Theorem EOPSS we can see that each of these row operations will preserve the set of solutions for the corresponding system of equations. ■
So at this point, our strategy is to begin with a system of equations, represent it by an augmented matrix, perform row operations (which will preserve solutions for the corresponding systems) to get a “simpler” augmented matrix, convert back to a “simpler” system of equations and then solve that system, knowing that its solutions are those of the original system. Here’s a rehash of Example US as an exercise in using our new tools.
Example USR
Three equations, one solution, reprised
We solve the following system using augmented matrices and row operations.
This is the same system of equations solved in Example US using equation
operations.
Form the augmented matrix,
and apply row operations,
So the matrix
B = \left [\array{
1&2&2&4\cr
0&1 &1 &1
\cr
0&0&1&4 } \right ]
|
is row equivalent to A and by Theorem REMES the system of equations below has the same solution set as the original system of equations.
Solving this “simpler” system is straightforward and is identical to the process in Example US. ⊠
The preceding example amply illustrates the definitions and theorems we have seen so far. But it still leaves two questions unanswered. Exactly what is this “simpler” form for a matrix, and just how do we get it? Here’s the answer to the first question, a definition of reduced row-echelon form.
Definition RREF
Reduced Row-Echelon Form
A matrix is in reduced row-echelon form if it meets all of the following
conditions:
A row of only zero entries will be called a zero row and the leftmost nonzero entry of a nonzero row will be called a leading 1. The number of nonzero rows will be denoted by r.
A column containing a leading 1 will be called a pivot column. The set of column indices for all of the pivot columns will be denoted by D = \left \{{d}_{1},\kern 1.95872pt {d}_{2},\kern 1.95872pt {d}_{3},\kern 1.95872pt \mathop{\mathop{…}},\kern 1.95872pt {d}_{r}\right \} where {d}_{1} < {d}_{2} < {d}_{3} < \mathrel{⋯} < {d}_{r}, while the columns that are not pivot columns will be denoted as F = \left \{{f}_{1},\kern 1.95872pt {f}_{2},\kern 1.95872pt {f}_{3},\kern 1.95872pt \mathop{\mathop{…}},\kern 1.95872pt {f}_{n−r}\right \} where {f}_{1} < {f}_{2} < {f}_{3} < \mathrel{⋯} < {f}_{n−r}.
(This definition contains Notation RREFA.) △
The principal feature of reduced row-echelon form is the pattern of leading 1’s guaranteed by conditions (2) and (4), reminiscent of a flight of geese, or steps in a staircase, or water cascading down a mountain stream.
There are a number of new terms and notation introduced in this definition, which should make you suspect that this is an important definition. Given all there is to digest here, we will save the use of D and F until Section TSS. However, one important point to make here is that all of these terms and notation apply to a matrix. Sometimes we will employ these terms and sets for an augmented matrix, and other times it might be a coefficient matrix. So always give some thought to exactly which type of matrix you are analyzing.
Example RREF
A matrix in reduced row-echelon form
The matrix C
is in reduced row-echelon form.
This matrix has two zero rows and three leading 1’s. r = 3. Columns 1, 5, and 6 are pivot columns. ⊠
Example NRREF
A matrix not in reduced row-echelon form
The matrix E
is not in reduced row-echelon form, as it fails each of the four requirements
once.
Our next theorem has a “constructive” proof. Learn about the meaning of this term in Technique C.
Theorem REMEF
Row-Equivalent Matrix in Echelon Form
Suppose A is a matrix.
Then there is a matrix B
so that
Proof Suppose that A has m rows and n columns. We will describe a process for converting A into B via row operations. This procedure is known as Gauss–Jordan elimination. Tracing through this procedure will be easier if you recognize that i refers to a row that is being converted, j refers to a column that is being converted, and r keeps track of the number of nonzero rows. Here we go.
The result of this procedure is that the matrix A is converted to a matrix in reduced row-echelon form, which we will refer to as B. We need to now prove this claim by showing that the converted matrix has the requisite properties of Definition RREF. First, the matrix is only converted through row operations (Step 6, Step 7, Step 8), so A and B are row-equivalent (Definition REM).
It is a bit more work to be certain that B is in reduced row-echelon form. We claim that as we begin Step 2, the first j columns of the matrix are in reduced row-echelon form with r nonzero rows. Certainly this is true at the start when j = 0, since the matrix has no columns and so vacuously meets the conditions of Definition RREF with r = 0 nonzero rows.
In Step 2 we increase j by 1 and begin to work with the next column. There are two possible outcomes for Step 3. Suppose that every entry of column j in rows r + 1 through m is zero. Then with no changes we recognize that the first j columns of the matrix has its first r rows still in reduced-row echelon form, with the final m − r rows still all zero.
Suppose instead that the entry in row i of column j is nonzero. Notice that since r + 1 ≤ i ≤ m, we know the first j − 1 entries of this row are all zero. Now, in Step 5 we increase r by 1, and then embark on building a new nonzero row. In Step 6 we swap row r and row i. In the first j columns, the first r − 1 rows remain in reduced row-echelon form after the swap. In Step 7 we multiply row r by a nonzero scalar, creating a 1 in the entry in column j of row i, and not changing any other rows. This new leading 1 is the first nonzero entry in its row, and is located to the right of all the leading 1’s in the preceding r − 1 rows. With Step 8 we insure that every entry in the column with this new leading 1 is now zero, as required for reduced row-echelon form. Also, rows r + 1 through m are now all zeros in the first j columns, so we now only have one new nonzero row, consistent with our increase of r by one. Furthermore, since the first j − 1 entries of row r are zero, the employment of the third row operation does not destroy any of the necessary features of rows 1 through r − 1 and rows r + 1 through m, in columns 1 through j − 1.
So at this stage, the first j columns of the matrix are in reduced row-echelon form. When Step 2 finally increases j to n + 1, then the procedure is completed and the full n columns of the matrix are in reduced row-echelon form, with the value of r correctly recording the number of nonzero rows. ■
The procedure given in the proof of Theorem REMEF can be more precisely described using a pseudo-code version of a computer program, as follows:
input m,
n
and A
r ← 0
for j ← 1
to n
i ← r + 1
while i ≤ m
and {\left [A\right ]}_{ij} = 0
i ← i + 1
if i\mathrel{≠}m + 1
r ← r + 1
swap rows i
and r
of A
(row op 1)
scale entry in row r,
column j
of A
to a leading 1 (row op 2)
for k ← 1\text{ to }m,
k\mathrel{≠}r
zero out entry in row k,
column j
of A
(row op 3 using row r)
output r
and A
Notice that as a practical matter the “and” used in the conditional statement of the while statement should be of the “short-circuit” variety so that the array access that follows is not out-of-bounds.
So now we can put it all together. Begin with a system of linear equations (Definition SLE), and represent the system by its augmented matrix (Definition AM). Use row operations (Definition RO) to convert this matrix into reduced row-echelon form (Definition RREF), using the procedure outlined in the proof of Theorem REMEF. Theorem REMEF also tells us we can always accomplish this, and that the result is row-equivalent (Definition REM) to the original augmented matrix. Since the matrix in reduced-row echelon form has the same solution set, we can analyze the row-reduced version instead of the original matrix, viewing it as the augmented matrix of a different system of equations. The beauty of augmented matrices in reduced row-echelon form is that the solution sets to their corresponding systems can be easily determined, as we will see in the next few examples and in the next section.
We will see through the course that almost every interesting property of a matrix can be discerned by looking at a row-equivalent matrix in reduced row-echelon form. For this reason it is important to know that the matrix B guaranteed to exist by Theorem REMEF is also unique.
Two proof techniques are applicable to the proof. First, head out and read two proof techniques: Technique CD and Technique U.
Theorem RREFU
Reduced Row-Echelon Form is Unique
Suppose that A
is an m × n matrix
and that B
and C are
m × n matrices that are row-equivalent
to A and in reduced
row-echelon form. Then B = C.
□
Proof We need to begin with no assumptions about any relationships between B and C, other than they are both in reduced row-echelon form, and they are both row-equivalent to A.
If B and C are both row-equivalent to A, then they are row-equivalent to each other. Repeated row operations on a matrix combine the rows with each other using operations that are linear, and are identical in each column. A key observation for this proof is that each individual row of B is linearly related to the rows of C. This relationship is different for each row of B, but once we fix a row, the relationship is the same across columns. More precisely, there are scalars {δ}_{ik}, 1 ≤ i,k ≤ m such that for any 1 ≤ i ≤ m, 1 ≤ j ≤ n,
You should read this as saying that an entry of row i of B (in column j) is a linear function of the entries of all the rows of C that are also in column j, and the scalars ({δ}_{ik}) depend on which row of B we are considering (the i subscript on {δ}_{ik}), but are the same for every column (no dependence on j in {δ}_{ik}). This idea may be complicated now, but will feel more familiar once we discuss “linear combinations” (Definition LCCV) and moreso when we discuss “row spaces” (Definition RSM). For now, spend some time carefully working Exercise RREF.M40, which is designed to illustrate the origins of this expression. This completes our exploitation of the row-equivalence of B and C.
We now repeatedly exploit the fact that B and C are in reduced row-echelon form. Recall that a pivot column is all zeros, except a single one. More carefully, if R is a matrix in reduced row-echelon form, and {d}_{ℓ} is the index of a pivot column, then {\left [R\right ]}_{k{d}_{ℓ}} = 1 precisely when k = ℓ and is otherwise zero. Notice also that any entry of R that is both below the entry in row ℓ and to the left of column {d}_{ℓ} is also zero (with below and left understood to include equality). In other words, look at examples of matrices in reduced row-echelon form and choose a leading 1 (with a box around it). The rest of the column is also zeros, and the lower left “quadrant” of the matrix that begins here is totally zeros.
Assuming no relationship about the form of B and C, let B have r nonzero rows and denote the pivot columns as D = \left \{{d}_{1},\kern 1.95872pt {d}_{2},\kern 1.95872pt {d}_{3},\kern 1.95872pt \mathop{\mathop{…}},\kern 1.95872pt {d}_{r}\right \}. For C let {r}^{′} denote the number of nonzero rows and denote the pivot columns as {D}^{′} = \left \{{{d}^{′}}_{ 1},\kern 1.95872pt {{d}^{′}}_{ 2},\kern 1.95872pt {{d}^{′}}_{ 3},\kern 1.95872pt \mathop{\mathop{…}},\kern 1.95872pt {{d}^{′}}_{{ r}^{′}}\right \} (Notation RREFA). There are four steps in the proof, and the first three are about showing that B and C have the same number of pivot columns, in the same places. In other words, the “primed” symbols are a necessary fiction.
First Step. Suppose that {d}_{1} < {d}_{1}^{′}. Then
The entries of C are all zero since they are left and below of the leading 1 in row 1 and column {d}_{1}^{′} of C. This is a contradiction, so we know that {d}_{1} ≥ {d}_{1}^{′}. By an entirely similar argument, reversing the roles of B and C, we could conclude that {d}_{1} ≤ {d}_{1}^{′}. Together this means that {d}_{1} = {d}_{1}^{′}.
Second Step. Suppose that we have determined that {d}_{1} = {d}_{1}^{′}, {d}_{2} = {d}_{2}^{′}, {d}_{3} = {d}_{3}^{′}, …, {d}_{p} = {d}_{p}^{′}. Let’s now show that {d}_{p+1} = {d}_{p+1}^{′}. Working towards a contradiction, suppose that {d}_{p+1} < {d}_{p+1}^{′}. For 1 ≤ ℓ ≤ p,
Now,
This contradiction shows that {d}_{p+1} ≥ {d}_{p+1}^{′}. By an entirely similar argument, we could conclude that {d}_{p+1} ≤ {d}_{p+1}^{′}, and therefore {d}_{p+1} = {d}_{p+1}^{′}.
Third Step. Now we establish that r = {r}^{′}. Suppose that {r}^{′} < r. By the arguments above know that {d}_{1} = {d}_{1}^{′}, {d}_{2} = {d}_{2}^{′}, {d}_{3} = {d}_{3}^{′}, …, {d}_{{r}^{′}} = {d}_{{r}^{′}}^{′}. For 1 ≤ ℓ ≤ {r}^{′} < r,
Now examine the entries of row r of B,
So row r is a totally zero row, contradicting that this should be the bottommost nonzero row of B. So {r}^{′} ≥ r. By an entirely similar argument, reversing the roles of B and C, we would conclude that {r}^{′} ≤ r and therefore r = {r}^{′}. Thus, combining the first three steps we can say that D = {D}^{′}. In other words, B and C have the same pivot columns, in the same locations.
Fourth Step. In this final step, we will not argue by contradiction. Our intent is to determine the values of the {δ}_{ij}. Notice that we can use the values of the {d}_{i} interchangeably for B and C. Here we go,
and for ℓ\mathrel{≠}i
Finally, having determined the values of the {δ}_{ij}, we can show that B = C. For 1 ≤ i ≤ m, 1 ≤ j ≤ n,
So B and C have equal values in every entry, and so are the same matrix. ■
We will now run through some examples of using these definitions and theorems to solve some systems of equations. From now on, when we have a matrix in reduced row-echelon form, we will mark the leading 1’s with a small box. In your work, you can box ’em, circle ’em or write ’em in a different color — just identify ’em somehow. This device will prove very useful later and is a very good habit to start developing right now.
Example SAB
Solutions for Archetype B
Let’s find the solutions to the following system of equations,
First, form the augmented matrix,
and work to reduced row-echelon form, first with i = 1,
Now, with i = 2,
And finally, with i = 3,
This is now the augmented matrix of a very simple system of equations, namely {x}_{1} = −3, {x}_{2} = 5, {x}_{3} = 2, which has an obvious solution. Furthermore, we can see that this is the only solution to this system, so we have determined the entire solution set,
You might compare this example with the procedure we used in Example US. ⊠
Archetypes A and B are meant to contrast each other in many respects. So let’s solve Archetype A now.
Example SAA
Solutions for Archetype A
Let’s find the solutions to the following system of equations,
First, form the augmented matrix,
and work to reduced row-echelon form, first with i = 1,
Now, with i = 2,
The system of equations represented by this augmented matrix needs to be considered a bit differently than that for Archetype B. First, the last row of the matrix is the equation 0 = 0, which is always true, so it imposes no restrictions on our possible solutions and therefore we can safely ignore it as we analyze the other two equations. These equations are,
While this system is fairly easy to solve, it also appears to have a multitude of solutions. For example, choose {x}_{3} = 1 and see that then {x}_{1} = 2 and {x}_{2} = 3 will together form a solution. Or choose {x}_{3} = 0, and then discover that {x}_{1} = 3 and {x}_{2} = 2 lead to a solution. Try it yourself: pick any value of {x}_{3} you please, and figure out what {x}_{1} and {x}_{2} should be to make the first and second equations (respectively) true. We’ll wait while you do that. Because of this behavior, we say that {x}_{3} is a “free” or “independent” variable. But why do we vary {x}_{3} and not some other variable? For now, notice that the third column of the augmented matrix does not have any leading 1’s in its column. With this idea, we can rearrange the two equations, solving each for the variable that corresponds to the leading 1 in that row.
To write the set of solution vectors in set notation, we have
We’ll learn more in the next section about systems with infinitely many solutions and how to express their solution sets. Right now, you might look back at Example IS. ⊠
Example SAE
Solutions for Archetype E
Let’s find the solutions to the following system of equations,
First, form the augmented matrix,
and work to reduced row-echelon form, first with i = 1,
Now, with i = 2,
And finally, with i = 3,
Let’s analyze the equations in the system represented by this augmented matrix. The third equation will read 0 = 1. This is patently false, all the time. No choice of values for our variables will ever make it true. We’re done. Since we cannot even make the last equation true, we have no hope of making all of the equations simultaneously true. So this system has no solutions, and its solution set is the empty set, ∅ = \left \{\ \right \} (Definition ES).
Notice that we could have reached this conclusion sooner. After performing the row operation − 7{R}_{2} + {R}_{3}, we can see that the third equation reads 0 = −5, a false statement. Since the system represented by this matrix has no solutions, none of the systems represented has any solutions. However, for this example, we have chosen to bring the matrix fully to reduced row-echelon form for the practice. ⊠
These three examples (Example SAB, Example SAA, Example SAE) illustrate the full range of possibilities for a system of linear equations — no solutions, one solution, or infinitely many solutions. In the next section we’ll examine these three scenarios more closely.
Definition RR
Row-Reducing
To row-reduce the matrix A
means to apply row operations to A
and arrive at a row-equivalent matrix
B in reduced
row-echelon form. △
So the term row-reduce is used as a verb. Theorem REMEF tells us that this process will always be successful and Theorem RREFU tells us that the result will be unambiguous. Typically, the analysis of A will proceed by analyzing B and applying theorems whose hypotheses include the row-equivalence of A and B.
After some practice by hand, you will want to use your favorite computing device to do the computations required to bring a matrix to reduced row-echelon form (Exercise RREF.C30). See: Computation RR.MMA Computation RR.TI86 Computation RR.TI83 Computation RR.SAGE
\left [\array{
1&5&0&6&8\cr
0&0 &1 &2 &0
\cr
0&0&0&0&1} \right ]
|
\left [\array{
2 &1& 8\cr
−1 &1 &−1
\cr
−2&5& 4} \right ]
|
C05 Each archetype below is a system of equations. Form the augmented
matrix of the system of equations, convert the matrix to reduced row-echelon form
by using equation operations and then describe the solution set of the original
system of equations.
Archetype A
Archetype B
Archetype C
Archetype D
Archetype E
Archetype F
Archetype G
Archetype H
Archetype I
Archetype J
Contributed by Robert Beezer
For problems C10–C19, find all solutions to the system of linear equations. Use
your favorite computing device to row-reduce the augmented matrices for the
systems, and write the solutions as a set, using correct set notation.
C10
Contributed by Robert Beezer Solution [126]
Contributed by Robert Beezer Solution [127]
Contributed by Robert Beezer Solution [127]
Contributed by Robert Beezer Solution [128]
Contributed by Robert Beezer Solution [129]
Contributed by Robert Beezer Solution [131]
Contributed by Robert Beezer Solution [132]
Contributed by Robert Beezer Solution [133]
Contributed by Robert Beezer Solution [134]
Contributed by Robert Beezer Solution [135]
For problems C30–C33, row-reduce the matrix without the aid of a calculator,
indicating the row operations you are using at each step using the notation of
Definition RO.
C30
Contributed by Robert Beezer Solution [137]
Contributed by Robert Beezer Solution [137]
Contributed by Robert Beezer Solution [138]
Contributed by Robert Beezer Solution [139]
M40 Consider the two 3 × 4 matrices below
(a) Row-reduce each matrix and determine that the reduced row-echelon forms of B and C are identical. From this argue that B and C are row-equivalent.
(b) In the proof of Theorem RREFU, we begin by arguing that entries of row-equivalent matrices are related by way of certain scalars and sums. In this example, we would write that entries of B from row i that are in column j are linearly related to the entries of C in column j from all three rows
For each 1 ≤ i ≤ 3
find the corresponding three scalars in this relationship. So your answer will be
nine scalars, determined three at a time.
Contributed by Robert Beezer Solution [139]
M45 You keep a number of lizards, mice and peacocks as pets. There are a total
of 108 legs and 30 tails in your menagerie. You have twice as many mice as
lizards. How many of each creature do you have?
Contributed by Chris Black Solution [142]
M50 A parking lot has 66 vehicles (cars, trucks, motorcycles and bicycles) in it.
There are four times as many cars as trucks. The total number of tires (4 per car
or truck, 2 per motorcycle or bicycle) is 252. How many cars are there? How many
bicycles?
Contributed by Robert Beezer Solution [143]
T10 Prove that each of the three row operations (Definition RO) is reversible. More precisely,
if the matrix B is
obtained from A
by application of a single row operation, show that there is a single row operation that will
transform B
back into A.
Contributed by Robert Beezer Solution [145]
T11 Suppose that A, B and C are m × n matrices. Use the definition of row-equivalence (Definition REM) to prove the following three facts.
A relationship that satisfies these three properties is known as an equivalence
relation, an important idea in the study of various algebras. This is a formal way
of saying that a relationship behaves like equality, without requiring the
relationship to be as strict as equality itself. We’ll see it again in Theorem SER.
Contributed by Robert Beezer
T12 Suppose that B
is an m × n
matrix in reduced row-echelon form. Build a new, likely smaller,
k × ℓ matrix
C as follows. Keep any
collection of k adjacent
rows, k ≤ m. From these
rows, keep columns 1
through ℓ,
ℓ ≤ n. Prove
that C
is in reduced row-echelon form.
Contributed by Robert Beezer
T13 Generalize Exercise RREF.T12 by just keeping any
k rows,
and not requiring the rows to be adjacent. Prove that any such matrix
C is in
reduced row-echelon form.
Contributed by Robert Beezer
C10 Contributed by Robert Beezer Statement [114]
The augmented matrix row-reduces to
\left [\array{
\text{1}&0&0&0& 1\cr
0&\text{1 } &0 &0 &−3
\cr
0&0&\text{1}&0&−4\cr
0&0 &0 &\text{1 } & 1 } \right ]
|
and we see from the locations of the leading 1’s that the system is consistent (Theorem RCLS) and that n − r = 4 − 4 = 0 and so the system has no free variables (Theorem CSRN) and hence has a unique solution. This solution is
C11 Contributed by Robert Beezer Statement [115]
The augmented matrix row-reduces to
\left [\array{
\text{1}&0& 1 & 4∕5 &0
\cr
0&\text{1}&−1&−1∕10&0
\cr
0&0& 0 & 0 &\text{1} } \right ]
|
and a leading 1 in the last column tells us that the system is inconsistent (Theorem RCLS). So the solution set is ∅ = \left \{\right \}.
C12 Contributed by Robert Beezer Statement [115]
The augmented matrix row-reduces to
\left [\array{
\text{1}&2&0&−4& 2\cr
0&0 &\text{1 } & 3 &−6
\cr
0&0&0& 0 & 0 } \right ]
|
(Theorem RCLS) and (Theorem CSRN) tells us the system is consistent and the solution set can be described with n − r = 4 − 2 = 2 free variables, namely {x}_{2} and {x}_{4}. Solving for the dependent variables (D = \left \{{x}_{1},\kern 1.95872pt {x}_{3}\right \}) the first and second equations represented in the row-reduced matrix yields,
As a set, we write this as
\left \{\left [\array{
2 − 2{x}_{2} + 4{x}_{4}
\cr
{x}_{2}
\cr
−6 − 3{x}_{4}
\cr
{x}_{4} } \right ]\mathrel{∣}{x}_{2},\kern 1.95872pt {x}_{4} ∈ {ℂ}^{}\right \}
|
C13 Contributed by Robert Beezer Statement [116]
The augmented matrix of the system of equations is
\left [\array{
1 &2& 8 &−7& −2\cr
3 &2 &12 &−5 & 6
\cr
−1&1& 1 &−5&−10 } \right ]
|
which row-reduces to
\left [\array{
\text{1}&0&2& 1 &0\cr
0&\text{1 } &3 &−4 &0
\cr
0&0&0& 0 &\text{1} } \right ]
|
With a leading one in the last column Theorem RCLS tells us the system of equations is inconsistent, so the solution set is the empty set, ∅.
C14 Contributed by Robert Beezer Statement [116]
The augmented matrix of the system of equations is
\left [\array{
2& 1 &7&−2& 4\cr
3&−2 &0 & 11 &13
\cr
1& 1 &5&−3& 1 } \right ]
|
which row-reduces to
\left [\array{
\text{1}&0&2& 1 & 3\cr
0&\text{1 } &3 &−4 &−2
\cr
0&0&0& 0 & 0 } \right ]
|
Then D = \left \{1, 2\right \} and F = \left \{3, 4, 5\right \}, so the system is consistent (5∉D) and can be described by the two free variables {x}_{3} and {x}_{4}. Rearranging the equations represented by the two nonzero rows to gain expressions for the dependent variables {x}_{1} and {x}_{2}, yields the solution set,
S = \left \{\left [\array{
3 − 2{x}_{3} − {x}_{4}
\cr
−2 − 3{x}_{3} + 4{x}_{4}
\cr
{x}_{3}
\cr
{x}_{4} } \right ]\mathrel{∣}{x}_{3},\kern 1.95872pt {x}_{4} ∈ {ℂ}^{}\right \}
|
C15 Contributed by Robert Beezer Statement [117]
The augmented matrix of the system of equations is
\left [\array{
2 &3&−1&−9&−16\cr
1 &2 & 1 & 0 & 0
\cr
−1&2& 3 & 4 & 8 } \right ]
|
which row-reduces to
\left [\array{
\text{1}&0&0& 2 & 3\cr
0&\text{1 } &0 &−3 &−5
\cr
0&0&\text{1}& 4 & 7 } \right ]
|
Then D = \left \{1, 2, 3\right \} and F = \left \{4, 5\right \}, so the system is consistent (5∉D) and can be described by the one free variable {x}_{4}. Rearranging the equations represented by the three nonzero rows to gain expressions for the dependent variables {x}_{1}, {x}_{2} and {x}_{3}, yields the solution set,
S = \left \{\left [\array{
3 − 2{x}_{4}
\cr
−5 + 3{x}_{4}
\cr
7 − 4{x}_{4}
\cr
{x}_{4} } \right ]\mathrel{∣}{x}_{4} ∈ {ℂ}^{}\right \}
|
C16 Contributed by Robert Beezer Statement [117]
The augmented matrix of the system of equations is
\left [\array{
2 &3&19&−4&2\cr
1 &2 &12 &−3 &1
\cr
−1&2& 8 &−5&1 } \right ]
|
which row-reduces to
\left [\array{
\text{1}&0&2& 1 &0\cr
0&\text{1 } &5 &−2 &0
\cr
0&0&0& 0 &\text{1} } \right ]
|
With a leading one in the last column Theorem RCLS tells us the system of equations is inconsistent, so the solution set is the empty set, ∅ = \left \{\right \}.
C17 Contributed by Robert Beezer Statement [118]
We row-reduce the augmented matrix of the system of equations,
The reduced row-echelon form of the matrix is the augmented matrix of the system {x}_{1} = 3, {x}_{2} = −1, {x}_{3} = 2, {x}_{4} = 5, which has a unique solution. As a set of column vectors, the solution set is
C18 Contributed by Robert Beezer Statement [118]
We row-reduce the augmented matrix of the system of equations,
With no leading 1 in the final column, we recognize the system as consistent (Theorem RCLS). Since the system is consistent, we compute the number of free variables as n − r = 5 − 3 = 2 (), and we see that columns 3 and 5 are not pivot columns, so {x}_{3} and {x}_{5} are free variables. We convert each row of the reduced row-echelon form of the matrix into an equation, and solve it for the lone dependent variable, as in expression in the two free variables.
These expressions give us a convenient way to describe the solution set, S.
C19 Contributed by Robert Beezer Statement [119]
We form the augmented matrix of the system,
which row-reduces to
With no leading 1 in the final column, this system is consistent (Theorem RCLS). There are n = 2 variables in the system and r = 2 non-zero rows in the row-reduced matrix. By Theorem FVCS, there are n − r = 2 − 2 = 0 free variables and we therefore know the solution is unique. Forming the system of equations represented by the row-reduced matrix, we see that {x}_{1} = 4 and {x}_{2} = −2. Written as set of column vectors,
C30 Contributed by Robert Beezer Statement [120]
C31 Contributed by Robert Beezer Statement [120]
C32 Contributed by Robert Beezer Statement [121]
Following the algorithm of Theorem REMEF, and working to create pivot
columns from left to right, we have
C33 Contributed by Robert Beezer Statement [121]
Following the algorithm of Theorem REMEF, and working to create pivot
columns from left to right, we have
M40 Contributed by Robert Beezer Statement [121]
(a) Let R
be the common reduced row-echelon form of
B and
C. A sequence of row
operations converts B
to R
and a second sequence of row operations converts
C to
R. If
we “reverse” the second sequence’s order, and reverse each individual
row operation (see Exercise RREF.T10) then we can begin with
B, convert to
R with the first sequence,
and then convert to C
with the reversed sequence. Satisfying Definition REM we can say
B and
C are
row-equivalent matrices.
(b) We will work this carefully for the first row of B and just give the solution for the next two rows. For row 1 of B take i = 1 and we have
If we substitute the four values for j we arrive at four linear equations in the three unknowns {δ}_{11},{δ}_{12},{δ}_{13},
We form the augmented matrix of this system and row-reduce to find the solutions,
So the unique solution is {δ}_{11} = 2, {δ}_{12} = −3, {δ}_{13} = −2. Entirely similar work will lead you to
}
M45 Contributed by Chris Black Statement [123]
Let l,m,p
denote the number of lizards, mice and peacocks. Then the statements from the
problem yield the equations:
The augmented matrix for this system is
\left [\array{
4& 4 &2&108\cr
1& 1 &1 & 30
\cr
2&−1&0& 0 } \right ]
|
which row-reduces to
\left [\array{
\text{1}&0&0& 8\cr
0&\text{1 } &0 &16
\cr
0&0&\text{1}& 6 } \right ]
|
From the row-reduced matrix, we see that we have an equivalent system l = 8, m = 16, and p = 6, which means that you have 8 lizards, 16 mice and 6 peacocks.
M50 Contributed by Robert Beezer Statement [123]
Let c,\kern 1.95872pt t,\kern 1.95872pt m,\kern 1.95872pt b
denote the number of cars, trucks, motorcycles, and bicycles. Then the statements
from the problem yield the equations:
The augmented matrix for this system is
\left [\array{
1& 1 &1&1& 66\cr
1&−4 &0 &0 & 0
\cr
4& 4 &2&2&252 } \right ]
|
which row-reduces to
\left [\array{
\text{1}&0&0&0&48\cr
0&\text{1 } &0 &0 &12
\cr
0&0&\text{1}&1& 6 } \right ]
|
The first row of the matrix represents the equation c = 48, so there are 48 cars. The second row of the matrix represents the equation t = 12, so there are 12 trucks. The third row of the matrix represents the equation m + b = 6 so there are anywhere from 0 to 6 bicycles. We can also say that b is a free variable, but the context of the problem limits it to 7 integer values since you cannot have a negative number of motorcycles.
T10 Contributed by Robert Beezer Statement [123]
If we can reverse each row operation individually, then we can reverse a sequence
of row operations. The operations that reverse each operation are listed below,
using our shorthand notation,