Section F Fields

We have chosen to present introductory linear algebra in the Core (Part C) using scalars from the set of complex numbers, ℂ. We could have instead chosen to use scalars from the set of real numbers, {ℝ}^{}. This would have presented certain difficulties when we encountered characteristic polynomials with complex roots (Definition CP) or when we needed to be sure every matrix had at least one eigenvalue (Theorem EMHE). However, much of the basics would be unchanged. The definition of a vector space would not change, nor would the ideas of linear independence, spanning, or bases. Linear transformations would still behave the same and we would still obtain matrix representations, though our ideas about canonical forms would have to be adjusted slightly.

The real numbers and the complex numbers are both examples of what are called fields, and we can “do” linear algebra in just a bit more generality by letting our scalars take values from some unspecified field. So in this section we will describe exactly what constitutes a field, give some finite examples, and discuss another connection between fields and vector spaces. Vector spaces over finite fields are very important in certain applications, so this is partially background for other topics. As such, we will not prove every claim we make.

Subsection F: Fields

Like a vector space, a field is a set along with two binary operations. The distinction is that both operations accept two elements of the set, and then produce a new element of the set. In a vector space we have two sets — the vectors and the scalars, and scalar multiplication mixes one of each to produce a vector. Here is the careful definition of a field.

Definition F
Field
Suppose that F is a set upon which we have defined two operations: (1) addition, which combines two elements of F and is denoted by “+”, and (2) multiplication, which combines two elements of F and is denoted by juxtaposition. Then F, along with the two operations, is a field if the following properties hold.

Mostly this definition says that all the good things you might expect, really do happen in a field. The one technicality is that the special element, 0, the additive identity element, does not have a multiplicative inverse. In other words, no dividing by zero.

This definition should remind you of Theorem PCNA, and indeed, Theorem PCNA provides the justification for the statement that the complex numbers form a field. Another example of field is the set of rational numbers

\eqalignno{ ℚ = \left \{{p\over q}\mathrel{∣}p,\kern 1.95872pt q\text{ are integers, }q\mathrel{≠}0\right \} & & }

Of course, the real numbers, {ℝ}^{}, also form a field. It is this field that you probably studied for many years. You began studying the integers (“counting”), then the rationals (“fractions”), then the reals (“algebra”), along with some excursions in the complex numbers (“imaginary numbers”). So you should have seen three fields already in your previous studies.

Our first observation about fields is that we can go back to our definition of a vector space (Definition VS) and replace every occurrence of ℂ by some general, unspecified field, F, and all our subsequent definitions and theorems are still true, so long as we avoid roots of polynomials (or equivalently, factoring polynomials). So if you consult more advanced texts on linear algebra, you will see this sort of approach. You might study some of the first theorems we proved about vector spaces in Subsection VS.VSP and work through their proofs in the more general setting of an arbitrary field. This exercise should convince you that very little changes when we move from ℂ to an arbitrary field F. (See Exercise F.T10.)

Subsection FF: Finite Fields

It may sound odd at first, but there exist finite fields, and even finite vector spaces. We will find certain of these important in subsequent applications, so we collect some ideas and properties here.

Definition IMP
Integers Modulo a Prime
Suppose that p is a prime number. Let {ℤ}_{p} = \left \{0,\kern 1.95872pt 1,\kern 1.95872pt 2,\kern 1.95872pt \mathop{\mathop{…}},\kern 1.95872pt p − 1\right \}. Add and multiply elements of {ℤ}_{p} as integers, but whenever a result lies outside of the set {ℤ}_{p}, find its remainder after division by p and replace the result by this remainder. △

Example IM11
Integers mod 11
{ℤ}_{11} is a field by Theorem FIMP. Here we provide some sample calculations.

\eqalignno{ 8 + 5 & = 2 & − 8 & = 3 &5 − 9 & = 7 & & & & & & \cr 5(7) & = 2 &{1\over 7} & = 8 &{6\over 5} & = 10 & & & & & & \cr {2}^{5} & = 10 & − 1 & = 10 &{1\over 0} & = \text{?} & & & & & & }

Example VSIM5
Vector space over integers mod 5
Let {\left ({ℤ}_{5}\right )}^{3} be the set of all column vectors of length 3 with entries from {ℤ}_{5}. Use {ℤ}_{5} as the set of scalars. Define addition and multiplication the usual way. We exhibit a few sample calculations.

\eqalignno{ \left [\array{ 2 \cr 3 \cr 4 } \right ] + \left [\array{ 4 \cr 1 \cr 3 } \right ] & = \left [\array{ 1 \cr 4 \cr 2 } \right ] &3\left [\array{ 2 \cr 0 \cr 4 } \right ] & = \left [\array{ 1 \cr 0 \cr 2 } \right ] & & & & }

\eqalignno{ 2\left [\array{ 1 \cr 3 \cr 0 } \right ] − 4\left [\array{ 2 \cr 1 \cr 1 } \right ] + \left [\array{ 1 \cr 2 \cr 4 } \right ] & = \left [\array{ 0 \cr 4 \cr 0 } \right ] & & }

\eqalignno{ \left \{\left [\array{ 1 \cr 3 \cr 1 } \right ],\kern 1.95872pt \left [\array{ 2 \cr 2 \cr 0 } \right ]\right \} & & }

\eqalignno{ \left \{\left [\array{ 1 \cr 3 \cr 1 } \right ],\kern 1.95872pt \left [\array{ 2 \cr 2 \cr 0 } \right ],\kern 1.95872pt \left [\array{ 4 \cr 3 \cr 2 } \right ]\right \} & & }

is linearly dependent, as can be seen from the relation of linear dependence formed by the scalars {a}_{1} = 2, {a}_{2} = 1 and {a}_{3} = 4. To find these scalars, one would take the same approach as Example LDS, but in performing row operations to solve a homogeneous system, you would need to take care that all scalar (field) operations are performed over {ℤ}_{5}, especially when multiplying a row by a scalar to make a leading entry equal to 1. One more observation about this example — the set

\eqalignno{ \left \{\left [\array{ 1 \cr 0 \cr 0 } \right ],\kern 1.95872pt \left [\array{ 1 \cr 1 \cr 0 } \right ],\kern 1.95872pt \left [\array{ 1 \cr 1 \cr 1 } \right ]\right \} & & }

is a basis for {\left ({ℤ}_{5}\right )}^{3}, since it is both linearly independent and spans {\left ({ℤ}_{5}\right )}^{3}. ⊠

In applications to computer science or electrical engineering, {ℤ}_{2} is the most important field, since it can be used to describe the binary nature of logic, circuitry, communications and their intertwined relationships. The vector space of column vectors with entries from {ℤ}_{2}, {\left ({ℤ}_{2}\right )}^{n}, with scalars taken from {ℤ}_{2} is the natural extension of this idea. Notice that {ℤ}_{2} has the minimum number of elements to be a field, since any field must contain a zero and a one (Property ZF, Property OF).

Example SM2Z7
Symmetric matrices of size 2 over {ℤ}_{7}
We can employ the field of integers modulo a prime to build other examples of vector spaces with novel fields of scalars. Define

\eqalignno{ {S}_{22}\left ({ℤ}_{7}\right ) & = \left \{\left [\array{ a&b \cr b&c } \right ]\mathrel{∣}a,\kern 1.95872pt b,\kern 1.95872pt c ∈ {ℤ}_{7}\right \} & & }

which is the set of all 2 × 2 symmetric matrices with entries from {ℤ}_{7}. Use the field {ℤ}_{7} as the set of scalars, and define vector addition and scalar multiplication in the natural way. The result will be a vector space.

Notice that the field of scalars is finite, as is the vector space, since there are {7}^{3} = 343 matrices in {S}_{22}\left ({ℤ}_{7}\right ). The set

\eqalignno{ \left \{\left [\array{ 1&0 \cr 0&0 } \right ],\kern 1.95872pt \left [\array{ 0&1 \cr 1&0 } \right ],\kern 1.95872pt \left [\array{ 0&0 \cr 0&1 } \right ]\right \} & & }

is a basis, so \mathop{ dim}\nolimits \left ({S}_{22}\left ({ℤ}_{7}\right )\right ) = 3. ⊠

In a more advanced algebra course it is possible to prove that the number of elements in a finite field must be of the form {p}^{n}, where p is a prime. We can’t go so far afield as to prove this here, but we can demonstrate an example.

Example FF8
Finite field of size 8
Define the set F as F = \left \{a + bt + c{t}^{2}\mathrel{∣}a,\kern 1.95872pt b,\kern 1.95872pt c ∈ {ℤ}_{ 2}\right \}. Add and multiply these quantities as polynomials in the variable t, but replace any occurrence of {t}^{3} by t + 1.

This defines a set, and the two operations on elements of that set. Do not be concerned with what t “is,” because it isn’t. t is just a handy device that makes the example a field. We’ll say a bit more about t when we finish. But first, some examples. Remember that 1 + 1 = 0 in {ℤ}_{2}. Addition is quite simple, for example,

\eqalignno{ \left (1 + t + {t}^{2}\right ) + \left (1 + {t}^{2}\right ) & = (1 + 1) + (1 + 0)t + (1 + 1){t}^{2} = t & & \cr & & }

\eqalignno{ \left (1 + t + {t}^{2}\right )\left (1 + {t}^{2}\right ) & = 1 + {t}^{2} + t + {t}^{3} + {t}^{2} + {t}^{4} & & \cr & = 1 + t + (1 + 1){t}^{2} + {t}^{3}\left (1 + t\right ) & & \cr & = 1 + t + \left (1 + t\right )\left (1 + t\right ) & & \cr & = 1 + t + 1 + t + t + {t}^{2} & & \cr & = (1 + 1) + (1 + 1 + 1)t + {t}^{2} & & \cr & = t + {t}^{2} & & \cr & & }

Every element has a multiplicative inverse (Property MIF). What is the inverse of t + {t}^{2}? Check that

\eqalignno{ \left (t + {t}^{2}\right )\left (1 + t\right ) & = t + {t}^{2} + {t}^{2} + {t}^{3} & & \cr & = t + (1 + 1){t}^{2} + \left (1 + t\right ) & & \cr & = t + 1 + t & & \cr & = 1 + (1 + 1)t & & \cr & = 1 & & }

So we can write {1\over t+{t}^{2}} = 1 + t. So that you may experiment, we give you the complete addition and multiplication tables for this field. Addition is simple, while multiplication is more interesting, so verify a few entries of each table. Because of the commutativity of addition and multiplication (Property CAF, Property CMF), we have just listed half of each table.

\array{ + &0&1&t &{t}^{2} &t + 1 &{t}^{2} + t &{t}^{2} + t + 1&{t}^{2} + 1 ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ \cr 0 &0&1&t &{t}^{2} &t + 1 &{t}^{2} + t &{t}^{2} + t + 1&{t}^{2} + 1 \cr 1 & &0&t + 1&{t}^{2} + 1&t &{t}^{2} + t + 1&{t}^{2} + t &{t}^{2} \cr t & & &0 &{t}^{2} + t&1 &{t}^{2} &{t}^{2} + 1 &{t}^{2} + t + 1 \cr {t}^{2} & & & &0 &{t}^{2} + t + 1&t &t + 1 &1 \cr t + 1 & & & & &0 &{t}^{2} + 1 &{t}^{2} &{t}^{2} + t \cr {t}^{2} + t & & & & & &0 &1 &t + 1 \cr {t}^{2} + t + 1& & & & & & &0 &t \cr {t}^{2} + 1 & & & & & & & &0}

\array{ ⋅ &0&1&t &{t}^{2} &t + 1 &{t}^{2} + t &{t}^{2} + t + 1&{t}^{2} + 1 ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ ̲ \cr 0 &0&0&0 &0 &0 &0 &0 &0 \cr 1 & &1&t &{t}^{2} &t + 1 &{t}^{2} + t &{t}^{2} + t + 1&{t}^{2} + 1 \cr t & & &{t}^{2}&t + 1 &{t}^{2} + t &{t}^{2} + t + 1&{t}^{2} + 1 &1 \cr {t}^{2} & & & &{t}^{2} + t&{t}^{2} + t + 1&{t}^{2} + 1 &1 &t \cr t + 1 & & & & &{t}^{2} + 1 &1 &t &{t}^{2} \cr {t}^{2} + t & & & & & &t &{t}^{2} &t + 1 \cr {t}^{2} + t + 1& & & & & & &1 + t &{t}^{2} + t \cr {t}^{2} + 1 & & & & & & & &{t}^{2} + t + 1}

Note that every element of F is a linear combination (with scalars from {ℤ}_{2}) of the polynomials 1, t, {t}^{2}. So B = \left \{1,\kern 1.95872pt t,\kern 1.95872pt {t}^{2}\right \} is a spanning set for F. Further, B is linearly independent since there is no nontrivial relation of linear dependence, and B is a basis. So \mathop{ dim}\nolimits \left (F\right ) = 3. Of course, this paragraph presumes that F is also a vector space over {ℤ}_{2} (which it is). ⊠

The defining relation for t ({t}^{3} = t + 1) in Example FF8 arises from the polynomial {t}^{3} + t + 1, which has no factorization with coefficients from {ℤ}_{2}. This is an example of an irreducible polynomial, which involves considerable theory to fully understand. In the exercises, we provide you with a few more irreducible polynomials to experiment with. See the suggested readings if you would like to learn more.

Trivially, every field (finite or otherwise) is a vector space. Suppose we begin with a field F. From this we know F has two binary operations defined on it. We need to somehow create a vector space from F, in a general way. First we need a set of vectors. That’ll be F. We also need a set of scalars. That’ll be F as well. How do we define the addition of two vectors? By the same rule that we use to add them when they are in the field. How do we define scalar multiplication? Since a scalar is an element of F, and a vector is an element of F, we can define scalar multiplication to be the same rule that we use to multiply the two elements as members of the field. With these definitions, F will be a vector space (Exercise F.T20). This is something of a trivial situation, since the set of vectors and the set of scalars are identical. In particular, do not confuse this with Example FF8 where the set of vectors has eight elements, and the set of scalars has just two elements.

Further Reading
Robert J. McEliece, Finite Fields for Scientists and Engineers. Kluwer Academic Publishers, 1987.

Rudolpf Lidl, Harald Niederreiter, Introduction to Finite Fields and Their Applications, Revised Edition. Cambridge University Press, 1994.

Subsection EXC: Exercises

C60 Consider the vector space {\left ({ℤ}_{5}\right )}^{4} composed of column vectors of size 4 with entries from {ℤ}_{5}. The matrix A is a square matrix composed of four such column vectors.

\eqalignno{ A & = \left [\array{ 3&3&0&3 \cr 1&2&3&0 \cr 1&1&0&2 \cr 4&2&2&1 } \right ] & & }

Find the inverse of A. Use this to find a solution to ℒS\kern -1.95872pt \left (A,\kern 1.95872pt b\right ) when

\eqalignno{ b & = \left [\array{ 3 \cr 3 \cr 2 \cr 0 } \right ] & & }

M10 Suppose we relax the restriction in Definition IMP to allow p to not be a prime. Will the construction given still be a field? Is {ℤ}_{6} a field? Can you generalize?
Contributed by Robert Beezer

\eqalignno{ F & = \left \{a + bt\mathrel{∣}a,\kern 1.95872pt b ∈ {ℤ}_{3}\right \} & & }

where {t}^{2} is consistently replaced by 2t + 1 in any intermediate results obtained with polynomial multiplication. Compute the first nine powers of t ({t}^{0} through {t}^{8}). Use this information to aid you in the construction of the multiplication table for this field. What is the multiplicative inverse of 2t?
Contributed by Robert Beezer

\eqalignno{ F & = \left \{a + bt\mathrel{∣}a,\kern 1.95872pt b ∈ {ℤ}_{5}\right \} & & }

where {t}^{2} is consistently replaced by t + 3 in any intermediate results obtained with polynomial multiplication. Compute the first 25 powers of t ({t}^{0} through {t}^{24}). Use this information to aid you in computing in this field. What is the multiplicative inverse of 2t? What is the multiplicative inverse of 4? What is the multiplicative inverse of 1 + 4t?

Find a basis for F as a vector space with {ℤ}_{5} used as the set of scalars.
Contributed by Robert Beezer

\eqalignno{ F & = \left \{a + bt + c{t}^{2} + d{t}^{3}\mathrel{∣}a,\kern 1.95872pt b,\kern 1.95872pt c,\kern 1.95872pt d ∈ {ℤ}_{ 2}\right \} & & }

where {t}^{4} is consistently replaced by t + 1 in any intermediate results obtained with polynomial multiplication. Compute the first 16 powers of t ({t}^{0} through {t}^{15}). Consider the set G = \left \{0,\kern 1.95872pt 1,\kern 1.95872pt {t}^{5},\kern 1.95872pt {t}^{10}\right \}. Then G will also be a finite field, a subfield of F. Construct the addition and multiplication tables for G. Notice that since both G and F are vector spaces over {ℤ}_{2}, and G ⊆ F, by Definition S, G is a subspace of F.
Contributed by Robert Beezer

T10 Give a new proof of Theorem ZVSM for a vector space whose scalars come from an arbitrary field F.
Contributed by Robert Beezer

T20 By applying Definition VS, prove that every field is also a vector space. (See the construction at the end of this section.)
Contributed by Robert Beezer

Subsection SOL: Solutions

C60 Contributed by Robert Beezer Statement [2456]
Remember that every computation must be done with arithmetic in the field, reducing any intermediate number outside of \left \{0,\kern 1.95872pt 1,\kern 1.95872pt 2,\kern 1.95872pt 3,\kern 1.95872pt 4\right \} to its remainder after division by 5.

The matrix inverse can be found with Theorem CINM (and we discover along the way that A is nonsingular). The inverse is

\eqalignno{ {A}^{−1} & = \left [\array{ 1&1&3&1 \cr 3&4&1&4 \cr 1&4&0&2 \cr 3&0&1&0 } \right ] & & }

Then by an application of Theorem SNCM the (unique) solution to the system will be

\eqalignno{ {A}^{−1}b & = \left [\array{ 1&1&3&1 \cr 3&4&1&4 \cr 1&4&0&2 \cr 3&0&1&0 } \right ]\left [\array{ 3 \cr 3 \cr 2 \cr 0 } \right ] & = \left [\array{ 2 \cr 3 \cr 0 \cr 1 } \right ] & & & & }