7
The Axiom of Choice (Revisited)

In Chapter 3 (see page 66), we first introduced the following principle, which is applied in many areas of mathematics.

Axiom of Choice. Let be an indexed function with nonempty terms. Then there is an indexed function such that , for all .

The above function , where for all , is called a choice function for . The axiom of choice is used to prove many theorems in mathematics. In real analysis, for example, the axiom of choice is (tacitly) applied to prove that a real valued function is continuous at a point if and only if for every sequence that converges to . It is thus said that continuity is equivalent to sequential continuity. The axiom of choice is also used to prove that every vector space has a basis.

In Chapter 3, it was shown that the axiom of choice implies the following theorem, which asserts the existence of another choice function .

Theorem 3.3.24 (AC). Let be a set of nonempty sets. Then there is a function such that for all .

It was then left as an exercise to show that Theorem 3.3.24 implies the axiom of choice. Hence, the theorem is actually equivalent to the axiom. Moreover, the axiom of choice is equivalent to a number of seemingly unrelated theorems. In this chapter, we will show that the axiom implies two other results that are also equivalent to it; but before we do this, we mention an application of the axiom of choice to topology, which concerns “product sets.”

Let be an indexed function with nonempty terms. The product set, denoted by , is defined by

(7.1)

Typically, when is an infinite set, the only way to conclude that this product set is nonempty is to appeal to the axiom of choice as follows: Let be a choice function such that for all . Define by . Then , and thus, the product is nonempty.

Product sets are used to create a variety of topological spaces and to prove Tychonoff’s compactness theorem, a classic result in topology. In Chapter 9 (see Exercise 19 on page 226), one is asked to complete a proof of a theorem on product sets due to Julius König.

Many important consequences of the axiom of choice can be proved from the following weaker version.

Countable Axiom of Choice. Suppose that is an indexed function with nonempty terms. If is countable, then there is a function such that , for all .

Thus, if is a countable set of nonempty sets, then the countable axiom of choice implies there is a function such that for all . The axiom of choice, of course, implies the countable axiom of choice, but not conversely.

Finally, we remark that the replacement axiom shall not be applied in this chapter; however, it will be used in Chapters 8 and 9.

7.1 Zorn’s Lemma

Zorn’s Lemma, also known as the maximum principle, is an important theorem about partially ordered sets that is normally used to prove the existence of a mathematical object when it cannot be explicitly produced. According to Max Zorn, he first formulated his principle at Hamburg in 1933. Zorn published this maximum principle in 1935. Amil Artin then used Zorn’s principle to establish theorems in algebra. Artin later proved that the principle implies the axiom of choice.

In 1935, Zorn proposed adding his maximum principle to ZF, the axioms of set theory. Although Zorn was not the first to suggest such a principle, he demonstrated how useful his formulation was in applications, particularly in topology, abstract algebra, and real analysis. Zorn also asserted (but did not prove) that his lemma and the axiom of choice are equivalent. To review the concepts of a maximal element and a chain, see Definitions 3.4.6 and 3.4.14.

Zorn’s Lemma 7.1.1 (AC). Let be a partially ordered set. If every chain in has an upper bound, then contains a maximal element.

Our proof of Zorn’s Lemma 7.1.1 will apply “proof by contradiction”; that is, we shall let be a partially ordered set in which every chain has an upper bound. We will then assume that has no maximal elements and thereby derive a contradiction. The proof will depend on a few lemmas and definitions, which we now present.

Let be a partially ordered set and . Recalling Definition 3.4.8, an element is called an upper bound for when satisfies . We will say that is a proper upper bound if, in addition, we have that .

Definition 7.1.2. Let be a partially ordered set in which is a chain. Then is the set of proper upper bounds for .

If is a poset and is a chain, then

Thus, is nonempty if and only if has proper upper bounds in .

Definition 7.1.3. Let be a chain in , where is a partially ordered set. We shall say that is cofinal in when and for all , there is a such that .

Lemma 7.1.4. Suppose that is a poset and let be a chain. If is cofinal in , then .

Lemma 7.1.5. Let be a partially ordered set in which every chain has an upper bound. If has no maximal elements, then every chain in has a proper upper bound, and hence, is nonempty.

Proof. Assume that has no maximal elements. Let be a chain in . Since every chain in has an upper bound, let be an upper bound for . If , then is a proper upper bound for . Suppose . So, because is not a maximal element, there is a such that . Thus, and is a proper upper bound for .

We now present a proof of Zorn’s Lemma.

Proof (of Zorn’s Lemma 7.1.1). Let be a partially ordered set in which every chain has an upper bound in . We will prove that there is an such that is a maximal element. Assume, for a contradiction, that has no maximal elements. By Lemma 7.1.5 and the axiom of choice, there is a function such that whenever is a chain in . As , we see that

(7.2)

A chain in shall be called a ladder if whenever and , then is the smallest element in (see Definition 3.4.11 on page 73). In other words, if and has a proper upper bound in , then and is the smallest element in that is a proper upper bound for .

Ladders exist. To confirm this, note that is (vacuously) a ladder and that . For another example, let . Since is a chain, we have that . One can show that the set is also a ladder.

Claim 1. Suppose where and are ladders. Let and . If , then .

Proof. Let and are as in the claim. Let and . Assume that . Suppose, for a contradiction, that . Thus, . Let be defined by

(7.3)

So, . Since , we also have that . Clearly, is a proper upper bound for . Because is a ladder, it follows that

(7.4)

Now, because and , the set also has a proper upper bound in . Thus, . Since and , (7.4) implies that . Hence, by (7.3), and this contradicts (7.2). (Claim 1)  

Claim 2. Suppose that and are ladders. Then either or .

Proof. Let and be ladders. Assume, for a contradiction, that and . Hence, there is an so that and a so that . Let

(7.5)

Clearly, and . Because and is a proper upper bound for , it follows that and . Similarly, as and is a proper upper bound for , we see that and . So, in summary, we have

So by (1), and by (2). Since and , we conclude from (7.5) that , contradicting (7.2). (Claim 2)  

Now let . By the power set and subset axioms, we see that is a set. Therefore, is the set of all ladders.

Claim 3. is a ladder.

Proof. To see that is a chain, let and be in . Hence, for some , and for some . By Claim 2, either or . So, we can assume that and are in . As is a chain, either or . Thus, is a chain. Suppose has a proper upper bound in , say, . We shall show that and . Since , we have that for some . We will now show that . Let . So, . Because , it follows that for some . By Claim 2, either or . If , then . If , then (as and ) Claim 1 implies that . Hence, , and is a proper upper bound for . Since is a ladder, and . Consequently, and . Therefore, is a ladder. (Claim 3)  

Let . As is the set of all ladders, () whenever is a ladder. Claim 3 states that is a ladder. Since is a chain, we have by assumption that is nonempty. Thus, . Let .

Claim 4. is a ladder.

Proof. Since is a chain and for each , it follows that is a chain. Let have a proper upper bound in . Thus, and . If , then and , because is a ladder. If , then , and there are two cases to consider.

CASE 1: is not cofinal in . Thus, has a proper upper bound in . Hence, and as is a ladder. Therefore, and because .

CASE 2: is cofinal in . Lemma 7.1.4 implies that . So we clearly have that and . (Claim 4)  

Since is a ladder, () implies that . As , we conclude that , and this contradicts (7.2). (Zorn’s Lemma)

Hence, the axiom of choice implies Zorn’s Lemma 7.1.1. Moreover, one can also prove that Zorn’s Lemma implies the axiom of choice (see Exercise 10). A different proof of Zorn’s Lemma is summarized in Exercise 14 on page 193; however, this alternative proof relies on ordinals and the replacement axiom.

Corollary 7.1.6 (AC). Suppose that is a partially ordered set in which every chain has an upper bound. For each , there exists a maximal element such that .

Proof. Let be a poset in which every chain has an upper bound, and let . Define and . Thus, is a partially ordered set (see Exercise 14 on page 76) in which every chain has an upper bound. By Zorn’s Lemma, has a maximal element . So and is a maximal element in .

7.1.1 Two Applications of Zorn’s Lemma

Zorn’s Lemma is frequently applied to posets of the form where is a set (of sets) and is the subset relation (see Problem 1 on page 70). In such applications, one proceeds as described in Exercise 5. We will now present two theorems whose proofs apply Zorn’s Lemma as outlined by Exercise 5.

The first theorem, often called the comparability theorem, shows that one can compare the cardinalities of any two sets (see Definition 5.4.14). First, we shall state and prove a relevant lemma.

Lemma 7.1.7. Let . If is a maximal element in the poset , then either or .

Proof. Let be the poset defined in the statement of the lemma, and let be a maximal element in . Clearly, and . We shall prove the either or . Suppose, for a contradiction, that and . Let and . Now let . Clearly, . Because and , it follows that is a one-to-one function. Thus, . Since and , this contradicts the fact that is maximal. Therefore, either or .

We can now state and prove the comparability theorem, which is actually equivalent to the axiom of choice (see Exercise 23, page 218).

Theorem 7.1.8 (AC). For any two sets and , either or .

Proof. Let and be sets, and let . We will show that every chain in the poset has an upper bound. To do this, suppose that is a chain. Thus, for all and , we have that either or . Clearly, . Exercise 11 on page 68 shows that is a one-to-one function. So and for each . We conclude that is an upper bound for . Hence, by Zorn’s Lemma, the poset has a maximal element . Lemma 7.1.7 asserts that either or . If , then because is one-to-one. If , then as is one-to-one by Corollary 3.3.15 and Theorem 3.2.7.

Our second theorem, identified as the order-extension principle, shows that any partial order on a set can be extended to a total order on the same set. A proof of this theorem was first published by Edward Marczewski in 1930. Our proof will apply Zorn’s Lemma to a partial order on a set of partial orders. We first state and prove a useful lemma, which shows that if a partial order is not a total order, then it can be extended to a larger partial order.

Lemma 7.1.9. If is a partial order on that is not a total order on , then there is a partial order on such that and .

Proof. Let is a partial order on that is not a total order on . Since is not a total order, there exist and such that () and . Let be defined by

Because is reflexive on , we see that and . Thus, . Let . So . Also, as and by (). We will now show that is a partial order on . Since , we have that

(7.6)

Let . Since , we see from (7.6) that . So is reflexive on . To show that is antisymmetric, suppose that and . We shall show that . Since and , the equivalence in (7.6) yields the following four cases (three of which, as we will see, cannot hold):

1. and . Since is antisymmetric, we conclude that .
2. and . So () , () , () . Conditions () and () imply that . Hence, () implies that , which contradicts ().
3. and . As in the previous case 2, this is impossible.
4. and . Thus, () , () , () , () . Clearly, () and () imply that , contradicting ().

Because case 1 is the only possibility, we conclude that is antisymmetric. Finally, we now show that is transitive. Assume and . We must show that . As and , the equivalence (7.6) again yields four cases to consider:

1. and . Since is transitive, we conclude that .
2. and . Thus, () , () , () . As () and () imply that , we see from () that and . Hence, .
3. and . So () , () , () . Since () and () imply that , we conclude from () that and . Thus, .
4. and . Hence, () , () , () , () . Clearly, () and () imply that , contradicting ().

In each of the valid cases, we have either or . Thus, by (7.6), . Hence, is a partial order on such that and .

We can now show that Zorn’s Lemma implies the order-extension principle.

Theorem 7.1.10 (AC). Let be a partial order on . Then there exists a total order on such that .

Proof. Let be a partial order on , and let

Hence, . We now show that every chain in the poset has an upper bound. Let be a chain. Thus, for all and , either or . Clearly, . Exercises 17 and 18 on page 76 imply that is a partial order on . So and for all . Hence, is an upper bound for . Hence, by Corollary 7.1.6, there is a maximal element such that . Since is maximal, Lemma 7.1.9 implies that is a total order on .

Exercises 7.1

3. Let , and let denote . Suppose that and for all , there is a such that . By Exercise 2 and the axiom of choice, there is an such that for all . Using Theorem 4.2.1, show that there is a function such that for all .
4. Prove Lemma 7.1.4.
9. Using Theorem 7.1.10, show there is a total order on such that for all and in , if , then . Show that on .

Exercise Notes: The conclusion of Exercise 3 is called the axiom of dependent choices (DC). For Exercise 6(a), apply Exercise 5. Exercise 7 shows that Zorn’s Lemma implies the Hausdorff maximal principle, which proclaims that every partially ordered set contains a maximal chain. This principle is also equivalent to the axiom of choice. For Exercise 10(a), see Exercise 11(a) on page 68.

7.2 Filters and Ultrafilters

We now introduce the concept of a filter, which has a variety of applications in set theory and in topology. Filters were introduced by Henri Cartan in 1937 and subsequently used by Bourbaki in the book Topologie générale.

The motivation for our next definition is to give meaning to the intuitive notion of a “large” subset of .

Definition 7.2.1. Let be nonempty and let . Then is a filter on  if it satisfies the following three conditions:

So if and is a filter, then behaves like a “large” set.

If is a filter on and , then . To prove this, suppose that . Thus, by Definition 7.2.1(2), , which contradicts Definition 7.2.1(1).

Clearly, the singleton set is a filter on any nonempty set . For another example, let be a nonempty subset of and let . One can verify (see Exercise 1) that satisfies the conditions of Definition 7.2.1. Hence, is a filter on , and it is called the filter on generated by . When is a singleton, then shall be called a principal filter. For example, if , then the principal filter generated by can be expressed by

(7.7)
Definition 7.2.2. Let be a filter on . We say that is a maximal filter if for all filters on , if , then .

If , then the principal filter generated by is a maximal filter. To see why it is maximal, let be a filter on so that . Let . To prove that , assume . Thus, and . So , and hence, . Since , we conclude from Definition 7.2.1(1) that , a contradiction. So and . On the other hand, for and where , one can verify that the filter generated by is not maximal.

We now present an example of a nonprincipal filter. Let be an infinite set. Let be defined by

One can show that is a filter on . For example, to verify Definition 7.2.1(2), let and be in . Since , we conclude that is finite, as and are finite. Thus, is in . A set is said to be a cofinite subset of , if and is finite. So the filter is called the filter of all cofinite subsets of . To show that is not a principal filter, let and . Consider the set . Then (because is finite) and . Therefore, is not generated by any singleton.

Definition 7.2.3. Let be a filter on . Then is said to be an ultrafilter on if for all , either or .

Let . The principle filter generated by (see (7.7)) is an ultrafilter. So there are principal ultrafilters. Do nonprincipal ultrafilters exist? Let be an infinite set and let be the filter of all cofinite subsets of . Our goal in this section is to prove a result that implies the existence of an ultrafilter such that . Exercise 9 then implies that is nonprincipal. Thus, nonprincipal ultrafilters do, in fact, exist.

The following lemma identifies two important properties that an ultrafilter possesses.

Lemma 7.2.4. Let be an ultrafilter on .

Proof. The proofs of (1) and (2) are left for Exercises 7 and 8, respectively.

Lemma 7.2.4(2) shows that an ultrafilter is maximal. Furthermore, if a filter is maximal, then it is an ultrafilter (see Exercise 15).

Definition 7.2.5. Let and be nonempty. Then has the finite intersection property if whenever is nonempty and finite, then .

Lemma 7.2.6. Let and be nonempty. Suppose that has the finite intersection property. Then there is a filter on such that .

Proof. Let . Assume has the finite intersection property. Define

Clearly, , , and . We shall prove that is a filter; that is, we will establish items (2) and (3) of Definition 7.2.1. Let and . So and where and are nonempty and finite. Thus, . Exercise 12 on page 40 implies that . Hence, . Since is a finite subset of (see Exercise 15 on page 116), we have that . Thus, (2) of Definition 7.2.1 holds. To prove (3), let and . Since for some nonempty finite , it follows that . Therefore, is a filter.

Definition 7.2.7. Let and be nonempty. Then is closed under finite intersections if whenever is nonempty and finite, then .

One can prove by induction that a filter is closed under finite intersections, and therefore, a filter has the finite intersection property (see Exercise 2).

Lemma 7.2.8. Let be a filter on . Suppose that is such that and . Then the set has the finite intersection property.

Proof. Let be a filter on , and suppose that is such that and (. Because is a filter, () implies that .

Claim. For every , we have that .

Proof. Assume, for a contradiction that for some . Since and , it follows that . Thus, because is a filter, we have that , which contradicts (). (Claim)  

Let be finite and nonempty. So either or . If , then . As has the finite intersection property, it follows that . If , then for a finite . If , then , and thus, . If , then one can show that . Since is closed under finite intersections, . Hence, by the above claim. Thus, and has the finite intersection property. (Lemma)

Lemmas 7.2.6 and 7.2.8, together with Zorn’s Lemma, imply the following theorem, which has many applications in mathematics.

Ultrafilter Theorem 7.2.9 (AC). If is a filter on , then there is an ultrafilter on such that .

Proof. See Exercise 14.

7.2.1 Ideals

Filters allowed us to investigate the concept of a “large” subset of a set . Ideals allow us to pursue the concept of a “small” subset of . As we will see, given a filter on , one can construct an ideal on , and conversely.

Definition 7.2.10. Let be nonempty and let . We say that is an ideal on if it satisfies the following three conditions:
(1) and .
(2) If and , then .
(3) If and , then .

The singleton is an ideal on any nonempty set . Let be a proper subset and let

The set is an ideal, since it clearly satisfies the conditions of Definition 7.2.10. This ideal is called the ideal on generated by .

The following lemma shows that there is an intimate connection between filters and ideals.

Lemma 7.2.11. Let be nonempty. Then the following hold:

Proof. See Exercises 19 and 20.

Let and be as in Lemma 7.2.11. Then is called the dual ideal, and is called the dual filter. It is easy to show that and .

Definition 7.2.12. Let be an ideal on . Then is said to be a prime ideal on if for all , either or .

Theorem 7.2.13. Let be a nonempty set. Then is an ultrafilter on if and only if is a prime ideal on .

Proof. See Exercise 21.

Finally, let be an ideal on . Then is said to be a maximal ideal if for all ideals , if , then . One can prove that a prime ideal is maximal (see Exercise 22).

Exercises 7.2

Exercise Notes: For Exercise 6, find a set and a set that violate Lemma 7.2.4(1). For Exercise 12(a), let be a finite set such that for all , if is finite then . Using Lemma 7.2.4(1), deduce that is a singleton. For Exercise 12(b), use Exercise 11. For Exercise 18, see Exercise 9 on page 68.

7.3 Well-Ordering Theorem

The following theorem is due to Ernst Zermelo and states that every set can be well-ordered. Therefore, the powerful principle of transfinite recursion can be applied to any set.

Well-Ordering Theorem 7.3.1 (AC). Each set has a well-ordering.

Proof. Assume the axiom of choice. Let be a set. We shall use Zorn’s Lemma to prove that there is a well-ordering on . Let be the following set:

(Theorem 2.1.3 implies that is a set). Define the continuation relation on by

(7.8)

where and in (7.8)(ii) are arbitrary. The relationship is illustrated by the following diagram (7.9) where the order continues the order ; that is, the order can only add new elements that are greater than all of the elements ordered by (see Exercise 3):

(7.9)

The structure is a partially ordered set (see Exercise 4). Let be a chain. Thus, for all and , either or . Now let be the relation with field , which is a subset of .

Claim 1. is a total order on its field.

Proof. Exercise 18 on page 76 implies that is a partial order on its field. To show that is a total order, let and be in the field of . Thus, and for some and in . Since is a chain, it follows from (7.8) that either or . Without loss of generality, let us assume that . Hence, and . As is a total order, we see that either or . Since , we conclude that either or . Therefore, is a total order. (Claim 1)  

Claim 2. is a well-ordering on its field.

Proof. Suppose that is nonempty. So let . Hence, () for some . We now show that . Since , we see that . Assume . So . Thus, for some . We must verify that . Since , , and is a chain, we have either or .

CASE 1: . So (7.8)(ii) holds. We know that . Since by (), we have that because is reflexive. Thus, and . From (7.8)(ii), we conclude that , and so .

CASE 2: . Item (i) of (7.8) implies that . Because , we infer that , and therefore, .

So . Thus, () . Since , the set is nonempty. As is a well-ordering, there is a -least element . Because , () implies that is the -least element in . So is a well-ordering on its field. (Claim 2)  

Claim 3. is in and is an upper bound for .

Proof. Claim 2 implies that is in . Let . We prove that . Clearly, . Thus, (i) of (7.8) holds. For (ii), suppose that and . Since , there is a such that . As is a chain, we have either or . The arguments used in case 1 and case 2 of the proof of Claim 2 show that . Therefore, . (Claim 3)  

By Zorn’s Lemma, there exists a maximal element in . Hence, is a well-ordering on a set and for all . We now prove that . Suppose, for a contradiction, that there is an such that . Let . One can easily show (see Exercise 5) that is a well-ordering on and therefore, . In addition, one can also show that , which contradicts the maximality of . Hence, and is a well-ordering on . (Theorem)

The proof of Theorem 7.3.1 now shows that the axiom of choice implies that every set can be well-ordered. Moreover, in Exercise 1 you are asked to show that the well-ordering theorem implies the axiom of choice. Another proof of Theorem 7.3.1 is outlined in Exercise 15 on page 194. This alternative proof depends on the replacement axiom.

Exercises 7.3

1. Let be an indexed function such that for all . By Theorem 7.3.1, the set has a well-ordering . Define a function such that and . Thus, is a choice function.
2. Let be a total order on . For each , let . Suppose that for every and , if is nonempty, then has a -least element. Prove that is a well-ordering on .

Exercise Notes: For Exercise 6, see Exercise 19 on page 76.