1 Introduction

1
Introduction

Georg Cantor devoted much of his mathematical career to the development of a new branch of mathematics, namely, Set Theory. Little did he know that his pioneering work would eventually lead to a unifying theory for mathematics. In his earlier work, Cantor took a set of real numbers $\text{[math]}$ and then formed the derived set $\text{[math]}$ of all limit points of $\text{[math]}$ . After iterating this operation, Cantor obtained further derived sets $\text{[math]}$ , $\text{[math]}$ . These derived sets enabled him to prove an important theorem on trigonometric series. This work led Cantor to investigate sets in a more general setting and to develop an abstract theory of sets that would dramatically change the course of mathematics.

The basic concepts in set theory are now applied in virtually every branch of mathematics. Furthermore, set theory serves as the basis for the definition and explanation of the most fundamental mathematical concepts: functions, relations, algebraic structures, function spaces, etc. Thus, it is often said that set theory provides a foundation for mathematics.

1.1 Elementary Set Theory

The set concept is one that pervades all of mathematics. We shall not attempt to give a precise definition of a set; however, we will give an informal description of a set and identify some important properties of sets.

A set is a collection of objects. The objects in such a collection are called the elements of the set. We write $\text{[math]}$ to assert that $\text{[math]}$ is an element, or a member, of the set $\text{[math]}$ . We write $\text{[math]}$ when $\text{[math]}$ is not an element of the set $\text{[math]}$ . A set is merely the result of collecting objects of interest, and it is usually identified by enclosing its elements with braces (curly brackets). For example, the collection $\text{[math]}$ is a set that contains the four elements $\text{[math]}$ . So $\text{[math]}$ , and $\text{[math]}$ .

Sets are exceedingly important in mathematics; in fact, most mathematical objects (e.g., numbers, functions) can be defined in terms of sets. When one first learns about sets, it appears that one can naively define a set to be any collection of objects. In Section 1.4, we will see that such a naive approach can create serious problems.

Certain sets routinely appear in mathematics. In particular, the sets of natural numbers, integers, rational, and real numbers are regularly discussed. These sets are usually denoted by:

1. $\text{[math]}$ is the set of natural numbers.

2. $\text{[math]}$ is the set of integers.

3. $\text{[math]}$ is the set of rational numbers. Thus, $\text{[math]}$ .

4. $\text{[math]}$ is the set of real numbers, and so $\text{[math]}$ .

Basic Definitions of Set Theory

In this section, we discuss the basic notation and concepts that are used in set theory. An object $\text{[math]}$ may or may not belong to a given set $\text{[math]}$ ; that is, either $\text{[math]}$ or $\text{[math]}$ , but not both.

Definition 1.1.1. The following set terminology is used extensively throughout mathematics:

1. Let $\text{[math]}$ and $\text{[math]}$ be sets. We write $\text{[math]}$ when both sets have exactly the same elements.

2. For sets $\text{[math]}$ and $\text{[math]}$ we write $\text{[math]}$ to mean that the set $\text{[math]}$ is a subset of the set $\text{[math]}$ , that is, every element of $\text{[math]}$ is also an element of $\text{[math]}$ .

3. We say that the set $\text{[math]}$ is a proper subset of the set $\text{[math]}$ , denoted by $\text{[math]}$ , when $\text{[math]}$ and $\text{[math]}$ , that is, when every element of $\text{[math]}$ is an element of $\text{[math]}$ but there is at least one element in $\text{[math]}$ that is not in $\text{[math]}$ .

4. We write $\text{[math]}$ for the empty set, or the null set. The set $\text{[math]}$ has no elements.

5. Two sets $\text{[math]}$ and $\text{[math]}$ are disjoint if they have no elements in common.

It follows that $\text{[math]}$ and $\text{[math]}$ , for any set $\text{[math]}$ . To see why $\text{[math]}$ , suppose that $\text{[math]}$ . Then there exists an $\text{[math]}$ such that $\text{[math]}$ . As there is no $\text{[math]}$ such that $\text{[math]}$ , we arrive at a contradiction. Therefore, we must have that $\text{[math]}$ .

A Venn diagram is a configuration of geometric shapes, which is commonly used to depict a particular relationship that holds between two or more sets. In Figure 1.1(a), we present a Venn diagram that illustrates the subset relation. Figure 1.1(b) portrays two sets that are disjoint.

Figure 1.1. Two set relationships

A property is a statement that asserts something about one or more variables (for more detail, see Section 1.3). For example, the two statements “ $\text{[math]}$ is a real number” and “ $\text{[math]}$ and $\text{[math]}$ ” are clearly properties that assert something, respectively, about $\text{[math]}$ and $\text{[math]}$ . One way to construct a subset is called the method of separation. Let $\text{[math]}$ be a set. Given a property $\text{[math]}$ about the variable $\text{[math]}$ , one can construct the set of objects $\text{[math]}$ that satisfy the property $\text{[math]}$ ; namely, we can form the truth set $\text{[math]}$ . Thus, we can separate the elements in $\text{[math]}$ that satisfy the property from those elements that do not satisfy the property.

Problem 1. Evaluate each of the truth sets:

1. $\text{[math]}$ .

2. $\text{[math]}$ .

3. $\text{[math]}$ .

Solution. $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ . $\text{[math]}$

An interval is a set consisting of all the real numbers that lie between two given real numbers $\text{[math]}$ and $\text{[math]}$ , where $\text{[math]}$ :

1. The open interval $\text{[math]}$ is defined to be $\text{[math]}$ .

2. The closed interval $\text{[math]}$ is defined to be $\text{[math]}$ .

3. The left-closed interval $\text{[math]}$ is defined to be $\text{[math]}$ .

4. The right-closed interval $\text{[math]}$ is defined to be $\text{[math]}$ .

For each real number $\text{[math]}$ , we can also define intervals called rays or half-lines:

1. The interval $\text{[math]}$ is defined to be $\text{[math]}$ .

2. The interval $\text{[math]}$ is defined to be $\text{[math]}$ .

3. The interval $\text{[math]}$ is defined to be $\text{[math]}$ .

4. The interval $\text{[math]}$ is defined to be $\text{[math]}$ .

The symbol $\text{[math]}$ denotes “infinity” and this symbol does not represent a number. The notation $\text{[math]}$ is often used to represent an interval “without a right endpoint.” Similarly, the mathematical notation $\text{[math]}$ is used to denote an interval “having no left endpoint.”

Definition 1.1.2. Let $\text{[math]}$ be a set. The power set of $\text{[math]}$ , denoted by $\text{[math]}$ , is the set whose elements are all of the subsets of $\text{[math]}$ . That is, $\text{[math]}$ .

Thus, $\text{[math]}$ if and only if $\text{[math]}$ . If $\text{[math]}$ is a finite set with $\text{[math]}$ elements, then one can show that the set $\text{[math]}$ has $\text{[math]}$ elements. The set $\text{[math]}$ has three elements, and so $\text{[math]}$ has eight elements, namely,

Set Operations

For a pair of sets $\text{[math]}$ and $\text{[math]}$ , there are three fundamental operations that we can perform on these sets. The union operation unites, into one set, the elements that belong either to $\text{[math]}$ or to $\text{[math]}$ . The intersection operation forms the set of elements that belong both to $\text{[math]}$ and to $\text{[math]}$ . The difference between $\text{[math]}$ and $\text{[math]}$ (in that order) is the set of all elements that are in $\text{[math]}$ and not in $\text{[math]}$ .

Definition 1.1.3. Given sets $\text{[math]}$ and $\text{[math]}$ , we can build new sets using the following set operations:

(a) $\text{[math]}$ is the union of $\text{[math]}$ and $\text{[math]}$ .¹

(b) $\text{[math]}$ is the intersection of $\text{[math]}$ and $\text{[math]}$ .

(c) $\text{[math]}$ is the set difference of $\text{[math]}$ and $\text{[math]}$ (also stated in English as $\text{[math]}$ “minus” $\text{[math]}$ ).

The set operations in Definition 1.1.3 are illustrated in Figures 1.2(a), 1.2(b), and 1.2(c). Shading is used to identify the result of a particular set operation. For example, in Figure 1.2(c) the shaded area represents the set $\text{[math]}$ .

Figure 1.2. Three set operations

When the elements of sets $\text{[math]}$ and $\text{[math]}$ are clearly presented, then one can easily evaluate the operations of union, intersection, and difference.

Example 2. Let $\text{[math]}$ and $\text{[math]}$ . Then

$\text{[math]}$ and $\text{[math]}$ .

$\text{[math]}$ and $\text{[math]}$ .

$\text{[math]}$ .

Exercises 1.1

Let $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ be sets.

1. If $\text{[math]}$ and $\text{[math]}$ , show that $\text{[math]}$ .

2. Show that if $\text{[math]}$ , then $\text{[math]}$ .

3. Suppose $\text{[math]}$ . Show that $\text{[math]}$ .

4. Suppose $\text{[math]}$ and $\text{[math]}$ . Show that $\text{[math]}$ .

5. Suppose $\text{[math]}$ and $\text{[math]}$ . Show that $\text{[math]}$ .

6. Show that $\text{[math]}$ .

7. Show that if $\text{[math]}$ and $\text{[math]}$ , then $\text{[math]}$ .

8. Let $\text{[math]}$ be the property $\text{[math]}$ . Are the assertions $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ true or false?

9. Show that each of the following sets can be expressed as an interval:

(a) $\text{[math]}$ .

(b) $\text{[math]}$ .

10. Express the following sets as truth sets:

(a) $\text{[math]}$ .

(b) $\text{[math]}$ .

11. Show that each of the following sets can be expressed as an interval:

(a) $\text{[math]}$ .

(b) $\text{[math]}$ .

12. Evaluate the truth sets:

(a) $\text{[math]}$ .

(b) $\text{[math]}$ .

(d) $\text{[math]}$ .

Exercise Notes: For Exercise 6, $\text{[math]}$ means that $\text{[math]}$ or $\text{[math]}$ .

1.2 Logical Notation

Before introducing the fundamentals of set theory, it will be useful to identify some relevant aspects of language and logic. The importance of correct logical notation to set theory, and to mathematics, cannot be overstated. Formal logical notation has one important advantage: statements can be expressed much more concisely and much more precisely. Set theory often expresses many of its important concepts using logical notation. With this in mind, we now discuss the basics of logic.

Propositions and Logical Connectives

A proposition is a declarative sentence that is either true or false, but not both. When discussing the logic of propositional statements, we shall use symbols to represent these statements. Capital letters, for instance, $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , are used to symbolize propositional statements and are called propositional components. Using the five logical connectives $\text{[math]}$ together with the components, we can form new logical sentences called compound sentences. For example,

1. $\text{[math]}$ (means “ $\text{[math]}$ and $\text{[math]}$ ” and is called conjunction).

2. $\text{[math]}$ (means “ $\text{[math]}$ or $\text{[math]}$ ” and is called disjunction).

3. $\text{[math]}$ (means “not $\text{[math]}$ ” and is called negation).

4. $\text{[math]}$ (means “if $\text{[math]}$ , then $\text{[math]}$ ” and is called a conditional).

5. $\text{[math]}$ (means “ $\text{[math]}$ if and only if $\text{[math]}$ ” and is called a biconditional).

Using propositional components as building blocks and the logical connectives as mortar, we can construct more complex compound sentences, for example, $\text{[math]}$ . Parentheses ensure that our compound sentences will be clear and readable; however, we shall be using the following conventions:

1. The outermost parentheses need not be explicitly written; that is, one can write $\text{[math]}$ to denote $\text{[math]}$ .

2. The negation symbol shall apply to as little as possible. We can therefore use $\text{[math]}$ to denote $\text{[math]}$ .

Truth Tables

The truth value of a compound sentence in propositional logic can be evaluated from the truth values of its components. The logical connectives $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ yield the natural truth values given in Table 1.1, where $\text{[math]}$ means “true” and $\text{[math]}$ means “false.”

Table 1.1. Basic truth tables


$\text{[math]}$ $\text{[math]}$ $\text{[math]}$

$\text{[math]}$ $\text{[math]}$

Table 1.1(1) has four rows (not including the header). The columns beneath $\text{[math]}$ and $\text{[math]}$ list all the possible pairs of truth values that can be assigned to the components $\text{[math]}$ and $\text{[math]}$ . For each such pair, the corresponding truth value for $\text{[math]}$ appears to the right. For example, consider the third pair of truth values in this table, $\text{[math]}$ . Thus, if the propositional components $\text{[math]}$ and $\text{[math]}$ are assigned the respective truth values $\text{[math]}$ and $\text{[math]}$ , we see that the truth value of $\text{[math]}$ is $\text{[math]}$ .

Table 1.1(2) shows that if $\text{[math]}$ and $\text{[math]}$ are assigned the respective truth values $\text{[math]}$ and $\text{[math]}$ , then the truth value of $\text{[math]}$ is $\text{[math]}$ . Moreover, when $\text{[math]}$ and $\text{[math]}$ are assigned the truth values $\text{[math]}$ and $\text{[math]}$ , then the truth value of $\text{[math]}$ is also $\text{[math]}$ . In mathematics, the connective “or” has the same meaning as “and/or”; that is, $\text{[math]}$ is true if and only if either $\text{[math]}$ is true or $\text{[math]}$ is true, or both $\text{[math]}$ and $\text{[math]}$ are true. Table 1.1(3) shows that the negation of a statement reverses the truth value of the statement.

Table 1.1(4) states that when $\text{[math]}$ and $\text{[math]}$ are assigned the respective truth values $\text{[math]}$ and $\text{[math]}$ , then the truth value of $\text{[math]}$ is $\text{[math]}$ ; otherwise, it is $\text{[math]}$ . In particular, when $\text{[math]}$ is false, we shall say that $\text{[math]}$ is vacuously true. Table 1.1(5) shows that $\text{[math]}$ is true when $\text{[math]}$ and $\text{[math]}$ are assigned the same truth value; when $\text{[math]}$ and $\text{[math]}$ have different truth values, then the biconditional is false.

Using the truth tables for the sentences $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ , we will now discuss how to build truth tables for more complicated compound sentences. Given a compound sentence, we identify the “outside” connective to be the “last connective that one needs to evaluate.” Once the outside connective has been identified, one can break up the sentence into its “parts.” For example, in the compound sentence $\text{[math]}$ we see that $\text{[math]}$ is the outside connective with two parts $\text{[math]}$ and $\text{[math]}$ .

Problem 1. Construct a truth table for the sentence $\text{[math]}$ .

Solution. The components $\text{[math]}$ and $\text{[math]}$ will each need a column in our truth table. Since there are two components, there are four possible combinations of truth values for $\text{[math]}$ and $\text{[math]}$ . We will enter these combinations in the two left most columns in the same order as that in Table 1.1(1). The outside connective of the propositional sentence $\text{[math]}$ is $\text{[math]}$ . We can break this sentence into the two parts $\text{[math]}$ and $\text{[math]}$ . So these parts will also need a column in our truth table. As we can break the sentences $\text{[math]}$ and $\text{[math]}$ only into components (namely, $\text{[math]}$ and $\text{[math]}$ ), we obtain the following truth table:

We will describe in steps how one obtains the truth values in the above table. STEP 1: Specify all of the truth values that can be assigned to the components. STEP 2: In each row, use the truth value assigned to the component $\text{[math]}$ to obtain the corresponding truth value for $\text{[math]}$ , using Table 1.1(3). STEP 3: In each row, use the truth values assigned to $\text{[math]}$ and $\text{[math]}$ to determine the corresponding truth value in the column under $\text{[math]}$ via Table 1.1(1). STEP 4: In each row, use the truth values in the columns under $\text{[math]}$ and $\text{[math]}$ to evaluate the matching truth value for the final column under the sentence $\text{[math]}$ , employing Table 1.1(4). $\text{[math]}$

Tautologies and Contradictions

After constructing a truth table for a compound sentence, suppose that every entry in the final column is true. The sentence is thus true no matter what truth values are assigned to its components. Such a sentence is called a tautology.

Definition 1.2.1. A compound sentence is a tautology when its truth value is true regardless of the truth values of its components.

So a compound sentence is a tautology if it is always true. One can clearly see from the following truth table that the sentence $\text{[math]}$ is a tautology:

Definition 1.2.2. A compound sentence is a contradiction when its truth value is false regardless of the truth values of its components.

Therefore, a compound sentence is a contradiction if it is always false. One can easily show that the sentence $\text{[math]}$ is a contradiction.

Logical Equivalence

A propositional sentence is either a compound sentence or just a component. The next definition describes when two propositional sentences are logically equivalent, that is, when they mean the same thing. Mathematicians frequently take advantage of logical equivalence to simplify their proofs, and we shall do the same in this book. In this section, we will use Greek letters (e.g., $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ ; see page xiii) to represent propositional sentences.

Definition 1.2.3. Let $\text{[math]}$ and $\text{[math]}$ be propositional sentences. We will say that $\text{[math]}$ and $\text{[math]}$ are logically equivalent, denoted by $\text{[math]}$ , whenever the following holds: For every truth assignment applied to the components of $\text{[math]}$ and $\text{[math]}$ , the resulting truth values of $\text{[math]}$ and $\text{[math]}$ are identical.

Problem 2. Show that $\text{[math]}$ .

Solution. After constructing truth tables for the two statements $\text{[math]}$ and $\text{[math]}$ , we obtain the following:

$\text{[math]}$

As the final columns in the truth tables for $\text{[math]}$ and $\text{[math]}$ are identical, we can conclude from Definition 1.2.3 that they are logically equivalent. $\text{[math]}$

Whenever $\text{[math]}$ and $\text{[math]}$ are logically equivalent, we shall say that $\text{[math]}$ is a logic law. We will now present two important logic laws that are often used in mathematical proofs. These laws were first identified by Augustus De Morgan.

De Morgan’s Laws (DML)

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Let $\text{[math]}$ and $\text{[math]}$ be propositional sentences. If one can apply a truth assignment to the components of $\text{[math]}$ and $\text{[math]}$ such that the resulting truth values of $\text{[math]}$ and $\text{[math]}$ disagree, then $\text{[math]}$ and $\text{[math]}$ are not logically equivalent. We will use this fact in our next problem, which shows that the placement of parentheses in a compound sentence is very important. Note: A regrouping can change the meaning of the sentence.

Problem 3. Show that sentences $\text{[math]}$ and $\text{[math]}$ are not logically equivalent.

Solution. We shall use the truth table

$\text{[math]}$

Since their final columns are not identical, we conclude that the propositional sentences $\text{[math]}$ and $\text{[math]}$ are not equivalent. $\text{[math]}$

Propositional Logic Laws

If a propositional component appears in a logic law and each occurrence of this component is replaced with a specific propositional sentence, then the result is also a logic law. Thus, in the above De Morgan’s Law

if we replace $\text{[math]}$ and $\text{[math]}$ , respectively, with propositional sentences $\text{[math]}$ and $\text{[math]}$ , then we obtain the logic law

which is also referred to as De Morgan’s Law.

Listed below are some important laws of logic, where $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ represent any propositional sentences. These particular logic laws are frequently applied in mathematical proofs. They will also allow us to derive theorems concerning certain set operations.

De Morgan’s Laws (DML)

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Commutative Laws

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Associative Laws

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Idempotent Laws

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Distributive Laws

1. $\text{[math]}$ .

2. $\text{[math]}$ .

3. $\text{[math]}$ .

4. $\text{[math]}$ .

Double Negation Law (DNL)

1. $\text{[math]}$ .

Tautology Law

1. $\text{[math]}$ .

Contradiction Law

1. $\text{[math]}$ .

Conditional Laws (CL)

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Contrapositive Law

1. $\text{[math]}$ .

Biconditional Law

1. $\text{[math]}$ .

The Tautology Law and Contradiction Law can be easily illustrated. Observe that $\text{[math]}$ is a tautology. From the Tautology Law we obtain the following logical equivalence: $\text{[math]}$ . On the other hand, because $\text{[math]}$ is a contradiction, it follows that $\text{[math]}$ by the Contradiction Law.

Let $\text{[math]}$ and $\text{[math]}$ be two propositional sentences that are logically equivalent. Now, suppose that $\text{[math]}$ appears in a given propositional sentence $\text{[math]}$ . If we replace occurrences of $\text{[math]}$ in $\text{[math]}$ with $\text{[math]}$ , then the resulting new sentence will be logically equivalent to $\text{[math]}$ . To illustrate this substitution principle, suppose that we have the propositional sentence $\text{[math]}$ and we also know that $\text{[math]}$ . Then we can conclude that $\text{[math]}$ . Now, using this substitution principle and the propositional logic laws, we will establish a new logic law without the use of truth tables.

Problem 4. Show that $\text{[math]}$ , using logic laws.

Solution. We first start with the more complicated side $\text{[math]}$ and derive the simpler side as follows:

Therefore, $\text{[math]}$ . $\text{[math]}$

Using a list of propositional components, say $\text{[math]}$ , and the logical connectives $\text{[math]}$ , we can form a variety of propositional sentences. For example,

The logical connectives are also used to tie together a variety of mathematical statements. A good understanding of these connectives and propositional logic will allow us to more easily understand and define set-theoretic concepts. The following problem and solution illustrate this observation.

Problem 5. Let $\text{[math]}$ and $\text{[math]}$ be any two sets. Show that $\text{[math]}$ is equivalent to the statement $\text{[math]}$ or $\text{[math]}$ .

Solution. We shall show that $\text{[math]}$ as follows:

Therefore, $\text{[math]}$ is equivalent to the assertion $\text{[math]}$ . $\text{[math]}$

Exercises 1.2

1. Using truth tables, show that $\text{[math]}$ .

2. Construct truth tables to show that $\text{[math]}$ .

3. Using truth tables, show that $\text{[math]}$ .

4. Using truth tables, show that $\text{[math]}$ .

5. Show that $\text{[math]}$ , using logic laws.

6. Show that $\text{[math]}$ , using logic laws.

7. Using propositional logic laws, show that $\text{[math]}$ .

8. Show that $\text{[math]}$ and $\text{[math]}$ are not logically equivalent.

1.3 Predicates and Quantifiers

Variables, for instance, $\text{[math]}$ and $\text{[math]}$ , are used throughout mathematics to represent unspecified values. They are employed when we are interested in “properties” that may be true or false, depending on the values represented by the variables. A predicate is simply a statement that proclaims that certain variables satisfy a property. For example, “ $\text{[math]}$ is a number” is a predicate, and we can symbolize this predicate by $\text{[math]}$ . Of course, the truth or falsity of the expression $\text{[math]}$ can be determined only when a value for $\text{[math]}$ is given. For example, the expression $\text{[math]}$ , which means “ $\text{[math]}$ is a number,” is clearly true.

When our attention is to be focused on just the elements in a particular set, we shall then refer to that set as our universe of discourse. For example, if we were just talking about real numbers, then our universe of discourse would be the set of real numbers $\text{[math]}$ . Furthermore, every statement made in a specific universe of discourse applies to just the elements in that universe.

Given a statement $\text{[math]}$ , which says something about the variable $\text{[math]}$ , we often want to assert that every element $\text{[math]}$ in the universe of discourse satisfies $\text{[math]}$ . Moreover, there will be times when we want to express the fact that at least one element $\text{[math]}$ in the universe makes $\text{[math]}$ true. We will thus form sentences using the quantifiers $\text{[math]}$ and $\text{[math]}$ . The quantifier $\text{[math]}$ means “for all” and is called the universal quantifier. The quantifier $\text{[math]}$ means “there exists,” and it is identified as the existential quantifier. For example, we can form the sentences

1. $\text{[math]}$ [means “for all $\text{[math]}$ , $\text{[math]}$ ”].

2. $\text{[math]}$ [means “there exists an $\text{[math]}$ such that $\text{[math]}$ ”].

Any statement of the form $\text{[math]}$ is called a universal statement. A statement having the form $\text{[math]}$ is called an existential statement. Quantifiers offer us a valuable tool for clear thinking in mathematics, where many concepts begin with the expression “for every” or “there exists.” Of course, the truth or falsity of a quantified statement depends on the universe of discourse.

Suppose that a variable $\text{[math]}$ appears in an assertion $\text{[math]}$ . In the two statements $\text{[math]}$ and $\text{[math]}$ , we say that $\text{[math]}$ is a bound variable because $\text{[math]}$ is bound by a quantifier. In other words, when a variable in a statement is immediately used by a quantifier, then that variable is referred to as being a bound variable. If a variable in a statement is not bound by a quantifier, then we shall say that the variable is a free variable. When a variable is free, then substitution may take place, that is, one can replace a free variable with any particular value from the universe of discourse–perhaps $\text{[math]}$ or $\text{[math]}$ . For example, the assertion $\text{[math]}$ has the one free variable $\text{[math]}$ . Therefore, we can perform a substitution to obtain $\text{[math]}$ . In a given context, if all of the free variables in a statement are replaced with values, then one can determine the truth or falsity of the resulting statement.

There are times in mathematics when one is required to prove that there is exactly one value that satisfies a property. There is another quantifier that is sometimes used, though not very often. It is called the uniqueness quantifier. This quantifier is written as $\text{[math]}$ , and it means that “there exists a unique $\text{[math]}$ satisfying $\text{[math]}$ .” This is in contrast with $\text{[math]}$ , which simply means that “at least one $\text{[math]}$ satisfies $\text{[math]}$ .”

As already noted, the quantifier $\text{[math]}$ is rarely used. One reason for this is that the assertion $\text{[math]}$ can be expressed in terms of the other quantifiers $\text{[math]}$ and $\text{[math]}$ . In particular, the statement $\text{[math]}$ is equivalent to

The above statement is equivalent to $\text{[math]}$ because it means that “there is an $\text{[math]}$ such that $\text{[math]}$ holds, and any individuals $\text{[math]}$ and $\text{[math]}$ that satisfy $\text{[math]}$ and $\text{[math]}$ must be the same individual.”

In addition to the quantifiers $\text{[math]}$ and $\text{[math]}$ , bounded set quantifiers are often used when one wants to restrict a quantifier to a specific set of values. For example, to state that every real number $\text{[math]}$ satisfies a property $\text{[math]}$ , we can simply write $\text{[math]}$ . Similarly, to say that some real number $\text{[math]}$ satisfies $\text{[math]}$ , we can write $\text{[math]}$ .

Definition 1.3.1. (Bounded Set Quantifiers) For each set $\text{[math]}$ , we shall write $\text{[math]}$ to mean that for every $\text{[math]}$ in $\text{[math]}$ , $\text{[math]}$ is true. Similarly, we will write $\text{[math]}$ to signify that for some $\text{[math]}$ in $\text{[math]}$ , $\text{[math]}$ is true.

The assertion $\text{[math]}$ means that for every $\text{[math]}$ , if $\text{[math]}$ , then $\text{[math]}$ is true. Similarly, the statement $\text{[math]}$ means that there is an $\text{[math]}$ such that $\text{[math]}$ and $\text{[math]}$ is true. Thus, we have the logical equivalences:

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Quantifier Negation Laws (QNL)

We now introduce logic laws that involve the negation of a quantified assertion. Let $\text{[math]}$ be any predicate. The statement $\text{[math]}$ means that “for every $\text{[math]}$ , $\text{[math]}$ is true.” Thus, the assertion $\text{[math]}$ means that “it is not the case that every $\text{[math]}$ makes $\text{[math]}$ true.” Therefore, $\text{[math]}$ means there is an $\text{[math]}$ that does not make $\text{[math]}$ true, which can be expressed as $\text{[math]}$ . This reasoning is reversible as we will now show. The assertion $\text{[math]}$ means that “there is an $\text{[math]}$ that makes $\text{[math]}$ false.” Hence, $\text{[math]}$ is not true for every $\text{[math]}$ ; that is, $\text{[math]}$ . Therefore, $\text{[math]}$ and $\text{[math]}$ are logically equivalent. Similar reasoning will show that $\text{[math]}$ and $\text{[math]}$ are also equivalent. We now formally state these important logic laws that connect quantifiers with negation.

Quantifier Negation Laws 1.3.2. For any predicate $\text{[math]}$ , we have the logical equivalences:

1. $\text{[math]}$ .

2. $\text{[math]}$ .

The above reasoning used to justify the quantifier negation laws can also be used to verify two negation laws for bounded set quantifiers. Thus, given a set $\text{[math]}$ and predicate $\text{[math]}$ , the following two logic laws show us how statements of the form $\text{[math]}$ and $\text{[math]}$ interact with negation. Notice that when you push the negation symbol through a bounded set quantifier, the quantifier changes and the negation symbol passes over “ $\text{[math]}$ .”

Bounded Quantifier Negation Laws 1.3.3. For every predicate $\text{[math]}$ , we have the logical equivalences:

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Quantifier Interchange Laws (QIL)

Adjacent quantifiers have the form $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ . In this section, we will see how to interpret statements that contain adjacent quantifiers. When a statement contains adjacent quantifiers, one should address the quantifiers, one at a time, in the order in which they are presented.

Problem 1. Let the universe of discourse be a group of people and let $\text{[math]}$ mean “ $\text{[math]}$ likes $\text{[math]}$ .” What do the following formulas mean?

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Solution. Note that “ $\text{[math]}$ likes $\text{[math]}$ ” also means that “ $\text{[math]}$ is liked by $\text{[math]}$ .” We will now translate each of these formulas from “left to right” as follows:

1. $\text{[math]}$ means “there is a person $\text{[math]}$ such that $\text{[math]}$ ,” that is, “there is a person $\text{[math]}$ who likes some person $\text{[math]}$ .” Therefore, $\text{[math]}$ means that “someone likes someone.”

2. $\text{[math]}$ states that “there is a person $\text{[math]}$ such that $\text{[math]}$ ,” that is, “there is a person $\text{[math]}$ who is liked by some person $\text{[math]}$ .” Thus, $\text{[math]}$ means that “someone is liked by someone.”

Hence, the statements $\text{[math]}$ and $\text{[math]}$ mean the same thing. $\text{[math]}$

Problem 2. Let the universe be a group of people and $\text{[math]}$ mean “ $\text{[math]}$ likes $\text{[math]}$ .” What do the following formulas mean in English?

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Solution. We will work again from “left to right” as follows:

1. $\text{[math]}$ means “for every person $\text{[math]}$ , we have that $\text{[math]}$ ,” that is, “for every person $\text{[math]}$ , we have that $\text{[math]}$ likes every person $\text{[math]}$ .” Hence, $\text{[math]}$ means that “everyone likes everyone.”

2. $\text{[math]}$ proclaims that “for each person $\text{[math]}$ , we have that $\text{[math]}$ ,” that is, “for each person $\text{[math]}$ , we have that $\text{[math]}$ is liked by every person $\text{[math]}$ .” Thus, $\text{[math]}$ means “everyone is liked by everyone.”

So the statements $\text{[math]}$ and $\text{[math]}$ mean the same thing. $\text{[math]}$

Adjacent quantifiers of a different type are referred to as mixed quantifiers.

Problem 3. Let the universe be a group of people and $\text{[math]}$ mean “ $\text{[math]}$ likes $\text{[math]}$ .” What do the following mixed quantifier formulas mean in English?

1. $\text{[math]}$ .

2. $\text{[math]}$ .

Solution. We will translate the formulas as follows:

1. $\text{[math]}$ asserts that “for every person $\text{[math]}$ we have that $\text{[math]}$ ,” that is, “for every person $\text{[math]}$ there is a person $\text{[math]}$ such that $\text{[math]}$ likes $\text{[math]}$ .” Thus, $\text{[math]}$ means that “everyone likes someone.”

2. $\text{[math]}$ states that “there is a person $\text{[math]}$ such that $\text{[math]}$ ,” that is, “there is a person $\text{[math]}$ who is liked by every person $\text{[math]}$ .” In other words, $\text{[math]}$ means “someone is liked by everyone.”

We conclude that the mixed quantifier statements $\text{[math]}$ and $\text{[math]}$ are not logically equivalent, that is, they do not mean the same thing. $\text{[math]}$

To clarify the conclusion obtained in our solution of Problem 3, consider the universe $\text{[math]}$ consisting of just four individuals with names as given. For this universe, Figure 1.3 identifies a world where $\text{[math]}$ is true, where we portray the property $\text{[math]}$ using the “arrow notation” $\text{[math]}$ . Figure 1.3 illustrates a world where there is an individual who is very popular because everyone likes this person; that is, “someone is liked by everyone.”

Figure 1.3. A world where $\text{[math]}$ is true, since someone is liked by everyone.

Figure 1.4 presents a slightly different world in which $\text{[math]}$ is true. So, in this new world, “everyone likes someone.”

Figure 1.4. A world where $\text{[math]}$ is true, because everyone likes someone.

Let us focus our attention on Figure 1.4. Clearly, the statement $\text{[math]}$ is true in the world depicted in this figure. Moreover, notice that $\text{[math]}$ is actually false in this world. Thus, $\text{[math]}$ is true and $\text{[math]}$ is false in the world presented in Figure 1.4. We can now conclude that $\text{[math]}$ and $\text{[math]}$ do not mean the same thing.

Our solution to Problem 1 shows that $\text{[math]}$ and $\text{[math]}$ both mean “someone likes someone.” This supports the true logical equivalence:

Similarly, Problem 2 confirms the true logical equivalence:

Therefore, interchanging adjacent quantifiers of the same kind does not change the meaning. Problem 3, however, verifies that the two statements $\text{[math]}$ and $\text{[math]}$ are not logically equivalent. We conclude this discussion with a summary of the above observations:

Adjacent quantifiers of the same type are interchangeable.
Adjacent quantifiers of a different type may not be interchangeable.

We offer another example, involving the real numbers, which shows that the interchange of mixed quantifiers can change the meaning of a statement.

Example 4. Let the universe of discourse be $\text{[math]}$ , the set of real numbers.

1. $\text{[math]}$ means that for every real number $\text{[math]}$ there is a real number $\text{[math]}$ such that $\text{[math]}$ . We see that the sentence $\text{[math]}$ is true.

2. $\text{[math]}$ states there is a $\text{[math]}$ such that $\text{[math]}$ . This is false.

Quantifier Interchange Laws 1.3.4. For every predicate $\text{[math]}$ , the following three statements are valid:

1. $\text{[math]}$ .

2. $\text{[math]}$ .

3. $\text{[math]}$ .

We will be using the arrow $\text{[math]}$ as an abbreviation for the word “implies.” The conditional connective $\text{[math]}$ shall be reserved for formal logical formulas. It should be noted that the implication in item 3 cannot, in general, be reversed.

The quantifier interchange laws also hold for bounded set quantifiers; for example, we have that

Quantifier Distribution Laws (QDL)

A quantifier can sometimes “distribute” over a particular logical connective. The quantifier distribution laws, given below, capture relationships that hold between a quantifier and the two logical connectives $\text{[math]}$ and $\text{[math]}$ . In particular, the existential quantifier distributes over disjunction (see 1.3.5(1)), and the universal quantifier distributes over conjunction (see 1.3.6(1)). The following quantifier distribution laws can be useful when proving certain set identities.

Existential Quantifier Distribution Laws 1.3.5. For any predicates $\text{[math]}$ and $\text{[math]}$ we have the following distribution laws:

1. $\text{[math]}$ .

2. $\text{[math]}$ .

3. $\text{[math]}$ .

4. $\text{[math]}$ .

If $\text{[math]}$ is a statement that does not involve the variable $\text{[math]}$ , then we have:

5. $\text{[math]}$ .

6. $\text{[math]}$ .

Universal Quantifier Distribution Laws 1.3.6. For any predicates $\text{[math]}$ and $\text{[math]}$ we have the following equivalences:

1. $\text{[math]}$ .

2. $\text{[math]}$ .

3. $\text{[math]}$ .

If $\text{[math]}$ is a statement that does not involve the variable $\text{[math]}$ , then we have:

4. $\text{[math]}$ .

5. $\text{[math]}$ .

1.4 A Formal Language for Set Theory

Cantor employed an informal approach in his development of set theory. For example, Cantor regularly used the Comprehension Principle: The collection of all objects that share a property forms a set. Thus, given a property $\text{[math]}$ , the comprehension principle asserts that the collection $\text{[math]}$ is a set. Using this principle, one can construct the intersection of two sets $\text{[math]}$ and $\text{[math]}$ via the property “ $\text{[math]}$ and $\text{[math]}$ ”; namely, the intersection of $\text{[math]}$ and $\text{[math]}$ is the set $\text{[math]}$ . Similarly, we can form the union of $\text{[math]}$ and $\text{[math]}$ to be the set $\text{[math]}$ . In addition, we obtain the power set of $\text{[math]}$ , denoted by $\text{[math]}$ , which is the set whose elements are all of the subsets of $\text{[math]}$ ; that is, $\text{[math]}$ . The comprehension principle allowed Cantor to establish the existence of many important sets. Today Cantor’s approach to set theory is referred to as naive set theory.

Cantor’s set theory soon became an indispensable tool for the development of new mathematics. For example, using fundamental set theoretic concepts, the mathematicians Émile Borel, René-Louis Baire, and Henri Lebesgue in the early 1900s created modern measure theory and function theory. The work of these mathematicians (and others) demonstrated the great mathematical utility of set theory.

Relying on Cantor’s naive set theory, mathematicians discovered and proved many significant theorems. Then a devastating contradiction was announced by Bertrand Russell. This contradiction is now called Russell’s paradox. Consider the property $\text{[math]}$ , where $\text{[math]}$ is understood to represent a set. The comprehension principle would allow us to conclude that $\text{[math]}$ is a set. Therefore,

Clearly, either $\text{[math]}$ or $\text{[math]}$ . Suppose $\text{[math]}$ . Then, as noted in $\text{[math]}$ , $\text{[math]}$ must satisfy the property $\text{[math]}$ , which is a contradiction. Suppose $\text{[math]}$ . Since $\text{[math]}$ satisfies $\text{[math]}$ , we infer from $\text{[math]}$ that $\text{[math]}$ , which is also a contradiction.

Russell’s paradox thus threatened the very foundations of mathematics and set theory. If one can deduce a contradiction from the comprehension principle, then one can derive anything; in particular, one can prove that $\text{[math]}$ . Cantor’s set theory is therefore inconsistent, and the validity of the very important work of Borel and Lebesgue then became questionable. It soon became clear that the comprehension principle needed to be restricted in some way and the following question needed to be addressed: How can one correctly construct a set?

Ernst Zermelo resolved the problems discovered with the comprehension principle by producing a collection of axioms for set theory. Shortly afterward, Abraham Fraenkel amended Zermelo’s axioms to obtain the Zermelo–Fraenkel axioms that have now become the accepted formulation of Cantor’s ideas about the nature of sets. In particular, these axioms will allow us to construct a power set and to form the intersection and union of two sets. These axioms also offer a highly versatile tool for exploring deeper topics in mathematics, such as infinity and the nature of infinite sets.

Before presenting the axioms of set theory, we must first describe a formal language for set theory. This formal language involves the logical connectives $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ together with the quantifier symbols $\text{[math]}$ and $\text{[math]}$ . In addition, this formal language uses the relation symbols $\text{[math]}$ and $\text{[math]}$ (also $\text{[math]}$ and $\text{[math]}$ ).

What is a formula in the language of set theory? An atomic formula is one that has the form $\text{[math]}$ or $\text{[math]}$ , where $\text{[math]}$ can be replaced with any other variables, say, $\text{[math]}$ . We say that $\text{[math]}$ is a formula (in the language of set theory) if $\text{[math]}$ is an atomic formula, or it can be constructed from atomic formulas by repeatedly applying the following recursive rule: If $\text{[math]}$ and $\text{[math]}$ are formulas, then the next seven items are also formulas:

Hence, $\text{[math]}$ is a formula in the language of set theory because it can be constructed from the atomic formulas $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ and repeated applications of the above recursive rule. Figure 1.5 illustrates this construction, where the statement $\text{[math]}$ is used to abbreviate $\text{[math]}$ .

Figure 1.5. Construction of the formula $\text{[math]}$

Formulas are viewed as “grammatically correct” statements in the language of set theory. Moreover, the expression $\text{[math]}$ is not a formula because it cannot be constructed from the atomic formulas and the above recursive rule. In practice, we shall use parentheses so that our formulas are clear and readable. We will also be using, for any formulas $\text{[math]}$ and $\text{[math]}$ , the following three conventions:

1. The outermost parentheses need not be explicitly written; that is, one can write $\text{[math]}$ to denote $\text{[math]}$ .

2. The negation symbol will apply to as little as possible. We can therefore use $\text{[math]}$ to denote $\text{[math]}$ .

3. Bounded set quantifiers shall be used. Thus, we can abbreviate the formula $\text{[math]}$ by the more readable $\text{[math]}$ .

We will also use symbols that are designed to make things easier to understand. For example, we may write $\text{[math]}$ rather than $\text{[math]}$ .

Throughout the book, we will be using the notation $\text{[math]}$ to identify $\text{[math]}$ as being free variables (see page 14) that appear in the formula $\text{[math]}$ . If the variables $\text{[math]}$ are free, then substitution may take place. Thus, we can replace all occurrences of $\text{[math]}$ , appearing in $\text{[math]}$ , with a particular set $\text{[math]}$ and obtain $\text{[math]}$ . Moreover, a formula $\text{[math]}$ may contain parameters, that is, free variables other than $\text{[math]}$ that represent unspecified (arbitrary) sets. Parameters denote “unassigned fixed sets.” For an example, let $\text{[math]}$ be the formula

So, $\text{[math]}$ has $\text{[math]}$ as an identified free variable, $\text{[math]}$ as a constant, and a parameter $\text{[math]}$ (an unassigned set). To replace a parameter $\text{[math]}$ in a formula $\text{[math]}$ with an specific set $\text{[math]}$ means that every occurrence of $\text{[math]}$ , in $\text{[math]}$ , is replaced with $\text{[math]}$ .

We will now explore the expressive power of this set theoretic language. For example, the formula $\text{[math]}$ asserts that the set $\text{[math]}$ is nonempty. Moreover, $\text{[math]}$ states that “it is not the case that there is a set that contains all sets as elements.” In addition, one can translate statements in English, which concern sets, into the language of set theory. Consider the English sentence “the set $\text{[math]}$ contains at least two elements.” This sentence can be translated into the language of set theory by $\text{[math]}$ .

Let $\text{[math]}$ be a formula with free variable $\text{[math]}$ and let $\text{[math]}$ be a set. The sentence “there is a set $\text{[math]}$ whose members are just those $\text{[math]}$ ’s that satisfy $\text{[math]}$ and $\text{[math]}$ ,” is represented by the formula $\text{[math]}$ .

Let $\text{[math]}$ and $\text{[math]}$ be formulas. Now consider the relationship

: (1.1)

This relationship can be translated into the language of set theory by

: (1.2)

Let $\text{[math]}$ be the formula in (1.2). One can verify that $\text{[math]}$ holds if and only if (1.1) holds. Note that for all $\text{[math]}$ there is a unique $\text{[math]}$ such that $\text{[math]}$ .

Exercises 1.4

1. What does the formula $\text{[math]}$ say in English?

2. What does the formula $\text{[math]}$ say in English?

3. What does the formula $\text{[math]}$ say in English?

4. What does the formula $\text{[math]}$ say in English?

5. What does the formula $\text{[math]}$ say in English?

6. Let $\text{[math]}$ be a formula. What does $\text{[math]}$ assert?

7. Translate each of the following into the language of set theory.

(a) $\text{[math]}$ is the union of $\text{[math]}$ and $\text{[math]}$ .

(b) $\text{[math]}$ is not a subset of $\text{[math]}$ .

(d) $\text{[math]}$ and $\text{[math]}$ have no elements in common.

8. Let $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ be sets. Show that the relationship

can be translated into the language of set theory.

1.5 The Zermelo–Fraenkel Axioms

The axiomatic approach to mathematics was pioneered by the Greeks well over 2000 years ago. The Greek mathematician Euclid formally introduced, in the Elements, an axiomatic system for proving theorems in plane geometry. Ever since Euclid’s success, mathematicians have developed a variety of axiomatic systems. The axiomatic method has now been applied in virtually every branch of mathematics. In this book, we will show how the axiomatic method can be applied to prove theorems in set theory.

We shall now present the Zermelo–Fraenkel axioms. Each of these axioms is first stated in English and then written in logical form. After the presentation, we will then discuss these axioms and some of their consequences; however, throughout the book we shall more carefully examine each of these axioms, beginning in Chapter 2. While reading these axioms, keep in mind that in set theory everything is a set, including the elements of a set. Also, recall that the notation $\text{[math]}$ means that $\text{[math]}$ are free variables in the formula $\text{[math]}$ and that $\text{[math]}$ is allowed to contain parameters (free variables other than $\text{[math]}$ ).

1. Extensionality Axiom. Two sets are equal if and only if they have the same elements.

2. Empty Set Axiom. There is a set with no elements.

3. Subset Axiom. Let $\text{[math]}$ be a formula. For every set $\text{[math]}$ there is a set $\text{[math]}$ that consists of all the elements $\text{[math]}$ such that $\text{[math]}$ holds. ²

4. Pairing Axiom. For every $\text{[math]}$ and $\text{[math]}$ there is a set that consists of just $\text{[math]}$ and $\text{[math]}$ .

5. Union Axiom. For every set $\text{[math]}$ there exists a set $\text{[math]}$ that consists of all the elements that belong to at least one set in $\text{[math]}$ .

6. Power Set Axiom. For every set $\text{[math]}$ there is a set $\text{[math]}$ that consists of all the sets that are subsets of $\text{[math]}$ .

7. Infinity Axiom. There is a set $\text{[math]}$ that contains the empty set as an element and whenever $\text{[math]}$ , then $\text{[math]}$ .

8. Replacement Axiom. Let $\text{[math]}$ be a formula. For every set $\text{[math]}$ , if for each $\text{[math]}$ there is a unique $\text{[math]}$ such that $\text{[math]}$ , then there is a set $\text{[math]}$ that consists of all the elements $\text{[math]}$ such that $\text{[math]}$ for some $\text{[math]}$ . (See endnote 2.)

9. Regularity Axiom. Every nonempty set $\text{[math]}$ has an element that is disjoint from $\text{[math]}$ .

The extensionality axiom simply states that two sets are equal if and only if they have exactly the same elements (see Definition 1.1.1(1)). The empty set axiom asserts that there exists a set with no elements. Since the extensionality axiom implies that this set is unique, we let $\text{[math]}$ denote the empty set.

The subset axiom proclaims that any definable subcollection of a set is itself a set. In other words, whenever we have a formula $\text{[math]}$ and a set $\text{[math]}$ , we can then conclude that $\text{[math]}$ is a set. Clearly, the subset axiom is a restricted form of the comprehension principle, but it does not lead to the contradiction that we encountered in Russell’s paradox. The subset axiom, also called the axiom of separation (see page 3), is described as an axiom schema, because it yields infinitely many axioms–one for each formula $\text{[math]}$ . Similarly, the replacement axiom is also referred to as an axiom schema.

The pairing axiom states that for any two given sets, there is a set consisting of just those two sets. Therefore, for all sets $\text{[math]}$ and $\text{[math]}$ , the set $\text{[math]}$ exists. Since $\text{[math]}$ , it follows that the set $\text{[math]}$ also exists for each $\text{[math]}$ .

The union axiom asserts that for any set $\text{[math]}$ , there is a set $\text{[math]}$ whose elements are precisely those elements that belong to at least one member of $\text{[math]}$ . More specifically, the union axiom proclaims that the union of any set $\text{[math]}$ exists; that is, there is a set $\text{[math]}$ so that $\text{[math]}$ if and only if $\text{[math]}$ for some $\text{[math]}$ . As we will see, the set $\text{[math]}$ is denoted by $\text{[math]}$ .

The infinity axiom declares that there is a set $\text{[math]}$ such that $\text{[math]}$ and whenever $\text{[math]}$ , then $\text{[math]}$ . Since $\text{[math]}$ , we thus conclude that $\text{[math]}$ . Now, as $\text{[math]}$ , we also have that $\text{[math]}$ . Continuing in this manner, we see that the set $\text{[math]}$ must contain all of the following sets:

Observe, by the extensionality axiom, that $\text{[math]}$ . One can also show that any two of the sets in the above list are distinct. Therefore, the set $\text{[math]}$ contains an infinite number of elements; that is, $\text{[math]}$ is an infinite set.

The replacement axiom plays a crucial role in modern set theory (see [8]). Let $\text{[math]}$ be a set and let $\text{[math]}$ be a formula. Suppose that for each $\text{[math]}$ , there is a unique $\text{[math]}$ such that $\text{[math]}$ . Thus, we shall say that $\text{[math]}$ is “uniquely connected” to $\text{[math]}$ . The replacement axiom can now be interpreted as asserting the following: If for each $\text{[math]}$ there is an element $\text{[math]}$ that is uniquely connected to $\text{[math]}$ , then we can replace each $\text{[math]}$ with its unique connection $\text{[math]}$ and the result forms a new set. In the words of Paul Halmos [7], “anything intelligent that one can do to the elements of a set yields a set.”

Given any nonempty set $\text{[math]}$ , the regularity axiom asserts the $\text{[math]}$ for some $\text{[math]}$ . Can a set belong to itself? The regularity axiom rules out this possibility (see Exercise 3).

The formulas in the subset and replacement axioms may contain parameters. We will soon be proving theorems about formulas that may possess parameters. Because parameters represent arbitrary sets, any axiom/theorem that concerns a generic formula with parameters is applicable whenever the parameters are replaced with identified sets. As a result, such an axiom/theorem can be applied when a formula contains fixed sets, as these sets can be viewed as ones that have replaced parameters. For example, the subset axiom concerns a generic formula $\text{[math]}$ . So this axiom can be applied when specific sets appear in $\text{[math]}$ .

This completes our preliminary examination of the set-theoretic axioms that were first introduced by Ernst Zermelo and Abraham Fraenkel; however, we will more fully examine each of these axioms in the remainder of the book. Furthermore, before we make our first appeal to a particular axiom, it shall be reintroduced prior to its initial application. In addition, we will not invoke an axiom before its time; that is, if we are able to prove a theorem without appealing to a specific axiom, then we shall do so. Accordingly, we will not be using the regularity axiom to prove a theorem until the last section of Chapter 8.

It is a most remarkable fact that essentially all mathematical objects can be defined as sets. For example, the natural numbers and the real numbers can be constructed within set theory. Consequently, the theorems of mathematics can be viewed as statements about sets. These theorems can also be proven using the axioms of set theory. Thus, “mathematics can be embedded into set theory.”

Exercises 1.5

1. Let $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ be sets. By the pairing axiom, the sets $\text{[math]}$ and $\text{[math]}$ exist. Using the pairing and union axioms, show that the set $\text{[math]}$ exists.

2. Let $\text{[math]}$ be a set. Show that the pairing axiom implies that the set $\text{[math]}$ exists.

3. Let $\text{[math]}$ be a set. The pairing axiom implies that the set $\text{[math]}$ exists. Using the regularity axiom, show that $\text{[math]}$ . Conclude that $\text{[math]}$ .

4. For sets $\text{[math]}$ and $\text{[math]}$ , the set $\text{[math]}$ exists by the pairing axiom. Let $\text{[math]}$ . Using the regularity axiom, show that $\text{[math]}$ , and thus $\text{[math]}$ .

5. Let $\text{[math]}$ , $\text{[math]}$ , and $\text{[math]}$ be sets. Suppose that $\text{[math]}$ and $\text{[math]}$ . Using the regularity axiom, show that $\text{[math]}$ . [Hint: Consider the set $\text{[math]}$ .]

6. Let $\text{[math]}$ and $\text{[math]}$ be sets. Using the subset and power set axioms, show that the set $\text{[math]}$ exists.

7. Let $\text{[math]}$ and $\text{[math]}$ be sets. Using the subset axiom, show that the set $\text{[math]}$ exists.

8. Show that no two of the sets $\text{[math]}$ , $\text{[math]}$ , $\text{[math]}$ are equal to each other.

9. Let $\text{[math]}$ be a set with no elements. Show that for all $\text{[math]}$ , we have that $\text{[math]}$ if and only if $\text{[math]}$ . Using the extensionality axiom, conclude that $\text{[math]}$ .

10. Let $\text{[math]}$ be the formula $\text{[math]}$ which asserts that $\text{[math]}$ . As noted on page 25, for all $\text{[math]}$ the set $\text{[math]}$ exists. So $\text{[math]}$ . Let $\text{[math]}$ be a set. Show that the collection $\text{[math]}$ is a set.