In this chapter, it shall be shown that number theory can be embedded within set theory. One consequence of this embedding is that all of the theorems in number theory can be proven from the axioms of set theory.
In order to show that number theory is, in fact, a special branch of set theory, we must first represent each natural number as a set. How can one define the natural numbers in set theory? To answer this question, we next consider a set theoretic construction that makes sense for any set.
Definition 4.0.1. For each set , the successor
is the set that is
obtained by adjoining
to the elements of
, namely,
.
We note the following three properties concerning the successor of a
set :
Using the successor operation, we can now construct, in set theory, the first few natural numbers as follows:
We note some interesting properties that these “natural numbers” possess (see the above bulleted list):
The existence of an infinite set is crucial in modern set theory and mathematics. Zermelo realized that one cannot prove that an infinite set exists and thus found a fairly simple way to assert the existence of an infinite set.
Infinity Axiom. There is a set that contains the empty set as an
element and whenever
, then
.
Using the infinity axiom, we shall prove that there is a set consisting of
only the natural numbers; this set is denoted by . First, we define a
property of a set that will ensure that each natural number belongs to such a
set. A set that satisfies this property will contain the empty set and will be
closed under the successor operation.
Definition 4.1.1. A set is said to be inductive if and only if
An inductive set contains and also contains the successor of each one
of its elements. Hence, an inductive set must contain all of the “natural
numbers.” Clearly, the infinity axiom declares that there is an inductive set.
Therefore, is a natural number if and only if
is in every
inductive set. We now want to prove that there is a set consisting
of just the natural numbers. To do this, we need to prove that the
class
is a set. By the infinity axiom, we know that there is an inductive set
. Hence, for each
, if
is in every inductive set, then
.
Theorem 2.1.3 therefore implies that (4.1) defines a set, and this set consists
of the natural numbers.
Of course, is the set
, but we will identify
as the set
that satisfies
Theorem 4.1.4. The set is inductive and
is a subset of every
inductive set. Thus,
is the smallest inductive set.
Proof. We first prove that is an inductive set; that is, we shall prove
that
Since is in every inductive set, (4.2) implies that
. To prove (2), let
. We thus conclude that
is in every inductive set. So
is also in
every inductive set. Hence,
and (2) holds. Now let
be an
inductive set and let
. The above (4.2) implies that
is in every
inductive set. Therefore,
and thus,
. ☐
We now prove the following simple corollary.
Proof. Let be an inductive set such that
. Since
is an
inductive set, Theorem 4.1.4 implies that
. Hence,
. ☐
Corollary 4.1.5 is, in fact, a restatement of the principle of mathematical
induction. Suppose that is some property. To prove by induction
that
is true, let and so,
. If we can prove
that
then is inductive and thus,
by Corollary 4.1.5. Therefore,
holds. Item (1) is often called the base step, and (2) is
commonly refereed to as the inductive step. We apply this method of proof to
establish our next theorem.
Proof. Let . Clearly,
. Let
. So either
or
for some
. Therefore,
or
where
. Thus,
. Hence, by Corollary 4.1.5,
. ☐
Therefore, for any set we have that
Example 1. The empty set is, vacuously, transitive. Each of the following
sets is also transitive:
On the other hand, the set is not transitive because
and
yet
. Similarly, the set
is not transitive.
Remark 4.1.8. Regrettably, we are now using the word “transitive” to
refer to two different concepts. A relation on a set
is transitive
provided that for all
,
,
in
, if
and
, then
(see Definition 3.2.11). A set
is transitive, if whenever
,
then
. If a set
is transitive, then one cannot infer that the
membership relation
on
is a transitive relation; that is, one
cannot always conclude that if
and
, then
, for all
,
,
in
.
Proof. Suppose that is a transitive set. Then
Therefore, . ☐
Proof. We prove that by induction. Let
Clearly, , as
is a transitive set. Let
. So
is
transitive. To prove that
is transitive, we will show that
(see 4.4). By Theorem 4.1.10,
. Since
(see item 3 on
page 83), we conclude that
. Hence,
is a transitive set and
thus,
. By Corollary 4.1.5, we conclude that
, and this
completes the proof. ☐
Theorem 4.1.11 implies that if and
, then
, whenever
is a natural number. We now prove that the successor function on
is
one-to-one.
Proof. Let and
. It follows from Theorem 4.1.11 that
and
are transitive sets. Assume that
. Thus,
.
Therefore,
by Theorem 4.1.10. ☐
Proof. We shall prove that by mathematical induction. Let
. Clearly,
, because
and
. Let
.
So (IH)1
. We shall prove that
. To do this, let
. We must
prove that
. Since
, we conclude that
or
. If
, then
by (IH). If
, then
because
. Hence,
. Therefore,
. ☐
Exercise Notes: For Exercise 7, apply Theorem 4.1.12 for the inductive
step. For Exercise 9, let and let
.
Prove that
is inductive. Assume
. To prove
, let
.
So
or
. If
, then
implies that
and
thus,
. If
, then
by Exercise 8.
Induction is frequently applied in mathematical proofs. Moreover,
one often defines a function by induction (recursion). A recursively
defined function is one that is defined in terms of “previously evaluated
values of the function.” A function on
is defined recursively if
its value at
is first specified and all of the remaining values are
then defined using an earlier value, together with a given function.
For example, let
be a function. Suppose that the initial
value of a function
is given to be
for a fixed
,
and to get the next value
we use the initial value to obtain
. Similarly, to get the next value
we must
evaluate
. By continuing in this manner, we can evaluate
more and more values of the function
. A more succinct way of
describing this function
is to first define (i)
and then
define the remaining values by (ii)
. Thus, one can
evaluate
,
, …,
. Just because we can evaluate a finite
number of values, do we really know that there is such a function
with domain
? The existence of such a function seems quite
reasonable. In mathematics, however, the acceptance of an existential
statement requires a valid argument, that is, a proof. The proof of our
next theorem shows that such a function does exist and that it is
unique.
Recursion Theorem 4.2.1. Let be a set and
. Suppose that
is a function. Then there exists a unique function
such that
The function in the recursion theorem, satisfying (1) and (2), is
said to be defined by recursion. If a function has been defined by
recursion, then proofs of statements about this function often use “proof
by induction.” Before we prove the recursion theorem, we make an
observation. First, let us assume that the function
exists. When
viewing
as a set of ordered pairs, conditions (1) and (2) imply that
In our proof, we will show that there exists a function that satisfies the
above bulleted conditions.
Proof. Let be a set,
, and let
be a function. Let us
call a relation
suitable if the following two conditions
hold:
Our goal is to construct a suitable relation that is also a function.
Since every suitable relation is a subset of , Theorem 2.1.3 implies
that the class
is, in fact, a set. Because
is a
suitable relation,
is nonempty. Let
. Clearly,
, and
so
is a relation. We now prove that
is suitable and that it is a
function.
Proof. We need to show that satisfies items (i) and (ii). Every
relation in
is suitable. So
is an element in every relation
in
. Thus,
; that is,
. To prove (ii), assume
that
. As
, it follows that
belongs to
every relation in
. Let
. Since
and
is
suitable, item (ii) implies that
. So
belongs
to every relation in
, and hence,
. Thus,
is
suitable. (Claim 1) ☐
We now must prove that is a function from
to
.
Proof. To prove that is a function from
to
, we must show
that for each
, there is exactly one
such that
.
Let
be defined by
We now prove that is an inductive set. To do this, we must first
prove that
. We know that
. To prove that
,
we need to show there is no
where
. Suppose, for a
contradiction, that
where
. Consider the relation
Clearly, ()
. As
is suitable, it easily follows that
is also suitable (because
for all
). Hence,
.
Since
, we conclude that
, and thus,
, which
contradicts (
). Consequently,
is the only element in
such that
. Therefore,
.
Let ; that is, assume that (IH) there is a unique
such that
. Because
is suitable, we conclude that
. To show that
, we must show there is
no
where
. Assume, for a contradiction, that
where
. Consider the relation
Clearly, ()
. We now show that
is suitable. Since
is suitable and
, we have only to show that if
,
then
. To do this, assume that
. Thus,
, and so
. Since
,
,
and
, we conclude from (IH) that
, and thus,
. Because
and
, we infer
that
. So
is suitable. Hence,
. Since
, we conclude that
, and thus,
, which
contradicts (
). Therefore,
is the only element in
such that
. So
, and thus,
is inductive.
Corollary 4.1.5 implies that
, and as a result,
is a function
from
to
. (Claim 2) ☐
Because the function is suitable, it satisfies conditions (1)
and (2) given in the statement of the theorem. To prove that
is unique, suppose that
also satisfies properties (1)
and (2). Thus,
is a suitable relation, and hence,
. Since
, we have that
. Exercise 10 on page 68 implies that
. Therefore,
is unique. (Theorem) ☐
Example 1. Let and let
. Theorem 4.2.1 implies
that there is a function
such that
and
, for all
.
When can we be assured that the function in Theorem 4.2.1 is
one-to-one? Our next theorem provides an answer.
Theorem 4.2.2. Let be a set,
,
, and
be
a function. Suppose that
satisfies
If is one-to-one, then
is also one-to-one.
Proof. Let be one-to-one,
,
, and let
satisfy (1) and (2). We prove that for all
and all
, if
, then
. Let
be defined by
We shall show that is an inductive set. To prove that
, we must
show that
So let and assume that
. By (1), we have
.
Hence,
. Suppose that
. Then
for some
by
Theorem 4.1.6. Since
, we conclude that
. The above
item (2) now implies that
. Thus,
, which
contradicts the fact that
. Hence,
. Therefore, (4.5) holds,
and so
.
Let . In other words, assume the induction hypothesis
We shall prove that ; that is, we will prove that
Let and suppose that (
)
. Because
, the
above (4.5) and (
) imply that
. So
for some
.
Hence,
. Item (2) implies that
. Since
is one-to-one, we conclude that
. Our induction hypothesis (IH)
implies that
, and thus,
; that is,
. Hence,
,
and so
is an inductive set. Therefore,
and
is a one-to-one
function. ☐
The Peano postulates offer an axiomatic foundation for the natural numbers. These postulates were first presented by the nineteenth-century Italian mathematician Giuseppe Peano. In 1888, Richard Dedekind proposed a set of axioms about the natural numbers. Peano then published a more precisely formulated version of these axioms in his 1889 book The principles of arithmetic presented by a new method [13]. The essence of the Peano postulates shall be summarized by means of the following two definitions.
Let denote an ordered triple, that is,
.
The above condition (3) is referred to as the induction postulate. Figure 4.1 illustrates a Peano system.
Theorem 4.1.4 shows that is inductive. Thus, we can define the
function
by
for all
. We can now prove that a
Peano system exists.
Proof. Since is inductive, we have that
and that
for
all
. Clearly,
for all
, as
and
.
Thus,
. Theorem 4.1.12 shows that
is one-to-one.
Finally, we need to show that
satisfies the induction postulate.
Let
be such that
and
is closed under
. So
is inductive. Corollary 4.1.5 implies that
. Therefore,
is a Peano system because it satisfies conditions (1)–(3) of
Definition 4.2.4. ☐
Definition 4.2.6. Let and
be Peano systems. Then
is isomorphic to
when there is a bijection
such that
The function is said to be an isomorphism from
onto
.
Two Peano systems are isomorphic if they are essentially the same
structure. We now prove that is isomorphic to any Peano system.
Thus, the set of natural numbers
is “the same” as the set
that is used
in calculus.
Proof. Let be a Peano system. So
is one-to-one,
, and
. Since
, Theorem 4.2.1 implies that
there is a function
such that
As and
is one-to-one, Theorem 4.2.2 implies that
is
one-to-one. To show that
is onto
, we use the induction postulate
(Definition 4.2.4(3)). Clearly,
. Since
, we see that
. Let
. Thus,
for some
. Hence,
. By (2), we conclude that
. So
, and
is thereby closed under
. The induction
postulate implies that
, that is,
is onto
. Therefore,
is
an isomorphism from
onto
. ☐
Theorem 4.2.7 implies that the Peano postulates characterize the system of natural numbers (up to isomorphism). Theorem 4.2.7 also implies that any two Peano systems are isomorphic to each other (see Exercises 7–8).
Theorem. Let be a set and let
. Suppose that
is a function. Then there exists a unique function
such
that
Assume that is one-to-one and onto
. Prove that
is
one-to-one.
Let be as stated in Theorem 4.2.1. Prove that
for all
.
Now address the following:
Exercise Notes: Exercises 4–5 do not involve Theorem 4.2.1, but
they do apply some of the ideas in its proof. For Exercise 4(e) and
for Exercise 5(e), in the direction (), prove that
.
For Exercise 6, item (a) implies (c); to prove (d), use (b) and (c)
together with the results of Exercise 5(c)(d) to show that
and
.
In this section, the recursion theorem will be used to define the operations of addition, multiplication, and exponentiation on the natural numbers. We will also prove the properties of arithmetic that we were taught as children. We first establish the following result.
Theorem 4.3.1. Let and
be functions.
Then there is a unique function
such that for each
,
Proof. For each , by the recursion theorem (viewing
as a
constant), there is a unique function
such that
Since is unique for each
, the connection
thus produces
a function with domain
. We can now define the function
by
. The function
easily satisfies conditions (1) and (2)
for all
, and it is unique. ☐
The function in Theorem 4.3.1 is defined in terms of two known
functions. Thus, before one can apply Theorem 4.3.1, one must first
introduce the two relevant functions
and
.
Recalling Definition 3.5.3, a function of the form is called
a binary operation on
. Our next definition is an application of
Theorem 4.3.1; that is, using the functions
and
,
we define addition on the natural numbers.
for all natural numbers and
. The function
shall be referred to
as addition, and we define the binary operation
on
to be
for all and
in
.
Proof. Let and
be elements in
. Recalling that
,
we see that (1) and (2) in Definition 4.3.2 immediately produce (A1) and
(A2). ☐
The next proposition shows that is equal to
, the
successor of
. After we prove this proposition, we will show that
.
Proof. Let . Since
, we obtain
and thus, . ☐
We can now prove that as follows:
Now that we have the operation of addition, we can apply
Theorem 4.3.1 to define multiplication on the natural numbers using the
functions
and
.
for all natural numbers and
. The function
is called
multiplication, and we define the binary operation
on
to be
for all and
in
.
Proof. Let and
be elements in
. Recalling that
,
we see that (1) and (2) in Definition 4.3.5 yields (M1) and (M2). ☐
Having presented the set-theoretic definitions of addition and multiplication, we will now show that some of the familiar laws of arithmetic are provable within set theory. Afterwards, we shall define exponentiation.
Proof. Let and
be elements in
, and let
We shall show that is an inductive set. To show that
, observe
that
and thus, , and so
. Assume that
.
Then
Therefore, , and hence,
. ☐
For , we know that
by (A1). Since we have not
proven that addition is commutative, we cannot immediately infer that
, and thus, we must prove that this latter equation holds for all
.
Proof. We prove that by induction. Let
We prove that is inductive. Since (A1) implies that
, we have
that
. Let
. So
. We prove that
as
follows:
Therefore, , and this completes the proof. ☐
Item (A2) states that for natural numbers
and
. We will now prove that we also have that
. Then
we will be able to prove that addition is commutative.
Proof. Let be arbitrary. We prove that
by induction. Let
To prove the lemma, we just need to show that is inductive.
First, we verify that
by showing that
as
follows:
Thus, . Suppose that
. Then
and hence, . Therefore,
and
is
inductive. ☐
Lemmas 4.3.8 and 4.3.9 will be used to prove that addition is commutative.
Proof. Let be any element in
and let
We prove that is an inductive set. Observe that
by
Lemma 4.3.8 and (A1). So
. Assume that
. Then
Hence, , and therefore,
. ☐
Using proof by induction, we will now show that the familiar distributive law of arithmetic holds. We will thus have additional evidence to support the assertion “Mathematics can be embedded in set theory.”
Proof. Let and
be in
. Now let
We will show that is an inductive set. To verify that
, we
have
Thus, , and so
. Now, suppose that
.
Then
Hence, , and thus,
. ☐
Our next two theorems verify two fundamental properties of multiplication.
Proof. Let and
be elements in
and let
We shall show that is an inductive set. To show that
, observe
that
and thus, , and so
. Assume that
.
Then
Therefore, , and hence,
. ☐
Proof. See Exercise 6. ☐
We will now apply Theorem 4.3.1 to define exponentiation on the natural
numbers, using the functions and
.
Definition 4.3.14. Let be the unique function satisfying
for all natural numbers and
. The function
is called exponentiation.
We define the binary operation, denoted by
, to be
for all and
in
.
Exercise Notes: For Exercise 1, suppose and then use Theorem 4.1.6.
For Exercise 2, assume that
and use Exercise 1 to prove that
. For Exercise 3, Theorem 4.1.12 can be used in the inductive
step.
We now want to develop a theory of order on the natural numbers. First, we define the “less than” relation on the natural numbers. Then we prove that this relation satisfies the trichotomy law (see Theorem 4.4.9) as well as the standard connections with addition and multiplication. How can we get such a relation in set theory?
Recall that for each , we defined the successor of
to be the
natural number
, and thus,
. As a result, we have the
following infinite membership list:
which is very much like the usual relation on the natural numbers; that
is,
It turns out that the order relation that we need is just “,” and, in fact,
we will prove that this is the “less than” relation that we were looking
for.
In this section, we will view “” as the “less than” relation on
the natural numbers, and we shall no longer use the symbol
in our initial discussion of ordering on
. We will show that the
relation
on
satisfies the trichotomy law, and we will show
that
is preserved under the operations of addition and nonzero
multiplication.
Theorem 4.1.11 and Theorem 4.1.13 assert, respectively, that each
natural number is a transitive set and that
is a transitive set. Hence,
we have the following result.
Proposition 4.4.2. Let be a natural number. Then
Item (2) proclaims that is a transitive relation on
. One can also
prove, without using the regularity axiom, that no natural number is “less
than” itself.
Using Definition 4.4.1, we can now easily define the “less than or equal” relation on the natural numbers.
Since is a transitive relation on
, it thus follows that
is also a
transitive relation on
. Clearly,
is reflexive. We will now show that it
is antisymmetric (see Definition 3.4.1).
Proof. Let and
satisfy
and
. Thus,
and
by Proposition 4.4.2(1). Therefore,
. ☐
Hence, is a partial order on
. Is the relation
a total order on
? To show that it is, we shall prove that the relation
satisfies the
trichotomy law on
. First, we establish two lemmas.
Proof. We shall prove that the set
is inductive. Clearly, . Assume
. Hence,
. So either
or
. If
, then, as
, we see that
. If
, then we have
and
. By transitivity (see
Proposition 4.4.2(2)), we conclude that
. Thus, in either case,
and so,
. Therefore,
is inductive and
. ☐
Proof. Let . We prove that
by
induction. Let
Clearly, as
is true vacuously. Let
.
To prove that
, assume that
. We shall show that
. Observe that (
)
. Because
, either
or
. If
, then
as
. Since
,
we see that (
) and transitivity imply that
. If
,
then
. Thus, (
) implies that
. Therefore,
. ☐
Proof. Assume that and
. By Lemma 4.4.7, we just need
to prove if
, then
. So let
. Recall that (
)
. Since
, we have either
or
. If
, then
by (
) and transitivity. Also, if
, then
(
) implies that
. ☐
The following theorem shows that is a total order on
; that is, for
all
and
in
, either
or
.
Proof. First, we show that any two of the relations ,
,
cannot hold at the same time, whenever
and
are natural
numbers. To do this, we just have to show that the following two
conjunctions do not hold:
when and
. Conjunction (a) implies that
, which
contradicts Lemma 4.4.3. Conjunction (b), by transitivity, also implies that
. Hence, both (a) and (b) do not hold.
We now prove that . To
prove this statement, let
and let
We will show that is inductive. Lemma 4.4.6 implies that
. Let
, that is, assume that
We must show that
To do this, we consider each of the three cases in (IH) separately. If ,
then
by the fact that
and transitivity. If
, then
since
. If
, then Lemma 4.4.7 implies that
. Thus, either
or
. So in each of these three
cases in (IH), we obtain one of the conjuncts in (4.6). Therefore,
,
and the proof is complete. ☐
The trichotomy law will now allow us to do the following:
Proof. Let and
. Assume that
. Because
is
transitive, we have that
. Since
, Theorem 4.4.9 implies that
. So
. To prove the converse, assume
. Thus,
.
From Theorem 4.4.9, we have either
or
. If
, then we
would have that
, as
. This contradicts Theorem 4.4.3. Thus,
. ☐
Corollary 4.4.10 implies that for all natural numbers and
,
When and
are natural numbers, we can now define
, which is the larger of the two natural numbers
and
(see Exercise 5).
Our next theorem shows that the expected properties of “inequality” hold for the natural numbers.
Theorem 4.4.11 (Properties of Inequality). For all natural numbers ,
,
and
, the following hold:
Proof. Item (1) follows from Exercises 6 and 7 of this section. Similarly, (2) is established via Exercises 8 and 9. ☐
Corollary 4.4.12 (Cancellation Laws). For all natural numbers ,
, and
, the following hold:
Proof. See Exercise 10. ☐
Corollary 4.1.5 (page 85) confirms the fact that the principle of
mathematical induction is provable from the first seven axioms of
Zermelo–Fraenkel (ZF) set theory. Virtually every undergraduate
text in mathematics (e.g., number theory, real analysis, and abstract
algebra) just cites this principle and does not attempt to prove it
directly. Our next theorem shows that these seven axioms also imply the
well-ordering principle: Every nonempty subset of has a least
element.
Theorem 4.4.13 (Well-Ordering Principle). Let be a nonempty
subset of
. Then
has a least element; that is, there is an
such that
for all
.
Proof. Let be a nonempty subset of
. Suppose, for a contradiction,
that
has no least element. Let
. So
is the set of
that are less than or equal to every element in
. Note
that (
)
because if
, then
and
would be
a least element in
.
We will now show that is inductive. Clearly,
, by Lemma 4.4.6.
Let
; that is, suppose that
for all
. Since
has no
least element, it follows that
. Therefore,
for all
.
Hence,
for all
by Exercise 3. So
and
is
inductive. Thus,
and (
) now implies that
.
Nevertheless, because
and
is nonempty, we also conclude that
. This contradiction completes the proof. ☐
We end this section with a proof of the strong induction principle on
.
Proof. Let satisfy (4.7). Suppose, to the contrary, that there is a
natural number that is not in
. Thus, the set
is nonempty.
Theorem 4.4.13 now implies that
has a least element
. Hence,
and if
, then
. Therefore,
. As
satisfies
(4.7), we infer that
, a contradiction. ☐
We can now define a special relation symbol with which we are all
familiar. Let and
be in
. We define
if and only if
,
and define
if and only if
. So in the future there may times
when we shall use the symbols
and
, rather than
and
, if it
seems appropriate.
Remark 4.4.15. Using the system of natural numbers ,
one can give set-theoretic constructions of the set of integers, the set
of rational numbers, and then the set of real numbers. A method for
constructing the set of integers is suggested in Exercise 9 on page 82.
In [5], Enderton gives a more detailed description of this method and
constructs the set of rational numbers and the set of real numbers.
Exercise Notes: For Exercise 3, apply Theorem 4.4.9. For Exercise 4, apply
Exercise 3. For Exercise 6, use induction. For Exercise 7, apply
Exercise 6 and Theorem 4.4.9. For Exercise 8, use induction. For
Exercise 9, use Exercise 8 and Theorem 4.4.9. For Exercise 13,
use Exercises 11–12. For Exercise 14, let and prove that
is inductive. For Exercise 15, one can
apply Theorem 4.4.13 and proof by contradiction: Assume that there
is such function
. Let
. Let
be the
least element in
. Thus,
for a
. For Exercise 16,
use the well-ordering principle and Exercise 2. For Exercise 17, use
induction and review 3.3.8(3) on page 60. For Exercise 18, prove that
is an inductive set. For Exercise 19, in the inductive step one can use
Exercise 3. Exercises 19–20 yield a proof the Division Algorithm: Let
and
be natural numbers, where
. Then there exist unique natural
numbers
and
such that
and
.