The Irrationals

Continued Fractions Revisited

If you change the way you look at things, the things you look at change.

Max Planck

In chapter 3 we considered the application of continued fractions in the eighteenth century, as they were used to grind out the first proofs of the irrationality of e and π. Their role in the study of irrational numbers is far greater though, with a number of the results of previous chapters implicitly dependent on them; here we make some of that dependence explicit.

We shall continue to restrict our interest to simple continued fractions and so to finite or infinite expressions of the form

and we will adopt the notation that the nth convergent of the continued fraction is written

We will first dispose of the finite case, and in doing so rational numbers, with the following two results.

1. If two simple continued fractions are equal,

[a₀; a₁, a₂, a₃, . . . , a_n] = [b₀; b₁, b₂, b₃, . . . , b_m],

and a_n, b_m > 1 (a technicality needed to cope with a non-serious ambiguity in the final digit of the representation), then m = n and a₁ = b₁, a₂ = b₂, a₃ = b₃, . . . , a_n = b_m.

2. A number is rational if and only if its simple continued fraction expansion is finite.

These we will not prove, but their veracity is easily established by simple proofs that are part of elementary theory and easily found.

With these we move to irrational numbers and so to the infinite continued fractions that can be used to represent them.

First, a result of Euler in the eighteenth century and Lagrange in the nineteenth combine to an important categorization, which again we choose not to prove:

The continued fraction representation of a number is periodic if and only if the number is a quadratic surd.

We discussed quadratic surds in chapter 3. Periodic continued fractions, just as with periodic decimals, are those that at some stage repeat themselves. For example,

We shall return to quadratic irrationals soon, but first we will look at the general case, wherein we have the necessarily infinite continued fraction representation of an irrational number α, without regard to any pattern in its digits.

The Hierarchy of the Convergents

First, the continued fraction process can be conveniently represented¹ using matrix multiplication in that, with our established notation,

which means that

and this means that

from which we can see that both numerator and denominator of the convergents are increasing with n.

These relationships can be developed into the interrelationships between the p_ns and q_ns in that

Our interest is with the first of the two, which we will all but prove.

The left side, as a function of n, can write in terms of n − 1 in that

Observe now that p₀ = a₀, q₀ = 1, p₁ = a₀a₁ + 1, q₁ = a₁ and we can quickly check the initial truth of the statement and we have all we need for an inductive proof; alternatively, the use of the multiplicative nature of determinants can be utilized to give the result.

This result discloses three things:

1. The greatest common divisors,

[p_n, q_n] = [p_n+1, p_n] = [q_n+1, q_n] = 1.

2. The form

shows that the convergents approximate α alternatively from above and below.

3. Adding this form gives

and the series on the left cascades to

and so we have

From this last, and with the q_ns known to be increasing, we conclude

and it is clear that we can continue this to

Similarly with the odd convergents

which continues to

Finally, we link the odd and the even convergents with

to get

Table 8.1.

and we have the full hierarchy

So the even-numbered convergents form a monotone increasing sequence converging to α and the odd-numbered convergents form a monotone decreasing sequence converging to α. Table 8.1 demonstrates the behaviour for α = π.

Note particularly that the path left−right−left . . . provides an oscillating sequence and that the biggest even convergent is smaller than the smallest odd convergent − and that π lies between them, as it does between every consecutive pair of them.

Rational Approximation Revisited

In chapter 6 we repeatedly pressed for ever greater accuracy of the rational approximation to a given irrational number, and we judged an approximation p/q to be better than p′/q′ provided that |qα − p| < |q′ α − p′|. As we squeezed the tolerance so we sieved the irrationals, finishing with the Lagrange Spectrum of ‘break-points’, which is intimately connected to the exotic Markov Spectrum of numbers. Our proofs were non-existential and disguised the fact that continued fractions usually lay at their heart, and we will here take the opportunity to revisit some of them, couched in this alternative language.

The convergents p_n/q_n are the best possible approximations to a number in the sense that if p and q are integers with q < q_n+1 then |qα − p| |q_nα − p_n|.

We will prove this and do so by assuming that, on the contrary, |qα − p| < |q_nα − p_n|, where q < q_n+1.

If we define the two numbers u and v by the equations

p = up_n + vp_n+1 and q = uq_n + vq_n+1.

Then u and v are integers, since,

and so

which means that

Furthermore, v ≠ 0, since if it were, p = up_n, q = uq_n and we would have the contradiction

Also, u ≠ 0, since if it were, q = |v|q_n+1, which contradicts q < q_n+1.

Further, u and v must be of opposite signs since, if v < 0 up_n = p − vp_n+1 ⇒ u > 0, whereas, if v > 0, q < vq_n+1 ⇒ uq_n < 0 ⇒ u < 0.

Also note that

must be of opposite signs because, from observation (3) on page 213, α lies between p_n/q_n and p_n+1/q_n+1. Combine these results to the observation that these two must have the same sign:

But then

which is, once again, a contradiction.

We also argued that any irrational number can be approximated so that |α−p/q| < 1/(2q²). These next two results show precisely how.

If |α − p/q| < 1/(2q²) then p/q is a convergent of the continued fraction of α.

To see this, let p_n/q_n be some convergent of α and, with the best approximation result above, consider

Write the spectrum of denominators of the convergents of α as {q₁, q₂, q₃, . . .} and choose n so that q_n q < q_n+1, in which case, 1/q 1/q_n and we have

which means that the positive integer |pq_n − qp_n| < 1 and so it must be that p/q = p_n/q_n.

For any irrational number α, and for any two consecutive convergents, p_n/q_n, p_n+1/q_n+1 to α, either or or both.

Now we have

Using the result that α lies between two consecutive convergents, we have

But

(a − b)² = a² + b² − 2ab > 0

Take a = 1/q_n and b = 1/q_n+1, then

this means that either or both of these inequalities hold

As we pressed further we met the result of Hurwitz, which in part showed that the 2 can be increased to . Again, the (and his) use of partial fractions reveals why, with the result:

For any 3 consecutive convergents of the irrational number α, p_n/q_n, p_n+1/q_n+1, p_n+2/q_n+2 at least one must satisfy |α − p/q| < 1/(q²).

For, suppose not, then

Write = q_n+1/q_n and we have + 1/ . Since is rational we have + 1/ < . So,

This means that and in particular which makes

Write μ = q_n+2/q_n+1 and . But q_n+2 = a_n+2q_n+1 + q_n so q_n+2/q_n+1 = a_n+2 + q_n/q_n+1 1 + q_n/q_n+1:

A contradiction.

Now let us see how the use of continued fractions and this result of Hurwitz can be combined to give a measure of irrationality.

The World’s Most Irrational Number

We know from this version of Hurwitz’s theorem that for every irrational number there are an infinite number of rational approximants satisfying |α − p/q| < 1/(q²) and that we should look to the continued fraction expansion of α for the best among them. Among this infinite number of ever more accurate approximants we can seek any for which the Hurwitz bound of the error is large compared with the actual error of the approximation. Consider, for example, table 8.2, which lists the first four convergents to π which are within the Hurwitz-prescribed error margin: and note the second row of data. We see that the error in the approximation is far smaller than the guaranteed error margin and we can quantify this discrepancy using the ratio of the two, which appears in the final column of the table. With this we can make precise the idea one number being ‘more irrational’ than another:

Table 8.2.

Table 8.3.

α is deemed more irrational than β if the minimum of the E/H ratio for α is greater than that for β. We judge that approximations with E/H near 0 are tighter than we might expect, whereas those near 1 are barely creeping into compliance with the Hurwitz result. Table 8.3 gives the data for e and table 8.4 that for the Golden Ratio.

It is looking as though e might be harder to approximate than π − and that the Golden Ratio might be more resistant still, with the relative error barely keeping below the upper limit of 1 and seemingly approaching it. In the front matter we hinted that the Golden Ratio is, in fact, the most irrational number: with the stage now set, we will prove it so. To be precise we will show that, with the continued fraction approximations,

Table 8.4.

or, put another way,

To begin the proof, we know that

And also we know the Binet (or Euler or De Moivre) formula for the nth Fibonnaci number in terms of :

Now consider

Since and so

recalling that the defining equation for is ² = + 1 we have (1 − ) = −1 and so

and we are done.

That Markov Spectrum

The second part of Hurwitz’s result showed that an essential boundary had been reached with, for example, being incapable of being approximated to a finer tolerance; from this came the cascade of boundary irrational numbers and from them came the mysterious spectrum of numbers (and those equivalent to them) investigated by Markov. This list began

We know from an earlier result that the continued fraction form of these quadratic irrationals will be recurring; let us observe these first four:

Table 8.5.

and a pattern is discerned which is absent in their surd form: periodic they certainly are, but it is this special nature of the periodicity that attracts the eye. Table 8.5 lists these and a few more in their continued fraction form, together with the associated boundary numbers of each: for brevity we adopt the standard convention that, for example, [0; 1, 1, 1, 1, 2, 2, 1, 1, 1, 1, 2, 2, 1, 1, 1, 12, 2, . . . ] = .

With this we are closer to the mystery of the Markov numbers but revealing it fully remains beyond our reach. We can, though, make one further incursion into the theory. The boundary numbers were not isolated but grouped into equivalence classes with the equivalence relation:2

α and α′ are equivalent if there are integers such that α′ = (aα + b)/(cα + d), where |ad − bc| = 1.

For example, the following are equivalent:

It takes but a second to notice that the last two expansions look to be identical with the first after the first few terms. In fact, the continued fraction statement is

α ~ α′ if and only if their continued fraction expansions are identical from some point on

and, in addition,

a/c and b/d are successive convergents in the continued expansion expression for α′ provided that α > 1 and c > d > 0.

For example, the list of convergents in the last case begins

There is more − much more − to expose in the study of irrational numbers through the use of continued fractions, but we hope that the reader will have gained some feeling for this most fruitful interaction. In the next chapter we will move to quite a different aspect of irrationality − for which continued fractions show a clear limitation.

¹The result is susceptible to proof by induction.

²See Appendix E on page 289.