appendix D

The Solution of Transcendental and Polynomial Equations

1 INTRODUCTION

In applied mathematics the need frequently arises for solving numerically transcendental or higher-degree algebraic equations; for example, in determining the natural frequencies of oscillation of a uniform prismatic bar built in at one end and with the other end supported, it is necessary to solve the transcendental equation

images

Transcendental equations occur very frequently in determining the natural frequencies and modes of oscillation of electrical and mechanical systems. Higher-degree polynomial equations also arise frequently in practice. Algebraic formulas exist for the solution of the general quadratic, cubic, and quartic equations with literal coefficients. However, no formulas exist for the solution of a general algebraic equation with literal coefficients if it is of higher degree than the fourth. The formulas for the cubic and quartic equations are sometimes laborious to apply to given cases. The importance of being able to solve numerically equations of this type is very great, and this appendix will be devoted to a brief discussion of possible methods of solution.

2 GRAPHICAL SOLUTION OF TRANSCENDENTAL EQUATIONS

As an example, let us solve the transcendental equation

images

This equation occurs in determining the natural frequencies of oscillation of a clamped cantilever beam.

Equation (2.1) is satisfied by an infinite number of values of x. Let us write Eq. (2.1) in the form

images

If we plot the curves

images

and

images

the roots of (2.1) are given by the abscissas of the points of intersection of these two curves as shown in Fig. 2.1.

From the figure, we find the first three roots to be approximately

images

Since

images

the higher roots are given with satisfactory accuracy by the equation

images

images

Fig. 2.1

This example illustrates the general principle involved in the graphical solution of transcendental equations. That is, if we wish to solve the equation

images

we write it in the form

images

This may usually be done in many ways. We then draw the curves

images

The real roots of F(x) = 0 are evidently the abscissas of the points of intersection of these curves. The larger the scale of the graph and the more carefully the drawing is performed, the greater the accuracy of the roots. Having once found the approximate location of the roots, the accuracy may be improved by an iterative process called the Newton-Raphson method.

3 THE NEWTON-RAPHSON METHOD

Let us consider the function

images

Let us draw this curve as in Fig. 3.1. The point x = A is a root of the equation

images

Let us draw the tangent to the curve at the point P. This tangent will intersect the x axis at a point x1.

From the figure we have

images

Hence

images

images

Fig. 3.1

If we now set up the sequence

images

it is apparent from the figure that this sequence tends to the root A. If we start on the other side of A where the arc of the curve is convex to the x axis, the first step carries us to the opposite side of A where the arc is concave to the x axis; after this the sequence tends to the root as before.

This discussion is based on the following two assumptions:

1.    That the slope of the curve does not become zero along the arc Q1, P.

2.    That the curve has no inflection point along Q1, P.

More precisely, we can say that, if F(x) has only one root between two points Q1 and P while F′(x) and (x) are never zero between these two points, then the Newton-Raphson process will succeed if we begin it at one of the points for which F′(x) and (x) have the same sign.

It is sometimes more convenient to use the formula

images

instead of (3.5).

This means that in the successive steps of the process we replace the tangents calculated at x1, x2, etc., by lines parallel to the tangent at P. This saves the trouble of calculating F′(xr) at each stage.

As an example of the method, let it be required to determine the solution of the equation

images

To obtain a rough estimate of the root, we draw the curves

images

as in Fig. 3.2.

images

Fig. 3.2

From the graph, we obtain

images

as a rough estimate of the root. With this value for x0, we begin the iterative process. Here we have

images

images

The first approximation is

images

Now

images

images

Hence

images

This is a very good approximation to the root. If more significant figures are desired, the iterative process of (3.6) may be repeated.

4 SOLUTION OF CUBIC EQUATIONS

In the study of the natural frequencies of undamped electrical and mechanical systems with three degrees of freedom, we have to determine the solution of the equation

images

We may eliminate the Z2 term by the substitution

images

We then obtain

images

This may be written in the form

images

There are two principal cases to consider.

a THE CASE WHERE 27r2 > 4q3

In this case the cubic has one real root and two complex roots. Then, if q and r are both positive, we find ϕ such that

images

Then the real root is given by the equation

images

Dividing (4.4) by xx0, we reduce the equation to a quadratic and obtain the pair of complex roots by solving the resulting quadratic.

If q is negative and r is positive, we find ϕ such that

images

Then the real root is given by the equation

images

It may be noted that we may always suppose that r is positive since if we change the sign of r we merely change the sign of the roots.

b THE CASE WHERE 27r2 < 4q3

In this case the cubic equation has three real roots. We now find the smallest positive angle ϕ such that

images

Then the real roots are given by

images

For example, let it be required to solve

images

In this case we remove the Z2 term by writing

images

and the equation becomes

images

We now have

images

Hence this equation comes under case b, and the roots are all real. We now compute

images

and, substituting into (4.10), we obtain

images

Hence the roots of (4.11) are

images

5 GRAEFFE’S ROOT-SQUARING METHOD

In the last section we considered some algebraic formulas for the solution of the cubic equation. There also exists a formula solution for the quartic equation. These formulas are, in general, laborious to use in numerical computations. No formulas exist for the solution of a general algebraic equation with literal coefficients if it is of higher degree than the fourth.

In this section we shall discuss a method for the numerical solution of an algebraic equation of any degree. Before considering this method, it is well to recall the following properties concerning the nature of algebraic equations:

1. The equation

images

where the coefficients ar are real numbers, has exactly n roots. Some of the roots may be repeated roots.

2. If n is a positive odd integer, the equation always has one real root.

3. The number of positive roots either is equal to the number of variations of signs of the a’s or is less than this number of variations by an even integer (Descartes’ rule of signs).

4. The complex roots occur in conjugate complex pairs.

The method we shall consider was suggested by Dandelin in 1826 and independently by Graeffe in 1837. It is of great use, especially in the case of equations possessing complex roots. The fundamental principle of the method is to form a new equation whose roots are some high power of the roots of the given equation. That is, if the roots of the original equation are images the roots of the new equation are x1, x2, . . . , xn., If s is large, then the high powers of the roots will be widely separated. If the roots are very widely separated, they may be obtained by a simple process.

Let the roots of Eq. (5.1) be – r1, – r2, . . . , – rn. These values are the roots of the equation with the signs reversed and are called the Encke roots. (We shall assume at present that they are real and unequal.) Since the r quantities are the roots of the equation with the signs reversed, it may be factored in the form

images

If we use the convenient notation

images

then on multiplying the various factors of (5.2), we obtain

images

SQUARING THE ROOTS

A simple device by which one may obtain a new equation whose Encke roots are the squares of the Encke roots of the original equation will now be explained. Let us write

images

Then

images

and

images

If in (5.7) we let

images

we obtain

images

Now the roots of (5.9) are images and hence the Encke roots are images and hence they are the squares of the Encke roots of (5.5). If we now write F(x) in the form

images

then

images

Carrying out the multiplication and writing

images

we obtain

images

This may be written in the form

images

We notice the coefficients of (5.9a) are found from the coefficients of the original equation (5.6a) by the following simple rule:

The coefficient of any power of y is formed by adding to the square of the corresponding coefficient in the original equation the doubled product of every pair of coefficients which stand equally far from it on either side. These products are taken with signs alternately negative and positive.

If a power of x is absent, then it is taken with a coefficient equal to zero. To facilitate the process, a table is constructed in the following manner:

images

The b’s are the coefficients of the equation whose Encke roots are the squares of the Encke roots of the original equation.

If we now repeat this process several times, we finally arrive at an equation whose Encke roots are the mth powers of the Encke roots of the original equation. This equation has the form

images

If now

images

then

images

That is, if the roots differ in magnitude in the manner (5.12), then the mth powers of the roots, where m is a large number, are widely separated. Hence

images

for a sufficiently large m. We also have

images

Hence from (5.14) we obtain

images

and from (5.15) we have

images

Equation (5.16) determines the absolute value of the largest root r1, and Eq. (5.17) determines the absolute value of the second largest root r2, and so on.

In the solution of equations by this method it is very necessary to know when to stop the root-squaring process. The time to stop is when another doubling of m produces new coefficients images that are practically the squares of the corresponding coefficients images in the equation already obtained. To illustrate the general theory, let us consider the solution of the equation

images

which has three real roots.

We construct the following table:

images

In this table p2 denotes the equation whose Encke roots are the squares of the Encke roots of p, etc.

If we stop at this stage, we have

images

images

and hence

images

This is the magnitude of the numerically greatest root. We also have

images

Hence

images

Therefore

images

Finally, we have

images

and

images

and therefore

images

It is not possible to determine the signs of the roots by this process. Descartes’ rule of signs shows that all the roots are negative, and they are therefore

images

COMPLEX ROOTS

We shall now discuss briefly the modifications which must be introduced in the above procedure in case the equation under consideration has complex roots. To illustrate the general procedure, let the equation under consideration be of the fifth degree, and let it have the following Encke roots:

images

where

images

We carry out the root-squaring process as before and obtain an equation whose Encke roots are the mth powers of the Encke roots of the original equation. For a sufficiently large m we have

images

as before, since by hypothesis r1 is the dominant root numerically. Hence this root may be determined as before. We now have

images

If we let

images

we then have

images

This shows that the coefficient of the x3 term will fluctuate in sign as the number m takes in succession a set of increasing values because of the cosine term. We also have

images

Proceeding in this manner, we see that when m is large enough so that only the dominant part of the coefficient of each power of x is retained the equation whose Encke roots are the mth powers of the roots of the original equation is

images

From this we may find r1, r2, r3, and R. To obtain the angle of the complex root, we write

images

But we know that the sum of the Encke roots of the original equation is given by

images

Hence

images

or

images

In the above example we saw that the fluctuation of the coefficient of the x3 term indicated the presence of a pair of complex roots. If two coefficients fluctuate in sign, the presence of two pairs of complex roots may be inferred and the analysis modified accordingly. Rather than consider any fixed rules, it is well to consider the general nature of the root-squaring process in solving equations by this method.

THE CASE OF REPEATED ROOTS

The nature of the process in case the equation has repeated roots may be illustrated by the consideration of a special case. As before, let the Encke roots of the equation be denoted by r1, r2, r3, . . . , rm. Let the root r2 be equal to the root r3,

images

That is, the equation has the repeated root r2. In this case again the equation whose Encke roots are the mth powers of those of the given equation is

images

If m is sufficiently large so that we may retain only the dominant term in each coefficient, we have

images

We notice that the coefficient of the term xn–2 does not follow the usual law that when m is doubled the coefficient is approximately squared. In this case, when m is doubled, the new coefficient is approximately half the square of the old one. This is an indication of a repeated root. To compute r2, let

images

If we divide b3 by b1, we obtain

images

and hence

images

The rest of the roots are computed as before. The foregoing discussion is the general theory of Graeffe’s root-squaring method. In general, it is better to keep the basic concept of the method of root squaring in mind rather than to formulate elaborate rules for special cases.

PROBLEMS

1. Determine the roots of the equation tanh x = tan x.

2. Determine the roots of the equation tan x = x.

3. Find the roots of ex = 5x.

4. Solve the equation x3 – 2x – 1 = 0.

5. Solve the equation x3 – 97 x – 202 = 0.

6. By using Graeffe’s method, solve the equation x2 – 2x + 2 = 0.

7. Solve the equation x3 + x2 + x + 1 = 0.

8. Find the roots of the equation x3 – 5x2 + 6x – 1 = 0:

(a) By the formula for the solution of a cubic equation.

(b) By Graeffe’s method.

9. Solve tan x = 10/x.

10. The equation x3 – 2x – 5 = 0 has a real root between 2 and 3. Determine the magnitude of this root by the use of the Newton-Raphson method.

11. Compute to four decimal places the real root of the equation x2 + 4 sin x = 0 by the use of the Newton-Raphson method.

12. Determine the roots of the equation x tan x = 10 to three decimal places.

13. Find the smallest positive root of the equation xx + 2x = 6 to three decimal places.

REFERENCES

1918. Burnside, R., and F. Panton: “Theory of Equations,” 8th ed., vol. 1, Hodges & Figgis & Co., Dublin.

1924. Whittaker, E. T., and G. Robinson: “The Calculus of Observations,” Blackie & Son, Ltd., Glasgow.

1934. Cowley, W. L.: “Advanced Practical Mathematics,” Pitman Publishing Corporation, New York.

1936. Doherty, R. E., and G. Keller: “Mathematics of Modern Engineering,” John Wiley & Sons, Inc., New York.

1949. Milne, W. E.: “Numerical Calculus,” Princeton University Press, Princeton, N.J.

1950. Scarborough, J. B.: “Numerical Mathematical Analysis,” Oxford University Press, New York.

1952. Hartree, D. R.: “Numerical Analysis,” Oxford University Press, New York.

1954. Soroka, W. W.: “Analog Methods in Computation and Simulation,” McGraw-Hill Book Company, New York.

See, for example, E. R. Dickson, “Elementary Theory of Equations,” p. 31, John Wiley & Sons, Inc., New York, 1922.