Methods of Mathematics Applied to Calculus, Probability, and Statistics

14 The Logarithm Function

14.1 INTRODUCTION—A NEW FUNCTION

We have developed the calculus in terms of the simple algebraic functions, and this has limited the class of problems you can do. Part III of the text is devoted to extending the range of functions you can handle, and consequently the range of problems you can do. The main ideas and applications will be repeated in terms of the new functions. Thus there is a great deal of repetition of ideas in this part, the spiral reinforcement of what you have just learned. It is often said that you learn each mathematics course when you use it regularly in the next course. We deliberately and regularly use old ideas, at first reminding you where they came from but gradually assuming that you know them.

We also introduce some new mathematical ideas. The first one is how to study a situation for which you have no already prepared solution. For example, in Part II you had the indefinite integral

for all n ≠ – 1, but when n = – 1 this formula cannot apply (note the indicated division by 0). This integral

is a new function.

The integrand does not exist for x = 0, so it is convenient to start the definite integral at x = 1. We will call this the In function (pronounced “the ell en function,” which will soon turn out to be a logarithm function). It is defined by

It is often convenient to write this as

when no confusion can arise (it amounts to dropping the constant of integration C).

We know from the existence of the integral (Section 11.5) that since 1/x is continuous for x ≠ 0, the ln x function exists. Are you sure you cannot find some function whose derivative is 1/x? It is evident, after some trial differentiations, that fractional exponents always remain fractional exponents, so you are confined to integer exponents. After some trials, and finding that you cannot find such a function, you are naturally driven to the idea that there is no rational function whose derivative is 1/x, that the ln x is not a rational function in x. In mathematical symbols, this is saying

where N(x) and D(x) are polynomials of some degrees and the fraction is in lowest terms (meaning no common factors in both the numerator and denominator). You wish to show the nonequality is true, that there can be no two polynomials N(x) and D(x) with the required property. As for most impossibility theorems, the proof is by contradiction.

You may recall from Section 4.6 that polynomials are very much like the integers, and this reminds you that you have seen a similar sort of impossibility problem for the in the example of Section 3.4. There you had

as the impossibility; here you have

What did you do there? You first assumed that there was such an equality, and then squared both sides to get rid of the operation of square root; this suggests that you assume the equality and then differentiate both sides to get rid of the integration. The result is

Continuing to model this proof on the earlier one, you remove the fractions

Next was the divisibility argument. Clearly, D(x) is divisible by x; indeed it may be divisible by x several times. Let k be the highest power of x that divides D(x); that is, write

D₀(0) ≠ 0 says that there is no factor x in D₀(x). The result is

Now x^k can be divided out to get

Here each term except the last one is divisible by x, so the last one must be, too. But D₀(x) is not divisible by x by its definition; therefore, N(x) must be divisible by x, the same sort of contradiction as before. You assumed that the original expression is in lowest form, and then you found that there is a common factor! Hence the ln x is not a rational function. The proof is a good example of the extension of a method of proof to a similar situation, once you have seen some of the similarities between rational numbers and rational functions.

The function ln x will turn out to be the logarithm of x to the base e = 2.7182818284 … , which is a transcendental number. Hence now is the time for a brief review of the topic of logarithms.

If y is the logarithm of x to the base b, we write this as

By definition (recall that a logarithm is the power to which the base must be raised to give the original number), it follows that

Let us take the log to the base a of this equation. We get

and, since by definition y = log_bx, we have

This says, any system of logs is proportional to any other system. If you have a table of logs to any base (the common logs use base 10), then dividing these entries in the table by the value (from the same table) of the log of the new base, you get the corresponding table in the new base.

If you set x = a in the above equation, you get the useful relationship

A pair of numbers that are needed in converting from base 10 to base e and back are

The three relationships you need involving logs are (for any base)

In the last equation, y is any real number. In general, when taking logs, you must start with positive numbers, since the logs of negative numbers are not defined in terms of real numbers. The base is conventionally a positive number greater than 1.

Remember that these are properties of logs, and that we have not yet shown that the integral

is a logarithm, let alone the actual base involved.

14.2* ln x IS NOT AN ALGEBRAIC FUNCTION

It is slightly less easy to prove that ln x is not an algebraic function. In mathematical terms, this statement means that there is no polynomial in x and y of the form

P(x, y) = 0

such that

is a solution. The conics of Chapter 6 were algebraic functions where the polynomial was of degree 2. The above assertion is that there can be no polynomial of any finite degree for which y = ln x is a solution. In more detailed mathematical notation this statement is as follows: There can be no polynomial in y with polynomial coefficients in x for which

for all x in some interval [where the P_k(x) are polynomials in x of arbitrary degrees]. The proof is, as usual, by contradiction. We assume that there is such an equation, and that the degree n is the lowest that is possible in y = ln x. There cannot be more than one polynomial of lowest degree that has the ln x as a solution, since if there were two then we could eliminate the highest power of ln x (by the multiplication of each equation by the coefficient of the highest power of the other followed by a subtraction of the two equations). As a result we would have a still lower degree polynomial (degree in ln x of course), which is not possible since we assumed that we had the lowest.

Using the calculus, we can differentiate this equation. Remember that by the fundamental theorem of the calculus, Section 11.6,

We obtain another equation

of the same degree in ln x. Next, by using suitable multipliers and a subtraction, we could eliminate the highest power of ln x to get a lower-degree polynomial and hence a contradiction, provided the resulting equation is not identically zero! We must take care of this possibility. This suggests an alternative approach to the theorem; we should divide through by P_n(x) before the differentiation. You then get, after differentiation,

The coefficient of the highest power of ln x is

and is exactly what we just showed (Section 14.1) could not be identically zero. Since the derived polynomial for which ln x is a solution is of lower degree than the one you assumed was of lowest degree, you have a contradiction.

Thus the function ln x is definitely something new, and it is not an algebraic expression in some disguise. We have shown that not only is ln x linearly independent of the powers of x, but that any finite number of integer powers of In Jt are linearly independent, using polynomials in x as coefficients of the powers of ln x. This will be useful later in the book.

EXERCISES 14.2

1. Carry out the first indicated proof in this section.

2. Prove that In x ≠ N(x)/D(x) using an argument based on the degrees of N(x) and D(x).

14.3 PROPERTIES OF THE FUNCTION ln x

We will investigate this new function using the techniques that have been developed in a field called functional analysis. First, you got rid of the arbitrary constant of the indefinite integral by taking the definite integral form with a fixed lower limit. You did not pick 0 as the lower limit, since at x = 0 the integrand 1/x does not exist. The next natural choice is at x = 1. You have, therefore, the definition (again)

where it is convenient to use the dummy variable t to avoid confusion with the upper limit of x. Suppose you replace t by a new variable

for any real n ≠ 0, 1. The integral becomes

Hence we have the functional equation

(Normal equations define sets of numbers; functional equations define whole functions.)

Inspection of the functional equation shows that it has the solution

for any k ≠ 0.

Are there any other solutions to this functional equation? Suppose that g (x) is any solution; hence

and consider the difference

For any x₀ ≠ 1, set

Then at x = x₀ the left-hand side is zero; hence

For any real z > 0, there is an n such that

namely,

Thus for all real z > 0 we have shown that

and the solution is essentially unique (within a constant k). The constant k determines the base of the system of logarithms.

The next step is, therefore, to determine the base. What is it that you know? The main property of logs that you know which involves the base b is

If you integrate numerically, using Simpson’s formula (Section 11.11) from 1 to 3 by double steps of , you can get an estimate of where the area under the curve 1/x from 1 to x has the value 1. At this point, x will equal the logarithm base. Table 14.3-1 (see also Figure 14.3-1) shows that the base, commonly called e, is approximately

e = 2.718

TABLE 14.3-1

images

A much more accurate value is

e = 2.71828 18284 59045 …

We will later investigate this number e more closely. It is conventional in mathematics to write logs to the base e as ln x. Logs to the base e are called natural logarithms. They are also called Naperian logs after John Napier (1550–1617), who discovered logarithms, although his were not to the base e in spite of what you may read in some books. We have already used the notation log_bx for other bases, with log x meaning base 10.

Figure 14.3-1 Estimate of e

EXERCISES 14.3

Write the value of the following in terms of In 2 and In 3:

1. ln 9

2. ln 12

3. ln (3³)

4. ln

5. (ln )²

6. ln 24

7. 3 ln

8. ln (3/2)³

9. 4 ln 1/4

10.

What are the following?

11. ln₂ 4

12. ln₄ 8

13. (ln )²

14. ln₁₀ 100

15. ln_a

14.4 AN ALTERNATIVE DERIVATION—COMPOUND INTEREST

Since we have regularly preached going back to basics, how would the direct derivation of the derivative of ln x go? We apply the four-step rule (Section 7.5) to the log_a x function. At the third step we have

images

We need to examine the limit of

as x → 0. Set x/x = t. The limit is now the same as

as t → ∞. We could instead set t = 1/u and have the limit

as u → 0. The three limits are clearly equivalent. From the following table of numbers, it seems as if the limit is the same number e = 2.71828 … as before.

images

Thus we again have

For a = e, this becomes

The three limits occur frequently.

Example 14.4-1

Compound interest. This limit e occurs in compound interest problems. Suppose you have an interest rate r of 6%, 12%, or 18% as given in the headings of the columns of Table 14.4-2. The left-hand column has the number of compounding periods. The first line gives the growth per year on 1 unit of money (or other quantity such as biological size). The entry in the body of the table is simply the total amount, interest plus original amount for one period. The second line is for compounding each quarter, the rate per quarter is one-quarter as much, but you get the interest four times plus the interest on the interest. The formula is, for rate r,

The first two terms account for the principal and the original simple interest, and the later terms are the interest on the interest due to the compounding.

Next, suppose it is compounded monthly. The effective rate is r/12 per month, and it is compounded 12 times. Finally, consider compounding daily. The daily rate is r/365, and it is done 365 times.

TABLE 14.4-2

images

We see that the numbers approach limits, and we see that well before we are at 365 we are close to the position where we could use the number e. We have, corresponding to (14.4-2), the limit

Using this, estimates are written at the bottom of the columns of the table. These values correspond to continuous compounding.

This is another example of the fact that, although the phenomenon you are studying may be discrete (a finite number of fixed sized events), nevertheless the continuous model of the calculus provides a very convenient tool for getting good approximations.

EXERCISES 14.4

1. Show that 1/x dx = 2 ln 10.

2. Show that, for any function f(x) and Hint: Use the reciprocal substitution.

3. Compute the estimate in Table 14.4-2 for 8% interest.

4. Find a formula for the doubling period at any given rate of compounding.

5. Make a table of the doubling periods and compare with the following classic rule: Divide 72 by the rate (expressed as a percent) to get the number of years to double.

14.5 FORMAL DIFFERENTIATION AND INTEGRATION INVOLVING ln x

We need to look at the differentiation and integration of this new function. As the first few examples show, there are often ways of simplifying the expression before differentiation.

Example 14.5-1

Compute

images

Find dy/dx where

Take the In of both sides before differentiating (and use the function of a function idea):

We see that each term is to be differentiated separately, and this makes things much easier.

It is important to note that if x is replaced by –x in the integral, then the dx is replaced by – dx, and the quantity

does not change! Thus in truth

We simply cannot tell which sign to use when we are given the derivative. Usually, the problem indicates which sign to use, but as protection we must remember that all we get is the absolute size, not the algebraic size.

Example 14.5-2

Differentiate

y = ln(1 + x²)

We get immediately (U = 1 + x²; note that we have shifted from the lower case u to the upper case U)

Example 14.5-3

Differentiate

This gives

Example 14.5-4

Differentiate

This gives

Example 14.5-5

Integrate

If we think of the denominator as U = 1 + x, then the dU = dx, and we have (1/U) dU to integrate. The result is

Example 14.5-6

Integrate

This time we pick U = 1 + x² and need dU = 2x dx. We lack a factor 2, so we supply it inside the integral and compensate by a factor of in front. Thus

Differentiation will verify it easily.

Example 14.5-7

Integrate

Here we appear to be in trouble since there is no x in the numerator for the derivative of the denominator. After some thought, we notice that a² – x² = (a – x)(a + x), and that maybe the term came from the sum of two terms of the form

We do not know the actual numerators, so we apply the method of undetermined coefficients and write

with A and B the undetermined coefficients. Cleared of fractions,

From the linear independence of the powers of x, we may equate like powers of x. We get form the x terms

A = B

Then from the constant terms we get

2aA = 1

Hence

It is easy to check this result by adding the two terms together. The integral is, therefore,

This can be verified by differentiating.

Example 14.5-8

Integrate

Set U = ln x, and we have the form U dU, which leads to U²/2:

Example 14.5-9

Integrate

The ln x in the denominator looks difficult to handle. How could it have gotten there from a differentiation? Well, a log does that, so we finally think of

ln |ln x| + C

as the answer. Differentiation checks our guess.

Example 14.5-10.

Integrate

This lacks a 3 in the numerator of being just right, dU/U. Thus the integral must be

which is easily verified by differentiation.

EXERCISES 14.5

Differentiate the following on sight:

1. y =

2. y = x² ln x

3. y = ln (x³ – 3x)

4. y = xⁿ x xⁿ/n

Integrate the following:

5. ∫ 1/(1 + x) dx

6. ∫ 1/(a + bx) dx

7. ∫ 1/(a + bx)² dx

8. ∫ x/(1 + x²) dx

9. ∫ x/(a² + x²) dx

10. ∫ x/(a² + x²)² dx

11. ∫ (ln x)²/x dx = |ln x|³/3 + C

12. ∫ (ln₃ x)/x dx

13. ∫ (ln_a x)/x dx

14.

15. ∫ (1/x)(1 + ln x)dx

16. ∫ 1/{x (1 + ln x)}dx

14.6 APPLICATIONS

The curve y = ln x rises smoothly but increasingly slowly as you go from x = 1 (y = 0) to the right (Figure 14.6-1). Each time you go a distance e = 2.71828 … times as far out, you rise by another unit; the curve becomes very flat, but continues to rise indefinitely (since y′ = 1/x > 0 for x > 0). There is no upper bound of the ln x function. The second derivative is –1/x² and is always negative, so the curve is always concave downward. Since ln 1/x = –ln x, the values going to the left of x = 1 head toward minus infinity as x → 0.

Figure 14.6-1 The ln x function

Example 14.6-1

Graph the function

(which occurs frequently). To study it near the origin where one factor is 0 and the other is arbitrarily large, we compute Table 14.6-1.

TABLE 14.6.1

x	x ln x
0.1	−0.23026
0.01	−0.04605
0.001	−0.00691
0.0001	−0.00092

We see why we think that the limit is zero. We will later prove in a more mathematical setting (Section 15.5) that the limit is zero. At the origin, the zero from the term x dominates the infinity from the term ln x. Thus the function

y = x ln x

has a pair of zeros (Figure 14.6-2), one at x = 0 and the other at x = 1, and between them the function is negative. Where does the minimum occur? Differentiate, set equal to zero, and solve.

Figure 14.6-2 y = x ln x

ln x = –1

Hence the minimum occurs at x = 1/e. At this point

Since y″ = 1/x > 0, this indicates that the curve is always (for x > 0) convex upward.

Example 14.6-2

The entropy function, which is defined for any discrete probability distribution p_i is given by the expression

where ∑ p_i = 1 and P represents the probability distribution, or if you wish the random variable. This function occurs in many places, particularly in information and coding theory; and as is customary there, we have taken 2 as the base of the logarithm system (also to give you practice in other bases than e and 10). In words, the entropy of a distribution is the probability-weighted average of the negative logarithms of the probabilities of the distribution.

It is natural to ask for the set of probabilities p_t for which the entropy function H_n(P) is maximum. By a study of the way each individual term of the function behaves (see Figure 14.6-2), and recognizing that since ∑ p_i = 1, then any increase in one term must decrease some other term, you soon come to the conclusion that the symmetric case must give the maximum, all the

Alternatively, you can eliminate one of the probabilities, say p_n, by using the relation

You get for H₂(P)

and when you differentiate with respect to p_i (for each i = 1, … , n – 1), you get for each derivative equated to zero [recall (14.4-4) and write 1 – p₁ – p₂ – … – p_n–1 = p_n after the differentiation]

Thus

and each p_i = p_n (because the log function is monotone and therefore there is only one solution for a given value of the function). Hence, since the sum of all the p_i is 1, it follows that each p_i = 1/n. What is the maximum value? Putp_i = 1/n in each term and you get n terms, each of which is

so the sum of all n terms gives the maximum

Example 14.6-3

The Gibbs inequality (used extensively in information theory and probability). Given two discrete probability distributions

show that

To prove this we need a simple lemma.

Lemma. For all x >0

and the equality holds only at x = 1.

The proof is easy; we simply fit the tangent line to the curve y = ln x at the point x = 1 (Figure 14.6-1) and note that the curve is below the tangent line. The derivative of ln x is 1/x, and at x = 1 this is 1. The tangent line, therefore, is

y – 0 = 1(x − 1)

The second derivative of ln x is –1/x², which is always negative, and this leads to the statement of the lemma (14.6-6).

To prove the main result, we simply apply this lemma to each term of the sum (14.6-5):

For the equality condition to hold, it must hold for each term of the sum; that is, each q_i = pi for all i. This is known as the Gibbs (1839–1903) inequality. The inequality applies for any base b > 1.

EXERCISES 14.6

1. Sketch H₂(P) = p log₂ (1/p) + (1 – p) log₂ 1/(1 – p) when the distribution has only two probabilities, p and 1 – p. Note carefully the slope at p = 0 and p = 1.

2. From the Gibbs inequality, show directly that a maximum of the entropy function is ln n. Hint: Set q_i = 1/n for all i.

3. Prove from Exercise 2 that the maximum entropy occurs when all the p_i are equal.

14.7 INTEGRATION BY PARTS

One of the most useful tools for integration is the method of integration by parts. It is easily derived from the formula for the product of two functions (in differential form)

d(uv) = u dv + v du

We merely take the indefinite integral of both sides (and neglect the constant C of integration). We get

or in the standard form (where we shift to the conventional uppercase letters)

The lost constant of integration can be associated with the second integral if you are worried about it.

The reason that the method is so important is that when you see an awkward function under the integral sign then calling it U means that you will later see it only as the derivative (which is often simpler than the function!). We will illustrate this observation by a series of examples.

Example 14.7-1.

Consider the integral

At first you are probably stunned by the appearance of the In x. You probably do not know an integral of it. But the above remark suggests that you can get rid of the ln x if you pick U = ln x and the rest as the dV. You have

(It is usually convenient to pick the constant of integrating C when integration dV as 0, as we have done, but occasionally other choices are useful.) Using the formula for integration by parts (14.7-1), you have for the given integral

images

You can verify this easily by differentiating the answer. You get

which is indeed the integrand.

Example 14.7-2

Next consider the integral

How would you proceed? Pick U as x ln x? That would still leave a ln x after the differentiation to get the dU. This could be done by another application of integration by parts, but perhaps

images

would be better. So you try it. You get

and the second integral can now be done. You get, finally,

which you then verify by differentiating back to check that the integrand comes out.

Generalization 14.7-3

Generalize the previous two examples. One possible generalization is

To get rid of the unwanted ln x function, you pick it as

U = ln x

That leaves the rest as the dV:

You have

and therefore

Again you verify by direct differentiation. With this simple check there is no excuse for giving a wrong answer; the only possible excuse is that you could not get started on the problem correctly! Note that this generalization provides a check on the two previous examples, n = 0 and n = 1. The formula is true for all real n ≠ – 1.

Example 14.7-4

Consider

Obviously, you pick U = (ln x)² = ln² x, and dV = dx. You get

The x terms cancel in the second integrand, and you have

You are now reduced to an earlier case, Example 14.7-1, which will yield to another integration by parts. Thus you have, finally,

To check the answer, you differentiate the result. You get from differentiating the right-hand side

and all the terms cancel, as they should, except the first one. This kind of cancellation is very typical of results obtained by repeated integration by parts.

Example 14.7-5

An example that usually bothers the student the first time it occurs is

This is the missing case n = –1 in Generalization 14.7-3. There are many ways of doing it. There is the direct way used in Section 14.5 (not integration by parts)

which gives, as before,

If you try integration by parts, you get

images

Notice the advantage of the fixed pattern for writing the terms you are going to use. The use of the same technique every time frees the mind for more important things. When you do the integration by parts, you get

The integral on the right end is the same as the original integral! A moment’s thought shows that you can transpose it to the left-hand side, divide through by 2 (and remember to attach the constant C) to get

as you should.

There are often several ways of doing the same integral. We have apparently neglected to handle the absolute values in doing the integration; this leaves you a problem to discuss for your own peace of mind. The results must, of course, be the same within an additive constant C. Note that ln 6x + C and ln x + C are the same (although the two values of the constant C differ by the number ln 6).

EXERCISES 14.7

Integrate the following:

1. ∫ x³ ln x dx

2. ∫ P(x) ln x dx, where P(x) is an arbitrary polynomial. Hint: write P(x) = Σ^N₀ c_kx^k.

3. ∫ lnⁿ x dx

4. Carry out one stage of the general case J(m,n) = ∫ x^m lnⁿ x dx to get a reduction formula lowering the degree of n to n – 1. J(m, n) = x^m⁺¹(lnⁿ x)/(m + 1) – n/(m + 1)J(m, n – 1).

5. Discuss why any polynomial in x and ln x can be integrated in principle.

6. In the above exercises, when can the implied condition of integer exponents be relaxed?

7. Why do we not use the derivative of a quotient formula as the basis for two corresponding integration formulas?

14.8* THE DISTRIBUTION OF NUMBERS

The widely used scientific notation for representing numbers places the decimal point after the first nonzero digit and follows the number by an appropriate power of 10; for example

The velocity of light is approximately 299,792.5 kilometers per second, and is written as

c = 2.997 925 × 10⁵

Likewise 1/1000 is written as

1 × 10^–3

The number part is often carelessly called the mantissa in analogy with the mantissa of logarithms.

You might think that these mantissas are uniformly distributed between 1 and 10, but simple experiments do not bear this out. Rather, for most (but not all) sources of numbers the probability density is close to

where b is the number base, in the usual case 10, but in computers it may be 2, 8, or 16. This distribution is often called the reciprocal distribution because p (x) is proportional to the reciprocal of x. The cumulative probability distribution for the random variable X is (1 ≤ x < b), P(X ≤ x) = P(x)

In computers it is usual to place the decimal point before the first digit, thus making the exponent 1 greater. The difference is a trivial nuisance, but it is probably too late to get the computer people to reform. In any case, the density distribution is essentially the same, (ln fax)/ln fa. We will use the conventional scientific notation and the interval (sample space) 1 ≤ x < 10.

It is easy to verify that the total probability, using base 10, is

The cumulative distribution for the random variable X is P(x) = ln x/ln 10 = logio x. What is the probability of a random number beginning with a given decimal digit? For the digit N, it is the difference

As an experimental test of the distribution, we took the 50 physical constants listed in a standard table and counted the number of entries beginning with each decimal digit. The result is given in Table 14.8-1,

TABLE 14.8-1

images

which seems to verify the model for physical constants. The distribution also applies to such widely diverse sources of numbers as the binomial coefficients, the factorials, the powers of any number whose logio is not a rational number, and so on.

The nonuniform distribution of the numbers has a number of consequences. For example, the effect of roundoff propagating through a sequence of multiplications depends, on the average, on the distribution of the leading digits, since if we have two numbers with corresponding small errors

and form their product, we get

x₁x₂ + x₁₂ + x₁₁ + ₁₂

Neglecting the last term as being too small to matter, we see that the expected value of the xi gives the effective multiplier of each propagated error for randomly chosen numbers.

Example 14.8-1

Find the expected value of a number from the reciprocal distribution. We have

which is a good deal less than the 5.5 that is given by the model of a uniform distribution.

EXERCISES 14.8

1. Make a table of the distribution of the first digit of the function 2ⁿ for n = 1, … ,25.

2. Same for y = 3ⁿ.

3. Same for y = 5ⁿ.

4. Compute the distributionof leading digits forthe base 8.

5. Compute the expected propagationcoefficient for base b and examine the results for the bases 2, 4, 8, 10, and 16.

6. Find the variance of the reciprocal distribution.

14.9 IMPROPER INTEGRALS

Up to this point we have insisted that the integral have two properties. First, that the range of integration, typically from a to b, be finite. Second, that in the range of integration the integrand be finite. Without these two conditions the derivations we made in connection with the development of the concept of the integral would not be valid.

Figure 14.9-1

However, frequently problems arise where one or the other (or both) of these conditions is violated. As an example of an infinite range, consider the area under the curve

from x = 1 to x = ∞ (see Figure 14.9-1). An example of an infinite integrand is the area below the curve

y = –ln x

above the x-axis and between x = 0 and x = 1 (see Figure 14.9-2). We treat both cases by the following very reasonable device. We ask, “What is the integral when the range is chosen to be slightly less?” and then we examine the answer as the range is increased to the limit, either (1) for infinite range or (2) to where the integrand becomes infinite.

Figure 14.9-2 y = —ln x

This process of handling the infinite by dropping back to the finite, working out the result, and then taking the limit is a basic approach, which we have already used many times. When in doubt, it is a sound rule: “Do the finite case, and then examine how it behaves as you approach infinity.” If the result appears strange, then you are in a position to see whether it is your intuition or an error that bothers you.

Example 14.9-1

We first consider the case of an infinite range. In this example we begin by considering the integral over the finite range

(Figure 14.9-1). When we get the answer in terms of N, we will then let N approach infinity. To do the integration, we use integration by parts with

images

The indefinite integral becomes

images

We have now to put in the limits 1 and N. We get the definite integral

We are now ready to let N approach infinity. The first term, – (In N)/N, requires attention. As before, the denominator N dominates the slower growing numerator In N, and the quotient goes to 0. The final answer is, therefore,

J(∞) = 1

The area under the curve y = (In x)/x² from x = 1 to infinity is exactly 1.

Example 14.9-2

An example of the infinite integrand is the area between

y = –ln x

the x-axis, and in the interval x = 0 to x = 1 (see Figure 14.9-2). We have to creep down to the origin, so we start with the integral (with a > 0):

We now let a approach 0. Again the a dominates the In a as a approaches 0. The final result is

J(0) = 1

There is one unit of area below the -In x curve and above the x-axis while lying between x = 0 and x = 1.

Example 14.9-3

Suppose the curve y = –ln x for (0 < x ≤ 1) is rotated about the y-axis to form a volume of rotation (see Figure 14.9-3). What is the volume? Again, there is an improper integral to evaluate. From the figure, the elements (of volume) we are summing are cylinders, each of volume

2πxy dx

Thus we have (and we eliminate, except in our minds, all the details of letting the lower value approach 0)

We get

for the volume of rotation. The direct substitution of the value 0 masks the careful process of examining the value obtained. This can at times lead to troubles, for example, when terms from two different places both cancel before putting in the limit. You should be prepared to follow the longer path of carefully filling in all the details in any doubtful situation.

images

Figure 14.9-3 Method of cylinders

Example 14.9-4

The same problem can be done using discs instead of cylinders (Figure 14.9-4). We have the element of volume

πx² dy

so we have the integral

We will change the variable of integration from dy to dx using the equation

The limits, of course, also change. The infinity for y becomes x = 0, and y = 0 becomes x = 1. We have, therefore (remember the minus sign),

images

as before.

images

Figure 14.9-4 Method of discs

Example 14.9-5

Sometimes (as in Example 14.9-4) a change of variable will eliminate an improper integral. Another example is, given the integral

the change of variable x = 1/y, dx = –1/y² dy will give

EXERCISES 14.9

Evaluate the following:

4. Find the area under H(P) = p ln 1/p + (1 – p) between 0 and 1.

5. Integrate

6. Eliminate the improper integral by the substitution x = 1/y.

14.10 SYSTEMATIC INTEGRATION

As you approach the problem of integration, it usually appears as a vast morass of special cases, some of which can be done by one trick, some by another, and some which cannot be done within the framework of a given set of functions. For example, the integral

cannot be done in closed form (a finite combination of elementary functions). On the other hand, the integral

But we have preached the necessity of the larger view, the uniformization of special tricks, the need for abstraction to master the sea of details, and the corresponding hopelessness of going down the path of special tricks for the endless number of special cases. It is necessary, therefore, to look a bit more abstractly at the problem of integration of functions involving the function ln x. In particular, what broad classes of integrals can be integrated in closed form? There will still be special isolated cases that require tricks to integrate, but at least we will know some of the general cases we can hope to handle.

From the Generalization 14.7-3 that

we have already done

where P(x) is any polynomial of degree n in x, can be integrated. We see that the form of the result is

If we apply the fundamental theorem of the calculus to this, we have (differentiate)

But we proved (Section 14.1) the important result that the ln x cannot be represented by a rational function (indeed we proved in Section 14.2* that the powers of ln x are linearly independent when the coefficients are polynomials). Therefore, we may equate all the terms in ln x to zero and the terms free of ln x to zero. We get

from which we get

images

We picked the lower limit in the first integral to be zero so that in the next integrand Q(x)/x is a polynomial.

Example 14.10-1

Integrate

We try the form

Upon differentiating both expressions, we get

Equating the coefficients, one by one, of the two In terms, we get

Now equating the terms without the ln function, we get, in sequence,

The integral is, therefore,

Upon differentiation, this easily checks.

This method of integration is a general one, but it is usually neglected in the standard textbooks. It is a powerful method worth remembering. First, you guess at the form (as you often do in the method of undetermined coefficients). Second, you apply the fundamental theorem of the calculus (differentiate both sides). Third, you equate the coefficients of the linearly independent functions to zero, thus getting as many conditions as there are unknowns in the form you assumed. As in the method of undetermined coefficients, if you guess the form wrong, you will find out that you are wrong and often get a clue as to where you were wrong. You can then either include more terms as suggested by the failure, or else set about proving that it cannot be done in the general form you are trying (see Sections 14.1 and 14.2*).

The approach depends on your understanding of what terms can possibly lead to the integrand you are given and what terms simply cannot be involved. Thus it should be clear that radicals when differentiated lead to radicals and, therefore, cannot be involved in a problem where the integrand is rational. You can fill in the details of the argument to convince yourself that if you try such a term it will end up with a zero coefficient (when you equate such linearly independent terms to zero).

Evidently, the method that assumes the form is often easier to carry out than the equivalent method of direct integration (using repeated integration by parts in this case).

The method is easily extended to

and then you see that the entire class

can be done, where now P(x, ln x) is the polynomial in two variables x and In x.

Why can these integrals be done? One partial explanation is that the ln x function arises from the integration of a rational function 1/x, and therefore when we either integrate by parts or differentiate in the above method, the In function effectively disappears.

EXERCISES 14.10

1. Write out the form you would assume for the integration of P(x, ln x).

2. Carry out the details for the integrand P(x) ln²x.

3.* Following the pattern of Section 14.2*, show that ∫ 1/ln x dx ≠ N (x, ln x)/D (x, ln x), where N(x, ln x) and D(x, ln x) are polynomials.

14.11 SUMMARY

We introduced and studied a new function ln x. It was defined by an integral, which we proved could not be done in a closed rational (or algebraic) form. We further showed that the powers of ln x were linearly independent, and this is useful in the later stages of the systematic integration of integrals with powers of ln x.

We also examined the use of the new function in a number of old and new applications, including compound interest, the distribution of numbers, and the entropy function.

The powerful method of integration by parts was introduced; it is simply the inverse of the derivative of a product. The method enables you to replace a function by its derivative in the new integral, which is often, when combined with the integral of the other part of the integrand, such that you can carry out an integration. Integration by parts and the method of changing variables are two of the most powerful methods of integration available.

We also extended the idea of an integral to improper integrals by the usual method of examining the finite case and letting it approach infinity. In general, it is not the details but the methods used to develop new applications that are important.

Finally, we introduced the obvious “guess at the solution” method as another illustration of the usefulness of the method of undetermined coefficients. If you guess the correct form, then from the fundamental theorem of calculus you can equate the derivative of your guess to the original integrand. If the result is an identity, for a suitable choice of the undetermined parameters you introduced into the guessed form, then you have the integral. If it is not an identity, then you can guess again, guided by the failure to fit, or else abandon that approach.