6

RIEMANN–STIELTJES INTEGRATION

6.1DEFINITION OF THE INTEGRAL

In calculus you did many computational examples and applications involving the Riemann integral images. In this section we examine the precise definition of a somewhat more general integral. The familiar intuitive notion of the integral as the limit of the sum of a very large number of very small quantities is central to the discussion.

6.1.1Definitions and Comments

Let ƒ and α be functions from [a, b] to R, and let P: a = x0 < x1 < ... < xn = b be a partition of [a, b]. The size of P is defined as the largest subinterval length; that is,

images

We assume throughout the discussion that α is increasing and ƒ is bounded. As in the familiar process of making a rectangular approximation for the purpose of computing the area under a curve, we define

images

(Since ƒ is bounded, Mk and mk are finite.) If tk is a point in [xk1, xk], k = 1, 2, ..., n (the choice of the tk may be regarded as part of the specification of P ), let

images

Also define

images

The upper sum associated with P, ƒ, α is defined as

images

similarly, the lower sum is

images

and the Riemann–Stieltjes sum is

images

If ƒ and α are understood, we write these sums as U(P), L(P), and S(P). When α(x) = x for all x, we have Δαk, = xk − xk1; in this case, we obtain approximating sums to the Riemann integral.

If ƒ is a bounded function such that U(P) and L(P) approach a common finite limit as |P| → 0, we say that ƒ is Riemann–Stieltjes integrable with respect to α or that the Riemann–Stieltjes integral of ƒ with respect to α exists. The integral is denoted by

images

If images is called the Riemann integral of ƒ, denoted by images.

In numerous practical examples, ƒ is a continuous function. Before showing that the Riemann–Stieltjes integral always exists in this case, we need some preliminaries. By Theorem 4.2.4, ƒ is uniformly continuous on [a, b], so that given > 0 there is a δ > 0 such that whenever |xy| < δ we have |f(x) − f(y)| < . Let us prove that if α is increasing and P is a partition of size less than δ,

images

To establish this, observe that the assumption |P| < δ implies that 0 ≤ Mk − mk for all k; hence,

images

Now consider a refinement of P, in other words, keep the points x0, ..., xn defining P and add new points (possibly in each subinterval) to form a partition P′. If a new point y appears in the subinterval [xk−1, xk] of P so that xk1 < y < xk, then the term Mk[α(xk) − α(xk−1)], which appears in U(P), may be written as

images

If images, the corresponding expression in U(P′) is

images

(note also that images; similarly, refinement increases lower sums). The point of all this is to compare U(P) with U(Р′). Since images, we obtain, as in the proof of (1),

images

and, similarly,

images

Finally, it follows immediately from the definitions that

images

It is convenient at this point to make a comment that we need later. Suppose P′ is formed by adding only one pointy to P in the subinterval [xk−1, xk]. The situation we are analyzing involves a fixed y, with the size of P (hence also the size of P′) approaching 0. If ƒ is continuous at y, then Mk, images, and images all approach ƒ(y) as |P| → 0. Thus, |U(P) − U(P′)| can be made less than if |P| is sufficiently small. If α is continuous at y, then α(y) − α(xk−1) and α(xk) − α(y) → 0 as |Ρ| → 0. Since Mk, images, and images are all bounded by supaxb ƒ(x), we can again conclude that |U(P) − U(P′)| can be made less than if |P| is sufficiently small. (A similar analysis can be made for lower sums.) If ƒ and α are discontinuous at the same point, images does not exist, as you should discover when doing Problem 1.

We are now ready for the main result.

6.1.2THEOREM. If f is continuous and α is increasing on [a, b], then the Riemann–Stieltjes integral images exists. In particular, if f is continuous on [a, b], then f is Riemann integrable on [a, b].

    Proof. Given > 0, let δ > 0 be such that |xy| < δ implies |f(x) − f(y)| < . If P1 and P2 are arbitrary partitions with |Ρ1| and |P2| < δ, let P be a common refinement of P1 and P2 (take the union of the points of P1 and the points of P2 to form P). By Eq. (2),

images

and

images

Thus,

images

Similarly,

images

Since may be chosen arbitrarily small, it follows that if {Pn} is any sequence of partitions with |Pn| → 0, the sequences {U(Pn)} and {L(Pn)} are Cauchy, and hence converge. By Eq. (1) they converge to the same limit. The observation that U(P1) and U(P2) must be close to each other for small |P1| and |P2| implies that the limit is the same regardless of the particular sequence {Pn}. ■

Note that by Eq. (3), L(Pn) ≤ S(Pn) ≤ U(Pn) for all n; hence, images as |P| → 0. This may be used as the basis of a generalization of the Riemann–Stieltjes integral in which the monotonicity requirement on α is removed.

6.1.3Definition

Let ƒ : [a, b] → R, α: [a, b] → R, and assume ƒ is bounded. We say that images exists if S (P, ƒ, α) approaches a finite limit A as |P| → 0. In other words, given > 0 there is a δ > 0 such that whenever |P| < δ (regardless of the choice of the tk), we have |S(P, f, α) − A| < . Equivalently, if {Pn} is any sequence of partitions with |Pn| → 0, we have S(Pn, f, α) → A.

We know that if ƒ is continuous and α is increasing, then images exists. The case in which ƒ is continuous and α is decreasing is handled similarly (if we like, we can multiply α by −1 so that −α will be increasing).

Problems for Section 6.1

    1.Give an example in which ƒ is Riemann integrable on [a. b] and α is increasing, but images does not exist.

    2.Let ƒ(x) = 1, x rational; f(x) = 0, x irrational. Show that ƒ is not Riemann integrable on [0,1].

    3.Give an example of a function ƒ that is not Riemann integrable but whose absolute value |f| is Riemann integrable.

    4.Assume ƒ bounded and a increasing on [a, b]. By Eq. (3) if U(P, ƒ, α) and L(P, f, α) approach a common finite limit images as |P| → 0, then images. Prove that, conversely, if images (finite), then U(P, ƒ, α) and L(P, f, α) also approach images. Thus, the characterizations of the integral via upper and lower sums and via Riemann–Stieltjes sums are equivalent.

6.2PROPERTIES OF THE INTEGRAL

A number of basic properties of the integral follow directly from the definition. To save space, we adopt the notation ƒ R(α) to indicate that images exists.

6.2.1THEOREM

            (a)If f1, f2 R(α), then f1 + f2 R(α) and

images

            (b)If f R(α) and c is a constant, then cf R(α) and

images

            Also, f R() and images

            (c)If f R(α1), and f R(α2), then f R1 + α2) and

images

            For the remaining statements, assume α is increasing.

            (d)If f1, f2 R(α) and f1f2 on [a, b], then

images

            (e)If f R(α), then |f| R(α) and

images

            Thus, if |f| ≤ M on [a, b], we have

images

            (f)Assume a < c < b. If f R(α) on [a, c] and f R(α) on [c, b], then f R(α) on [a, b] and

images

            (g)If f is Riemann integrable on [a, b] and g = f except at finitely many points y1, ..., yr, then g is Riemann integrable on [a, b] and images

            (h)If f is piecewise continuous on [a, b], that is, f has only finitely many discontinuities on [a, b], all removable or jumps, then f is Riemann integrable on [a, b].

            Proof

            (a)

images

            (b)

images

            (c)

images

            (d)

images

            (since f1f2 and α is increasing, so Δαk ≥ 0). Let |P| → 0 to obtain the desired result.

            (e)

images

hence,

images

            Let |P| 0 to get images, provided |f| R(α). But |f(y)| = |f(y) − f(x) + f(x)| ≤ |f(y) – f(x)| + |f(x)|, and similarly |ƒ(x)| ≤ |f(y) − f(x)| + |ƒ(y)|. Consequently, ||ƒ(y)| − |f(x)|| ≤ |f(y) − f(x)|, and it follows that replacing ƒ by |ƒ| decreases |U(P) − L(P)|. (Let |ƒ(y)| approximate sup |ƒ|, and let |f(x)| approximate inf |f|.) Consequently, |f| R(α). (See Problem 1.)

            (f)We may assume that either ƒ or α is continuous at c; otherwise, images or images fails to exist (cf. Section 6.1, Problem 1). Given any sequence of partitions P of [a, b] with |P| → 0, and given > 0, we wish to refine P by adding the point c, producing a new partition Q with |U(P, f, α) − U(Q, f, α)| < and |L(P, ƒ, α) − L(Q, ƒ, α)| < . The discussion following Eq. (3) of 6.1.1 shows that this can be accomplished for sufficiently small |P| (in that discussion y is replaced by c). Since Q is formed by a partition of [a, c] followed by a partition of [c, b], U(Q, ƒ, α) and images as |P|, hence |Q|, approaches 0. It follows that for sufficiently small |P|, U(P, ƒ, α) and L(P, f, α) differ from images by less than . Since > 0 is arbitrary, the proof is complete.

            (g)If images and imagesimages, then

images

            (h)The interval [a, b] can be written as [y0, y1] [у1, у2] ... [yr−1, yr], where the discontinuities of ƒ occur at the yi. By (g), ƒ is Riemann integrable on each subinterval, since it becomes continuous when changed at the end points. By (f), ƒ is Riemann integrable on [a, b]. ■

It follows from Theorem 6.2.1(b) and (c) that if ƒ is continuous and α is the difference of two increasing functions, then images exists. We will see in the next section that if α has a continuous derivative α′ on [a, b] then α can be expressed as α1α2, where α1 and α2 are increasing; it follows that ƒ R(α). (In Chapter 5 we omitted any discussion of the “one-sided” derivative at the end points of a closed interval, but the definition is natural:

images

These comments allow us to show that a Riemann–Stieltjes integral may often be reduced to a Riemann integral, as follows.

6.2.2THEOREM. If f is continuous on [a, b] and α has a continuous derivative on [a, b], then

images

    Proof. Let P: a = x0 < x1 < ... < xn = b be a partition of [a, b]. By the Mean Value Theorem 5.1.4, α(xk) − α(хк1) = α′(tk)(xk − Xk1) for some tk (xk1, xk). The points t1, ..., tn yield the following Riemann–Stieltjes sum:

images

Let |P| → 0 to obtain

images

as desired. ■

We are now in a position to evaluate Riemann–Stieltjes integrals in practical cases. In the usual physical situation, α = α1 + α2 where α1 has a continuous derivative and α2 is a jump function having jumps of size cn at the points xn, n = 1, 2, ... N ; α2 is constant between jumps (see Fig. 6.2.1). In our discussion we assume α2 is increasing, but the evaluation formula to be obtained is valid for any jump function α2, since α2 is expressible as the difference of two increasing jump functions.

images

Figure 6.2.1 Jump Function

Consider the behavior of α2 at a jump of size Δα2(c) at x = c (see Fig. 6.2.2). Assuming ƒ continuous, we know that images as |P| → 0, so we are free to choose any convenient sequences of P’s. If we choose P so that [ch, c + h], h > 0, is one of the subintervals of P, we obtain a contribution of the form

images

as h → 0, by continuity of ƒ. In general, a jump of size cn at xn contributes f(xn)cn to images. We have arrived at the following result.

6.2.3Evaluation Formula

Let ƒ be continuous on [a, b], and let α = α1 + α2, where images is continuous on [a, b] and α2 is a jump function. Assume that the jumps of α2 occur at x1, ..., xN, and let images; notice that the actual value of α2(x) at x = cn is irrelevant. Then

images

images

Figure 6.2.2 Behavior at a Jump

The evaluation formula may be extended to the case where ƒ has a finite number of simple discontinuities as long as ƒ and α are not discontinuous at the same point; when simultaneous discontinuities occur, the Riemann–Stieltjes integral does not exist (see Problem 1 of Section 6.1).

A basic application of Riemann–Stieltjes integrals occurs in probability. The function α is the “distribution function” of a “random variable” X. Intuitively, if α has a jump of size cn at xn, then the probability that X will take the value xn in a particular performance of the random experiment is cn. If images, then X has a “continuous component” (as well as a discrete component represented by the set of probabilities cn). The term images represents the probability that X will take a value between x and x + dx, assuming none of the xi, 1 ≤ iN, belong to [x, x + dx]. The evaluation formula then gives the average value of f(X). For example, if f(x) = x2, then the formula gives the average value of the square of the random variable.

Another basic application is to line integrals. For example, if C is a curve in the plane, parametrized by x = x(t), y = y(t), atb, let s be arc length along the curve, so that s(t0) is the length of the portion of the curve given by (x(t), y(t)), att0. The line integral C f(x, y) ds is given by the Riemann–Stieltjes integral

images

In particular, if ƒ is the density of a curved rod, then C f(x, y) ds is the total mass.

In the evaluation formula, the integral images is evaluated by standard calculus techniques. The following result justifies such a computation.

6.2.4Fundamental Theorem of Calculus

Let ƒ be a continuous real–valued function on [a, b].

            (a)If images, then F′ = f on [a, b].

            (b)If G′ = ƒ on [a, b], then images.

            Proof

            (a)Let h > 0 (the argument for h <0 is similar). Then

images

            by continuity of ƒ.

            (b)By (a), (d/dx)(G(x) − F(x)) = f(x) − f(x) = 0 on [a, b], and hence GF is constant on [a, b] (use the Mean Value Theorem). Thus, images

For an intuitive view of Theorem 6.2.4(a), represent F by the cumulative area A under ƒ between a and x. If we change x by a “small” amount dx, the cumulative area changes by (approximately) f(x) dx (see Fig. 6.2.3). Thus dA = f(x) dx, or dA/dx = f(x), which says that F′ = ƒ.

images

Figure 6.2.3 Intuitive View of the Fundamental Theorem of Calculus

Problems for Section 6.2

    1.If α is increasing and ƒ is bounded on [a, b], show that ƒ R(α) iff U(P, f, α) − L(P, f, α) → 0 as |P| → 0.

    2.Assume α increasing and ƒ bounded on [a, b]. If a < c < b and ƒ R(α) on [a, b], show that ƒ R(α) on both [a, c] and [c, b].

    3.Let f(x) = x2, and define α as follows:

images

            Evaluate images

    4.Evaluate images where α(x) = [x] = the largest integer ≤ x.

6.3FUNCTIONS OF BOUNDED VARIATION

We have seen that if ƒ is continuous and a is the difference of two increasing functions, then ƒ R(α). We now look at the question of when a function α can be expressed as α1α2, where α1 and α2 are increasing.

6.3.1Definitions and Comments

Let ƒ: [a, b] → R, and let P: a = x0 < x1 < ... < xn = b be a partition of [a, b]. Define

images

The variation of ƒ on [a, b] is defined by

images

If we wish to emphasize the interval [a, b] on which the computation is carried out, we write V (ƒ; [a, b]) instead of simply V(ƒ).

We say that ƒ is of bounded variation (on [a, b]) if V (ƒ) < ∞.

It follows from the definitions that a monotone function is of bounded variation, with V(ƒ) = |f(b) − f(a)|. Also, for arbitrary ƒ we have V(−ƒ ) = V(ƒ). A brief computation shows that

images

To see this, note that V(f + g, P) ≤ V(f, P) + V(g, P) by the triangle inequality; hence, V(f + g, P) ≤ V(f) + V(g) for any P. Take the sup over P to obtain the desired result. Observe also that since V(−g) = V(g), we have V(fg) ≤ V(ƒ) + V(g). It follows that if α = f − g, where ƒ and g are increasing, then α is of bounded variation.

We adopt the notation ƒ BV to indicate that ƒ is of bounded variation. The main result is that if α BV then α can be expressed as the difference of two increasing functions. It follows that if ƒ is continuous and α BV, then images exists. Before proving this, we examine some basic properties and examples.

6.3.2THEOREM

            (a)If ƒ BV, then f is bounded.

            (b)If f, g BV with |f| ≤ A and |g| ≤ B on [a, b], then V(ƒg) ≤ AV(g ) + BV(ƒ); hence, f g BV.

            (c)If f′ exists and is bounded on [a, b], then f BV.

            (d)If f(x) = x sin(1/x), 0 ≤ xb, ƒ(0) = 0, then f is continuous but not of bounded variation.

            (e) If a < c < b, then V (ƒ; [a, b]) = V(ƒ; [a, c]) + V(ƒ ; [c, b]).

            Proof

        (a)If a < x < b, we have

images

        (b)A typical term in the computation of V (f g, P) is

images

            Since |f(xi)| ≤ A and |g (xi1)| ≤ B, it follows that

images

            Take the sup over P to obtain the desired result.

        (c)By the Mean Value Theorem,

images

            Thus, V(ƒ) ≤ M(b − a) < ∞.

        (d)Let yn = 2/, n = 1, 2, ... Then

images

            Let Pn be the partition formed by 0, y2n+1, y2n, ..., y1. Then

images

        (e)If P is any partition of [a, c] and Q is any partition of [c, b], we have V(f, P) + V(f, Q) ≤ V(ƒ; [a, b]), so V(ƒ; [a, c]) + V(f;[c, b]) ≤ V(f;[a, b]). If P0 is any partition of [a, b], we can refine P0 if necessary to obtain a partition Р′ of [a, c] followed by a partition Q′ of [c, b]. The refining process can only increase the variation, by the triangle inequality. Thus,

images

            Take the sup over P0 to finish the argument. ■

We now prove the main result on functions of bounded variation.

6.3.3THEOREM. If f is of bounded variation on [a, b], define

images

Then F and G are increasing on [a, b], so f is expressible as the difference of two increasing functions.

    Proof. Note that F “follows” ƒ, reflecting decreasing portions about the horizontal to obtain increasing portions; see Fig. 6.3.1. It follows from Theorem 6.3.2(e) that F is increasing. To show that G is increasing, let x < y. Then

images

If ƒ is continuous on [a, b] then so are F and G; see the problems.

Problems for Section 6.3

The purpose of this problem set is to show that, in Theorem 6.3.3, if ƒ is continuous on [a, b] so are F and G.

    1.Let c [a, b), > 0. Show that there is a δ > 0 and a partition P: c = x0+ < x1 < ... < xn = b such that 0 < x1x0 < δ, |f(x1) − ƒ(x0)| < /2, and imagesimages

images

Figure 6.3.1 Expressing a Function of Bounded Variation as the Difference of Two Increasing Functions

    2.Show that F(b) − F(c) − /2 < /2 + F(b) − F(x1), and conclude that F is right-continuous at c; i.e.; limxc;x > c F(x) = F(c).

    3.Let a < cb, > 0. Show that there is a δ > 0 and a partition P : a = x0 < x1 < ... < xn1 < xn = c such that |f(xn) − ƒ(xn−1)| < /2, 0 < xn − xn1 < δ, and imagesimages

    4.Show that F(c) − F(a) /2 < /2 + F(xn1) − F(a), and conclude that F is left-continuous at c; i.e. limx→c; x<c F(x) = F(c). It follows that F, hence G, is continuous on [a, b].

6.4SOME USEFUL INTEGRATION THEOREMS

We close this chapter with some results that are often useful in computations.

6.4.1Integration by Parts

If images exists, then so does images, and

images

    Proof. Consider any Riemann–Stieltjes sum for images, say

images

Now

images

Call this last line A. Then

images

Since xk−1tkxk, this is a Riemann–Stieltjes sum for images corresponding to a partition Р′ refining P. It follows that as |P| → 0, images. Thus, images, as desired. ■

6.4.2Change of Variable Formula

Let ƒ and h be continuous functions from [a, b] to R, and assume h strictly increasing. Let α be the inverse function of h; in other words, if y = h(x), then x = α(y), axb, h(a) ≤ yh(b). Then

images

    Proof. Let P: a = x0 < x1 < ... < xn = b, Q: h(a) = y0 < y1 < ... < yn = h(b), where yk = h(xk), 0 ≤ k ≤ n. (Since h is strictly increasing, so is α.) Then

images

Since h and α are continuous functions defined on closed, bounded (therefore compact) intervals, they are uniformly continuous. (Continuity of a follows from Section 4.2, Problem 5). Thus, if > 0, there is a δ > 0 such that if |xk − хk−1| < δ then |yk − yk−1| < . It follows that if |P| → 0 then |Q| → 0 (and conversely). Now ƒ α is a composition of continuous functions and is therefore continuous. Since α is monotone, images exists. Thus, if we let |P| → 0 in (1), we obtain images, as desired. ■

6.4.3Mean Value Theorem for Integrals

If f is continuous and a is increasing on [a, b], there is a point x0 [a, b] such that images

    Proof. Let M = sup{f(t) : atb} and m = inf {ƒ(t) : atb}. Then images; hence,

images

(Note that if α(a) = α(b) then α is constant on [a, b] and the result is trival.) Thus images for some c [m, M]. Now if, say, f(x1) = m and f(x2) = M, then by the Intermediate Value Theorem there is a point x0 between x1 and x2 such that f(x0) = c. We conclude that images. ■

6.4.4Upper Bounds on Integrals

Assume images exists, and let M = sup{|f(x)| : axb}. Then

images

where V(α) is the variation of a on [a, b]. Thus, if α is increasing, we have

images

If α(x) = x, we obtain a bound on the Riemann integral:

images

    Proof. Consider an arbitrary Riemann–Stieltjes sum images. We have

images

6.4.5Improper Integrals

An integral in which one or both limits of integration are infinite is called improper; it is defined as a limit of ordinary integrals. For example,

images

In many practical cases, integrals of this type are evaluated in the usual way; for example, images. For a formal justification, note that images as b → ∞.

An integral in which the function ƒ is unbounded is also referred to as improper. The theory of improper integrals is best handled within the domain of measure theory. As an example of what might happen, consider the function of Fig. 6.4.1. Here, ƒ is bounded and continuous everywhere, but images does not exist because images oscillates between 0 and 1 for n = 0, 1, 2, ... In general, it turns out that if images exists and is finite—that is, images approaches a finite limit as a → − ∞ and b → ∞—then images exists and is finite also.

images

Figure 6.4.1 Improper Riemann Integral

Problems for Section 6.4

    1.If ƒ and g are continuous and α increasing on [a, b], show that

images

            This is the Cauchy–Schwarz inequality for integrals;, the analogous result for sums is given in Section 3.2, Problem 3.

    2.Prove the integral test for convergence of series: if ƒ is continuous, nonnegative, and decreasing, show that images is finite (that is, images approaches a finite limit as b → ∞) if and only if images.

    3.Give an example of a function ƒ for which the improper Riemann integral images is finite, but images.

    4.Give an example of an unbounded function ƒ on (0,1] such that the improper integral images is

        (a)finite.

        (b)infinite.

REVIEW PROBLEMS FOR CHAPTER 6

    1.Evaluate images, where ƒ(x) = 1/x and

images

    2.True or false, and explain briefly.

        (a)If ƒ is piecewise continuous on [a, b] and α is increasing on [a, b], then images exists.

        (b)Let ƒ be continuous and α increasing on [a, b]. If Q is a refinement of the partition P, then L(Q, f, α) ≥ L(P, ƒ, α).

        (c)If ƒ is increasing and a is continuous on [a, b], then images exists.

    3.Let ƒ be continuous on R, and let g and h be differentiable on R. Show that

images

    4.Give an example of a function ƒ : [0, 1] → R such that ƒ is not Riemann integrable on [0,1] but ƒ2 is Riemann integrable on [0,1].

    5.Let

images

            If f(x) = 3х, evaluate images.

    6.Let ƒ be a continuous, real-valued function on [a, b], and let images. Show that F is of bounded variation on [a, b].

    7.A student evaluates the integral images as follows: Make the change of variable y = x2 to get

images

            because images... is always 0.

            But

images

            Where did the student go wrong?