Suppose that the only functions we have come across are quotients of polynomials and that we are asked to solve the differential equation
for all x > 0, subject to the condition f(1) = 0.
If we try f(x) = P(x)/Q(x), where P and Q are polynomials with no common factors, then we find that
By comparing coefficients we can show that Q(0) = P(0) = 0, which shows that, contrary to our assumptions, both P(x) and Q(x) are divisible by x. Thus, we cannot solve equation (1) in terms of known functions. However, THE FUNDAMENTAL THEOREM OF CALCULUS [I.3 § 5.5] tells us that equation (1) does indeed have a solution, namely
Further study shows that the function F has many useful properties. For example, using the substitution u = t/a, we find that
and, using the formula for differentiating an inverse function, we find that F-1 is the solution of the differential equation
g'(x) = 9(x).
We therefore give the function a name (the logarithm) and add it to our list of standard functions.
At a more advanced level, integration by parts shows that the GAMMA FUNCTION [III.31] (introduced by EULER [VI.19])
defined for all x > 0, has the property that
for all x > 1, and therefore Γ(n) = (n - 1)! for all integers n ≥ 1 (since Γ(1) = 1). As one might expect from its association with factorials, the gamma function turns out to be very useful in number theory and statistics.
In practice, a “special function” is any function that, like the logarithm and the gamma function, has been extensively studied and has turned out to be useful. Some authors use the phrase “special functions” in a more restricted sense, meaning something like “functions that turn up in the solution of physical problems” or “functions other than those generally provided by a pocket calculator,” but these restrictions do not seem to be very useful.
In spite of this apparent generality, the theory of special functions is linked in the minds of many mathematicians to a collection of particular ideas and methods. Indeed, it is often linked to particular books like Whittaker and Watson’s A Course of Modern Analysis (which was first published in 1902 and is still selling well) and Abramowitz and Stegun’s Handbook of Mathematical Functions. These connections may simply be accidents of history, but the phrase “special functions” is often associated with other phrases like “equations of mathematical physics,” “beautiful formulas,” and “sheer ingenuity.” We illustrate this and other themes in the particular case of Legendre polynomials. (The next paragraph involves more advanced mathematics and glosses over several long calculations, but the reader may simply glance over its contents and resume careful reading thereafter.)
Suppose that we wish to examine the gravitational potential ψ of Earth by looking at solutions of LAPLACE’S? EQUATION [1.3 § 5.4]Δψ = 0. Since Earth is an oblate spheroid that is nearly spherical, we use spherical polar coordinates (r, θ, φ) and, noting that Earth is symmetric about its axis of rotation, we may suppose that ψ depends only on r and θ. Under these assumptions, Laplace’s equation takes the form
Following the standard technique of separation of variables, we look for solutions of the form ψ(r, θ) = R(r)θ(θ). After a little calculation, equation (2) yields
Since one side of equation (3) depends on r alone and the other on r alone, both sides must equal some constant k. The equation
has the solution R(r) = rl whenever l(l + 1) = k. The corresponding equation for θ is then
We now make the substitution x = cosθ, y(x) = θ(θ) to convert (4) to Legendre’s equation
Routine equating of coefficients reveals that, if we seek nontrivial solutions of the form f(x) = then, unless l is an integer, f(x) is unbounded as x approaches 1 (that is, as θ approaches 0), so these solutions are not useful physically. However, if l is a positive integer, then there is a polynomial solution of degree l. (If l is a negative integer, the same polynomials reappear.) In fact, we have the following stronger statement: if l is a positive integer, then there exists a unique polynomial pl of degree 1 satisfying Legendre’s equation (5) such that pl(1) = 1. We call pl the lth Legendre polynomial. Returning to our original problem, we see that it has solutions of the form
It is obvious to the physicist, and can be proved by the mathematician, that this is the most general solution if we also demand that θ(r, θ) → 0 as r → ∞. Notice that if r is large, then only the first few terms will contribute much to the final answer.
There are many different ways of obtaining the Legendre polynomials. The reader is invited to verify that, if we define Qn inductively by setting Qo(x) = 1 and Q1(x) = x, and using the “three-term recurrence relation”
then Qn (1) = 1 and Qn is a polynomial that satisfies Legendre’s equation (5) (with l = n), from which it follows that Qn is the Legendre polynomial of degree n.
If we set νn(x) = (x2 - 1)n, then
Differentiating both sides of this equation n + 1 times using Leibniz’s rule, we see that satisfies Legendre’s equation (5) with l = n. Differentiating νn(x = (x - 1)n(x + 1)n n times using Leibniz’s rule and noting that all but one of the resulting terms vanish when x = 1, we see that is a polynomial with (1) = 2nn!. Putting all this information together, we obtain Rodriguez’s formula
Equation (5) is an example of a Sturm-Liouville equation. Setting l = n and y = Pn and rewriting slightly, we obtain the equation
If m and n are positive integers, then, using (6) and integrating by parts, we obtain
Thus, by symmetry,
and, if m ≠ n,
The “orthogonality relation” given by (7) has important consequences. Since Pr is a polynomial of degree exactly r, we know that any polynomial Q of degree n - 1 or less can be written
and so
Thus, Pn is orthogonal to all polynomials of lower degree.
Suppose that the points where Pn(x) changes sign in the interval [-1,1] are α1, . . . , αm. Then, if we write
we know that P(x)Q(x) does not change sign on [-1, 1] and so
By equation (8) this means that the degree m of Q is at least n and so (since a polynomial of degree n can have at most n zeros) Pn must have exactly n distinct zeros and they must all lie in the interval [-1, 1].
GAUSS [VI.26] made use of these facts to obtain a powerful method of numerical integration. Suppose that x1, x2, . . . ,xn+1 are distinct points on [-1, 1]. If we set
then ej(x) is a polynomial of degree n that takes the value 1 when x = xj and 0 when x = xk with k ≠ j. Thus, if R is any polynomial of degree at most n, the polynomial Q given by
has degree at most n, and R - Q vanishes at the n + 1 points xj. It follows that R = Q, so
If we write , then
It is natural to hope that the approximation
which is an exact equality when f is a polynomial of degree n or less, will work well for other well-behaved functions.
Gauss observed that we can make a major improvement by taking the xj to be the n + 1 roots of the (n + 1)st Legendre polynomial. Suppose that P is a polynomial of degree at most 2n + 1. Then we can write
where Q and R are polynomials of degree at most n and Pn+1 is is the (n + 1)st Legendre polynomial. Now Pn + 1 is orthogonal to polynomials over lower degree (and, in particular, to Q), Pn + 1 (xj) = 0 by the definition of xj and the approximation (9) is an equality for R. Thus,
We have shown that the “quadrature formula” (9) is actually exact for all polynomials of degree at most 2n + 1, provided we choose the xj to be the numbers suggested by Gauss. Unsurprisingly, this choice gives an extremely good way of estimating integrals numerically. “Gaussian quadrature” is one of the two main methods used to evaluate integrals on computers today.
We conclude with a brief look at a few other special functions.
Consider de Moivre’s formula
cos nθ + isin nθ = (cos θ + i sin θ)n.
Using the binomial expansion, we see that
and, taking real parts,
Since sin2 θ = 1 – cos2 θ, we have
where Tn is a polynomial of degree n called the nth Chebyshev polynomial. The Chebyshev polynomials play an important role in numerical analysis.
The next collection of functions requires us to calculate with infinite sums. Readers may treat our calculations as plausible or justify them rigorously according to taste. Observe first that
is well-defined for each real x that is not a multiple of π. Note also that h(x + π) = h(x) and = . Set f(x) = h(x) – cosec2(πx). By showing that there are constants K1 and K2 such that
and
for all , we deduce that there is a constant K such that |f(x)| < K for all 0 < x < π. Simple calculations show that
A single application of (10) shows that |f(x)| < K for all 0 < x < π, and repeated applications shows that f(x) = 0. Thus
for all real noninteger x.
If we seek analogues in the complex plane, we are led to functions of the type
Observe that, while the real function cosec2 x satisfies cosec2 (x + π) = cosec2 (x) and is periodic with period π, the complex function F just defined satisfies
F(z + 1) = F(z), F(z + i) = F(z)
and is doubly periodic with periods 1 and i. Functions like F are called elliptic functions and have a theory that parallels that Of the TRIGONOMETRIC FUNCTIONS [III.92].
The function is called the Gaussian or normal function and appears in probability and the study of diffusion processes (see [III.71 §5] and [IV.24]). The partial differential equation
with x corresponding to distance and t to time provides a reasonable model for diffusion. It is easy to check that ϕ(x, t) = ψ(x, t) = (Kt)-1/2E(x(Kt)-1/2) is a solution. By sketching a graph of ψ(x, t) as a function of x for various values of t, readers will see that ψ can be considered as the response to a disturbance at x = 0 when t = 0. By considering the behavior of ψ(x, t) as a function of t for a given value of x, they will see that “the effect at x of a disturbance at the origin becomes noticeable only after a time of the order x1/2.” Living cells depend on diffusion processes and the result just given suggests (correctly) that such processes are very slow over long distances. It is plausible that this sets a limit on the size of a single cell: a large organism must be multi-celled.
Statisticians constantly use the related error function
There is a famous theorem of LIOUNILLE [VI.39] that shows that erf(x) cannot be expressed as a composition of elementary functions (such as quotients of polynomials, trigonometric functions, and EXPONENTIAL FUNCTIONS [III.2 5]).
We have been able to look at only a few properties of a few special functions in this article, but even this small sample shows how much interesting mathematics arises when we study one function or a class of particular functions rather than functions in general.