Solutions

Solution to Exercise 0.1, page ix

We have .

On the other hand, .

Clearly f(x₂) > f(x₁), and so the mining operation x₁ is preferred to x₂ because it incurs a lower cost.

Solutions to the exercises from Chapter 1

Solution to Exercise 1.1, page 7

True. Indeed we have:

(V1)For all x, y, z > 0, x (y z) = x (yz) = x(yz) = (xy)z = (xy) z = (x y) z.

(V2)For all x > 0, x 1 = x1 = x = 1x = 1 x.
(So 1 serves as the zero vector in this vector space!)

(V3)If x > 0, then 1/x > 0 too, and x (1/x) = x(1/x) = 1 = (1/x)x = (1/x) x.
(Thus 1/x acts as the inverse of x with respect to the operation .)

(V4)For all x, y > 0, x y = xy = yx = y x.

(V5)For all x > 0, 1 · x = x¹ = x.

(V6)For all x > 0 and all .

(V7)For all x > 0 and all .

(V8)For all x, y > 0, .

We remark that V is isomorphic to the one dimensional vector space R (with the usual operations): indeed, it can be checked that the maps log : V → R and exp : R → V are linear transformations, and are inverses of each other.

Solution to Exercise 1.2, page 7

We prove this by contradiction. Suppose that C[0, 1] has dimension d. Consider functions x_n(t) = tⁿ, t ∈ [0, 1], n = 1, ··· , d. Since polynomials are continuous, we have x_n ∈ C[0, 1] for all n = 1, ··· , d.

First we prove that x_n, n = 1, ··· , d, are linearly independent in C[0, 1]. Suppose not. Then there exist α_n ∈ R, n = 1, ··· , d, not all zeros, such that α₁ · x₁ + ··· + α_d · x_d = 0. Let m ∈ {1, ··· , d} be the smallest index such that α_m ≠ 0. Then for all t ∈ [0, 1], α_mt^m + ··· + α_dt^d = 0. In particular, for all t ∈ [0, 1], we have .

Thus for all n ∈ N we have .

Passing the limit as n → 8, we obtain α_m = 0, a contradiction. So the functions x_n, n = 1, ··· , d, are linearly independent in C[0, 1].

Next, we get the contradiction to C[0, 1] having dimension d. Since any independent set of cardinality d in a d-dimensional vector space is a basis for this vector space, {x_n : n = 1, ··· , d} is a basis for C[0, 1]. Since the constant function 1 (taking value 1 everywhere on [0, 1]) belongs to C[0, 1], there exist β_n ∈ R, n = 1, ··· , d, such that 1 = β₁ · x₁ + ··· + β_d · x_d. In particular, putting t = 0, we obtain the contradiction that 1 = 0: 1 = 1(0) = (β₁ · x₁ + ··· + β_d · x_d)(0) = 0.

Solution to Exercise 1.3, page 7

(“If ” part.) Suppose that y_a = y_b = 0. Then we have:

(S1)If x₁, x₂ ∈ S, then x₁ + x₂ ∈ S. As x₁, x₂ ∈ C¹[a, b], also x₁ + x₂ ∈ C¹[a, b]. Moreover, x₁(a) + x₂(a) = 0 + 0 = 0 = y_a and x₁(b) + x₂(b) = 0 + 0 = 0 = y_b.

(S2)If x ∈ S and α ∈ R, then α · x ∈ S. Indeed, as x ∈ C¹[a, b], and α ∈ R,
we have α·x ∈ C¹[a, b], and (α·x)(a) = α0 = 0 = y_a, (α·x)(b) = α0 = 0 = y_b.

(S3)0 ∈ S, since 0 ∈ C¹[a, b] and 0(a) = 0 = y_a = y_b = 0(b).

Hence, S is a subspace of a vector space C¹[a, b]

(“Only if ” part.) Suppose that S is a subspace of C¹[a, b]. Let x ∈ S.

Then 2 · x ∈ S. Therefore, (2 · x)(a) = y_a, and so y_a = (2 · x)(a) = 2x(a) = 2y_a.

Thus y_a = 0. Moreover, (2 · x)(b) = y_b, and so y_b = (2 · x)(b) = 2x(b) = 2y_b.

Hence also y_b = 0.

Solution to Exercise 1.4, page 10

Solution to Exercise 1.5, page 14

From the triangle inequality, we have that ||x|| = ||y + x − y|| ||y|| + ||x − y||, for all x, y ∈ X. So for all x, y ∈ X, ||x|| − ||y|| ||x − y||.

Interchanging x, y, we get .

So for all x, y ∈ X, − (||x|| − ||y||) ||x − y||.

Combining the results from the first two paragraphs, we obtain |||x|| − ||y||| ||x − y|| for all x, y ∈ X.

Solution to Exercise 1.6, page 14

No, since for example (N2) fails if we take x = 1 and α = 2:

Solution to Exercise 1.7, page 15

We verify that (N1), (N2), (N3) are satisfied by || · ||_Y:

(N1)For all y ∈ Y, ||y||_Y = ||y||_X 0.
If y ∈ Y and ||y||_Y = 0, then ||y||_X = 0, and so y = 0 ∈ X.
But 0 ∈ Y, and so y = 0 ∈ Y.

(N2)If y ∈ Y and α ∈ R, then α · y ∈ Y and images .

(N3)If y₁, y₂ ∈ Y, then y₁ + y₂ ∈ Y.
Also, .

Solution to Exercise 1.8, page 15

(1) We first consider the case 1 p < ∞, and then p = ∞. Let 1 p < ∞.

(N1)If x = (x₁, ··· , x_d) ∈ R^d then .

If x ∈ R^d and ||x||_p = 0, then ||x||^p_p = 0. that is, .

So |x_n| = 0 for 1 n d, that is, x = 0.

(N2)Let x = (x₁ , ··· , x_d) ∈ R^d, and α ∈ R.

Then .

(N3)Let x = (x₁, ··· , x_d) ∈ R^d and y = (y₁ , ··· , y_d) ∈ R^d.

If p = 1, then we have |x_n + y_n| |x_n| + |y_n| for 1 n d.

By adding these, ||x + y||₁ ||x||₁ + ||y||₁, establishing (N3) for p = 1.

Now consider the case 1 < p < ∞.

If x + y = 0, then ||x + y||_p = ||0||_p = 0 ||x||_p + ||y||_p trivially.

So we assume that x + y ≠ 0. By Hölder’s Inequality, we have

where we used q(p − 1) = p in order to obtain the last equality.

Similarly, . Consequently,

Dividing throughout by , we obtain . This completes the proof that (R^d, || · ||_p) is a normed space for 1 p < ∞.

Now we consider the case p = ∞.

(N1)If x = (x₁ , ··· , x_d) ∈ R^d, then ||x||_∞ = max{|x|, ··· , |x_d|} 0.

If x ∈ R^d and ||x||_∞ = 0, then max{|x₁|, ··· , |x_d|} = 0, and so |x_n| = 0 for 1 n d, thart is, x = 0.

(N2)Let x = (x₁ , ··· , x_d) ∈ R^d, and α ∈ R.
Then .

(N3)Let x = (x₁, ··· , x_d) ∈ R^d and y = (y₁, ··· , y_d) ∈ R^d.

We have for 1 n d.
So it follows that ||x + y||_∞ ||x||_∞ + ||y||_∞, establishing (N3) for p = ∞.

(2) See the following pictures.

(3) We have for x = (a, b) ∈ R² that

So .

We have . We have

giving 1/p images h_p images 0 for all p, and so h_p → 0 as p → ∞.) So it follows by the Sandwich Theorem 1 that images .

The balls B_p(0, 1) grow to B_∞(0, 1) as p increases.

Solution to Exercise 1.9, page 16

(1)If x, y ∈ B(0, 1), then for all α ∈ (0, 1), (1 − α) · x + α · y ∈ B(0, 1) too, since

images

(2)See the following picture.

(3)B(0, 1) is not convex: taking x = (1, 0), y = (0, 1) and α = 1/2, we obtain , and so .

Solution to Exercise 1.10, page 16

We’ll verify that (N1), (N2), (N3) hold.

(N1)If x ∈ C[a, b], then |x(t)| 0 for all t ∈ [0, 1], and so .

Let x ∈ C[a, b] be such that ||x||₁ = 0. If x(t) = 0 for all t ∈ (a, b), then by the continuity of x on [a, b], it follows that x(t) = 0 for all t ∈ [a, b] too, and we are done! So suppose that it is not the case that for all t ∈ (a, b), x(t) = 0. Then there exists a t₀ ∈ (a, b) such that x(t₀) ≠ 0. As x is continuous at t₀, there exists a δ > 0 small enough so that a < t₀ − δ, t₀ + δ < b, and such that for all t ∈ [a, b] such that t₀ − δ < t < t₀ + δ, |x(t) − x(t₀)| < |x(t₀)|/2. Then for t₀ − δ < t < t₀ + δ, we have, using the “reverse” Triangle Inequality from Exercise 1.5, page 14, that

So .

This is a contradiction. Hence x = 0.

(N2)For x ∈ C[a, b], α ∈ R, .

(N3)Let x, y ∈ C[a, b]. Then

Solution to Exercise 1.11, page 17

(N1)For x ∈ Cⁿ[a, b], clearly .

If x ∈ Cⁿ[a, b] is such that ||x||_n,∞ = 0, then ||x||_∞ + ··· + ||x⁽ⁿ⁾||_∞ = 0, and since each term in this sum is nonnegative, we have ||x||_∞ = 0, and so x = 0.

(N2)Let x ∈ Cⁿ[a, b] and α ∈ R. Then

(N3)Let x, y ∈ Cⁿ[a, b]. For all 0 k n, ||x^(k) + y^(k)||_∞ ||x^(k)||_∞ + ||y^(k)||_∞, by the Triangle Inequality for || · ||_∞ Consequently,

Solution to Exercise 1.12, page 17

(1)Let k₁, k₂, m₁, m₂, n₁, n₂ ∈ Z, p ł m₁, m₂, n₁, n₂ and .

If k₁ > k₂, then p^k₁−k₂m₁n₂ = m₂n₁, which implies that p | m₂n₁, and as p is prime, this would mean p | m₁ or p | n₁, a contradiction. Hence k₁ k₂. Similarly, we also obtain k₂ k₁.

Thus k₁ = k₂. Consequently, , and so | · |_p is well-defined.

(2)If 0 ≠ r ∈ Q, then we can express r as , with k, m, n ∈ Z, and p m, n.

We see that . If r = 0, then |r|_p = |0|_p = 0 by definition.
Thus |r|_p 0 for all r ∈ R. Also if r ≠ 0, then |r|_p > 0. Hence |r|_p = 0 implies that r = 0.

(3)The claim is obvious if r₁ = 0 or r₂ = 0. Suppose that r₁ ≠ 0 and r₂ ≠ 0.

Let and .

So . As p m₁, p m₂, and p is prime, we have p m₁m₂.

Similarly p n₁n₂. Thus .

(4)The inequality is trivially true if r₁ = 0 or r₂ = 0 or if r₁ + r₂ = 0.

Assume r₁ ≠ 0, r₂ ≠ 0, and r₁ + r₂ ≠ 0.

Let images , with k₁, k₂, m₁, m₂, n₁, n₂ ∈ Z, p m₁, m₂, n₁, n₂. We have

where := p^{k₁−min{k₁,k₂}} m₁n₂ + p^{k₂−min{k₁,k₂}} n₁m₂ (≠ 0, since r₁ + r₂ ≠ 0). By the Fundamental Theorem of Arithmetic, there exists a unique integer 0 and an integer m such that and p m. Clearly p n₁n₂.

Hence r₁ + r₂ = , with p m, n₁n₂.

This yields the Triangle Inequality:

Solution to Exercise 1.13, page 17

(N1)Clearly for all M = [m_ij] ∈ R^m×n.

If ||M||_∞ = 0, then |m_ij| = 0 for all 1 i m, 1 j n, that is, M = [m_ij] = 0, the zero matrix.

(N2)For M = [m_ij] ∈ R^m×n and α ∈ R, we have

(N3)For P = [p_ij], Q = [q_ij] ∈ R^m×n, |p_ij + q_ij| |p_ij| + |q_ij| ||P||_∞ + ||Q||_∞.

As this holds for all i, j, ||P + Q||_∞ = .

Solution to Exercise 1.14, page 19

Consider the open ball B(x, r) = {y ∈ X : ||x − y|| < r} in X. If y ∈ B(x, r), then ||x − y|| < r. Define r′ = r − ||x − y|| > 0. We claim that B(y, r′) ⊂ B(x, r). Let z ∈ B(y, r′). Then ||z − y|| < r′ = r − ||x − y|| and so ||x − z|| ||x − y|| + ||y − z|| < r. Hence z ∈ B(x, r). The following picture illustrates this.

Solution to Exercise 1.15, page 19

The point , but for each r > 0, the point belongs to the ball B(c, r), but not to I, since ||y − c||₂ = , but ≠ 0. See the following picture.

Solution to Exercise 1.16, page 19

Using the following picture, it can be seen that the collections O₁, O₂, O_∞ of open sets in the normed spaces (R², || · ||₁), (R², || · ||₂), (R², || · ||_∞), respectively, coincide.

Solution to Exercise 1.17, page 20

If F_i, i ∈ I, is a family of closed sets, then X\F_i, i ∈ I, is a family of open sets. Hence is open. So is closed.

If F₁, ··· , F_n are closed, then X\F₁, ··· , X\F_n are open, and so the intersection images of these finitely many open sets is open as well.

Thus is closed.

For showing that the finiteness condition cannot be dropped, we’ll consider the normed space X = R, and simply rework Example 1.15, page 20, by taking complements.

We know that F_n := R\(−1/n, 1/n), n ∈ N, is closed and the union of these, which is not closed, since if it were, its complement R\(R\{0}) = {0} would be open, which is false.

Solution to Exercise 1.18, page 20

Consider the closed ball B(x, r) = {y ∈ X : ||x − y|| r} in X. To show that B(x, r) is closed, we’ll show its complement, U := {y ∈ X : ||x − y|| > r}, is open. If y ∈ U, then ||x − y|| > r. Define r′ = ||x − y|| − r > 0. We claim that B(y, r′) ⊂ U. Let z ∈ B(y, r′). Then ||z − y|| < r′ = ||x − y|| − r and so ||x − z|| ||x − y|| − ||y − z|| > ||x − y|| − (||x − y||− r) = r. Hence z ∈ U.

Solution to Exercise 1.19, page 20

(1)False.

For example, in the normed space R, consider the set [0, 1). Then [0, 1) is not open, since every open ball B with centre 0 contains at least one negative real number, and so B has points not belonging to [0, 1).

On the other hand, this set [0, 1) is not closed either, as its complement is C := (−∞, 0) ∪ [1, ∞), which is not open, since every open ball B′ with centre 1 contains at least one positive real number strictly less than one, and so B′; contains points that do not belong to C.

(2)False. R is open in R, and it is also closed.

(3)True. Ø and X are both open and closed in any normed space X.

(4)True. [0, 1) is neither open nor closed in R.

(5)False.

0 ∈ Q, but every open ball centred at 0 contains irrational numbers; just consider /n, with a sufficiently large n.

(6)False.

Consider the sequence (a_n)_n∈N given by a₁=, and for n > 1, a_n+1 = . Then it can be shown, using induction on n, that (a_n)_n∈N is bounded below by , and that (a_n)_n∈N is monotone decreasing. (Example 1.19, page 31.) So (a_n)_n∈N is convergent with a limit L satisfying , and so L² = 2.

As L must be positive (the sequence is bounded below by ), it follows that L = . So every ball with centre and a positive radius contains elements from Q (terms a_n for large n), showing that R\Q is not open, and hence Q is not closed.

(Alternately, let c ∈ R have the decimal expansion c = 0.101001000100001 ···. The number c is irrational because 2 it has a nonterminating and nonrepeating decimal expansion. The sequence of rational numbers obtained by truncation, namely 0.1, 0.101, 0.101001, 0.1010010001, 0.101001000100001, ··· converges with limit c, and so every ball with centre c and a positive radius contains elements from Q, showing again that R \ Q is not open, and hence Q is not closed.)

(7)True. As each (n, n + 1) is open, so is their union.
Hence Z = R\(R\Z) is closed.

Solution to Exercise 1.20, page 21

We have already seen in Exercise 1.14, page 19, that the interior of S, namely the open ball B(0, 1) = {x ∈ X : ||x|| < 1} is open. Also, it follows from Exercise 1.18, page 20, that the exterior of the closed ball B(0, 1), namely the set U = {x ∈ X : ||x|| > 1} is open as well. Thus, the complement of S, being the union of the two open sets B(0, 1) and U, is open. Consequently, S is closed.

Solution to Exercise 1.21, page 21

If X = {0}, then {0} is clearly closed, since X\{0} = Ø is open.

Now suppose that X ≠ {0}, and let x ∈ X. We want to show that U := X\{x} is open. Let y ∈ U := X\{x}, and set r := ||x – y|| > 0. We claim that the open ball B(y, r) is contained in U. If z ∈ B(y, r), then ||y – z|| < r, and so ||z – x|| ||x – y|| – ||y – z|| r – ||y – z|| > r – r = 0. Hence z ≠ x, and so z ∈ X\{x} = U. Consequently U is open, and so {x} = X\U is closed.

If F is empty, then it is closed.

If F is not empty, then F = {x₁, ··· , x_n} = {x_i}, for some x₁, ···, x_n ∈ X.

As F is the finite union of the closed sets {x₁}, ···, {x_n}, F is closed too.

Solution to Exercise 1.22, page 21

Let x, y ∈ R and x < y. By the Archimedean property of R, there is a positive integer n such that n > 1/(y – x), that is n(y – x) > 1. Also, there are positive integers m₁, m₂ such that m₁ > nx and m₂ > –nx, so that –m₂ < nx < m₁. Thus we have nx ∈ [–m₂, –m₂ + 1) ∪ [–m₂ + 1, –m₂ + 2)∪···∪[m₁ – 1, m₁). Hence there is an integer m such that m – 1 nx < m. We have nx < m 1 + nx < ny, and so dividing by n, we have x < q := m/n < y. Consequently, between any two real numbers, there is a rational number.

Let x ∈ R and let > 0. Then there is a rational number y such that x – < y < x + , that is, |x – y| < . Hence Q is dense in R.

Solution to Exercise 1.23, page 21

Let x ∈ R and let > 0. If x ∈ R\Q, then taking y = x, we have |x – y| = 0 < . If on the other hand, x ∈ Q, then let n ∈ N be such that n > / so that with y := x + /n, we have y ∈ R\Q, and |x – y| = /n < . So R\Q is dense in R.

Solution to Exercise 1.24, page 21

Let x = (x_n)_n∈N ∈ ℓ², and > 0. Let N ∈ N be such that

Then y := (x₁, ···, x_N, 0, ···) ∈ c₀₀, and

Thus ||x – y||₂ < . Consequently, c₀₀ is dense in ℓ².

Solution to Exercise 1.25, page 21

Consider the set D of all finitely supported sequences with rational terms. Then D is a countable set since it is a countable union of countable sets. We now show that D is dense in ℓ¹. Let x := (x_n)_n∈N ∈ ℓ¹ and let r > 0.

Let N ∈ N be large enough so that

As Q is dense in R, there exist q₁, ···, q_N ∈ Q such that

With x′ := (q₁, ···, q_N, 0, ···) ∈ D,

Solution to Exercise 1.26, page 22

By the Binomial Theorem, we have

Putting s = t, we get 1 = (t + (1 – t))ⁿ

Keeping s fixed, and differentiating (7.1) with respect to t yields

Multiplying throughout by t gives

With

Differentiating (7.2) with respect to t yields

Multiplying throughout by t yields

Setting s = t now gives

Hence

Solution to Exercise 1.27, page 33

(1)We check that the relation ~ is reflexive, symmetric and transitive.

(ER1)(Reflexivity) If ||·|| is a norm on X, then for all x ∈ X, we have that 1 · ||x|| = ||x|| = 1 · ||x||, and so ||·|| ~ ||·||.

(ER2)(Symmetry) If ||·||_a ~ ||·||_b, then there exist positive m, M such that for all x ∈ X, m||x||_b ||x||_a M ||x||_b. A rearrangement of this gives (1/M)||x||_a ||x||_b (1/m)||x||_a, x ∈ X, and so ||·||₂ ~ ||·||₁.

(ER3)(Transitivity) If ||·||_a ~ ||·||_b and ||·||_b ~ ||·||_c, then there exist positive constants M_ab, M_bc, m_ab, m_bc such that for all x ∈ X, we have that m_ab||x||b ||x||_a M_ab||x||_b and m_bc ||x||_c ||x||_b M_bc ||x||_c.
Thus m_abm_bc||x||c m_ab||x||_b ||x||_a M_ab||x||_b M_abM_bc ||x||_c, and so ||·||_a ~ ||·||_c.

(2)Suppose that ||·||_a ~ ||·||_b. Because ~ is an equivalence relation, it is enough to just prove that if U is open in (X, ||·||_b), then U is open in (X, ||·||_a) too, and similarly, if (x_n)_n∈N is Cauchy (respectively) convergent in (X, ||·||_b), then it is Cauchy (respectively convergent) in (X, ||·||_a) as well. Let m, M > 0 be such that for all x ∈ X, m||x||_b ||x||_a M||x||_b.

Let U be open in (X, ||·||_b), and x ∈ U. Then as U is open in (X, ||·||_b), there exists an r > 0 such that B_b(x, r) := {y ∈ X : ||y – x||_b < r} ⊂ U. But if y ∈ X satisfies ||y – x||_a < mr, then ||y – x||_b (1/m)||y – x||_a < (1/m)mr = r, and so y ∈ B_b(x, r) ⊂ U. Hence B_a(x, mr) := {y ∈ X : ||y – x||_a < mr} ⊂ U. So it follows that U is open in (X, ||·||_a) too.

Now suppose that (x_n)_n∈N is a Cauchy sequence in (X, ||·||_b). Let > 0. Then there exists an N ∈ N such that for all n > N, ||x_n – x_m||_b < /M. Hence for all n > N, ||x_n – x_m||_a M ||x_n – x_m||_b < M · (/M) = .

Consequently, (x_n)_n∈N is a Cauchy sequence in (X, ||·||_a) as well.

If (x_n)_n∈N is a convergent sequence in (X, ||·||_b) with limit L, then for > 0, there exists an N ∈ N such that for all n > N, ||x_n – L||_b < /M. Thus for all n > N, ||x_n – L||_a M ||x_n – L||_b < M · (/M) = . So (x_n)_n∈N is convergent with limit L in (X, ||·||_a) too.

Solution to Exercise 1.28, page 42

(1)Let L > 0 be such that for all x, y ∈ R, |f(x) – f(y)| =

Then in particular, with n ∈ N, and y = 0, we obtain

Thus n L for all n ∈ N, which is absurd. So f is not Lipschitz.

(2)x₁(0) = 0 and x₂(0) = 0²/4 = 0, and so x₁, x₂ satisfy the initial condition.

For all t 0,

So x₁, x₂ are both solutions to the given Initial Value Problem.

Solution to Exercise 1.29, page 43

Let F be closed, and (x_n)_n∈N be a sequence in F which converges to x. Suppose that x ∉ F. Since F is closed, there is an open ball B(x, r) := {x ∈ X : ||x – x|| < r} with r > 0, which is contained in X\F. But with := r > 0, there exists an N ∈ N such that for all n > N, ||x_n – x|| < r. In particular, ||x_N+1 – x|| < r, so that F ∋ x_N+1 ∈ B(x, r) ⊂ X\F, a contradiction. Hence x ∈ F.

Now suppose that for every sequence (x_n)_n∈N in F, convergent in X with a limit x ∈ X, we have that the limit x ∈ F. We want to show that X\F is open. Suppose it isn’t. Then 3 ¬[∀x ∈ X\F, ∃r > 0 such that B(x, r) ⊂ X\F]. In other words, ∃x ∈ X\F such that ∀r > 0, B(x, r) ∩ F ≠ Ø. So with r = 1/n, n ∈ N, we can find an x_n ∈ B(x, r) ∩ F. Then we obtain a sequence (x_n)_n∈N in F satisfying ||x_n – x|| < 1/n for all n ∈ N. Thus (x_n)_n∈N converges to x. But x ∉ F, contradicting the hypothesis. Hence X\F is open, that is, F is closed.

Solution to Exercise 1.30, page 43

Let (xn)_n∈N in c₀₀ be given by n ∈ N.

Then with we have

showing that c₀₀ is not closed.

Solution to Exercise 1.31, page 43

(1)Suppose that F is a closed set containing S. Let L be a limit point of S.

Then there exists a sequence (x_n)_n∈N in S\{L} which converges to L.

As each x_n ∈ S\{L} ⊂ S ⊂ F, and since F is closed, it follows that L ∈ F.

So all the limit points of S belong to F. Hence S ⊂ F.

S is closed. Suppose that (x_n)_n∈N is a sequence in S that converges to L.

We would like to prove that L ∈ S. If L ∈ S, then L ∈ S, and we are done.

So suppose that L ∉ S. Now for each n, we define the new term x′_n as follows:

1°If x_n ∈ S, then x′_n := x_n.

2°If x_n ∉ S, then x_n must be a limit point of S, and so B(x_n, 1/n) must contain some element, say x′_n, of S.

Hence we have

Thus (x′_n)_n∈N is a sequence in S\{L} which converges to L, and so L is a limit point of S, that is, L ∈ S. Consequently S is closed.

(2)We first note that if y ∈ Y, then there exists a (y_n)_n∈N in Y that converges to y. Indeed, this is obvious if y is a limit point of Y, and if y ∈ Y, then we may just take (y_n)_n∈N as the constant sequence with all terms equal to y. We have:

(S1)Let x, y ∈ Y. Let (x_n)_n∈N, (y_n)_n∈N be sequences in Y that converge to x, y, respectively. Then x_n + y_n ∈ Y ⊂ Y for each n ∈ N, and (x_n + y_n)_n∈N converges to x + y. But as Y is closed, it follows that x + y ∈ Y too.

(S2)Let α ∈ K, y ∈ Y. Let (y_n)_n∈N be a sequence in Y that converges to y. Then α · y_n ∈ Y ⊂ Y for each n ∈ N, and (α · y_n)_n∈N converges to α · y.
But as Y is closed, it follows that α · y ∈ Y too.

(S3)0 ∈ Y ⊂ Y.

Hence Y is a closed subspace.

(3)The proof is similar to part (2). Let x, y ∈ C. Then there exist sequences (x_n)_n∈N and (y_n)_n∈N in C that converge to x, y, respectively. If α ∈ (0, 1), then (1 – α)x + αy = (1 – α) x_n + α y_n = ((1 – α)x_n + αy_n).

As (1 – α)x_n + αy_n ∈ C ⊂ C for all n ∈ N, and since C is closed, it follows that (1 – α)x + αy ∈ C too.

(4)Suppose that D is dense in X. Let x ∈ X\D. If n ∈ N, then the ball B(x, 1/n) must contain an element d_n ∈ D. The sequence (d_n)_n∈N converges to x because ||x – d_n|| < 1/n, n ∈ N. Hence x is a limit point of D, that is, x ∈ D.

So X\D ⊂ D. Also D ⊂ D. Thus X = D ∪ (X\D) ⊂ D ⊂ X, and so X = D. Now suppose that X = D. If x ∈ X\D = D\D, then x is a limit point of D, and so there is a sequence (d_n)_n∈N in D that converges to x. Thus given an > 0, there is an N such that ||x – d_N|| < , that is, d_N ∈ D ∩ B(x, ).

On the other hand, if x ∈ D, and > 0, then x ∈ B(x, ) ∩ D.

Hence D is dense in X.

Solution to Exercise 1.32, page 43

Let (x_n)_n∈N) ℓ¹. Then and so |x_n| = 0.

Thus there exists an N ∈ N such that |x_n| 1 for all n N. For all n N, |x_n|² = |x_n| · |x_n| |x_n| · 1 = |x_n|. By the Comparison Test 4,

Hence (x_n)_n∈N ∈ ℓ².

while the Harmonic Series diverges.

(ℓ¹, ||·||²) is not a Banach space: Let us suppose, on the contrary, that it is a Banach space, and we will arrive at a contradiction by showing a Cauchy sequence which is not convergent in (ℓ¹, ||·||²).

Consider for n ∈ N, Then (x_n)_n∈N converges in ℓ² to because

So (x_n)_N is a Cauchy sequence in (ℓ², ||·||₂), and so it is also Cauchy in (ℓ¹, ||·||₂). As we have assumed that (ℓ¹, ||·||₂) is a Banach space, it follows that the Cauchy sequence (xn)_n∈N must be convergent to some element x′ ∈ ℓ¹ ⊂ ℓ². But by the uniqueness of limits (when we consider (x_n)_n∈N as a sequence in ℓ²), we must have x = x′ ∈ ℓ¹, which is false, since we know that the Harmonic Series diverges. This contradiction proves that (ℓ¹, ||·||₂) is not a Banach space.

Solution to Exercise 1.33, page 43

Let (a_n)_n∈N be a Cauchy sequence in c₀. Then this is also a Cauchy sequence in ℓ^∞, and hence convergent to a sequence in ℓ^∞, say a. We’ll show that a ∈ c₀. We write and Let > 0. Then there exists an N ∈ N such that ||a_N – a||_∞ < . In particular, for all m ∈ N, < . But as a_N ∈ c₀, we can find an M such that for all m > M, Consequently, for m > M, we have from the above that Thus a ∈ c₀ too.

Solution to Exercise 1.34, page 44

Given > 0, let N ∈ N be large enough so that for all n > N, ||x_n – x|| < . Then for all n > N, we have ||x_n|| – ||x|| ||x_n – x|| < , and so it follows that the sequence (||x_n||)_n∈N is R is convergent, with limit ||x||.

Solution to Exercise 1.35, page 44

First consider the case 1 p < ∞.

(N1) for all x = (x₁, x₂, x₃, ···) ∈ ℓ^p.

If then |x_n| = 0 for all n, and so x = 0.

(N2)||α · x||_p = = |α| ||x||_p, for x ∈ ℓ^p, α ∈ K.

(N3)Let x = (x₁, x₂, ···) and y = (y₁, y₂, ···) belong to ℓ^p. Let d ∈ N.

By the Triangle Inequality for the ||·||_p-norm on R^d,

Passing the limit as d tends to ∞ yields ||x + y||_p ||x||_p + ||y||_p.

Now consider the case p = ∞.

(N1) for all x = (x₁, x₂, x₃, ···) ∈ ℓ^∞.

If then |x_n| = 0 for all n, that is, x = 0.

(N2) for x ∈ ℓ^∞, α ∈ K.

(N3)Let x = (x₁, x₂, ···) and y = (y₁, y₂, ···) belong to ℓ^∞.

Then for all k, |x_k + y_k| |x_k| + |y_k| ||x||_∞ + ||y||_ℓ, and so

Solution to Exercise 1.36, page 44

From Exercise 1.11, page 17, taking n = 1, (C¹[a, b], ||·||_1,∞) is a normed space. We show that (C¹[a, b], ||·||_1,∞) is complete. Let (x_n)_n∈N be a Cauchy sequence in C¹[a, b]. Then ||x_n – x_m||_∞ ||x_n – x_m||_∞ + ||x′_n – x′_m||_1,∞, and so (x_n)_n∈N is a Cauchy sequence in (C[a, b], ||·||_∞), and hence convergent to, say, x ∈ C[a, b]. Also, ||x′_n – x′_m||_∞ ||x_n – x_m||_∞ + ||x′_n – x′_m||_1,∞ = ||x_n – x_m||_1,∞, shows that (x′_n)_n∈N is a Cauchy sequence in (C[a, b], ||·||_∞), and hence convergent to say, y ∈ C[a, b]. We will now show that x ∈ C¹[a, b], and x′ = y. Let t ∈ [a, b]. By the Fundamental Theorem of Calculus, and so

Passing the limit as n goes to ∞ gives, for all t ∈ [a, b],

By the Fundamental Theorem of Calculus, x′ = y ∈ C[a, b]. So x ∈ C¹[a, b]. Finally, we’ll show that (x_n)_n∈N converges to x in C¹[a, b]. Let > 0, and let N be such that for all m, n > N, ||x_n – x_m||_1,∞ < . Then for all t ∈ [a, b], we have |x_n(t) – x_m(t)| + |x′_n(t) – x′_m(t)| |x_n – x_m|_∞ + |x′_n – x′_m|_∞ = ||x_n – x_m||_1,∞ < . Letting m go to ∞, it follows that for all n > N, |x_n(t) – x(t)| + |x′_n(t) – x′(t) As the choice of t ∈ [a, b] was arbitrary, it follows that

that is, ||x_n – x_m||_1,∞ 2.

Solution to Exercise 1.37, page 44

Let (x_n)_n∈N be any Cauchy sequence in X. We construct a subsequence (x_{n_k})_k∈N inductively, possessing the property that if n > n_k, then ||x_n – x_{n_k}|| < 1/2^k, k ∈ N. large enough so that if n, m n₁, then ||x_n – x_m|| < 1/2. Suppose x_n₁, ···, x_{n_k} have been constructed. Choose n_k+1 > n_k such that if n, m n_k+1, then ||x_n – x_m|| < 1/2^k+1. In particular for n n_k+1, ||x_n – x_{n_k+1}|| Z 1/2^k+1.

Now define u₁ = x_n₁, u_k+1 = x_{n_k+1} – x_{n_k}, k ∈ N.

We have Thus converges.

But the partial sums of are

So (x_{n_k})_k∈N converges in X, to, say x ∈ X. As (x_{n_k})_k∈N is a convergent subsequence of the Cauchy sequence (x_n)_n∈N, it now follows that (x_n)_n∈N is itself convergent with the same limit x. Indeed, given > 0, first let N be such that for all n, m > N, ||x_n – x_m|| < /2, and next let n_K > N be such that ||x_{n_K} – x|| < /2, which yields that for all n > N,

Solution to Exercise 1.38, page 44

(N1)For (x, y) ∈ X × Y, ||(x, y)|| = max{||x||, ||y||} 0.
If ||(x, y)|| = 0, then 0 ||x|| max{||x||, ||y||} = {||x, y)|| = 0, and so ||x|| = 0, giving x = 0. Similarly, y = 0 too, and so (x, y) = 0_X×Y.

(N2)For α ∈ K, and (x, y) ∈ X × Y,

(N3)Let (x₁, y₁), (x₂, y₂) ∈ X × Y. Then

and so max{||x₁ + x₂||, ||y₁ + y₂||} ||(x₁, y₁)|| + {||x₂, y₂)||. Thus

Hence (x, y) max{||x||, ||y||}, (x, y) ∈ X × Y, defines a norm on X × Y.

Let ((x_n, y_n))_n∈N be Cauchy in X × Y. As ||x|| max{||x||, ||y||} = ||(x, y)||, (x_n)_n∈N is Cauchy in X. As X is Banach, (x_n)_n∈N converges to some x ∈ X. Similarly (y_n)_n∈N converges to a y ∈ Y. Let > 0. Then there exists an N_x such that for all n > N_x, ||x_n – x|| < , and there is an N_y such that for all n > N_y, ||y_n – y|| < . So with N := max{N_x, N_y}, for all n > N, we have ||x_n – x|| < and ||y_n – y|| < . Thus ||(x_n, y_n) – (x, y)|| = ||(x_n – x, y_n – y)|| = max{||x_n – x||, ||y_n – y||} < , showing that ((x_n, y_n))_n∈N converges to (x, y) in X × Y. So X × Y is Banach.

Solution to Exercise 1.39, page 50

Since K is compact in R^d, it is closed and bounded. Let R > 0 be such that for all x ∈ K, ||x||₂ R. In particular, for every x ∈ K ∩ F, we have ||x||₂ R. Thus K ∩ F is bounded. Also, since both K and F are closed, it follows that even K ∩ F is closed. Hence K ∩ F is closed and bounded, and so by Theorem 1.10, page 45, we conclude that K∩F is compact.

Solution to Exercise 1.40, page 50

Clearly S^d–1 is bounded. It is also closed, and we prove this below. Let (x_n)_n∈N be a sequence in S^d–1 which converges to L in R^d. Let L = (L₁, ···, L_d) and for n ∈ N. Then x_n^(k) = L_k (k = 1, ..., d).

Since xn ∈ S^d–1 for each n ∈ N, we have Passing the limit as n → ∞, we obtain Hence L ∈ S^d–1. So S^d–1 is closed. As S^d–1 is closed and bounded, it follows from Theorem 1.10, page 45, that it is compact.

Solution to Exercise 1.41, page 50

(1)Let (R_n)_n∈N be a sequence in O(2).
Using then and

So each of the sequences (a_n)_n∈R, (b_n)_n∈R, (c_n)_n∈R, (d_n)_n∈R is bounded.

By successively refining subsequences of these sequences, we can choose a sequence of indices n₁ < n₂ < n₃ <···, such that the sequences (a_{n_k})_k∈N, (b_{n_k})_k∈N, (c_{n_k})_k∈N, (d_{n_k})_k∈N are convergent, to, say, a, b, c, d, respectively.

Hence (R_{n_k})_k∈N is convergent with the limit

From (R_n) R_n = I (n ∈ N), it follows that also RR = I, that is, R ∈ O(2).

(2)The hyperbolic rotations belong to O(1, 1) because

But ||R(t)||_∞ | cosh(t)| = cosh t → ∞ as t → ∞, showing that O(1, 1) is not bounded. Hence O(1, 1) can’t be compact (as every compact set is necessarily bounded).

Solution to Exercise 1.42, page 51

Let Since K ⊂ [0, 1], clearly K is bounded.

Moreover,

Thus R\K, being the union of open intervals, is open, that is, K is closed. Since K is closed and bounded, it is compact.

Solutions to the exercises from Chapter 2

Solution to Exercise 2.1, page 58

If 1 ∈ C[0, 1] denotes the constant function taking value 1 everywhere, then

and so

So (L2) is violated, showing that S₁ is not a linear transformation.

On the other hand, S₂ is a linear transformation. For all x₁, x₂ ∈ C[0, 1],

and so (L1) holds. Moreover, for all α ∈ R and x ∈ C[0, 1] we have

and so (L2) holds as well.

Solution to Exercise 2.2, page 58

(1)Let α₁, α₂ ∈ R be such that α₁f₁ + α₂f₂ = 0, that is,

In particular, with t = 0, we obtain α₁ = 0. Thus α₂e^at sin(bt) = 0 for all t ∈ R. With t = π/2b, we see that and so α₂ = 0. Consequently, f₁, f₂ are linearly independent.

(2)First of all, D is a well-defined map from S_f₁,f₂ to itself, since

Thus DS_f₁,f₂ ⊂ S_f₁,f₂.

Furthermore, it is clear that D(g₁ + g₂) = D(g₁) + D(g₂) for all g₁, g₂ ∈ C¹(R) (and in particular for g₁, g₂ ∈ S_f₁,f₂ ⊂ C¹(R)), and also D(α · g) = α · D(g) and all g ∈ (R) (and in particular, for all g ∈ S_f₁,f₂).

Hence D is a linear transformation from S_f₁,f₂ to itself.

(3)We have Df₁ = ae^at cos(bt) – e^atb sin(bt) = af₁ – bf₂, and
Df₂ = ae^at sin(bt) + e^atb cos(bt) = bf₁ + af₂.

So the matrix of D with respect to the basis B = (f₁, f₂) is

(4)As det[D]_B = a² + b² ≠ 0, [D]_B is invertible, and
Hence D is invertible, and the inverse D^–1 : S_f₁,f₂ → S_f₁,f₂ has the matrix [D^–1]_B (with respect to B) given by [D^–1]_B = [D]^–1]_B found above.

(5)We note that and so

By the definition of D,

So any constant.

Similarly, as we have

and so

So any constant.

Solution to Exercise 2.3, page 61

(1)We have

(As expected, the arc length is simply the length of the line segment [0, 1].)

(2)We have and so

(3)Suppose that f is continuous at 0. Then with := 1 > 0, there exists a δ > 0 such that whenever x ∈ C¹[0, 1] and ||x – 0|| < δ, we have |f(x) – f(0)| < 1.

We have for all

Hence for such n there must hold that |f(x_n) – f(0)| = |f(x_n) – 1| < 1.

So for all we have |f(x_n)| |f(x_n) – 1| + 1 < 1 + 1 = 2,

which is a contradiction. Hence f is not continuous at 0.

Let x₀, x ∈ C¹[a, b]. Using the triangle inequality in (R², ||·||₂), we obtain

and so

Thus given > 0, if we set δ := , then we have for all x ∈ C¹[0, 1] satisfying ||x – x₀||_1,∞ < δ that |f(x) – f(x₀)| ||x – x₀||_1,∞ < δ = .

So f is continuous at x₀. As the choice of x₀ was arbitrary, f is continuous.

Solution to Exercise 2.4, page 62

Let x₀ ∈ X. Given > 0, set δ := . Then for all x ∈ X satisfying ||x – x₀|| < δ, we have | ||x|| – ||x₀|| | ||x – x₀|| < δ = . Thus ||·|| is continuous at x₀. As x₀ ∈ X was arbitrary, it follows that ||·|| is continuous on X.

Solution to Exercise 2.5, page 62

f^–1({–1, 1}) = {nπ : n ∈ Z}, f^–1({1}) = {2nπ : n ∈ Z}, f^–1([–1, 1]) = R, and

Solution to Exercise 2.6, page 62

Since cos is periodic with period 2π (that is, f(x) = f(x + 2π) for all x ∈ R), we have f(R) = f([0, 2π]) = f([δ, δ + 2π]) = [–1, 1].

Solution to Exercise 2.7, page 64

(“If” part) Suppose that for every closed F in Y, f^–1(F) is closed in X.

Now let V be open in Y. Then Y\V is closed in Y.

Thus f^–1(Y\V) = f^–1(Y)\f^–1(V) = X\f^–1(V) is closed in X.

Hence f^–1(V) = X\(X\f^–1(V)) is open in X.

So for every open V in Y, f^–1(V) is open in X.

By Theorem 2.1, page 63, f is continuous on X.

(“Only if” part) Suppose that f is continuous.

Let F be closed in Y, that is, Y\F is open in Y.

Hence f^–1(Y\F) = f^–1(Y\f^–1(F) = X\f^–1(F) is open in X.

Consequently, we have that f^–1(F) is closed in X.

Solution to Exercise 2.8, page 64

If x ∈ (g f)^–1(W), then (g f)(x) ∈ W, that is, g(f(x)) ∈ W. So f(x) ∈ g^–1(W), that is, x ∈ f^–1(g^–1(W)). Thus (g f)^–1(W) ⊂ f^–1 (g^–1(W)).

If x ∈ f^–1(g^–1(W)), then f(x) ∈ g^–1(W), that is, (g f)(x) = g(f(x)) ∈ W. Hence x ∈ (g f)^–1(W). So we have f^–1(g^–1(W)) ⊂ (gf)^–1(W).

Consequently, (g f)^–1(W) = f^–1(g^–1(W)).

Solution to Exercise 2.9, page 64

(1)True.

Since (–∞, 1) is open in R and f : X → R is continuous, it follows that {x ∈ X : f(x) < 1} = f^–1(–∞, 1) is open in X by Theorem 2.1, page 63.

(2)True.

Because (1, ∞) is open in R, and f : X → R is continuous, it follows by Theorem 2.1, page 63, that {x ∈ X : f(x) > 1} = f^–1 (1, ∞) is open in X.

(3)False.

Take for example X = R with the usual Euclidean norm, and consider the continuous function f(x) = x for all x ∈ R. Then {x ∈ X : f(x) = 1} = {1}, which is not open in R.

(4)True.

(–∞, 1] is closed in R because its complement is (1, ∞), which is open in R. As f : X → R is continuous, {x ∈ X : f(x) 1} = f^–1 (–∞, 1] is closed in X by Corollary 2.1, page 64.

(5)True.

Since {1} is closed in R and since f : X → R is continuous, it follows by Corollary 2.1, page 64, that {x ∈ X : f(x) = 1} = f^–1{1} is closed in X.

(6)True.

Each of the sets f^–1{1} and f^–1{2} are closed, and so their finite union, namely {x ∈ X : f(x) = 1 or 2} is closed as well.

(7)False.

Take for example X = R with the usual Euclidean norm, and consider the continuous function f(x) = 1 (x ∈ R). Then {x ∈ X : f(x) = 1} = R, which is not bounded, and hence can’t be compact.

Solution to Exercise 2.10, page 65

For all x ∈ X, we have f(2x) = –f(x), and so

Since the sequence converges to 0, it follows that

So we obtain that ((–1)ⁿf(x))_n∈N is convergent with limit f(0). Thus the subsequence (f(x))_n∈N = ((–1)²ⁿf(x))_n∈N of ((–1)f(x))_n∈N is also convergent with limit f(0). Hence f(x) = f(0) for all x ∈ X. As f(0) = f(2 · 0) = –f(0), it follows that f(0) = 0. Hence f(x) = 0 for all x ∈ X. So if f is continuous and it satisfies the given identity then it must be the constant function x 0 : X → Y.

Conversely, the constant function x 0 : X → Y is indeed continuous and also f(2x) + f(x) = 0 + 0 = 0 for all x ∈ X.

Solution to Exercise 2.11, page 65

The determinant of M = [m_ij] is given by the sum of expressions of the type

where p : {1, 2, 3, ···, n} → {1, 2, 3, ···, n} is a permutation. Since each of the maps M m_1p(1) m_2p(2) m_3p(3) ... m_np(n) is easily seen to be continuous using the characterisation of continuous functions provided by Theorem 2.3, page 64, it follows that their linear combination is also continuous.

{0} is closed in R, and so its inverse image det^–1{0} = {M ∈ R^n×n : det M = 0} under the continuous map det is also closed. Thus its complement, namely the set {M ∈ R^n×n : det M ≠ 0}, is open. But this is precisely the set of invertible matrices, since M ∈ R^n×n is invertible if and only if det M ≠ 0.

Solution to Exercise 2.12, page 73

We’d seen in Exercise 1.21, page 21, that a singleton set in any normed space is closed. So {0} is closed in R^m. As the linear transformation T_A : Rⁿ → R^m is continuous, its inverse image under T_A, T^–1_A({0}) = {x ∈ Rⁿ : Ax = 0} = ker A, is closed in Rⁿ.

Solution to Exercise 2.13, page 73

Let V be a subspace of Rⁿ, and let {v₁, · · ·, v_k} be a basis for V. Extend this to a basis {v₁, · · ·, v_k, v_k+1, · · ·, v_n} for R. By using the Gram-Schmidt orthogonalisation procedure, we can find an orthonormal 5 set of vectors {u₁, · · ·, u_n} such that for each k ∈ {1, · · ·, n}, the span of the vectors v₁, · · ·, v_k coincides with the span of u₁, · · ·, u_k. Now define A ∈ R^(n–k)×n as follows:

It is clear from the orthonormality of the u_js that Au₁ = · · · = Au_k = 0, and so it follows that also any linear combination of u₁, · · ·, u_k lies in the kernel of A. In other words, V ⊂ ker A.

On the other hand, if x = α₁u₁ + · · · + α_nu_n, where α₁, · · ·, α_n are scalars and if Ax = 0, then it follows that

So x = α₁u₁ + · · · + α_ku_k ∈ V. Hence ker A ⊂ V.

Consequently V = ker A, and by the result of the previous exercise, it now follows that V is closed.

Solution to Exercise 2.14, page 73

(1)The linearity of T follows immediately from the properties of the Riemann integral. Continuity follows from the straightforward estimate

(2)The partial sums s_n of the series converge to f. Thus, since the continuous map T preserves convergent sequences, it follows that

Solution to Exercise 2.15, page 73

We have for all t ∈ R that

Thus ||f ∗ g||_∞ ||g||_∞||f||₁ for all g ∈ L^∞(R). So f∗ is well-defined. Linearity is easy to see. From the above estimate, it follows that the linear transformation f∗ is continuous as well.

Solution to Exercise 2.16, page 73

Consider the reflection map : L²(R) → L²(R). Then it is straightforward to check that R ∈ L(L²(R)), and moreover it is continuous since ||f||₂ = ₂ for all f ∈ L²(R). Clearly Y = ker(I – R), and so, being the inverse image of the closed set {0} under the continuous map I – R, it follows that Y is closed.

Solution to Exercise 2.17, page 76

For

and so Λ ∈ CL(ℓ²) and ||Λ|| |λ_n|.

Moreover, for ℓ² ∋ e_n := (0, · · ·, 0, 1, 0, · · ·) ∈ ℓ² (sequence with all terms equal to 0 and nth term equal to 1), we have

for all n, and so ||Λ|| is an upper bound for {|λ_n| : n ∈ N}. Hence ||Λ|| |λ_n|.

From the above, it now follows that ||Λ|| = |λ_n|.

If λ_n = 1 –, n ∈ N, then ||Λ|| = = 1.

Suppose that x = (a_n)_n∈N ∈ ℓ² is such that ||x||₂ 1 and ||Λx||₂ = ||Λ|| = 1.

If 0 = a₂ = a₃ = · · ·, then Λx = 0, and this contradicts the fact that ||Λx||₂ = 1.

So at least one of the terms a₂, a₃, · · · must be nonzero.

a contradiction. So the operator norm is not attained for this particular Λ.

Solution to Exercise 2.18, page 76

Let x = (x_n)_n∈N ∈ ℓ^p, and let > 0.

Then there exists an N such that |x_k|^p < ^p. Let s_n := x_ke_k.

Then for n > N, x – s_n = (0, · · ·, 0, x_n+1, x_n+2, x_n+3, · · ·).

So giving ||x – s_n||_p < .

So (s_n)_n∈N converges in ℓ^p to x, that is, x = x_ne_n.

The map x = (x₁, x₂, x₃, · · ·) x_n : ℓ^p → K is easily seen to be linear.

It’s continuous as for all x ∈ ℓ^p, |φ_n(x)| = |x_n| = (|x_n|^p)^1/p = |x|_p.

If x = , where the ξ_is and s are scalars, then applying φ_n,

As the choice of n was arbitrary, ξ_n = for all n.

Solution to Exercise 2.19, page 77

(1)Let x = (x_n)_n∈N ∈ ℓ^∞. Then for all n ∈ N, |x_n| ||x|_∞.

Thus = ||x||_∞.

Consequently Ax ∈ ℓ^∞. So A is a well-defined map.

The linearity is easy to check.

Also, we see that for all x ∈ ℓ^∞ that ||Ax||_∞ = ||x||_∞.

So A ∈ CL(ℓ^∞), and ||A|| 1. Also, with 1 := (1, 1, 1, · · ·) ∈ ℓ^∞, we have

Consequently, ||A|| = 1.

(2)Let x = (x_n)_n∈N ∈ c, and let its limit be denoted by L.

We’ll show that Ax ∈ c as well.

We will prove that Ax is convergent with the same limit L! (Intuitively, this makes sense since for large n, all x_ns look alike, images L, and the average of these is approximately L, since the first few terms do not “contribute much” if we take a large collection to take an average.)

Let > 0. Then there exists an N₁ ∈ N such that for all n > N₁, |x_n – L| < /2. Since (x_n)_n∈N is convergent, it is bounded, and so there exists an M > 0 such that for all n ∈ N, |a_n| M.

Choose N ∈ N such that N > max

(This ghastly choice of N is arrived at by working backwards. Since we wish to make less than for n > N, we manipulate this, as shown in the chain of inequalities below, and then choose N large enough to achieve this.)

So N > N₁ and Then for all n > N, we have:

So is a convergent sequence with limit L.

Hence Ax ∈ c. Consequently Ac ⊂ c, and c is an invariant subspace of A.

Solution to Exercise 2.20, page 85

(If part:) Since |λ_n| > 0, we have |λ_k| |λ_n| > 0, and so λ_k ≠ 0 for all k.

Moreover, < ∞, and so V : ℓ² → ℓ² given by

belongs to CL(ℓ²). Moreover for all (a_n)_n∈N we have

and so VΛ = I = ΛV. Hence Λ is invertible in CL(ℓ²), with Λ^–1 = V.

(Only if part:) Let Λ be invertible in CL(ℓ²). Then there exists a Λ^–1 ∈ CL(ℓ²) such that Λ^–1Λ = I = ΛΛ^–1. So ||x||₂ = ||Λ^–1Λx||₂ ||Λ^–1||||Λx||₂, for all x ∈ ℓ².

Hence ||Λx||₂ for all x ∈ ℓ². So with x := e_k (kth term 1, others 0),

Thus

Solution to Exercise 2.21, page 86

We have

Similarly,

Solution to Exercise 2.22, page 86

(1)If there exist matrices A, B such that AB – BA = I, then

a contradiction.

(2)If n = 1, then ABⁿ – BⁿA = AB – BA = I = 1 · B⁰ = nB^n–1.

If for some n ∈ N, we have ABⁿ – BⁿA = nB^n–1, then

and so the result follows by induction.

Suppose that AB – BA = I. Then for all n ∈ N, ABⁿ – BⁿA = nB^n–1. Taking operator norm on both sides yields

We claim that B^n–1 ≠ 0 for all n ∈ N. Indeed, if n = 1, then B⁰ := I ≠ 0. If B^n–1 ≠ 0 for some n ∈ N, then Bⁿ = 0 gives the contradiction that

and so we must have Bⁿ ≠ 0 too. By induction, our claim is proved. Thus in (7.3), we may cancel ||Bⁿ⁻¹|| > 0 on both sides of the inequality, obtaining n images 2||A||||B|| for all n ∈ N, which is absurd. Consequently, our original assumption that AB − BA = I must be false.

(3)If Ψ ∈ C^∞(R), then

and so AB − BA = I.

Solution to Exercise 2.23, page 87

(1)For x = (x₁, x₂) ∈ R², we have, using the Cauchy-Schwarz inequality, that

By the Neumann Series Theorem, (I − K)⁻¹ exists in CL(R²).

So there is a unique solution x ∈ R² to (I − K)x = y, given by x = (I − K)⁻¹y.

(2)We have and so

Thus

(3)A computer program yielded the following numerical values:

Solution to Exercise 2.24, page 88

If n = 1, then (I − A)P₁ = (I − A)(I + A)(I + A²) = I − A⁴ = I − A^{2^{1 + 1}}.

If the claim is true for some k ∈ N, then

So the claim follows by induction for all n ∈ N.

(I − A^2ⁿ⁺¹)_n∈N converges to I in L(X) since ||A|| < 1 and

Also, since ||A|| < 1, I − A is invertible in CL(X). We have

and so ((I − A)⁻¹ (I − A^2ⁿ⁺¹))_n∈N = ((I − A)⁻¹(I − A)P_n)_n∈N = (P_n)_n∈N is convergent with limit (I − A)⁻¹.

Solution to Exercise 2.25, page 88

(1)Let T₀ ∈ GL(X). Then T₀⁻¹ ∈ CL(X), and also r := ||T₀⁻¹|| ≠ 0.

If T ∈ , and in particular,

and so by the Neumann Series Theorem, I + (T − T₀)T₀⁻¹ belongs to GL(X).

But as T₀ ∈ GL(X) too, it now follows that

This completes the proof that GL(X) is an open subset of CL(X).

(2)Let T₀ ∈ GL(X) and > 0. Set

Let T ∈ CL(X) be such that ||T − T₀|| < δ.

Then in particular ||T − and so by part (1), T ∈ GL(X), with

Moreover, we have

Thus using the estimate from the Neumann Series Theorem,

Solution to Exercise 2.26, page 92

A² = B² = 0, and so A, B are nilpotent.

Hence and

We note that

Also,

We have and Thus

and so

Solution to Exercise 2.27, page 94

Suppose that the Banach space has an infinite countable Hamel basis {x₁, x₂, x₃, ··· }. We can ensure that for all n ∈ N, we have ||x_n|| = 1. Let Fⁿ := span{x₁, x₂, ···, x_n}. Then each F_n is a finite dimensional normed space (with the induced norm from X), and so it is a Banach space. It follows that F_n is a closed subspace of X. By the Baire Lemma, there is an n ∈ N such that F_n contains an open set U, and in particular, an open ball B(X, 2r) for some r > 0. The vector y := rx_n+1 + x belongs to B(X, 2r) since ||y − x|| = ||rx_n+1|| = r < 2r. Since y, x ∈ B(X, 2r) ⊂ F_n, and as F_n is a subspace, we conclude that (y − x)/r ∈ F_n too, that is, x_n+1 ∈ F_n = span{x₁, ···, x_n}, a contradiction.

Solution to Exercise 2.28, page 96

In light of the Open Mapping Theorem, such a function must necessarily be nonlinear. If the function is constant on an open interval I, then the image f(I) will be a singleton, which is not closed. The following function does the job:

If I := (−1, 1), then f(I) = {0}, which is not open. f is surjective and continuous, and its graph is depicted in the following picture.

Solution to Exercise 2.29, page 96

From Exercise 1.38, page 44, X × Y is a Banach space. Since G(T) is a closed subspace of the Banach space X × Y, it is a Banach space too. Let us now consider the map p : G(T) → X defined by p(X, Tx) = x for x ∈ X. Then p is a linear transformation:

for α ∈ K, x, x₁, x₂ ∈ X. Moreover, p continuous because

p is also injective since if p(X, Tx) = 0, then x = 0.

Furthermore, if x ∈ X, then x = p(x, Tx), showing that p is surjective too.

Thus, p ∈ CL(G(T), X) is bijective, and hence invertible in CL(G(T), X), with inverse p⁻¹ ∈ CL(X, G(T)). Hence for all x ∈ X,

showing that T ∈ CL(X, Y).

Solution to Exercise 2.30, page 102

We have

Solution to Exercise 2.31, page 102

(1)We know that σ (T) ⊂ {λ ∈ C : |λ| ||T||}, and so ||T|| is an upper bound for {|λ| : λ ∈ σ(T)}. Thus

(2)We have σ(T_A) = {eigenvalues of A} = {1}, and so r_σ(T_A) = 1.

On the other hand, with images we have ||x₁||₂ = 1, and so

Solution to Exercise 2.32, page 103

Suppose that λ² ∉ σ(T²). Then λ² ∈ ρ(T²), that is, λ² I − T² is invertible in CL(X). From the identity (λ²I − T²) = (λI − T)(λI + T) = (λI + T)(λI − T), we then obtain

But then Q = QI = Q(λI − T)P = IP = P, and so P = Q ∈ CL(X) is the inverse of λI − T, a contradiction to the fact that λ ∈ σ(T).

Solution to Exercise 2.33, page 103

If e_n ∈ ℓ² denotes the sequence with the nth term equal to 1, and all others equal to 0, then Λe_n = λ_ne_n, and so each λ_n is an eigenvalue of Λ with eigenvector e_n ≠ 0. Thus {λ_n : n ∈ N} ⊂ σ_p(Λ).

Next we will show that σ(Λ) ⊂ {λ_n : n ∈ N}{0}. To this end, suppose that μ ∉ {λ_n : n ∈ N}{0}. Then we claim that μI − Λ is invertible in CL(ℓ²). By a previous exercise, we know that in order to show the invertibility of

it is enough to show that |μ − λ_n| is bounded away from 0. To see this, note that since there is an N large enough such that |λ_n| < |μ|/2 for all n > N, and so

But also |μ − λ₁|, ···, |μ − λ_N| are all positive, so that we do have

Hence μI − Λ ∈ CL(ℓ²) is invertible in CL(ℓ²), that is, μ ∈ ρ(Λ).

Thus σ(Λ) ⊂ {λ_n : n ∈ N}{0}.

But the spectrum σ(Λ) is closed, and since it contains σ_p(Λ) ⊃ {λ_n : n ∈ N}, it must contain the limit of (λ_n)_n∈N, which is {0}.

So we also obtain {λ_n : n ∈ N}{0} ⊂ σ_p(Λ){0} ⊂ σ(Λ).

Thus σ(Λ) = {λ_n : n ∈ N}{0}.

Consequently, {λ_n : n ∈ N} ⊂ σ_p(Λ) ⊂ {λ_n : n ∈ N}{0} = σ(Λ).

Solution to Exercise 2.34, page 103

(1)Suppose that λ ∈ σ_ap(T). Then there exists a sequence (x_n)_n∈C of vectors in X such that ||x_n|| = 1 for all n ∈ N, and

We will just prove that λ ∉ ρ(T), and so by definition it will follow that then λ ∈ σ(T). Suppose, on the contrary, that λ ∈ ρ(T). Then T − λI is invertible in CL(X). Thus

a contradiction. Consequently, λ ∉ ρ(T), that is, λ ∈ σ(T).

(2)For k ∈ N, let e_k denote the sequence in ℓ² whose kth term is 1 and all other terms are zeros. Then ||e_k||₂ = 1, and Λe_k = λ_ke_k, so that

that is, Consequently,

Solution to Exercise 2.35, page 103

Let λ ∈ C and Ψ ∈ D_Q be such that xΨ(x) = λΨ(x) for almost all x ∈ R, that is, (x − λ)Ψ(x) = 0 for almost all x ∈ R. Now x − λ ≠ 0 for all x ∈ R\{λ}. Hence for almost all x ∈ R, Ψ(x) = 0, that is, Ψ = 0 in L²(R). Consequently, λ can’t be an eigenvalue of Q, and so σ_p(Q) = ∅.

Solution to Exercise 2.36, page 105

For simplicity we’ll assume K = R. If a = (a_n)_n∈N ∈ ℓ¹, then define the functional φ_a ∈ CL(c₀, R) = (c₀)′ by

Then a φ_a : ℓ¹ → (c₀)′ is an injective linear transformation, and it is also continuous because |φ_a(b)| ||b||_∞ ||a||¹ for all b ∈ c₀, and ||φ_a|| ||a||₁. To see the surjectivity of this map, we need to show that given φ ∈ (c₀)′, there exists an a ∈ ℓ¹ such that φ = φ_a. Let e_n ∈ c₀ being the sequence with nth term 1 and all others 0. Set a = (φ(e₁), φ(e₂), φ(e₃), ···). We’ll show that a ∈ ℓ¹, and that φ = φ_a.

Define the scalars α_n, n ∈ N, by

Then for all n we have α_nφ(e_n) = |φ(e_n)|.

We have ||(α₁, ···, α_n, 0, ···)||_∞ images 1, and so

for all n ∈ N. Hence a ∈ ℓ¹.

Finally, we need to show φ = φ_a . Let b = (b_n)_n∈N ∈ c₀ and > 0. Then there exists an N such that for all n > N, |b_n| < . Set b = (b₁, ···, b_N, 0, ···) ∈ c₀. Then ||b − b||_∞ = ||(0, ··· , 0, b_N+1, ···)||_∞ . Moreover, we have that

Hence

As the choice of > 0 was arbitrary, it follows that φ(b) = φ_a(b) for all b ∈ c₀, that is, φ = φ_a.

Solution to Exercise 2.37, page 105

(1)BV [a, b] is a vector space: We prove that BV [a, b] is a subspace of the vector space R^[a,b] of all real valued functions on [a, b] with pointwise operations.

(S1)The zero function 0 belongs to BV [a, b].

Indeed, for any partition and so var(0) = 0 < ∞.

(S2)Let μ₁, μ₂ ∈ BV [a, b]. Then we have

and so μ₁ + μ₂ ∈ BV [a, b].

(S3)Let α ∈ R and μ ∈ BV [a, b]. Then

and so αμ ∈ BV [a, b].

(2)We show that μ images ||μ|| defines a norm on BV [a, b].

(N1)If μ ∈ BV [a, b], then ||μ|| = |μ(a)| + var(μ) 0.

Let μ ∈ BV [a, b] be such that ||μ|| = 0. Then var(μ) = 0, and |μ(a)| = 0.

Hence μ(a) = 0. Suppose that μ ≠ 0. Then there exists a c ∈ [a, b] such that μ(c) ≠ 0. Clearly c ≠ a, since μ(a) = 0. Now consider the partition

Then var

a contradiction. Hence μ = 0.

(N2)Let α ∈ R and μ ∈ BV [a, b]. Then αμ ∈ BV [a, b], and we have seen earlier that varαμ = |α|var(μ). Hence

(N3)Let μ₁, μ₂ ∈ BV [a, b]. Then μ₁ + μ₂ ∈ BV [a, b], and we’ve seen above that var(μ₁ + μ₂) var(μ₁) + var(μ₂). Thus

Consequently BV [a, b] is a normed space with the norm ||·||.

(3)Let x ∈ C[a, b] and μ ∈ BV [a, b]. Given > 0, let δ > 0 be such that for every partition P satisfying δ_P < δ, we have

Then

As the choice of > 0 was arbitrary, it follows that

(4)For all x ∈ C[a, b], |φ_µx| images ||x||_∞ var(μ).

From the linearity of the Riemann-Stieltjes integral, it follows that φ_µ is a linear transformation from C[a, b] to R. From the above estimate, we also see that φ_µ is continuous. Consequently φ_µ ∈ CL(C[a, b), R) = (C[a, b])′.

Moreover ||φ_µ|| var(μ).

(5)We will show that (x x(a)) = φ_µ, where

First of all, μ ∈ BV [a, b], since var(μ) = 1 < ∞.

Let x ∈ C[a, b], and > 0. Let δ > 0 be such that for all t such that t − a < δ, we have |x(t) − x(a)| < .

Then for all partitions P with δ_P < δ, we have

where the last inequality follows from the fact that |a − t₁| δ_P < δ.

So (μ is not unique: for any c ∈ R, μ + c also works!)

Solution to Exercise 2.38, page 109

On the one dimensional subspace Y :=span{x_∗} ⊂ X, we have a continuous linear map φ : Y → C. (Simply define φ(αx_∗) = α, then |φ(αx_∗)| = |α| = ||αx_∗||/||x_∗||, and so ||φ|| = 1/||x_∗|| < ∞.) By the Hahn-Banach Theorem, there exists an extension φ_∗ ∈ CL(X, C) of φ, and so φ_∗(x_∗) = φ(x_∗) = 1 ≠ 0. (Alternatively, one could just use Corollary 2.7, page 109, with x = x_∗ and y = 0: there exists a functional φ_∗ ∈ CL(X, C) such that φ_∗(x_∗) ≠ φ_∗(0) = 0.)

Solution to Exercise 2.39, page 115

Consider the collection P of all linearly independent subsets S ⊂ X. Consider the partial order which is simply set inclusion ⊂. Then every chain in P has an upper bound, as explained below.

If C is a chain in P, then is an upper bound of C.

We just need to show the linear independence of this set U. To this end, let v₁, ···, v_n be any set of vectors from U for which there exist scalars α₁, ···, α_n in F such that α₁v₁ + ··· + α_nv_n = 0. Let the sets S₁, ···, S_n ∈ C be such that v₁ ∈ S₁, ···, v_n ∈ S_n. As C is a chain, we can arrange the finitely many S_ks in “ascending order”, and there exists a k_∗ ∈ {1, ···, n} such that S₁, ···, S_n ⊂ S_{k_∗}. Then v₁, ···, v_n ∈ S_{k_∗}. But by the linear independence of S_{k_∗}, we conclude that α₁ = ··· = α_n = 0. Thus U is linearly independent, showing that every chain in P has an upper bound.

By Zorn’s Lemma, P has a maximal element B. We claim that span B = X. For if not, then there exists an x ∈ X\span B. We will show B′ := B ∪ {x} is linearly independent. Suppose that α₁, ···, α_n, α ∈ K and v₁, ···, v_n ∈ B are such that αx + α₁v₁ + ··· + α_nv_n = 0. First we note that α = 0, since otherwise

which is false. As α = 0, the equality αx + α₁v₁ + ··· + α_nv_n = 0 now becomes α₁v₁ + ··· + α_nv_n = 0. But by the independence of the set B, we conclude that α₁ = ··· = α_n = 0 too. Hence B′ is linearly independent, and so B′ belongs to P. As B′ = B ∪ {x} B, we obtain a contradiction (to the maximality of B). Consequently, span B = X, and as B ∈ P, B is also linearly independent.

Solution to Exercise 2.40, page 115

Let B = {v_i : i ∈ I}. Every x ∈ X has a unique decomposition

for some finite number of indices i₁, ···, i_n ∈ I and scalars α₁, ···, α_n in F. Define F(x) = α₁f(v_i₁) + ··· + α_nf(v_{i_n}). It is clear that F(v_i) = f(v_i), i ∈ I. Let us check that F : X → Y is linear.

(L1)Given x₁, x₂ ∈ X, there exist scalars α₁, ···, α_n and β₁, ···, β_n (possibly several of them equal to zero) and indices i₁, ···, i_n ∈ I, such that

(L2)Let α ∈ F. Given x ∈ X, there exist β₁, ··· , β_n ∈ F and i₁, ···, i_n ∈ I, such that x = β₁v_i₁ + ··· + β_nv_{i_n}. Then αx = (αβ₁)v_i₁ + ··· + (αβ_n)v_{i_n}.

Solution to Exercise 2.41, page 115

Let B be a Hamel basis for X. As X is infinite dimensional, B is an infinite set. Let {v_n : n ∈ N} be a countable subset of B. Let y_∗ ∈ Y be any nonzero vector.

Let f : B → Y be defined by

By the previous exercise, this f extends to a linear transformation F from X to Y. We claim that F ∉ CL(X, Y). Suppose that it does. Then there exists an M > 0 such that for all x ∈ X, ||F(x)|| M||x||. But if we put x = v_n, n ∈ N, this yields n||v_n||||y_∗|| = ||f(v_n)|| = ||F(v_n)|| M ||v_n||, and so for all n ∈ N, n M/||y_∗||, which is absurd. Thus F is a linear transformation from X to Y, but is not continuous.

Solution to Exercise 2.42, page 115

If R were finite dimensional, say d-dimensional over Q, then there would exist a one-to-one correspondence between R and Q^d. But Q^d is countable, while R isn’t, a contradiction. So R is an infinite dimensional vector space over Q.

Suppose that R has a countable basis B = {v_n : n ∈ N} over Q.

We will define an injective map yielding a contradiction.

Set f(0) := 0 ∈ Q¹. If x ≠ 0, then x has a decomposition x = q₁v₁ + ··· + q_nv_n, where q₁, ···, q_n ∈ Q and q_n ≠ 0. In this case, set f(x) = (q₁, ···, q_n) ∈ Qⁿ. It can be seen that if f(x) = f(y), for some x, y ∈ R, then x = y. So f is injective.

As is countable, follows that R is countable too, a contradiction.

Hence B can’t be countable.

Solution to Exercise 2.43, page 115

The set R is an infinite dimensional vector space over Q. Let {v_i : i ∈ I} be a Hamel basis for this vector space. Fix any i_∗ ∈ I.

We define a function f : B → R on the basis elements:

Let F be an extension of f from B to R, as provided by Exercise 2.40, page 115. Then F is linear, and in particular, additive. So F(x + y) = F(x) + F(y) for all x, y ∈ R.

We now show that F is not continuous on R: for otherwise, for any v_i ≠ v_{i_∗}, if (q_n)_n∈N is a sequence in Q converging to the real number v_i/v_{i_∗} (v_{i_∗} ≠ 0 since it is a basis vector), then we would have

a contradiction!

Solution to Exercise 2.44, page 116

(1)By the Algebra of Limits, the map l is linear.

Let (x_n)_n∈N ∈ c. For all n ∈ N, |x_n| ||(x_n)_n∈N||∞.

Passing the limit as

Thus l ∈ CL(c, K).

(2)Y is a subspace of ℓ^∞. Indeed we have:

(S1)Clearly (0)_n∈N ∈ Y, since

(S2)Let (x_n)_n∈N, (y_n)_n∈N ∈ Y.

Then and exist.

we conclude that exists as well.

Thus (x_n)_n∈N + (y_n)_n∈N ∈ Y too.

(S3)Let (x_n)_n∈N ∈ Y and α ∈ K. Then exists.

As it follows that

exists, and so α · (x_n)_n∈N ∈ Y.

Consequently, Y is a subspace of ℓ^∞.

(3)For all x ∈ ℓ^∞, x − Sx ∈ Y : Let x = (x_n)_n∈N ∈ ℓ^∞. Then we have

We have

As x ∈ ℓ^∞, it follows that and so x − Sx ∈ Y.

(4)If x = (x_n)_n∈N ∈ c, then Ax ∈ c, where A denotes the averaging operator (Exercise 2.19, page 77).

Hence exists, and so x ∈ Y. Consequently, c ⊂ Y.

(5)Define L₀ : Y → K by L₀(x_n)_n∈N =

Then it is easy to check that L₀ : Y → K is a linear transformation.

Moreover, if x ∈ Y, then

But

Hence |L₀x| ||x||∞. Consequently, L₀ ∈ CL(Y, K).

We had seen that if x ∈ c, then Ax ∈ c, and that l(Ax) = l(x).

Hence for all x ∈ c, L₀(x) = l(Ax) = l(x), that is, L₀|c = l.

Using the Hahn-Banach Theorem, there exists an L ∈ CL(ℓ^∞, K) such that L|_Y = L₀ (and ||L|| = ||L₀||).

In particular, if x ∈ c, then x ∈ Y and so Lx = L₀x = lx. Thus L|_c = l.

Also, if x = (x_n)_n∈N ∈ ℓ^∞, then x − Sx ∈ Y.

Hence images

Thus Lx = LSx for all x ∈ ℓ^∞, that is, L = LS.

(6)We have

Consequently,

Solutions to the exercises from Chapter 3

Solution to Exercise 3.1, page 124

f is a continuous linear transformation. Thus it follows that f′(x₀) = f for all x₀, and in particular also for x₀ = 0.

Solution to Exercise 3.2, page 125

Suppose that f′(x₀) = L ∈ CL(X, Y). Let M > 0 be such that ||Lh|| M||h||, for all h ∈ X. Let > 0. Then there exists a δ₁ > 0 such that whenever x ∈ X satisfies 0 < ||x − x₀|| < δ₁, we have

So if x ∈ X satisfies ||x − x₀|| < δ₁, then ||f(x) − f(x₀) − L(x − x₀)|| ||x − x₀||.

Let Then for all x ∈ X satisfying ||x − x₀|| < δ, we have

Hence f is continuous at x₀.

Solution to Exercise 3.3, page 125

(Rough work: We have for x ∈ C¹[0, 1] that

where L : C¹[0, 1] → R is the map given by Lh = 2x′₀(1)h′(1), h′ ∈ C¹[0, 1]. So we make the guess that f′(x₀) = L.)

Let us first check that L is a continuous linear transformation. L is linear because:

(L1)For all h₁, h₂ ∈ C¹[0, 1], we have

(L2)For all h ∈ C¹ [0, 1] and α ∈ R, we have

Also, L is continuous since for all h ∈ C¹[0, 1], we have

So L is a continuous linear transformation. Moreover, for all x ∈ C¹[0, 1],

so that

Given images > 0, set δ = images . Then if x ∈ C¹[0, 1] satisfies 0 < ||x − x₀||_{1, ∞} < δ, we have

Solution to Exercise 3.4, page 125

Given > 0, let ′ > 0 be such that ′||x₂ − x₁|| < . Let δ′ > 0 such that whenever 0 < ||x − γ(t₀)|| < δ′, we have

Let δ 0 be such that δ ||x₂ − x₁|| < δ′. For all t ∈ R satisfying 0 < |t − t₀| < δ,

and so ||γ(t) − γ(t₀)|| = |t − t₀|||x₂ − x₁|| δ||x₂ − x₁|| < δ′. Thus for all t ∈ R satisfying 0 < |t − t₀| < δ, we have

Thus f γ is differentiable at t₀ and

Let x₁, x₂ ∈ X be such that g(X₁) ≠ g(X₂). With γ the same as above, we have for all t ∈ R that

So g γ is constant. Thus (g γ)(1) = g(x₂) = g(x₁) = (g γ)(0), a contradiction. Consequently, g is constant.

Solution to Exercise 3.5, page 128

Suppose that f′(x₀) = 0. Then for every

In particular, setting h = x₀, we have giving x₀ = 0 ∈ C[a, b].

Vice versa, if x₀ = 0, then

for all h ∈ C[a, b], that is, f′(0) = 0.

Consequently, f′(x₀) = 0 if and only if x₀ = 0.

So we see that if x_∗ is a minimiser, then f′(x_∗) = 0, and so from the above x_∗ = 0. We remark that 0 is easily seen to be the minimiser because

Solution to Exercise 3.6, page 129

If x₁, x₂ ∈ S, α ∈ (0, 1), then x₁, x₂ ∈ C¹[a, b]. So (1 − α)x₁ + αx₂ ∈ C¹[a, b]. Moreover, as x₁(a) = x₂(a) = y_a and x₁b = x₂(b) = y_b, we also have that

Thus (1 − α)x₁ + αx₂ ∈ S. Consequently, S is convex.

Solution to Exercise 3.7, page 129

For x₁, x₂ ∈ X and α ∈ (0, 1) we have by the triangle inequality that

Thus || · || is convex.

Solution to Exercise 3.8, page 129

(If part:) Let x₁, x₂ ∈ C and α ∈ (0, 1). Then we have that (x₁, f(x₁) ∈ U(f) and (x₂, f(x₂)) ∈ U(f). Since U(f) is convex,

Consequently, (1 − α)f(x₁) + αf(x₂) = y f(x) = f((1 − α) · x₁ + α · x₂). Hence f is convex.

(Only if part:) Let (x₁, y₁), (x₂, y₂) ∈ U(f) and α ∈ (0, 1). Then we know that y₁ f(x₁) and y₂ f(x₂) and so

Consequently, that is,

So U(f) is convex.

Solution to Exercise 3.9, page 129

We prove this using induction on n. The result is trivially true when n = 1, and in fact we have equality in this case. Suppose the inequality has been established for some n ∈ N. If x₁, ···, x_n, x_n+1 are n + 1 vectors, and images then

and so the claim follows for all n.

Solution to Exercise 3.10, page 130

We have for all x ∈ R

Thus f is convex.

(Alternately, one could note that is a norm on R², and so it is convex. Now fixing y = 1, and keeping x variable, we get convexity of

Solution to Exercise 3.11, page 132

For x₁, x₂ ∈ C¹[0, 1] and α ∈ (0, 1), we have, using the convexity of function (Exercise 3.10, page 130), that

Solution to Exercise 3.12, page 133

(If:) Suppose that x₀(t) = 0 for all t ∈ [0, 1]. Then we have that for all h ∈ C[0, 1],

and so f′(x₀) = 0.

(Only if:) Now suppose that f′(x₀) = 0. Thus for every h ∈ C[0, 1], we have

In particular, taking h := x₀ ∈ C[0, 1], we obtain

So As x₀ is continuous on [0, 1], it follows that x₀ = 0.

By the necessary condition for x₀ to be a minimiser, we have that f′(x₀) = 0 and so x₀ must be the zero function 0 on [0, 1]. Furthermore, as f is convex and f′(0) = 0, it follows that the zero function is a minimiser. Consequently, there exists a unique solution to the optimisation problem, namely the zero function 0 ∈ C[0, 1]. The conclusion is also obvious from the fact that for all x ∈ C[0, 1],

Solution to Exercise 3.13, page 141

We have Then and

The Euler-Lagrange equation is

Upon integrating, we obtain on [a, b] for some constant C.

Thus , for all t ∈ [a, b].

So A 0, and for each t ∈ [a, b]. As is continuous, we can conclude that must be either everywhere equal to , or everywhere equal to −. In either case, is constant, and so x_∗ is given by x_∗(t) = αt + β, t ∈ [a, b]. Since x_∗(a) = x_a and x_∗(b) = x_b, we have

and for all t ∈ [a, b].

That this x_∗ ∈ S is indeed a minimiser can be concluded by noticing that the map x images L(γ_x) : S → R is convex, thanks to the convexity of images images for all η ∈ R (Exercise 3.10, page 130).

(The fact that x_∗ is a minimiser, is of course expected geometrically, since the straight line is the curve of shortest length between two points in the Euclidean plane.)

Solution to Exercise 3.14, page 141

We have

Solution to Exercise 3.15, page 141

With we have

Then and

The Euler-Lagrange equation is

Upon integrating, we obtain on [a, b] for some constant C.

Thus

and for all t ∈ [a, b]

We will now show that this x_∗ is a maximiser of x L(γ_x) : S → R, that is, it is a minimiser of x −L(γ_x). Note that the map is convex because

Hence x −L(γ_x) : S → R is convex too, and this proves our claim.

Solution to Exercise 3.16, page 142

We have Thus

So the Euler-Lagrange equations are

that is,

Solution to Exercise 3.17, page 143

(1)With we have that

We have

So the Euler-Lagrange equation is

We have

Similarly

Thus the Euler-Lagrange equation becomes (using u_xy = u_yx)

If u = Ax + By + C, then u_xx = 0, u_xy = 0 and u_yy = 0, so that all the three

summands on the left-hand side of the Euler-Lagrange equation vanish, and so we see that the Euler-Lagrange equation is satisfied.

If u = tan^–1 (y/x), then we have

Thus u_xx = , u_xy = u_yx = , and u_yy = .

Hence

With s := and t = tan^–1(y/x) = u, we have tan t = , and so

Thus x = · cos t = s · cos t. Then

Vice versa, if x = s · cos t, y = s · sin t and u = t, then

and so s = . Also = tan t, and so u = tan^–1(y/x) = t.

Using the Maple command given in the exercise we obtain the following:

(2)If L(X₁, X₂, U, V₁, V₂) := , then I(u) = .

We have .

So the Euler-Lagrange equation is:

Thus u_∗ satisfies the wave equation = 0.

We can check this by direct differentiation that the given u in terms of f satisfies the wave equation. We have

Differentiating again with respect to t, we obtain

Similarly, by differentiating u with respect to x we obtain

Differentiating again with respect to x, we obtain

It follows from (∗) and (∗∗) that = 0.

Let us check that the boundary conditions are satisfied.

Note that u(0, t) = = 0 since f is odd.

Now we would like to check u(1, t) = 0 too.

Using the oddness and 2-periodicity of f, we have

So u(1, t) = = 0.

Finally, we can check if the initial conditions is satisfied.

We have u(x, 0) = = f(x) for all x.

Also, from our previous calculation, we have

for all x.

For a fixed t, the graph of f(· –t) is just a shifted version of the graph of f by t units to the right. As t increases, the graph travels to the right, representing a travelling wave, moving to the right with a speed 1. Similarly the graph of f(·+t) with increasing t represents a travelling wave moving to the left with speed 1. The solution of the wave equation is an average of these two travelling waves moving in opposite directions, and the shape of the wave is determined by the initial shape of the string.

Solution to Exercise 3.18, page 153

We have (suppressing the argument (q, p) everywhere)

Also,

Finally, we will prove the Jacobi Identity. In order to simplify the notation, we will use subscripts to denote partial derivatives, for example F_p will mean .First we note that

Similarly, by making cyclic substitutions F → G → H above, we obtain

Thanks to the symmetry of the left-hand side of the expression in Jacobi’s Identity in F, G, H, it is enough to show that after collecting all the F_q, F_p terms, their overall coefficients are zero.

The overall coefficient of F_q is

Since G_pq = G_qp and H_pq = H_qp, we see that the above expression is 0.

The overall coefficient of F_p is

This completes the proof of the Jacobi Identity.

Solution to Exercise 3.19, page 153

We have {Q, P} = = 1 · 1 – 0 · 0 = 1.

Solutions to the exercises from Chapter 4

Solution to Exercise 4.1, page 162

With x := 1 = (t 1), and y := (t t), 2||x||²_∞ + 2||y||²_∞ = 2 · 1² + 2 · 1² = 4, while ||x + y||²_∞ + ||x – y||²_∞ = ||1 + t||²_∞ + ||1 – t||²_∞ = 2² + 1² = 5. So ||·||_∞ does not obey the Parallelogram Law, and hence ||·||_∞ cannot be a norm induced by some inner product on C[0, 1].

Solution to Exercise 4.2, page 162

Let x, y, z ∈ X. Then

Adding these, we obtain

Geometric interpretation in R²: If x, y, z are the vertices of a triangle ABC, then is the length of the median AD (see the picture).

The Appollonius Identity gives AB² + AC² = BC² + 2AD².

Solution to Exercise 4.3, page 162

Let > 0. Let N₁ ∈ N be such that for all n > N₁, ||x_n – x|| < .

Let N₂ ∈ N be such that for all n > N₂, ||y_n – y|| < images , where the number M := images ||x_n|| < ∞ (this exists since (x_n)_n∈N, being convergent, is bounded).

Consequently, for all n > N := max{N₁, N₂},

Hence (〈x_n, y_n〉)_n∈N is convergent in K, with limit 〈x, y〉.

Solution to Exercise 4.4, page 162

If the ellipse has major and minor axis lengths as 2a and 2b, respectively, then observe that the perimeter is given by

where the last expression is obtained by rotating the ellipse through 90°, obtaining a new ellipse with the same perimeter.

Using Cauchy-Schwarz Inequality we obtain

Thus P 2π√ab. Since the areas of the circle and the ellipse are equal, it follows that πr² = πab, where r denotes the radius of the circle. Hence r = √ab. So we have P 2π√ab = 2πr, that is, the perimeter P of the ellipse is at least as large as the circumference of the circle.

Solution to Exercise 4.5, page 163

(IP1)If A ∈ R^m×n, then 〈A, A〉 = tr(AA) = a_ki a_ki = a²_ki 0.
If A ∈ R^m×n and 〈A, A〉 = 0, then a²_ki = 0, and so for all
k ∈ {1, ···, m} and all i ∈ {1, ···, n}, a_ki = 0, that is, A = 0.

(IP2)For all A₁, A₂, B ∈ R^m×n,

For all A, B ∈ R^m×n and α ∈ R,

(IP3)For all A, B ∈ R^m×n,

This is a Hilbert space, since finite-dimensional normed spaces are complete.

Solution to Exercise 4.6, page 163

Let x, y ∈ X. Then

Also,

From (∗) and (∗∗) it follows that for all x, y ∈ X, 〈Tx, Ty〉 = 0.

In particular, with y = Tx, we get 〈Tx, Tx〉 = 0, that is, ||Tx||² = 0.

Hence for all x ∈ X, Tx = 0, that is, T = 0.

We have 〈Tx, x〉 = = –x₂x₁ + x₁x₂ = 0, for all x = ∈ R².

There is no contradiction to the previous part since the vector space R² is a vector space over the real scalars.

Solution to Exercise 4.7, page 163

R is an equivalence relation on C:

(ER1)If x = (x_n)_n∈N ∈ C, then ||x_n – x_n||_X = 0 = 0, and so (x, x) ∈ R.

(ER2)If x = (x_n)_n∈N, y = (y_n)_n∈N ∈ C, and (x, y) ∈ R, then ||x_n –y_n||_X = 0.
So ||y_n – x_n||_X = |–1| ||x_n – y_n||_X = ||x_n – y_n||_X = 0.
Hence (y, x) ∈ R.

(ER3)Let x = (x_n)_n∈N, y = (y_n)_n∈N, z = (z_n)_n∈N be in C, such that (x, y) ∈ R and (y, z) ∈ R. Then images ||x_n – y_n||_X = 0 and images ||y_n – z_n||_X = 0.
As 0 images ||x_n – z_n||_X images ||x_n – y_n||_X + ||y_n – z_n||_X , we get images ||x_n – z_n||_X = 0.
So (x, z) ∈ R.

Consequently, R is an equivalence relation on C.

is well-defined:

If [(x_n)_n∈N] = [(x′_n)_n∈N] and [(y_n)_n∈N] = [(y′_n)_n∈N], then we wish to show that [(x_n + y_n)_n∈N] = [(x′_n + y′_n)_n∈N]. We have that (x_n + y_n)_n∈N ∈ C, since (x_n)_n∈N, (y_n)_n∈N ∈ C and ||x_n + y_n – (x_m + y_m)||_X ||x_n – x_m||_X + ||y_n – y_m||_X.

Similarly, (x′_n + y′_n)_n∈N ∈ C.

Furthermore, 0 ||(x_n + y_n) – (x′_n + y′_n)||_X + ||x_n + x′_n||_X + ||y_n + y′_n)||_X, and so

that is, ((x_n + y_n)_n∈N, (x′_n + y′_n)||_n∈N) ∈ R. So [(x_n + y_n)_n∈N] = [(x′_n + y′_n)||_n∈N].

is well-defined:

Let α ∈ K and [(x_n)_n∈N] = [x′_n)_n∈N]. Since ||αx_n – αx_m||_X = |α|||x_n – x_m||_X, clearly (αx_n)_n∈N ∈ C. Similarly, (αx′_n)_n∈N ∈ C. We have

and so ((αx_n)_n∈N, (αx′_n)_n∈N) ∈ R. So [(αx_n)_n∈N] = [(αx′_n)_n∈N].

is well-defined:

Since Cauchy sequences are bounded, given (x_n)_n∈N, (y_n)_n∈N in C, we have that M_x := ||x_n||_X < ∞ and M_y := ||y_n||_X < ∞.

Let N be large enough so that if m, n > N, then

Thus for m, n > N,

So (〈x_n, y_n〉_X)_n∈N is a Cauchy sequence in K, and as K (= R or C) is complete, it follows that 〈x_n, y_n〉_X exists.

Now suppose that [(x_n)_n∈N] = [(x′_n)_n∈N] and [(y_n)_n∈N] = [(y′_n)_n∈N].

Given images > 0, let N be such that for all n > N,

where M_x′ = ||x′_n||_X < ∞. For n > N, we have

Passing the limit as n → ∞, we obtain

〈·, ·〉 defines an inner product on X:

(IP1)If , then .

Let be such that = 0.

Then 〈x_n, x_n〉_X = ||x_n||²_X = 0.

(0)_n∈N ∈ C and ||x_n – 0||_X = ||x_n||_X = 0 (using the above).

Thus [(x_n)_n∈N] = [(0)_n∈N].

(IP2)For all x₁, x₂, y ∈ X,

For all α ∈ K and x, y ∈ X, we have

(IP3)For all x, y ∈ X, 〈x, y〉_X = .

ι is a linear transformation:

ι is injective:

If ι(x) = [(x)_n∈N] = [(0)_n∈N], then ||x|| = ||x – 0|| = 0, and so x = 0.

ι preserves inner products: For x, y ∈ X, 〈ι(x), ι(y)〉_X = 〈x, y〉_X = 〈x, y〉_X.

Solution to Exercise 4.8, page 168

As span{v₁} = span{x₁} = span{u₁}, it follows that v₁ = α₁u₁.

Thus 1 = ||v₁|| = |α₁|||u₁|| = |α₁| · 1 = |α₁|.

For n > 1, v_n ∈ span{v₁, ···, v_n} = span{x₁, ···, x_n} = span{u₁, ···, u_n}.

So there are scalars β₁, ···, β_n–1, α_n such that v_n = β₁u₁ + ··· + β_n–1u_n–1 + α_nu_n. We also know that for all k < n, 〈v_n, v_k〉 = 0. So it follows that 〈v_n, v〉 = 0 for all v ∈ span{v₁, ···, v_n–1} = span{x₁, ···, x_n–1} = span{u₁, ···, u_n–1}. Thus 〈v_n, u_k〉 = 0 for all k < n. This gives β₁ = ··· = β_n–1 = 0, and v_n = α_nu_n. Moreover, 1 = ||v_n|| = |α_n| ||u_n|| = |α_n| · 1 = |α_n|.

Solution to Exercise 4.9, page 171

Let us first note that the derivative of an even monomial t^2k is odd, and that of an odd monomial t^2k+1 is even. From here it follows that the derivative of a polynomial with only even monomials is a polynomial consisting of only odd monomials, while that of a polynomial with only odd monomials is a polynomial with only even monomials.

By the Binomial Theorem, we see that the polynomial (t² – 1)ⁿ is the sum of even monomials of the form c_kt^2k, for suitable scalars c_k, k = 0, ···, n.

So (t² – 1)ⁿ will be a polynomial p with:

(1) only even monomials if n is even,

(2) only odd monomials if n is odd.

In the former case, when n is even, p, being the sum of even functions will be even, while in the latter case, p, being the sum of odd functions, will be odd. Thus P_n is even when n is even, and odd if n is odd.

If n is odd, then each of the terms c_kt^2k–n is an odd polynomial, and hence so is their sum. Consequently, P_n is odd if n is odd.

We have P_n(–1) = (–1)ⁿP_n(1) = (–1)ⁿ · 1 = (–1)ⁿ for all n 0.

Solution to Exercise 4.10, page 171

With y(t) := (t² – 1)ⁿ, we have y′(t) = n(t² – 1)^n–1 · 2t. So

By differentiating the left-hand side of (∗), we obtain

and by differentiating the right-hand side of (∗), we have

Equating the final expressions from the above calculations, we obtain

Multiplying by , we get (1 – t₂)P″_n(t) – 2tP″_n(t) + n(n + 1)P_n(t) = 0.

Solution to Exercise 4.11, page 171

t² – 1 is zero at ±1. By Rolle’s Theorem, it follows that (d/dt)(t² – 1) is zero at some t⁽¹⁾ ∈ (–1, 1). But we had seen that (d/dt)(t² – 1) is also zero at the end points ±1. So by Rolle’s Theorem applied to the function (d/dt)(t² – 1) on the two intervals [–1, t⁽¹⁾] and [t⁽¹⁾, 1], we get the existence of points t₁⁽²⁾ ∈ (–1, t⁽¹⁾) and t₂⁽²⁾ ∈ (t⁽¹⁾, 1), where (d/dt)²(t² – 1) is zero. Proceeding in this manner, we get the existence of points t₁⁽ⁿ⁾, ···, t_n⁽ⁿ⁾ ∈ (–1, 1) where (d/dt)ⁿ(t² – 1)ⁿ vanishes. So P_n has at least n zeros on (–1, 1). But P_n has degree n, and hence it can have at most n zeros in C. This shows that all the zeros of P_n are real, and all of them lie in the open interval (–1, 1).

Solution to Exercise 4.12, page 171

The set {e_ij : 1 i m, 1 j n}, where e_ij is the matrix with 1 in the ith row and jth column, and all other entries 0, is a basis for R^m×n. To see that this basis is in fact orthonormal, observe that the map ι : R^m×n → R^mn given by A = [a_ij] (a₁₁, ···, a_1n, a₂₁, ···, a_2n, ···, a_m1, ···, a_mn) (that is, lay out the rows of A next to each other in one long row), is an isomorphism that preserves inner products:

{ι(e_ij) : 1 i m, 1 j n} is orthonormal, and so it follows that the set {e_ij : 1 i m, 1 j n} is orthonormal as well.

Solution to Exercise 4.13, page 172

(1)We have H₀ = e^x²e^–x² = 1. For n 0,

Thus if H_n is a polynomial, then 2xH_n, H′_n are polynomials too, and so is H_n+1 = 2xH_n – H′_n. Since H₀ = 1 is a nonzero polynomial of degree 0, it follows by induction on n that each H_n, n images 0 is a polynomial. Moreover, if H_n has degree d, and its leading term is c_dx^d, then H′_n has degree d – 1, while 2xH_n has degree d + 1 with the leading term 2c_dx^d+1. Consequently, the recurrence relation together with H₀ = 1 also reveals that H_n has the leading term 2ⁿxⁿ, and in particular has degree n.

Using the recursion relation, we get H₁ = 2x, H₂ = 4x² – 2, H₃ = 8x³ – 12x.

(2)Let m < n. Then we have

As (d/dx)^n–1e^–x² is a sum of terms of the form c_kx^ke^–x², and because H_m is a polynomial, it follows that the first summand in the right-hand side is 0.
So we have 〈φ_m, φ_n〉 = (–1)ⁿ⁺¹ .

We can continue this process of integration by parts, until we arrive at

But as H_m has degree m < n, (d/dx)ⁿ H_m = 0, so that 〈φ_m, φ_n〉 = 0.

The case m > n also follows from here, since the inner product is conjugate symmetric. Finally,

(The last equality can be justified as follows. With I := , we have

So I =

(3)For n 0, we have

(4)First let us note that if n images 1, then we have

Hence for n 1,

(5)We have for all φ

Hence for all n 0,

(6)We have and .

From the previous part, we have (–(d/dx)² + x²)φ_n = (2n + 1)φ_n, giving

We have

In Schrödinger’s equation, a² = , and so = a(2n + 1).

So E_n = , for n 0.

Solution to Exercise 4.14, page 172

Since diverges, does not converge absolutely.

If s_n is the nth partial sum of , then for n > m, we have

and this can be made as small as we please since .

Hence (s_n)_n∈N is Cauchy in H, and since H is a Hilbert space, it converges.

Solution to Exercise 4.15, page 172

For all N ∈ N, we have

Thus , and as N was arbitrary, .

Solution to Exercise 4.16, page 173

Let y ∈ Y ∩ Y^⊥. As y ∈ Y^⊥, we know that for all y′ ∈ Y , 〈y, y′〉 = 0. Taking y′ := y ∈ Y, we obtain 0 = 〈y, y′〉 = 〈y, y〉 = ||y||², and so ||y|| = 0, giving y = 0. So Y ∩ Y^⊥ ⊂ {0}. Also, since Y, Y^⊥ are subspaces, it follows that each contains the zero vector 0. So Y ∩ Y^⊥ = {0}.

Solution to Exercise 4.17, page 173

(1)If y ∈ Y, then for each x ∈ Y^⊥, 〈y, x〉 = 〈x, y〉^∗ = 0, and so y ∈ (Y^⊥)^⊥. Thus Y ⊂ (Y^⊥)^⊥.

(2)Let x ∈ Z^⊥. Then 〈x, z〉 = 0 for all z ∈ Z. As Y ⊂ Z, we also have 〈x, y〉 = 0 in particular for all y ∈ Y. Hence x ∈ Y^⊥. This shows that Z^⊥ ⊂ Y^⊥.

(3)As Y ⊂ Y, it follows from part (2) that .

Now let x ∈ Y^⊥. Then 〈x, y〉 = 0 for all y ∈ Y.

If y′ ∈ Y, then there exists a sequence (y_n)_n∈N in Y such that y_n = y′. Thus 〈x, y′〉 = = 0.

Hence , showing that as well.

(4)Suppose that x ∈ Y^⊥.

As Y is dense in X, there is a sequence (y_n)_n∈N in Y converging to x in X. Thus 〈x, x〉 = 〈x, y_n〉 = 0 = 0.

(5)Suppose x = (x_n)_n∈N ∈ . Since e_2n ∈ Y_even for each N, x_2n = 〈x, e_2n〉 = 0. Hence the subspace ⊂ Y_odd, where Y_odd denotes the subspace of ℓ² all sequences whose evenly indexed terms are 0.

Vice versa, if x ∈ Y_odd, it is clear that for all y ∈ Y_even, 〈x, y〉 = 0. Thus Y_odd ⊂ .

Consequently, = Y_odd.

Similarly, = Y_even. And so, .

(6)We know that c₀₀ is dense in ℓ². (Just truncate the series to the desired accuracy to get a finitely supported approximation!)

So . But then .

Solution to Exercise 4.18, page 176

Let .

Then E(m, b) = .

Thus the problem of finding the least square regression line is:

It follows from Theorem 4.5, page 174, that a minimiser Y_∗ is given by

where {U₁, U₂} is any orthonormal basis for the subspace Y := span{Y₁, Y₂} of Rⁿ with the usual Euclidean inner product. By the Gram-Schmidt Orthonormalisation Procedure, U₁ = , and U₂ = .

For the given data, using the above formulae, we obtain m = –0.3184 million tonnes coal per °C, and b = 10.4667 million tonnes of coal. The y-intercept is b = 10.4667 million tonnes of coal, and this is the inland energy consumption when the mean temperature is 0°C (that is when it is freezing!). The x-intercept is 10.4667/0.3184 = 32.8728, which is the mean temperature when the inland consumption is 0 (that is, no heating required). The slope is m = –0.3184 million tonnes of coal per °C. Thus for each °C drop in temperature, the inland energy consumption increases by 0.3184 million tonnes of coal. Finally, the forecast of the energy consumption for a month with mean temperature 9°C is given by y = mx + b = (–0.3184)(9) + 10.4667 = 7.6011 million tonnes of coal.

Solution to Exercise 4.19, page 179

Let C := L²₊(R). Then C is convex. Thus C is convex too. We will show that g_∗ := max{f, 0} ∈ L²₊(R) = C ⊂ C satisfies: for all g ∈ C, 〈f – g_∗, g – g_∗〉 0.

We have f = max{f, 0} + min{f, 0}. So f – g_∗ = min{f, 0}. Also,

Hence we obtain for all g ∈ C that

So for all g ∈ C, ||f – g_∗|| ||f – g||. In particular, for all g ∈ L²₊(R) = C ⊂ C, we also have ||f – g_∗|| ||f – g||.

Solution to Exercise 4.20, page 182

We’d seen in Exercise 4.17, page 173, that . So , where the last equality follows from Corollary 4.1, page 182, since Y is closed.

Solution to Exercise 4.21, page 182

For all f ∈ L²(R), it is easy to check that f_e := (f + )/2 is even, and f_o := (f – )/2 is odd. Thus for all g ∈ Y, we have

Thus, by Theorem 4.7, page 180, P_Yf = f_e for all f ∈ L²(R).

By Theorem 4.8, page 180, we have

P_Y^⊥ = I – P_Y, and so for all f ∈ L²(R), P_Y^⊥f = f – .

We have f = If = P_Yf + P_Y^⊥f = .

Moreover, by Theorem 4.8, this decomposition is unique.

Solution to Exercise 4.22, page 182

Y = ker(I – S), and so Y is a closed subspace of H.

For all x ∈ H, .

So for all x ∈ H. Moreover, for all y ∈ Y, we have

Thus, by Theorem 4.7, page 180, P_Yx = for all x ∈ H.

By Theorem 4.8, page 180, we have

Thus Z^⊥ = (Y^⊥)^⊥ = Y.

P_Y^⊥ = I – P_Y, and so for all x ∈ H, P_Y^⊥x = x –

Solution to Exercise 4.23, page 182

Consider the map , where is the indicator function of .

As < ∞, Mf ∈ L²(R).

It is also easy to see that M is linear. The above inequality then establishes that M ∈ CL(L²(R)). We have

Thus Y_A is closed.

For f ∈ L²(R), 1_Af ∈ Y_A, and moreover, for any g ∈ Y_A,

Thus P_Af = 1_Af for all f ∈ L²(R).

Solution to Exercise 4.24, page 182

Suppose that D^⊥ = {0}. Then D = (D^⊥)^⊥ = {0}^⊥ = H. So D is dense in H.

Now suppose that D is dense in H. Then D = H. Thus D^⊥ = (D)^⊥ = H^⊥ = {0}.

Solution to Exercise 4.25, page 184

Let x ∈ C[–1, 1] and > 0. By Weierstrass’s Approximation Theorem (Exercise 1.26, page 22), there is a polynomial p ∈ C[–1, 1] such that ||x – p||_∞ < .

Then

Hence ||x – p||₂ < . Consequently the polynomials are dense in C[–1, 1] (with the usual inner product).

Solution to Exercise 4.26, page 185

Moreover ι is continuous because ||ι(x)||² = for all x ∈ H.

If x ∈ H is such that ι(x) “ 0, then ||x|| = ||ι(x)|| = 0, and so x = 0.

Hence ι is injective.

If (c_n)_n∈N ∈ ℓ², then x := c_nu_n ∈ H, and for all k ∈ N,

So ι(x) = (c_n)_n∈N, showing that ι is surjective too.

As ι ∈ CL(H, ℓ²) is a bijection, it has a continuous inverse ι^–1 ∈ CL(ℓ², H) (by Corollary 2.4 on page 96). Moreover, ||ι(x)|| = ||x|| for all x ∈ H, and so ι is an isometry.

Solution to Exercise 4.27, page 187

Let x ∈ C[0, 1] be the function t t.

For n ≠ 0, we have 〈x, T_n〉 = , using integration by parts.

Also 〈x, T₀〉 = 1/2. By Parseval’s Identity,

which yields

Solution to Exercise 4.28, page 187

Let [(x_n)_n∈N] ∈ X. Consider the sequence (x_n)_n∈N in X. Since (x_n)_n∈N ∈ C, given any > 0, there exists an N ∈ N such that for all m, n > N, ||x_n – x_m|| < . Consequently, for all m > N, ||ι(x_m) – [(x_n)_n∈N]||_X = ||x_m – x_n|| .

Hence ι(x_n) = [(x_n)_n∈N].

Solution to Exercise 4.29, page 187

We have

with equality if and only if .

Thus the curve enclosing the maximum area is given by

with .

Let α ∈ [0, 2π) be such that cos α = and sin α = . Then

Hence (x_∗(s) – a₀)² + (y_∗(s) – c₀)² = .

Consequently, s (x_∗(s), y_∗(s)) : [0, L] → R² is the parametric representation of a circle with centre at (a₀, c₀) ∈ R² and radius equal to .

Solution to Exercise 4.30, page 188

(1)Call u_n the nth vector in the list. If {u_n : n ∈ N} were an orthonormal basis, then

a contradiction. So the given set is not an orthonormal basis.

(2)Let us call the evenly indexed vectors as v_n, and the oddly indexed ones as w_n. Then clearly 〈v_i, v_j〉 = 〈w_i, w_j〉 = 〈v_i, w_j〉 = 0 whenever i ≠ j, since there are no overlapping nonzero terms. Also 〈v_i, w_i〉 = 0.

Finally ||v_i|| = ||w_i|| = 1. This shows that the given set B is orthonormal. In order to show density, we note that and . Thus span B = span{e_n : n ∈ N}, and the latter is dense in ℓ².

Solution to Exercise 4.31, page 188

If X is a real vector space, then let K_Q := Q, while if X is a complex vector space, then let K_Q := Q + iQ. Set

Then D is countable. Let x ∈ X, and > 0.

Then there exists an N such that

Let c_n ∈ K_Q, n = 1, ···, N, be such that .

Then with y := c_nu_n ∈ B, we have

Thus X is separable.

Solution to Exercise 4.32, page 188

We have for λ ≠ μ that

On the other hand, ||e^iλx||² = 1. Thus

Hence

Suppose now that X is separable, with a dense subset D = {d₁, d₂, d₃, ···}. Then for each λ ∈ R, there exists a d_λ ∈ D such that ||e^iλx – d_λ|| < 1/√2.

This gives us the existence 6 of a map λ d_λ : R → D.

This map is injective since if λ ≠ μ, then

giving ||d_λ − d_μ|| > 0, and in particular d_λ ≠ d_μ.

But this is absurd, since R is uncountable, while D is countable!

So is not separable.

Solution to Exercise 4.33, page 189

For n ∈ N, set

If U_n has more than n − 1 elements, then for any distinct u_i₁, · · · , u_{i_n} ∈ U_n,

(where the former inequality is by virtue of the fact that the u_{i_k} ’s belong to U_n, and the latter is Bessel’s Inequality). So we obtain ||x||² < ||x||², which is absurd. Thus U_n has at most n − 1 elements. Hence each U_n is finite. But

and as each U_n is finite, their union U is at most countable.

Consequently, 〈x, u_i〉 is nonzero for at most a countable number of the u_i ’s.

Solution to Exercise 4.34, page 190

(1)We have for all x ∈ H that |φ_y(x)| = |〈x, y〉| ||x|| ||y||, and so ||φ_y|| ||y||.

If y = 0, then ||φ_y|| ||y|| = 0, and so ||φ_y|| = 0 = ||y||.

If y ≠ 0, then define z = , and observe that ||z|| = 1, so that

Hence it follows that ||φ_y|| = ||y||.

(2)Let y ∈ H\{0}. Then for x ∈ H,

and so φ_iy = −iφ_y.

Also ||φ_y|| = ||y|| ≠ 0, so that φ_y ≠ 0, the zero linear functional.

If the map η φ_η : H → CL(H, C) were linear, then in particular, we would have φ_iy = iφ_y, and from the above, we would then get iφ_y = −iφ_y, giving φ_y = 0, which is absurd.

Solution to Exercise 4.35, page 195

We will show that Y := ran P = ker(I − P), and since the kernel of the continuous linear transformation I − P is closed, it follows that Y is closed.

That ran P = ker(I − P): If y ∈ ran P, then y = Px for some x ∈ H. Then

So y ∈ ker(I − P). Hence ran P ⊂ ker(I − P).

On the other hand, if y ∈ ker(I − P), then (I − P)y = 0 and so y = Py ∈ ran P. Thus ker(I − P) ⊂ ran P as well.

It remains to show that P = P_Y. We will use (ran P)^⊥ = ker(P^∗) = ker P, where the last equality follows thanks to the self-adjointness of P. Let x ∈ H. Then x = P_Y x + P_Y ⊥ x. But P_Y ⊥ x ∈ Y^⊥ = ker P, and so

As P_Y x ∈ Y = ran P, P_Y x = Px₁ for some x₁ ∈ H.

Thus P (P_Y x) = P (Px₁) = P² x₁ = Px₁ = P_Y x. Hence Px = P (P_Y x) = P_Y x.

Solution to Exercise 4.36, page 195

and so T₁ is self-adjoint, while T₂ is skew-adjoint.

Moreover,

In order to show uniqueness, suppose that T′₁, T′₂ are self-adjoint and skew-adjoint respectively such that T = T′₁ + T′₂. Then T₁ + T₂ = T′₁ + T′₂, and so we obtain T₁ − T′₁ = T′₂ − T₂. As the left-hand side is self-adjoint, and the right-hand side is skew-adjoint, both sides must be zero. (Indeed, if S := T₁ − T′₁ = T′₂ − T₂ is the common value, then S = S^∗ = −S, and so 2S = 0, that is, S = 0.)

Solution to Exercise 4.37, page 195

. Define T : ℓ² → ℓ² by Tk = .

Then T is well-defined and T ∈ CL(ℓ²). We will show that Λ^∗ = T.

For all h = (h_n)_n∈N and k = (k_n)_n∈N in ℓ², we have

Thus Λ^∗ = T.

Solution to Exercise 4.38, page 195

We’ll show that I^∗ is given by

I^∗ ∈ CL(L² [0, 1)] by Example 2.10 (page 70), with

For h, k ∈ L² [0, 1], we have

and so I^∗ ∈ CL(L² [0, 1]) is given by

Solution to Exercise 4.39, page 195

T^*_A = T_A^∗, where

Thus T^∗_A is clockwise rotation through an angle θ in the plane.

Solution to Exercise 4.40, page 196

For x ∈ H, we have

(Note that x′ ∈ Y^⊥_n because 〈x′, u_i〉 = 0 for all i = 1, · · · ,n.)

So for all x ∈ H. For all x ∈ H, we have

since

Solution to Exercise 4.41, page 196

(1)If B′ = {u′_n: n ∈ N} is another orthonormal basis, then

On the other hand, we also have

and so

(2)We will verify simultaneously the norm and subspace axioms:

(N1/S3) For all T ∈ S₂ (H) that

Now let T ∈ S₂ (H) and ||T||_HS = 0. Then

So Tu_n = 0 for all n. But then for all x ∈ H, we have

Consequently T = 0.

Clearly 0 ∈ S₂ (H) since ||0||_HS = 0 < ∞.

(N2/S2) For all T ∈ S₂ (H) and α ∈ K, we have

and so ||α · T||_HS = |α| ||T||_HS.

Note that we’ve also shown for all T ∈ S₂ (H), α ∈ K, that α · T ∈ S₂ (H).

(N3/S1) Finally, if T₁, T₂ ∈ S₂ (H), then we have

and so ||T₁ + T₂||_HS ||T₁||_HS + ||T₂||_HS.

Also, this shows that for all T₁, T₂ ∈ S₂ (H), T₁ + T₂ ∈ S₂(H).

(3)We have for all x ∈ H that

and so ||T || = ||T^∗|| ||T||_HS.

Solution to Exercise 4.42, page 197

As CL(H) is an algebra, Λ(T) ∈ CL(H). We verify linearity:

(L1) For T₁, T₂ ∈ CL(H),

(L2) Λ(αT) = A^∗ (αT)+(αT)A = α(A^∗ T + T A) = αΛT, T ∈ CL(H), α ∈ K.

Continuity: For T ∈ CL(H),

and so Λ ∈ CL(CL(H)).

If T ∈ CL(H) is such that T = T^∗, then

So Λ(T) is self-adjoint.

Solution to Exercise 4.43, page 197

Let (T_n)_n∈N be a sequence of self-adjoint operators in CL(H) that converges to T ∈ CL(H). We’d like to show that for all x, y ∈ H, 〈T x, y〉 = 〈x, Ty〉. As we have ||T_nx − T x|| ||T_n − T || ||x||, it follows that (T_nx)_n∈N converges to Tx, and similarly, (T_ny)_n∈N converges to Ty. Thus

Solution to Exercise 4.44, page 197

Let μ ∈ ρ(T). Then there is an S ∈ CL(H) such that S(μI − T) = I = (μI − T)S. Taking adjoints, we obtain

Thus μ^∗ I − T^∗ is invertible in CL(H), and so μ^∗ ∈ ρ(T^∗).

So we have proved that for all T ∈ CL(H).

Applying this to T^∗ instead of T gives:

Consequently for all T ∈ CL(H), μ ∈ ρ(T) if and only if μ^∗ ∈ ρ(T^∗).

We had seen that R = L^∗ and that σ(L) = {z ∈ C : |z| 1}.

From the above, we obtain σ(R) = C\ρ(R) = C\(ρ(L))^∗ = C\ρ(L) = σ(L).

Consequently the spectrum of R is the same as that of L, namely the closed unit

disc {z ∈ C : |z| 1} in the complex plane.

Solution to Exercise 4.45, page 197

We have for λ ∉ {0, 1},

and similarly

The previous part shows that σ(P_Y) ⊂ {0, 1}.

We now show that both 0 and 1 are eigenvalues, so that σ(P_Y) = σp(P_Y) = {0, 1}.

As Y is a proper subspace, Y ≠ {0}. So there exist nonzero vectors y in Y, and all of these are eigenvectors of P_Y with eigenvalue 1: P_Y y = y = 1 · y.

Also, as Y is a proper subspace, Y ≠ H.

If Y^⊥ = {0}, then we have that Y = (Y^⊥ qK = {0}^⊥ = H, a contradiction.

Thus Y^⊥ ≠ {0}. But this means that there exist nonzero vectors x in Y^⊥.

All of these are eigenvectors of P_Y with eigenvalue 0, since P_Y x = 0 = 0 · x.

Solution to Exercise 4.46, page 197

Let λ ∈ σ_p (U) with eigenvector v ≠ 0.

Then Uv = λv, and so |λ|² ||v||² = 〈λv, λv〉 = 〈Uv, Uv〉 = 〈U* Uv, v〉 = 〈Ivv〉 = ||v||².

Thus |λ| = 1, that is, λ lies on the unit circle with centre 0 in the complex plane.

If v₁, v₂ ∈ H\{0} are eigenvectors of U corresponding to distinct eigenvalues λ₁, λ₂, then we have

and so 〈v₁, v₂〉 = 0.

Solution to Exercise 4.47, page 199

The spectrum of T is real, and hence T + iI is invertible in CL(H). Since (T + iI)(T − iI) = T² + I = (T − iI)(T + iI), it follows by pre- and post-multiplying with (T + iI)⁻¹ that (T − iI)(T + iI)⁻¹ = (T + iI)⁻¹(T − iI) =: U. Hence we have

Thus U is unitary. We have

Hence I − U is invertible in CL(H) with inverse . Similarly,

Solution to Exercise 4.48, page 199

(1)Suppose that P_Y PZ. If y ∈ Y, then

So P_Z⊥ y = 0, giving y = P_Z y + P_Z⊥ y = P_Z y + 0 = P_Z y ∈ Z. Thus Y ⊂ Z.

(2)Now let Y ⊂ Z and x ∈ H. We have P_Z x = P_Y x + (P_Z x − P_Y x).

We first show that P_Zx − P_Yx is perpendicular to P_Yx.

As x = P_Y x + P_Y⊥x = P_Z x + P_Z⊥x, we have P_Z x − P_Y x = P_Y⊥x − P_Z⊥x.

So 〈P_Y x, PZ x − P_Y x〉 = 〈P_Y x, P_Y⊥x − P_Z⊥x〉 = = 0.

Hence

Consequently, P_Y P_Z.

Solution to Exercise 4.49, page 204

(1)By the Fundamental Theorem of Calculus,

So .

As f (x) 0 for all x, we must have that L 0. Suppose that L > 0.

Then there exists an R > 0 such that for all x > R, L −f (x) |f (x) − L| < , and in particular, f (x) > for all x > R. Hence for all x > R,

which is absurd. Hence L = 0.

(2)We apply part (1) with f (x) := |Ψ(x)|².

We note that f′ = (|Ψ|²)′ = (ΨΨ*)′ = Ψ′Ψ* + Ψ(Ψ′)*, and so |f′| 2||Ψ|| ||Ψ′||.

Thus

To show that , we apply the above to x Ψ(−x), and note that if Ψ, Ψ′ ∈ L² (R), then so do Ψ(−·), (Ψ(−·))′ = −Ψ′(−·).

Solution to Exercise 4.50, page 205

We have for self-adjoint A, B that

[A, B]^∗ = (AB − BA)^∗ = B^∗ A^∗ − A^∗B^∗ = BA − AB = −(AB − BA) = −[A, B].

Solution to Exercise 4.51, page 205

We have

Similarly, Hence

Hence

Solution to Exercise 4.52, page 205

If n = 1, then [Q, P] = −[P, Q] = −(−iI) = i1Q¹⁻¹, and so the claim is true. If [Qⁿ, P] = inQⁿ⁻¹ for some n ∈ N, then we have

and so the claim follows for all n ∈ N by induction.

Solution to Exercise 4.53, page 207

We have in the classical case that

Thus {Q², P²} = 4QP.

In the quantum mechanical case, we have, using Exercise 4.52, page 205, that

Thus (since otherwise QP = PQ, which is false since [Q, P] = iI ≠ 0).

QP is not self-adjoint, since if it were self-adjoint, then for all compactly supported Ψ and Φ, we would have

which would give iΦ = [Q, P] Φ = 0, which is clearly false for nonzero Φ! On the other hand, for all Ψ and Φ, we have

Solution to Exercise 4.54, page 207

We have

and so t ||ψ(t)||² is constant, giving ||ψ(t)||² = ||ψ(0)||² = 1.

Solution to Exercise 4.55, page 207

As V ≡ 0 for x ∈ (0, π), we have , that is, .

Depending on the sign of E, the solution is given by

If E = 0, then the conditions X(0) = X(π) = 0 give A = B = 0. So X ≡ 0.

If E < 0, then the conditions X(0) = X(π) imply that A = B = 0 so that X ≡ 0.

So only the case E > 0 remains. The condition X(0) = 0 gives A = 0.

The condition X(π) = 0 implies B sin

As we want nontrivial solutions, we know B ≠ 0 (otherwise X ≡ 0).

So sin , giving

Thus (discrete/“quantised” energy levels!).

We have |Ψ(x, t)| = |X(x)||T (t)| = |X(x)| · |C| = |C| · |B| · | sin(nx)|.

The plots of |Ψ|² = constant · sin(nx))² when n = 1, 2 are shown below.

When n = 1, the probability is

When n = 2, the probability is

Solutions to the exercises from Chapter 5

Solution to Exercise 5.1, page 215

(1)T_m is linear:

(L1) For all x₁, x₂ ∈ H,

(L2) For all x ∈ H and α ∈ K,

So T_m is a linear transformation. Next we prove continuity: for all x ∈ H,

Conclusion: T_m ∈ CL(H).

For x ∈ H we have

(2)As , we have

Thus (T_m)_m∈N converges to T in CL(H). Since the range of T_m is contained in the span of Tu₁, · · ·, Tu_m, it follows that T_m has finite rank, and so T_m is compact. As T is the limit in CL(H) of a sequence of compact operators, it follows that T is compact.

Solution to Exercise 5.2, page 216

(1)(L1) For x₁, x₂ ∈ H, we have

(L2) For α ∈ K and x ∈ H, we have

Continuity: For x ∈ H, we have

So x₀ y₀ ∈ CL(H), and ||x₀ y₀|| ||x₀ || ||y₀||.

(2)As ran(x₀ y₀) ⊂ span{x₀}, we have that x₀ y₀ has finite rank, and so it is compact.

(3)For all x ∈ H,

Since this is true for all x ∈ H, we conclude that A(x₀y₀)B = (Ax₀)(B^∗ y₀).

Solution to Exercise 5.3, page 217

(1)Let H = ℓ², and T be diagonal with 2 × 2 nilpotent blocks

More explicitly, T (a₁, a₂, a₃, a₄, a₅, a₆, · · ·) = (a₂, 0, a₄, 0, a₆, 0, · · ·), for all (a_n)_n∈N ∈ ℓ². Thus T ∈ CL(ℓ²). Also, T² = 0 is compact.

But if we take the bounded sequence (e_2n)_n∈N, then (Te_2n)_n∈N = (e_2n−1)_n∈N, and this has no convergent subsequence. Hence T is not compact.

(2)Suppose that (x_n)_n∈N is a bounded sequence in H, and ||x_n|| M for all n. Since T² is compact, (T²x_n)_n∈N has a convergent subsequence, say (T² x_{n_k})_k∈N. We will show that (T x_{n_k})_k∈N is also convergent, by showing that it is Cauchy. We have for j, k that

and so (T x_{n_k})_k∈N is Cauchy. As H is a Hilbert space, it follows that (T x_{n_k})_k∈N is convergent. Hence T is compact.

Solution to Exercise 5.4, page 217

(1)True.

(2)False.

Neither I nor −I is compact, but their sum is 0, which is compact.

(3)True.

(4)False.

See the example in the solution to Exercise 5.3, part (1), page 217.

Alternately, we could take two diagonal operators on ℓ² corresponding to the sequences (1, 0, 1, 0, 1, 0, · · ·) and (0, 1, 0, 1, 0, 1, · · ·).

Solution to Exercise 5.5, page 217

If T ∈ K(H), then as A^∗ ∈ CL(H), we have A^∗T ∈ K(H). Also, TA ∈ K(H) because T ∈ K(H) and A ∈ CL(H). Since A^∗T and TA are in K(H), also their sum A^∗T + TA ∈ K(H), that is, Λ(T) ∈ K(H). Thus K(H) is Λ-invariant.

Solution to Exercise 5.6, page 226

We have ker T = {0}. So ran T = (ker T^∗)^⊥ = (ker T)^⊥ = {0}^⊥ = H.

So T has infinite rank. Let x ∈ H = ranT, and > 0. Then there exists a y ∈ ran T, such that ||x − y|| < /2.

As y ∈ ran T, we have y = T x′, for some x′ ∈ H, and

So there exists an N Such that with

we have ||y − z|| < /2. Consequently, ||x − z|| ||x − y|| + ||y − z|| < /2 + /2 = , and so span{u_n : n ∈ N} is dense in H. Since {u_n : n ∈ N} is also an orthonormal set, it follows that it is an orthonormal basis for H.

Solution to Exercise 5.7, page 226

We note that each eigenvalue λ of T is nonnegative because if u is a corresponding unit-norm eigenvector, then λ = λ · 1 = λ〈u, u〉 = 〈T u, u〉 0. By the spectral theorem, we know that there exists a sequence of orthonormal eigenvectors u₁, u₂, u₃, · · · of T with corresponding eigenvalues λ₁ λ₂ λ₃ · · · 0.

We will show that for all x ∈ H, converges in H.

For N > M, we have,

In the above, we have used Bessel’s Inequality to get the last inequality.

Hence is Cauchy in H. As H is a Hilbert space,

converges in H. Consequently x is well-defined for all x ∈ H.

Also, it is easy to see that is a linear transformation.

Continuity: For all N ∈ N,

Passing the limit N → ∞, we obtain ||x||² λ₁ ||x||², and so ∈ CL(H).

We have for all x ∈ H that

So ()² = T.

Solutions to the exercises from Chapter 6

Solution to Exercise 6.1, page 232

(1)Since exists, given an > 0, there exists a δ > 0 such that

whenever 0 < |h| < δ. Consider the interval [0, h] for some h which satisfies 0 < h < δ. Since f is differentiable in (0, h) and continuous on [0, h], it follows from the Mean Value Theorem that

Thus |θh| < δ and so

So for all h ∈ (0, δ),

Applying the Mean Value Theorem on [−h, 0], where 0 < h < δ, we also get

for all h ∈ (−δ, 0). Consequently, for all h satisfying 0 < |h| < δ, we have

that is, f is differentiable at 0, and

shows that f′ is continuous at 0. It was given that f′ is also continuous on R_∗. So f is continuously differentiable on R.

(2)Applying the result from part (1) above, to the function f⁽ⁿ⁻¹⁾ : R → R, we obtain that f⁽ⁿ⁻¹⁾ is continuously differentiable on R, that is, f is n times continuously differentiable on R.

(3)We’ll show that for x > 0, where p_k is a polynomial.

This holds for k = 1: f(x) = e^−1/x for x > 0, and so

If the claim holds for some k, then

where is a polynomial.

Now e^1/x e^−1/x = 1, and since we have it follows that for x > 0. So 0 < x⁻²ⁿ e^−1/x < (2n + 1)!x for x > 0. Thus Consequently

By the previous part, it follows that f ∈ C^∞(R).

Solution to Exercise 6.2, page 232

The equation says that u is constant along the lines parallel to the x-axis. So for each fixed y, there is a number C_y such that u(x, y) = C_y for all x ∈ R. But u ∈ D(R²) must have compact support, and so it is zero outside a ball B(0, R) with a large enough radius R. So Cy is forced to be 0 for all y! Hence u ≡ 0 is the only solution.

Solution to Exercise 6.3, page 232

It is clear that if Φ ∈ D(R), then Φ′ ∈ D(R). Moreover,

So we have

Now suppose that φ ∈ D(R) is such that

Define Φ by for x ∈ R. Then Φ′ = φ, and so Φ ∈ C^∞.

If a > 0 is such that φ is zero outside [−a, a], then we have for x < −a that

On the other hand, for

So φ also vanishes outside [−a, a], and hence Φ ∈ D(R).

Finally, let φ ∈ Y, and suppose that Φ₁, Φ₂ ∈ D(R) are such that Then (Φ₁ − Φ₂)′ = 0, and so Φ₁ − Φ₂ = C, where C is a constant. But as Φ₁, Φ₂ both have compact supports, it follows that C must be zero. Hence Φ₁ = Φ₂.

Solution to Exercise 6.4, page 233

From the solution to Exercise 6.3, page 232, we know that the Φ_ns are given by

As there is some a > 0 such that all the φ_n vanish olutside [−a, a].

Then it follows that each Φ_n also vanishes outside [−a, a]. Also,

Hence it follows that (Φn)_n∈N converges uniformly to 0 as n → ∞. Since it follows that for k 1. Thus for each k 1, we have that converges uniformly to 0 (thanks to the fact that This completes the proof that

Solution to Exercise 6.5, page 236

Suppose that such a function δ exists. Let

For n ∈ N, and let φ_n : R → R be defined by φ_n(x) := φ(nx), x ∈ R.

Then φ_n is smooth, takes values in [0, 1], and vanishes outside [−1/n, 1/n]. So we have

a contradiction.

Solution to Exercise 6.6, page 237

(1)For all φ ∈ D(R), there exists an N ∈ N such that φ = 0 on R\[−n, n].

So the sum in the definition of 〈T, φ〉 is finite:

Hence 〈T, φ〉 is well defined for each φ ∈ D(R). The linearity is obvious.

Now suppose that Then there exists an K ∈ N such that each φ_n, vanishes outside [−K, K]. Also, for all |k| K,

Thus and so T ∈ D′(R).

(2)Take any φ ∈ D(R) that is positive in (0, 1q and zero outside [0, 1].

(From Example 6.1, page 230, there is a ψ ∈ D(R) that is positive on (−1, 1) and zero outside [−1, 1]. By shifting and scaling, we see that the function φ defined by φ(x) := ψ(2x − 1), x ∈ R, is one such function.)

Now define φ_n ∈ D(R), n ∈ N, by

We have for k ∈ N that

Thus for all

Hence for all k 0, we have uniformly. However, we have

(3)There is no contradiction to our conclusion from (1) that T is a distribution, since we observe that there is no compact set K ⊂ R such that for all n ∈ N, φ_n is zero outside K: Indeed,

Solution to Exercise 6.7, page 241

The function is continuously differentiable on R\{0}, and has a jump f(0+) − f(0−) = 1 at 0. For x < 0, H(x) = 0 and so (H(x) cos x)′ = 0.

For x > 0, H(x) = 1, and so (H(x) cos x)′ = (cos x)′ = − sin x.

Moreover,

Consequently,

The function is continuously differentiable on R\{0}, and has a jump g(0+) − g(0−) = 0 at 0. For x < 0, H(x) = 0 and so (H(x) sin x)′ = 0.

For x > 0, H(x) = 1, and so (H(x) sin x)′ = (sin x)′ = cos x.

Moreover,

Consequently,

Solution to Exercise 6.8, page 241

The function is continuously differentiable on R\{0}, and has a jump f(0+) − f(0−) = 0 at 0. Moreover, for x > 0, |x|/2 = x/2, and so we have (|x|/2)′ = (x/2)′ = 1/2 for x > 0. On the other hand, for x < 0, |x|/2 = −x/2, and so we obtain (|x|/2)′ = (−x/2)′ = −1/2 for x < 0.

Also,

Hence where

Again, g is continuously differentiable on R\{0}.

g has a jump of g(0+) − g(0−) = − (− ) = 1 at 0.

Also g is constant for x > 0 (respectively for x < 0), and so g′(x) = 0 for x > 0 (respectively for x < 0).

Also,

Hence

Solution to Exercise 6.9, page 241

(1)Let us first consider the case when ℓ ≡ 0.

Then V = ker ℓ ⊂ ker L implies that ker L = V too, and so L = 0 as well.

Thus we may simply take c = 0, and then clearly L = 0 = 0ℓ is valid.

Now let us suppose that ℓ ≠ 0.

Then there is a vector v₀ ∈ V such that ℓ(v₀) ≠ 0.

This vector v₀ must be nonzero, for otherwise ℓ(v₀) = 0.

(To show the desired decomposition of an arbitrary vector as v = c_vv₀ + w,

with w ∈ ker ℓ, we need to find the appropriate scalar c_v, because then we can set w := v − c_vv₀. To find what c_v might work, we apply ℓ on both sides to obtain ℓ(v) = c_vℓ(v₀) + ℓ(w) = c_vℓ(v₀) + 0 = c_vℓ(v₀).

So it seems that should do the trick!)

Given v ∈ V, we now proceed to show that

We have and so w ∈ ker ℓ.

As w ∈ ker ℓ ⊂ ker L, we have L(w) = 0, and

Hence with we have L = cℓ.

(2)For φ ∈ D(R), 0 = 〈0, φ〉 = 〈T′, φ〉 = −〈T, φ〉. So {φ′ : φ ∈ D(R)} ⊂ ker T.

Let 1 denote the constant function R ∋ x 1. By Exercise 6.3, page 232

Finally, by part (1), applied to the vector space V = D(R), with L := T and ℓ := T₁, we get the existence of a c ∈ C so that T = cT₁ = T_c.

(Here T_c denotes the regular distribution corresponding to the constant function taking value c everywhere on R.)

Solution to Exercise 6.10, page 242

Fix any φ₀ ∈ D(R)\{0} which is nonnegative everywhere. For ψ ∈ D(R), set

As ψ and φ₀ belong to D(R), so does φ. Moreover,

Thus

By Exercise 6.3, page 232, there is a unique Φ ∈ D(R) such that Φ′ = φ.

We define S : D(R) → C by 〈S, ψ〉 = −〈T, Φ〉. Let us check that S is linear.

Let ψ₁, ψ₂ ∈ D(R), and let Φ₁, Φ₂ ∈ D(R) be such that

Then

Similarly, 〈S, αψ〉 = α〈S, ψ〉 for all ψ ∈ D(R) and all α ∈ C.

Now we check the continuity of S. Let (ψ_n)_n∈N be a sequence in D(R) such that images . Then there exists an a > 0 such that all the ψ_n vanish outside [−a, a], and (ψ_n)_n∈N converges uniformly to 0 as n → ∞, giving

Now set

Then there exists a b > 0 such that each φ_n vanishes outside [−b, b].

Also, for k 0,

So for each k 0, converges uniformly to 0. Thus

Let Φ_n be the unique element in D(R) such that

From Exercise 6.4, page 233, we can conclude that

Consequently, 〈S, ψ_n〉 = −〈T, Φ_n〉 → 0 as n → ∞. Hence S ∈ D′(R).

Finally, we’ll show that S′ = T.

If Φ ∈ D(R), then

Thus

Solution to Exercise 6.11, page 242

Let φ ∈ D(R) be such that φ(0) ≠ 0. (For example we can simply take the test function from Example 6.1, page 230.) Then xⁿ φ ∈ D(R) too, and we have

So δ⁽ⁿ⁾ ≠ 0.

Solution to Exercise 6.12, page 242

It is enough to show the linear independence of δ, δ′, · · ·, δ⁽ⁿ⁾ for each n. Suppose that there are scalars c₀, c₁, · · ·, c_n such that Let φ ∈ D(R), and for λ > 0, set φ_λ(x) := φ(λx), for all x ∈ R. Then

The polynomial is zero on {λ : λ > 0}, and hence must be identically zero. So c₀φ(0) = · · · = cⁿ φ⁽ⁿ⁾(0) = 0. As the choice of φ was arbitrary, we have that for all test functions φ ∈ D(R),

But if we look at the φ from Example 6.1, page 230, then φ(0) ≠ 0, and also xⁿφ, n ∈ N, belongs to D(R), which moreover satisfies

So using φ, xφ, · · ·, xⁿφ as the test functions in (∗), we obtain c₀ = · · · = c_n = 0.

Solution to Exercise 6.13, page 242

For any φ ∈ D(R^d), we have

Solution to Exercise 6.14, page 242

and so it defines a regular distribution on R².

For φ ∈ D(R²), with a > 0 such that φ ≡ 0 on R²\(−a, a)², we have

Thus

Solution to Exercise 6.15, page 242

If u : R² → R is a radial function, say u(x) = f(r), where r = ||x||₂, then

Thus Since for all R > 0 we have

we conclude that

For φ ∈ D(R²) which vanishes outside the ball B(0, R), we have

(log r)(Δφ) is integrable, as logr is locally integrable, and Δφ = 0 outside a ball.

Let images > 0.

Using Green’s formula in the annulus Ω := {x ∈ R² : < ||x||₂ < R} (with the boundary ∂Ω being the union of the two circles S() = {x : ||x||₂ = } and S(R) = {x : ||x||₂ = R}), for the functions u = log r and v = φ, we obtain

We’ll show below that the first integral on the right-hand side is O(), and thus it tends to 0 as → 0.

As and ||n(x)||₂ = 1, the Cauchy-Schwarz Inequality gives

where Finally,

Next we will look at the second integral

First, Moreover,

Given η > 0,

where first ₀ > 0 is chosen small enough so that |φ(x) − φ(0)| η if ||x||₂ ₀, and satisfies 0 < ₀.

Thus

So Hence

Solution to Exercise 6.16, page 248

u is continuous on R, and continuously differentiable on R\{0}.

For x < 0, we have u′(x) = 0. For x > 0, u′(x) = 1.

Also,

Thus by the Jump Rule, in the sense of distributions.

So u is a weak solution of u′ = H.

Solution to Exercise 6.17, page 251

We view H(x) cosx as the product of the C^∞ function cos with the regular distribution H. Using the Product Rule, we have

Similarly,

Solution to Exercise 6.18, page 251

(1)We have

Hence

(2)When

If the claim is true for some n ∈ N, then

So the claim follows for all n ∈ N by induction.

(3)We have

Thus

Consequently,

Solution to Exercise 6.19, page 252

We have for all φ ∈ D(R) that

So αδ′ = α(0)δ′ − α′(0)δ. In particular, xδ′ = 0δ′ − 1δ = −δ.

Solution to Exercise 6.20, page 252

For all φ ∈ D(R), we have that

Solution to Exercise 6.21, page 252

With u := e^−3yxH(y), we have

For all φ ∈ D(R²), we have

So Hence

Moreover, u(0, y) = e⁻⁰ H(y) = 1 · H(y) = H(y).

Solution to Exercise 6.22, page 252

First we note that for all φ ∈ D(R), we have

where 1 is the constant function R ∋ x 1.

Suppose on contrary, it is possible to define an associative and commutative product such that for α ∈ C^∞(R) and T ∈ D′(R), it agrees with Definition 6.6, page 249. Then

whereas

and so violating associativity.

Solution to Exercise 6.23, page 252

(1)Let Then we have

From Exercise 6.9, page 241, there exists a c ∈ C such that e^−λxT = c, that is, T = ce^λx.

(2)Since f ∈ C^∞, there exists an F ∈ C^∞ such that

(In fact, an explicit expression for one such F (for which F(0) = 0), is given by This can be checked by differentiation using the Product Rule and the Fundamental Theorem of Calculus.)

Hence we obtain

From part (1), T − F = ce^λx for some c ∈ C. Hence T = F + ce^λx ∈ C^∞.

(3)Let images with a_n ≠ 0.

Then

So P(ξ) = (ξ − λ)Q(ξ), where λ = λ_n, and a suitable polynomial Q.

Correspondingly, with

We’ll use induction (on the order n of D) to prove

This is true for n = 1, from part (2) above.

Suppose that the claim is true for all differential operators of order n.

Let D have order n + 1, and write where D₁ is order n.

If DT = f ∈ C^∞, then and so D₁T = T_g for some g ∈ C^∞.

But by the induction hypothesis, it now follows that T = T_F, with F ∈ C^∞.

(4)If E is also a fundamental solution, then DE = δ.

But also DE_∗ = δ, and so D(E − E_∗) = 0.

Thus E − E_∗ = F, where F is a classical solution of the homogeneous equation DF = 0. So E = E_∗ + F.

Conversely, if F is a classical solution of the homogeneous equation DF = 0, then E := E_∗ + F is a fundamental solution of D too: indeed, we have that DE = DE_∗ + DF = δ + 0 = δ.

So we conclude that: E ∈ D′(R) satisfies DE = δ if and only if

Solution to Exercise 6.24, page 252

If T = cδ, where c ∈ C, then clearly xT = x(cδ) = 0.

Now suppose that T ∈ D′(R) is such that xT = 0.

This means that for all φ ∈ D(R), we have 0 = 〈xT, φ〉 = 〈T, xφ〉.

Hence {xφ : φ ∈ D(R)} ⊂ ker T. We will now identify the set on the left-hand side as ker δ = {ψ ∈ D(R) : ψ(0) = 0}, and then use part (1) of Exercise 6.9, page 241.

First, let us note that if ψ = xφ, where φ ∈ D(R), then ψ ∈ D(R), and moreover, ψ(0) = 0φ(0) = 0. So we have {xφ : φ ∈ D(R)} ⊂ {ψ ∈ D(R) : ψ(0) = 0}.

Next, let us show the reverse inclusion. Let ψ ∈ D(R) be such that ψ(0) = 0.

We have, by the Fundamental Theorem of Calculus:

Set images Then ψ(x) = xφ(x).

By differentiating under the integral sign we see that φ ∈ C^∞.

If ψ is zero outside [−a, a] for some a > 0, then as

it follows that φ also vanishes outside [−a, a]. Thus φ ∈ D(R).

So we have {ψ ∈ D(R) : ψ(0) = 0} ⊂ {xφ : φ ∈ D(R)} as well.

Thus ker δ = {xφ : φ ∈ D(R)} ⊂ ker T, and by part (1) of Exercise 6.9, page 241 there exists a c ∈ C such that T = cδ.

Solution to Exercise 6.25, page 254

First, we prove by induction that where p_n is a polynomial.

This is indeed true for n = 0 and

If it is true for some n, then

where is a polynomial.

This finishes the proof of our claim.

Now to show e^−x² ∈ S(R), it is enough to show that for all nonnegative integers ℓ. For ℓ = 0, this is clear since |e^−x²| 1 for all x ∈ R.

We have and so for

Since is a continuous function, there is an M > 0 such that

for x ∈ [−1, 1]. Consequently,

Solution to Exercise 6.26, page 254

Since we know that there exists an a > 0 such that all the φ_n vanish outside [−a, a], and moreover, φ_n and all its derivatives converge uniformly to 0 on [−a, a]. So for any nonnegative integers m, k, we have that

Solution to Exercise 6.27, page 255

We have for φ ∈ S(R) that

From here it follows that if (φ_n)_n∈N is a sequence in S(R) such that as n → ∞, then 〈T_f, φ_n〉 → 0. Thus T_f ∈ S′(R).

Solution to Exercise 6.28, page 256

For φ ∈ S(R), we have that

Note that in the last step, we have used the fact that the Fourier transform of an L′(R) function is bounded on R, and hence it defines a tempered distribution.

1 See for example [Sasane (2015), §2.4].

² See for example [Sasane (2015), Chapter 6].

³ The symbol ¬ stands for “negation”. It is read as: “It is not the case that · · ·”.

⁴ See for example [Sasane (2015), page 311].

⁵ u_j = 0, unless i = i, in which case u_i = 1. Here · denotes transpose.

⁶ By the Axiom of Choice!