CHAPTER VIII

SPIN

§54 Spin

IN BOTH classical and quantum mechanics, the law of conservation of angular momentum is a consequence of the isotropy of space with respect to a closed system. This already demonstrates the relation between the angular moment um and the symmetry properties under rotation. In quantum mechanics, however, the relation in question is a particularly far-reaching one, and essentially constitutes the basic content of the concept of angular momentum, especially as the classical definition of the angular momentum of a particle as the product r × p has no direct significance in quantum mechanics, owing to the fact that position and momentum cannot be simultaneously measured.

We have seen in §28 that, if the values of l and m are specified, the angular dependence of the wave function of the particle is determined, and therefore so are all its symmetry properties under rotation. The most general formulation of these properties involves specifying the transformation of the wave functions when the coordinate system is rotated.

The wave function ψLM of a system of particles (with specified values of the angular momentum L and its component M remains unchanged only in a rotation of the coordinate system about the z-axis. Any rotation that alters the direction of this axis has the result that the z-component of the angular momentum does not have a definite value. This means that, in the new coordinates, the wave function in general becomes a superposition (a linear combination) of 2L + 1 functions corresponding to the different possible values of M for the given L. We can say that the 2L + 1 functions ψLM are transformed into linear combinations of one another when the coordinate system is rotated. The law governing this transformation (i.e. the coefficients in the superposition as functions of the angles of rotation of the coordinate axes) is entirely determined by specifying the value of L. Thus the angular momentum acquires the significance of a quantum number which classifies the states of the system according to their transformation properties under rotation of the coordinate system. This aspect of the concept of angular momentum in quantum mechanics is particularly important because it is not directly related to the explicit angular dependence of the wave functions; the law of mutual transformation of these functions can be stated without reference to that dependence.

Let us consider a composite particle, such as an atomic nucleus, which is at rest as a whole and is in a definite internal state. In addition to an internal energy, it has also an angular momentum of definite magnitude L, due to the motion of the particles within the nucleus. This angular momentum can have 2L + 1 different orientations in space. Thus, in considering the movement of a complex particle as a whole, we must assign to it, as well as its coordinates, another discrete variable: the projection of its internal angular momentum on some chosen direction in space.

However, with the preceding understanding of the concept of angular momentum, the origin of it becomes unimportant, and we naturally arrive at the concept of an “intrinsic” angular momentum which must be ascribed to the particle regardless of whether it is “composite” or “elementary”.

Thus, in quantum mechanics an elementary particle must be assigned a certain “intrinsic” angular momentum unconnected with its motion in space. This property of elementary particles is peculiar to quantum theory (it disappears in the limit ħ → 0), and therefore has in principle no classical interpretation.

The intrinsic angular momentum of a particle is called its spin, as distinct from the angular momentum due to the motion of the particle in space, called the orbital angular momentum. The particle concerned may be either elementary, or composite but behaving in some respect as an elementary particle (e.g. an atomic nucleus). The spin of a particle (measured, like the orbital angular momentum, in units of ħ) will be denoted by s.

For particles having spin, the description of the state by means of the wave function must determine the probability not only of its different positions in space but also of the possible orientations of the spin. Thus the wave function must depend not only on three continuous variables, the coordinates of the particle, but also on a discrete spin variable, which gives the value of the projection of the spin on a selected direction in space (the z-axis) and takes a limited number of discrete values, which we shall denote by σ.

Let ψ(x, y, z; σ) be such a wave function. It is essentially a set of several different functions of the coordinates, corresponding to different values of σ; these functions will be called the spin components of the wave function. The integral

image

determines the probability that the particle has a certain value of σ. The probability that the particle is in the volume element dV with any value of σ is

image

The quantum-mechanical spin operator, on being applied to the wave function, acts on the spin variable σ. In other words, it in some way linearly transforms the components of the wave function into one another. The form of this operator will be established later. However, it is easy to see from very general considerations that the operators ŝx, ŝy, ŝz satisfy the same commutation conditions as the operators of the orbital angular momentum.

The angular momentum operator is essentially the same as that of an infinitely small rotation. In deriving, in §26, the expression for the orbital angular momentum operator, we considered the result of applying the rotation operator to a function of the coordinates. In the case of the spin, this derivation becomes invalid, since the spin operator acts on the spin variable, and not on the coordinates. Hence, to obtain the required commutation relations, we must consider the operation of an infinitely small rotation in a general form, as a rotation of the system of coordinates. If we successively perform infinitely small rotations about the x-axis and the y-axis, and then about the same axes in the reverse order, it is easy to see by direct calculation that the difference between the results of these two operations is equivalent to an infinitely small rotation about the z-axis (through an angle equal to the product of the angles of rotation about the x and y-axes). We shall not pause here to carry out these simple calculations, as a result of which we again obtain the usual commutation relations between the operators of the components of angular momentum; these must therefore hold for the spin operators also:

image (54.1)

together with all the physical consequences resulting from them.

The commutation relations (54.1) enable us to determine the possible values of the absolute magnitude and components of the spin. All the results derived in §27 (formulae (27.7)(27.9)) were based only on the commutation relations, and hence are fully applicable here also; we need only replace L in these formulae by s. It follows from formula (27.7) that the eigenvalues of the component of the spin form a sequence of numbers differing by unity. However, we cannot now assert that these values must be integral, as we could for the component Ls of the orbital angular momentum (the derivation given at the beginning of §27 is invalid here, since it was based on the expression (26.14) for the operator imagez, which holds only for the orbital angular momentum).

Moreover, we find that the sequence of eigenvalues sz is limited above and below by values equal in absolute magnitude and opposite in sign, which we denote by ±s. The difference 2s between the greatest and least values of sz must be an integer or zero. Consequently s can take the values 0, ½, 1, 3/2, ….

Thus the eigenvalues of the square of the spin are

image (54.2)

where s can be either an integer (including zero) or half an integer. For given s, the component sz of the spin can take the values s, s − 1, …, −s, i.e. 2s+1 values in all. Accordingly, the wave function of a particle with spin s has 2s + 1 components.

Experiment shows that the majority of the elementary particles (electrons, positrons, protons, neutrons, μ-mesons and all hyperons (Λ, ∑, Ξ)) have a spin of image. There are also elementary particles, the π-mesons and the K-mesons, whose spin is zero.

The total angular momentum of a particle is composed of its orbital angular momentum 1 and its spin s. Their operators act on functions of different variables, and therefore, of course, commute. The eigenvalues of the total angular momentum

image (54.3)

are determined by the same “vector model” rule as the sum of the orbital angular momenta of two different particles (§31). That is, for given values of l and s, the total angular momentum can take the values l+s, l+s− 1, …, |ls|. Thus, for an electron (spin ½) with non-zero orbital angular momentum l, the total angular momentum can be j = l ± ½; for l = 0 the angular momentum j has, of course, only the one value j = ½.

The operator of the total angular momentum J of a system of particles is equal to the sum of the operators of the angular momentum j of each particle, so that its values are again determined by the vector model rules. The angular momentum J can be put in the form

image (54.4)

where S may be called the total spin and L the total orbital angular momentum of the system. We notice that, if the total spin of the system is half-integral (or integral), the same is true of the total angular momentum, since the orbital angular momentum is always integral. In particular, if the system consists of an even number of similar particles, its total spin is always integral, and therefore so is the total angular momentum.

The operators of the total angular momentum j of a particle (or J, of a system of particles) satisfy the same commutation rules as the operators of the orbital angular momentum or the spin, since these rules are general commutation rules holding for any angular momentum. The formulae (27.13) for the matrix elements of angular momentum, which follow from the commutation rules, are also valid for any angular momentum, provided that the matrix elements are defined with respect to the eigenstates of this angular momentum. Formulae (29.7)(29.10) for the matrix elements of arbitrary vector quantities also remain valid (with appropriate change of notation).

PROBLEMS

A particle with spin ½ is in a state with a definite value sz = ½. Determine the probabilities of the possible values of the component of the spin along an axis z′ at an angle θ to the z-axis.

SOLUTION.

The mean spin vector image is evidently along the z-axis and has magnitude ½. Taking the component along the z′-axis, we find that the mean value of the spin in that direction is image. We also have image) where ω± are the probabilities of the values sz′ = ± ½. Since ω+ + ω = 1, we find w+ = cos2imageθ, w− = sin2imageθ.

§55 The spin operator

In the rest of this chapter we shall not be interested in the dependence of the wave functions on the coordinates. For example, in speaking of the behaviour of the functions ψ(x, y, z; σ) when the system of coordinates is rotated, we can suppose that the particle is at the origin, so that its coordinates remain unchanged by such a rotation, and the results obtained will characterize the behaviour of the function ψ with regard to the spin variable σ.

The variable σ differs from the ordinary variables (the coordinates) by being discrete. The most general form of a linear operator acting on functions of a discrete variable σ is

image (55.1)

where the fσσ, are constants. We put imageψ in parentheses in order to emphasize that the spin argument following it is not that of the original function ψ but that of the function resulting from it under the action of the operator f. It is easy to see that the quantities fσσ, are the same as the matrix elements of the operator, defined by the usual rule (11.5).

The integration over the coordinates in (11.5) is here replaced by summation over the discrete variable, so that the definition of the matrix element is

image (55.2)

Here ψσ1(σ) and ψσ2(σ) are the eigenfunctions of the operator ŝz corresponding to the eigenvalues sz = σ1 and σ2; each such function corresponds to a state in which the particle has a definite value of sz, i.e. in which only one component of the wave function is non-zero:

image (55.3)

According to (55.1),

image

and on substitution of this and ψσ*(σ) the equation (55.2) is satisfied identically; this completes the proof.

Thus the operators acting on functions of σ can be represented in the form of (2s + 1)-rowed matrices. In particular, we have for the operator of the spin itself, acting on the wave function, by (55.1),

image (55.4)

According to what has been said at the end of §54, the matrices ŝx, ŝy, ŝz are identical with the matrices imagex, Ly, imagez obtained in §27, where the letters L and M need only be replaced by s and σ:

image (55.5)

This determines the spin operator.

In the important case of a spin of ½ (s = ½, σ = ± ½), these matrices have two rows, and are of the form

image (55.6)

where

image (55.7)

These are called Pauli matrices. The matrix image is diagonal, as it should be, since it is defined in terms of the eigenfunctions of the quantity sz itself.

The following are some specific properties of the Pauli matrices. Direct multiplication of the matrices (55.7) gives the equations

image (55.8)

Combining these with the general commutation rules (54.1), we find that

image (55.9)

i.e. the Pauli matrices anticommute with one another. By means of these equations, we can easily verify the following useful formulae:

image (55.10)

where a and b are any vectors. According to these relations, any scalar polynomial formed from the matrices ∂i can be reduced to terms independent of ŝ and terms linear in ∂; hence it follows that any scalar function of the operator ∂ reduces to a linear function (see Problem 1). Lastly, the values of the traces (sums of diagonal elements) of the Pauli matrices and their products are

image (55.11)

Subsequent sections of this chapter give a more detailed account of the spin properties of wave functions, including their behaviour under any rotation of the coordinate system, but we may note immediately an important property of these functions, namely their behaviour in respect of rotations about the z-axis.

Let there be an infinitesimal rotation through an angle δφ about the z-axis. The operator of such a rotation is expressed in terms of the angular momentum operator (in this case, the spin operator) as 1 + iδφ. ŝz. As a result of the rotation, the functions ψ(σ) therefore become ψ(σ) + δψ(σ), where

image

Writing this relation in the form dψ/dφ = iσψ(σ) and integrating, we find that a rotation through a finite angle φ changes the functions ψ(σ) into

image (55.12)

In particular, a rotation through 2π multiplies them by a factor eiσ, which is the same for all σ and is equal to (−1)2s (since 2σ always has the same parity as 2s). Thus, in a complete rotation of the coordinate system about the z-axis, the wave functions of particles with integral spin return to their original values, and those of particles with half-integral spin change sign.

PROBLEMS

PROBLEM 1.

Reduce an arbitrary function of the scalar a + b. ∂ linear in the Pauli matrices to another linear function.

SOLUTION.

TO determine the coefficients in the required formula f (a + b, ∂) = A + B. ∂, we note that, when the z-axis is taken in the direction of b, the eigenvalues of the operator a + b. ∂ are a ± b, and the corresponding eigenvalues of the operator f (a + b. ∂) are f (a ± b) Hence we find A = ½[f (a + b) + f (ab)], B = (b/2b) [f (a + b)-f (ab)].

PROBLEM 2.

Determine the values of the scalar product S1. S2 of spins (1/2) of two particles in states in which the total spin of the system, S = s1 + s2, has definite values (0 or 1).

SOLUTION.

From the general formula (31.3), which is valid for the addition of any two angular momenta, we find s1. s2 = 1/4 for S = 1, s1. s2 = −3/4 for S = 0.’

PROBLEM 3.

Which powers of the operator ŝ of an arbitrary spin s are independent?

SOLUTION.

The operator

image

formed from the differences between ŝz and all possible eigenvalues sz, gives zero when it is applied to any wave function, and is therefore itself zero. Hence it follows that (Ŝz)2s+1 is expressed in terms of lower powers of the operator Ŝz, so that only its powers from 1 to 2s are independent.

§56 Spinors

When the spin is zero, the wave function has only one component, ψ(0).

The effect of the spin operator is to reduce it to zero: ŝψ = 0. The relation between ŝ and the operator of an infinitesimal rotation implies that the wave function of a particle with zero spin is invariant under rotation of the coordinate system, i.e. it is a scalar.

The wave function of a particle with spin ½ has two components, ψ(1/2) and ψ(−1/2). For convenience in later generalizations, we shall distinguish these components by the superscripts 1 and 2 respectively. The two-component quantity

image (56.1)

is called a spinor.

In any rotation of the coordinate system, the components of the spinor undergo a linear transformation:

image (56.2)

This may be written

image (56.3)

where Û is the transformation matrix. Its elements are in general complex functions of the angles of rotation of the coordinate axes. They are connected by relations which follow directly from the physical conditions imposed on the spinor as the wave function of a particle.

Let us consider the bilinear form

image (56.4)

where ψ and φ are two spinors. A simple calculation gives

image

i.e. (56.4) is transformed into itself when the coordinate system is rotated. If, however, there is only one function which is transformed into itself, it can be regarded as corresponding to zero spin, and therefore must be a scalar, i.e. must remain unchanged when the coordinate system is rotated in any manner. Hence we have

image (56.5)

the determinant of the transformation matrix is unity.

Further relations follow from the requirement that the expression

image (56.6)

which determines the probability of finding the particle at a given point in space, should be a scalar. A transformation which leaves unchanged the sum of the squared moduli of the quantities is a unitary transformation, i.e. we must have Û+ = Û−1 (see §12). With the condition (56.5) the inverse matrix is

image

Equating this to the Hermitian conjugate matrix

image

we find

image (56.7)

By virtue of the relations (56.5) and (56.7), the four complex quantities a, b, c, d actually contain only three independent real parameters, corresponding to the three angles which define a rotation of a three-dimensional system of coordinates.

Comparison of the expressions for the scalars (56.4) and (56.6) shows that ψ1* and ψ2* must be transformed as ψ2 and − ψ1 respectively. It is easy to verify that this is in fact so, using (56.5) and (56.7).

It is possible to put the algebra of spinors in a form analogous to that of tensor algebra. This is done by introducing, in addition to contravariant spinor components ψ1, ψ2 (with superscripts), the covariant components (with subscripts), defined by

image (56.8)

The invariant combination (56.4) of the two spinors may also be written as a scalar product

image (56.9)

here and below, summation over repeated (dummy) indices is implied, as in tensor algebra. We may note the following rule which has to be borne in mind in spinor algebra. We have

image

Thus

image (56.10)

Hence it is evident that the scalar product of any spinor with itself is zero:

image (56.11)

According to the foregoing discussion, the quantities ψ1 and ψ2 are transformed as ψ1* and ψ2*, i.e.

image (56.12)

The product Û*ψ may also be written as ψ image*, with the transposed matrix image*. Since Û is unitary, we have image* = Û−1, so that ψ′λ = (ψÛ-1)λ or

image (56.13)

Analogously to the transition from vectors to tensors in ordinary tensor algebra, we can introduce the idea of spinors of higher rank. Thus, a quantity ψλμ, having four components which are transformed as the products ψλφμ of the components of two spinors of rank one, is called a spinor of rank two. Besides the contra variant components ψλμ we can consider the covariant components ψλμ and the mixed components ψλμ which are transformed as the products ψλφμ and ψλφμ respectively. Spinors of any rank are similarly defined.

The transition from contravariant to covariant spinor components and vice versa may be written

image (56.14)

where

image (56.15)

is the metric spinor in a vector space of two dimensions. Thus we have, for example,

image

so that ψ12 = − ψ11 = −ψ21, ψ11 = ψ12 = ψ22, and so on.

The quantities gλμ themselves form an antisymmetric unit spinor of rank two. It is easy to see that the values of its components remain unchanged under transformations of the coordinates, and that

image (56.16)

where δ11 = δ22 = 1, δ21 = δ12 = 0.

As in ordinary tensor algebra, there are two fundamental operations in spinor algebra: multiplication, and contraction with respect to a pair of indices. The multiplication of two spinors gives a spinor of higher rank; thus, from two spinors of ranks two and three, ψλμ and ψvρσ, we can form a spinor of rank five, ψλμψvρσ. Contraction with respect to a pair of indices (i.e. summation of the components over corresponding values of one covariant and one contravariant index) decreases the rank of a spinor by two. Thus, a contraction of the spinor ψλμvρσ with respect to the indices μ and v gives the spinor ψλμμρσ of rank three; the contraction of the spinor ψλμ gives the scalar ψλλ. Here there is a rule similar to that expressed by formula (56.10): if we interchange the upper and lower indices with respect to which the contraction is effected, the sign is changed (i.e. ψλλ = −ψλλ). Hence, in particular, it follows that, if a spinor is symmetrical with respect to any two of its indices, the result of a contraction with respect to these indices is zero. Thus, for a symmetrical spinor ψλμ of rank two, we have ψλλ = 0.

A spinor of rank n symmetrical with respect to all its indices is called a symmetrical spinor of rank n. From an asymmetrical spinor we can construct a symmetrical one by the process of symmetrization, i.e. summation of the components obtained by all possible interchanges of the indices. From what has been said above, it is impossible to construct (by contraction) a spinor of lower rank from the components of a symmetrical spinor.

Only a spinor of rank two can be antisymmetrical with respect to all its indices. For, since each index can take only two values, at least two out of three or more indices must have the same value, and therefore the components of the spinor are zero identically. Any antisymmetrical spinor of rank two is a scalar multiple of the unit spinor gλμ. We may notice here the following relation:

image (56.17)

(where ψλ is any spinor), which follows from the above; this rule is simply a consequence of the fact that the expression on the left is (as we may easily verify) an antisymmetrical spinor of rank 1 three.

The spinor which is the product of a spinor ψλμ with itself, on contraction with respect to one pair of indices, becomes antisymmetrical with respect to the other pair:

image

Hence, from what was said above, this spinor must be a scalar multiple of the spinor gλμ. Defining the scalar factor so that contraction with respect to the second pair of indices gives the correct result, we find

image (56.18)

The components of the spinor ψλμ…* which is the complex conjugate of ψλμ… are transformed as the components of a contravariant spinor φλμ…, and conversely. The sum of the squared moduli of the components of any spinor is consequently invariant.

§57 The wave functions of particles with arbitrary spin

Having developed a formal algebra for spinors of any rank, we can now turn to our immediate problem, to study the properties of wave functions of particles with arbitrary spin.

This subject is conveniently approached by considering an assembly of n particles with spin ½. The greatest possible value of the z-component of the total spin is ½n, which is obtained when sz = ½ for every particle (i.e. all the spins are directed the same way, along the z-axis). In this case we can evidently say that the total spin S of the system is also ½n.

All the components of the wave function ψ(σ1, σ2, …, σn) of the system of particles are then zero, except for ψ(1/2, ½, …,1/2). If we write the wave function as a product of n spinors ψλφμ …, each of which refers to one of the particles, only the component with λ, μ,… = 1 in each spinor is not zero. Thus only the product ψ1φ1… is not zero. The set of all these products, however, is a spinor of rank n which is symmetrical with respect to all its indices. If we transform the coordinate system (so that the spins are not directed along the z-axis), we obtain a spinor of rank n, general in form except that it is symmetrical as before.

The spin properties of wave functions, being essentially their properties with respect to rotations of the coordinate system, are identical for a particle with spin s and for a system of n = 2s particles each with spin ½ directed so that the total spin of the system is s. Hence we conclude that the wave function of a particle with spin s is a symmetrical spinor of rank n = 2s.

It is easy to see that the number of independent components of a symmetrical spinor of rank 2s is equal to 2s + 1, as it should be. For all those components are the same whose indices include 2s ones and 0 twos; so are all those with 2s−1 ones and 1 two, and so on up to 0 ones and 2s twos.

Mathematically, the symmetrical spinors provide a classification of the possible types of transformation of quantities when the coordinate system is rotated. If there are 2s + 1 different quantities which are transformed linearly into one another (and which cannot be reduced in number by any choice of linear combinations of them), then we can assert that their law of transformation is equivalent to that of the components of a symmetrical spinor of rank 2s. Any set of any number of functions which are transformed linearly into one another when the coordinate system is rotated can be reduced (by an appropriate linear transformation) to one or more symmetrical spinors.

Thus an arbitrary spinor ψλμv… of rank n can be reduced to symmetrical spinors of ranks n, n−2, n−4, …. In practice, such a reduction can be made as follows. By symmetrizing the spinor ψλμv… with respect to all its indices, we form a symmetrical spinor of the same rank n. Next, by contracting the original spinor ψλμv… with respect to various pairs of indices, we obtain spinors of rank n − 2, of the form ψλλv…, which, in turn, we symmetrize, so that symmetrical spinors of rank n − 2 are obtained. By symmetrizing the spinors obtained by contracting ψλμ… with respect to two pairs of indices, we obtain symmetrical spinors of rank n−4, and so on.

We have still to establish the relation between the components of a symmetrical spinor of rank 2s and the 2s + 1 functions ψ(σ), where σ = s, s − 1, …, −s. The component

image

in whose indices 1 occurs s + σ times and 2 s − σ times, corresponds to a value σ of the projection of the spin on the z-axis. For, if we again consider a system of n = 2s particles with spin ½, instead of one particle with spin s, the product image corresponds to the above component; this product belongs to a state in which s + σ particles have a projection of the spin equal to ½, and s − σ a projection of − ½, so that the total projection is ½(s + σ) − ½(s − σ) = σ. Finally, the proportionality coefficient between the above component of the spinor and ψ(σ) is chosen so that the equation

image (57.1)

holds; this sum is a scalar, as it should be, since it determines the probability of finding the particle at a given point in space. In the sum on the right-hand side, the components with (s + σ) indices 1 occur

image

times. Hence it is clear that the relation between the functions ψ(σ) and the components of the spinor is given by the formula

image (57.2)

The relation (57.2) ensures the fulfilment not only of the condition (57.1), but also, as we easily see, of the more general condition

image (57.3)

where ψλμ… and φλμ… are two different spinors of the same rank, while ψ(σ), φ(σ) are functions derived from these spinors by formula (57.2); the factor (−1)s−σ is due to the fact that, when all the indices of the spinor components are raised, the sign changes as many times as there are twos among the indices.

Formulae (55.5) determine the result of the action of the spin operator on the wave functions ψ(σ). It is not difficult to find how these operators act on a wave function written in the form of a spinor of rank 2s. For a spin ½, the functions ψ(1/2), ψ(− ½) are the same as the components ψ1, ψ2 of the spinor. According to (55.6) and (55,7), the result of the spin operators’ acting on them will be

image (57.4)

To pass to the general case of arbitrary spin, we again consider a system of 2s particles with spin ½, and write its wave function as a product of 2s spinors. The spin operator of the system is the sum of the spin operators of each particle, acting only on the corresponding spinor, the result of this action being given by formulae (57.4). Next, returning to arbitrary symmetrical spinors, i.e. to the wave functions of a particle with spin s, we obtain

image (57.5)

Hitherto we have spoken of spinors as wave functions of the intrinsic angular momentum of elementary particles. Formally, however, there is no difference between the spin of a single particle and the total angular momentum of any system regarded as a whole, neglecting its internal structure. It is therefore evident that the transformation properties of spinors apply equally to the behaviour, with respect to rotations in space, of the wave functions ψjm of any particle or system of particles with total angular momentum j, independent of whether orbital or spin angular momentum is concerned. There must therefore be some definite relation between the laws of transformation for the eigenfunctions ψjm under rotations of the coordinate system and those for the components of a symmetrical spinor of rank 2j.

In establishing this relation we must, however, make a clear distinction between two aspects of the dependence of the wave functions on the component m (for a given value of j). The wave function may be regarded as the probability amplitude for various values of m, or may be considered for a given value of m.

These two aspects have already been discussed at the beginning of §55, where we dealt with the eigenfunction image of the operator ŝz which corresponds to sz = σ0. The mathematical difference between them is especially clear for a particle of spin s = ½. In this case the spin function is, with respect to the variable σ, a contravariant spinor of rank 1, i.e. must be written in spinor notation as δσσ. With respect to σ0 it is therefore a covariant spinor.

This is evidently a general result: the eigenfunctions ψjm can be put in correspondence with the components of a covariant symmetrical spinor of rank 2j by means of formulae analogous to (57.2):

image (57.6)

The eigenfunctions of integral angular momentum j are spherical harmonics Yjm. The case j = 1 is of particular importance. The three spherical harmonics Y1m are

image

where n is a unit vector along the radius vector. It is seen that these three functions are equivalent, as regards their transformation properties, to the components of a vector a, with the relations

image (57.7)

Comparing with (57.6), we see that the components of a symmetrical spinor of rank two can be brought into correspondence with the components of thevector by the formulae

image (57.8)

image (57.9)

Conversely

image (57.10)

It is easily verified that with these definitions we have

image (57.11)

where a and b are vectors corresponding to the symmetrical spinors ψλμ and φλμ. It is also not difficult to see that there is a correspondence between the spinor and the vector

image (57.12)

Formulae (57.10) may be compactly written by means of the Pauli matrices:

image (57.13)

the matrix indices of image are written as superscript and subscript in correspondence with the position of the spinor indices in ψλμ. The origin of this formula is easily understood by considering the particular case where the spinor of rank two ψλμ reduces to a product of a spinor of rank one ψμ and its complex conjugate ψλ*. Then the quantity ½ψλ*σλμψμ is the mean value of the spin (for a particle with wave function ψμ) and it is therefore evidently a vector.

The relations (57.8) or (57.9) are a particular case of a general rule: any symmetrical spinor of even rank 2j, where j is integral, can be correlated with a symmetrical tensor of half the rank (j) which gives zero on contraction with respect to any pair of indices; we call this an irreducible tensor.

This follows from the fact that the numbers of independent components of the spinor and of the tensor are the same (2j + 1), as may easily be seen. The relation between the components of the spinor and of the tensor can be found by means of formulae (57.8)(57.10), if we consider a spinor of the rank concerned as the product of several spinors of rank two, and the tensor as a product of vectors.

PROBLEMS

PROBLEM 1.

Rewrite the definition (57.4) of the operator of spin ½ in terms of the spinor components of the vector ŝ.

SOLUTION.

By means of formulae (57.9), which give the relation between the vector ŝ and the spinor ŝλμ, the definition (57.4) can be written as

image

PROBLEM 2.

Derive formulae which determine the effect of the spin operator on a vector wave function of a particle with spin 1.

SOLUTION.

The relation between the components of the vector function ψ and the components of the spinor ψλμ is given by formulae (57.9), and from (57.5) we have

image

(where image) or

image

The remaining formulae are derived from these by cyclic permutation of the suffixes x, y, z. They can be written together as

image

The complex vector ψ can be put in the form ψ = e(u + iv), where u and v are real vectors, which can be taken to be mutually perpendicular if the common phase α is suitably chosen. The two vectors u and v determine a plane which has the property that the spin component perpendicular to it can take only the values ± 1.

§58 The operator of finite rotations

Let us now return to the transformation of spinors, and show how the coefficients of this transformation can in fact be expressed in terms of the angles of rotation of the coordinate axes.

By the definition of the angular momentum operator (in this case, the spin operator), 1 + iδφ. n. ŝ is the operator of a rotation through an angle δφ about a direction specified by the unit vector n; for application to the wave function of a particle with spin ½, i.e. a spinor of rank one, we must take image in this operator. The operator of a rotation through a finite angle φ about the same direction will be correspondingly given by

image (58.1)

cf. (15.13). Like any function of the Pauli matrices (see §55, Problem 1), this expression reduces to one that is linear in these matrices:

image (58.2)

For example, with a rotation about the z-axis,

image (58.3)

This means that the components of the spinor are transformed in such a rotation according to

image

In particular, in a rotation through an angle 2π the spinor components change sign; spinors of any odd rank must therefore have the same property (cf. the end of §55).

Similarly, we can find the matrices of transformations consisting of a rotation through an angle φ about the x-axis or the y-axis:

image (58.4)

We may note the particular case of a rotation through an angle π about the y-axis, for which

image

i.e.

image (58.5)

It is now easy to write down the transformation matrix for any rotation of the coordinate axes, as a function of the Eulerian angles which specify the rotation.

A rotation of the axes, defined by the Eulerian angles α, β, γ, is carried out in three stages: (1) a rotation through the angle α (0 ≤ α ≤ 2π) about the z-axis, (2) a rotation through the angle β (0 ≤ β ≤ π) about the new position of the y-axis (ON in Fig. 20, called the line of nodes), (3) a rotation through the angle γ (0 ≤ γ ≤ 2π) about the resulting final position z′ of the z-axis.

image

FIG. 20

It is evident that the angles α and β are the spherical polar angles φ and θ of the new z′-axis with respect to the xyz axes: α = φ, β = θ.

In accordance with this manner of rotating the axes, the matrix of the complete transformation is equal to the product of three matrices (58.3) and (58.4):

image

By direct multiplication of the matrices we finally obtain

image (58.6)

Spinors of higher ranks are, by definition, transformed as products of components of a spinor of rank one. In physical applications, however, we are interested in the wave functions ψjm rather than the transformation laws of the spinors themselves.

Let the functions ψjm (m = j, j − 1, …, −j) describe, in a coordinate system xyz, a state having a definite value of the angular momentum j, and ψjm the same state for the axes x’y’z’; in the first case m is the value of jz, and in the second case m′ is the value of jz. The two sets of functions are connected by linear relations, which we write in the form

image (58.7)

The coefficients D(j)m·m form a matrix of order 2j + 1 with respect to m′ and m, called the finite-rotation matriximage(j); its elements are functions of the angles α, β, γ of rotation of the system x’y’z’ relative to xyz.

The finite-rotation matrix can be built up by means of the spinor representation of the functions ψjm. For j = ½, the two functions ψ1/2m(w = ± ½) form a covariant spinor of rank 1. According to (56.13), its transformation from x’y’z’ to xyz is effected by the matrix û (58.6), so that image(1/2) = û. Its elements may be written

image

where

image (58.8)

For any value of j, the functions ψjm are related to the components of a symmetrical covariant spinor of rank 2j by (57.6). The transformation matrix for the components of a spinor of rank 2j is the product of 2j matrices image(1/2), each acting on one of the spinor indices. Carrying out the multiplication and returning to the functions ψjm, we find their transformation matrix:

image (58.9)

the functions dm’m(j) (β) being given by

image (58.10)

where

image (58.11)

are called Jacobi polynomials. We may note that

image (58.12)

The functions image possess a number of symmetry properties which might be derived from the expressions (58.11) and (58.12), but it is simpler to obtain them directly from the definition as coefficients in the rotational transformation.

The matrix image(j) is unitary, being the matrix of a rotational transformation. Since the transformation inverse to the rotation (α, β, γ) is the rotation (− γ, − β, − α), we have for the real matrix d(j) the relations

image (58.13)

The following equations are also valid:

image (58.14)

image (58.15)

When j = ½ these are evident from (58.8); the generalization to arbitrary j is evident from the manner of construction of the transformation matrix, described above.

A rotation through an angle π − β can be carried out as two successive rotations through π and − β:

image

or, using (58.13),

image (58.16)

The result of two rotations about the same axis is independent of the sequence in which they occur. We must therefore arrive at the same result by carrying out the rotations through − β and π in the opposite order. Comparison of the result with (58.16) gives the relation

image (58.17)

From (58.17), (58.14) and (58.13), it follows that

image (58.18)

Using (58.13)(58.18), we can deduce various symmetry properties of the complete matrix elements image. In particular, the complex conjugate function is given by

image (58.19)

Mathematically, the matrices image(j) give the unitary irreducible representations of the rotation group having dimension 2j + 1 (see §98 below). Hence we have immediately the orthonormality relation

image (58.20)

where dω = sin β dα dβ dγ.

The orthogonality of the functions with respect to the suffixes m and m′ is ensured by the factor ei (mz+mγ); that with respect to the index j arises from the functions image, for which we have

image (58.21)

Lastly, we shall give for reference the expressions for the functions image for various particular values of the parameters. For j = 1, we have

image (58.22)

For integral j = l and m′ = 0, formulae (58.10) and (58.11) give

image (58.23)

The derivation of this formula is easily seen from the original definition (58.7). We shall assign the values of the functions ψjm. on the right of (58.7) to the z-axis, on which (for j = l)

image (58.24)

The function ψjm on the left is then the spherical harmonic function Ylm(β, α) of the spherical polar angles φ ≡ α, θ ≡ β giving the direction of the z′-axis. Substitution of (58.24) in (58.7) leads to

image (58.25)

which is equivalent to (58.23).

Lastly, there is the following expression for the function with the maximum possible value of m or m′:

image (58.26)

§59 Partial polarization of particles

By a suitable choice of the direction of the z-axis, we can always cause one component (e.g. ψ2) of a given spinor ψλ, the wave function of a particle with spin ½, to vanish. This is evident from the fact that a direction in space is determined by two quantities (angles), i.e. the number of disposable parameters is just equal to the number of quantities (the real and imaginary parts of the complex ψ2) which it is desired to make zero.

Physically this means that, if a particle with spin ½ (for definiteness, we shall speak of an electron) is in a state described by a spin wave function, then there is a direction in space in which the component of the particle spin has the definite value σ = ½. We can say that in such a state the electron is completely polarized.

There are also, however, states of an electron which may be said to be partially polarized. Such states are not described by wave functions but only by density matrices, i.e. they are mixed states (with respect to spin) (see §14).

The spin or polarization density matrix of an electron is a spinor ρλμ of rank two normalized by the condition

image (59.1)

and satisfying the “Hermitian” condition

image (59.2)

For a pure (i.e. completely polarized) spin state of the electron the spinor image reduces to a product of components of the wave function ψλ:

image (59.3)

The diagonal components of the density matrix determine the probabilities of the values + ½ and − ½ of the z-component of the electron spin. The mean value of this component is therefore

image

or, using (59.1),

image (59.4)

In a pure state the mean value of the quantities s± = sx ± isy is calculated as

image

Since, according to (55.6) and (55.7), the operators ŝ± are given by the matrices

image

we find that

image

Accordingly we have in a mixed state

image (59.5)

Using the Pauli matrices, formulae (59.4) and (59.5) can be combined as

image (59.6)

Thus all the components of the polarization density matrix of the electron are expressed in terms of the mean values of components of its spin vector. In other words, the real vector image entirely determines the polarization properties of a particle with spin ½. In the limit of complete polarization one of the components of this vector (with an appropriate choice of the directions of the axes) is ½ and the other two are zero. In the opposite case of an unpolarized state all three components are zero. In the general case of an arbitrary partial polarization and any choice of the coordinate system we have 0 ≤ ρ ≤ 1, where

image

is a quantity which may be called the degree of polarization of the electron.

For a particle of arbitrary spin s, the density matrix is a spinor ρλμ…ρσ… of rank 4s, symmetrical in the first 2s and the last 2s indices and satisfying the conditions

image (59.7)

image (59.8)

To calculate the number of independent components of the density matrix, we note that, among the possible sets of values of the indices λ, μ, … (or ρ, σ, …) there are only 2s + 1 which are essentially different. Using also the fact that the components of the spinor ρλμ…ρσ… are related by (59.7), we find that the number of different components is (2s + 1)2− 1 = 4s (s + 1). Although these components are complex, the relation (59.8) shows that this does not increase the total number of independent quantities describing the state of partial polarization of the particle, which is therefore 4s (s+ 1). For comparison, it may be remarked that the state of complete polarization of the particle is described by only 4s quantities (the 2s + 1 complex components of the wave function ψλμ…, related by one normalization condition and containing one common phase which is unimportant in the description of the state).

Like any spinor of rank 4s, the spinor ρλμ…ρσ… is equivalent to a set of irreducible tensors of ranks 4s, 4s − 2, …, 0. In the present case there is only one tensor of each rank, since, on account of the symmetry properties of the spinor ρλμ…ρσ…, each contraction of it can be carried out in only one way: with respect to any one of the indices λ, μ, …, and one of ρ, σ,…. In addition, the scalar (tensor of rank 0) does not appear, reducing to unity by virtue of the condition (59.7).

§60 Time reversal and Kramers’ theorem

The symmetry of motion with respect to a change in the sign of the time is expressed in quantum mechanics by the fact that, if ψ is the wave function of a stationary state of the system, the “time-reversed” wave function (which we denote by ψrev) describes a possible state with the same energy. At the end of §18 it has been pointed out that ψrev is the same as the complex conjugate function ψ*. In this simple form the statement applies to wave functions where the spin of particles is neglected. When spin is present, a refinement is necessary.

Let us take the wave function of a particle of spin s in the form of the contravariant spinor ψλμ… (of rank 2s). On taking the complex conjugate function ψλμ…* we obtain a set of quantities which are transformed as components of a covariant spinor. Hence the operation of time reversal corresponds to a change from the wave function ψλμ… to a new wave function whose covariant components are given by

image (60.1)

For a given set of values of the indices λ, μ, …, the components of covariant and contravariant spinors correspond to values of the angular-momentum component which differ in sign. In terms of the functions ψsσ therefore, time reversal corresponds to a change from ψsσ to ψs,-σ, as it should, since a change in the sign of the time changes the direction of the angular momentum. The exact relation is given by (60.1):

image (60.2)

Thus the change ψsσ → ψsσ* required by the operation of time reversal signifies the change

image (60.3)

When this operation is repeated, we have

image

Thus a twofold time reversal restores the wave function to its original value only if the spin is integral; if the spin is half-integral, the sign of the wave function is changed.

Let us consider an arbitrary system of interacting particles. The orbital and spin angular momenta of such a system are not in general separately conserved when relativistic interactions are taken into account. Only the total angular momentum J is conserved. If there is no external field, each energy level of the system has (2J + 1)-fold degeneracy. When an external field is applied, the degeneracy is removed. The question arises whether the degeneracy can be removed completely, i.e. so that the system has only simple levels. This is closely related to the symmetry with respect to time reversal.

In classical electrodynamics the equations are invariant with respect to a change in the sign of the time, if the electric field is left unchanged and the sign of the magnetic field is reversed. This fundamental property of motion must be preserved in quantum mechanics. Hence, not only in a closed system but in any external electric field (there being no magnetic field), there is symmetry with respect to time reversal.

The wave functions of the system are spinors ψλμ…, whose rank n is twice the sum of the spins sa of all the particles (n = 2 σ sa); this sum may not be equal to the total spin S of the system.

According to what was said above, we can assert that, in any electric field, the wave function and its time reversal must correspond to states with the same energy. If a level is non-degenerate, it is necessary that these states should be identical, i.e. the corresponding wave functions must be the same apart from a constant factor (both, of course, being expressed as similar (covariant or contravariant) spinors).

We write image = Cψλμ… or, by (60.1),

image (60.4)

where C is a constant.

Taking the complex conjugate of both sides of this equation, we obtain

image

We lower the indices on the left-hand side of the equation and correspondingly raise them on the right. This means that we multiply both sides of the equation by gαλgβμ … and sum over the indices λ, μ, …; on the right-hand side we must use the fact that

image

As a result we have

image

Substituting ψλμ…* from (60.4), we find

image

This equation must be satisfied identically, i.e. we must have (−1)nCC* = 1. Since, however, |C|2 is always positive, it is clear that this is possible only for even n (i.e. for integral values of the sum Σ sa). For odd n (half-integral values of σ sa) the condition (60.4) cannot be fulfilled.

Thus we reach the result that an electric field can completely remove the degeneracy only for a system with an integral value of the sum of the spins of the particles. For a system with a half-integral value of this sum, in an arbitrary electric field, all the levels must be doubly degenerate, and complex conjugate spinors correspond to two different states with the same energy (H. A. Kramers 1930).

One further, mathematical, comment may be made. A relation of the form (60.4) with a real constant C is mathematically the condition that the components of the spinor may be put in correspondence with a set of real quantities, and may be called the condition for the spinor to be “real”. || The impossibility of fulfilling the condition (60.4) for odd n signifies that no real quantity can correspond to a spinor of odd rank. For even n, on the other hand, the condition (60.4) can be satisfied, and C can be real. In particular, a real vector can correspond to a symmetrical spinor of rank two if the condition (60.4) is satisfied with C = 1:

image

(as is easily seen by means of (57.8) and (57.9)). The condition (60.4) with C = 1 is in fact the condition for a symmetrical spinor of any even rank to be “real”.


Apart from an unimportant phase factor.

In mathematical terms, these functions are the irreducible representations of the rotation group. The number of functions which are transformed into linear combinations of one another is called the dimension of the representation; it is assumed that this number cannot be made smaller by taking any other linear combinations of these functions.

In particular, it would be wholly meaningless to imagine the “intrinsic” angular momentum of an elementary particle as being the result of its rotation “about its own axis”.

The physical idea that an electron has an intrinsic angular momentum was put forward by G. Uhlenbeck and S. Goudsmit in 1925. Spin was introduced into quantum mechanics in 1927 by W. Pauli.

Since s is fixed for each kind of particle, the spin angular momentum ħs becomes zero in the limit of classical mechanics (ħ → 0). This consideration does not apply to the orbital angular momentum, since l can take any value. The transition to classical mechanics is represented by ħ tending to zero and l simultaneously tending to infinity, in such a way that the product ħl remains finite.

Note that the suffixes in the matrix elements on the right of (55.1) are written in an order which is, in a sense, the reverse of the usual order in (11.11).

More precisely, we should write

image

in (55.3) the coordinate factors are omitted, being unimportant in this connection.
We must once again emphasize the distinction between the specified eigenvalue σ1 or σ2 of sz and the independent variable σ.

In the tabular matrices (55.7) the rows and columns are numbered by the values of σ, the row number corresponding to the first and the column number to the second suffix of the matrix element. In the present case, these numbers are + ½ and − ½. The action of the. operator shown by (55.4) multiplies row σ of the matrix by a column matrix containing the components of the wave function:

image

There should be no misunderstanding because of the use of the same letter to denote the spin component and the Pauli matrices, since the latter always have the circumflex.

The terms on the right of (55.8)(55.10) which are independent of image must, of course, be understood as constants multiplying the unit two-by-two matrix.

The notation Ûφ implies that the rows of the matrix Û are multiplied by the column φ.

Such a transformation of two quantities is called a binary transformation.

This property is closely associated with symmetry under time reversal. The latter (see §18) involves the replacement of the wave function by its complex conjugate. Under time reversal, the angular momentum components also change sign. Hence the functions that are the complex conjugates of the components ψ1 ≡ ψ(1/2) and ψ2 ≡ ψ(−1/2) must have properties equivalent to those of the components corresponding to spin projections −1/2 and ½ respectively.

The notation ψÛ (with ψ to the left of Û) denotes that the components (ψ1ψ2) as a row are multiplied by the columns of the matrix Û.

In other words, the symmetrical spinors form what are called irreducible representations of the rotation group (see §98).

This result can also be regarded somewhat differently. If the wave function ψ of a particle in a state with angular momentum j is expanded in terms of the eignfunctions ψjm:

image

then the coefficients am are the probability amplitudes for various values of m. In this sense they correspond to the “components” ψ(m) of a spin wave function, and this gives their law of transformation. On the other hand, the value of ψ at a given point in space cannot depend on the choice of the coordinate system, i.e. the sum σamψjm must be a scalar. Comparing with the scalar (57.3), we see that am must transform as (−1)jmψj,-m.

The mixed components of a symmetrical spinor may be written in the form ψμλ, without distinction between ψμλ and ψμλ.

We can say that the 2j + 1 components of an irreducible tensor of rank j (an integer), the 2j + 1 spherical harmonics Yjm, and the 2j + 1 components of a symmetrical spinor of rank 2j give the same irreducible representation of the rotation group.

The systems xyz and x’y’z’ are, as always, right-handed, and a positive angle corresponds to the movement of a corkscrew advanced in the positive direction of the axis of rotation.
The definition of the Eulerian angles given here (and usual in quantum mechanics) differs from that in Mechanics, §35, in that the second rotation is about the y-axis and not the x-axis. The angles α, β, γ are related to the angles φ, θ, ψ used in Mechanics (not the spherical polar angles φ and θ) by φ = α + ½π, θ = β, ψ = γ − ½π.

Note that the matrix indices in (58.7) are placed in the order that corresponds to multiplying the columns of the matrix image(j) by the functions ψjm′ arranged in a row. In the symbolic notation, (58.7) would have to be written ψjm = (ψjD(j))m in accordance with (56.13).

The calculations are described by A. R. Edmonds, Angular Momentum in Quantum Mechanics, Princeton, 1957. The definition of the functions Dm’m(j) by (58.9) differs from that used in Edmonds’s book by the interchange of α and γ, this being the more natural treatment in the approach given here.

See §e of the Mathematical Appendices, formula (e.11), for the relation between these polynomials and the hypergeometric series.

When these quantities are given, so are the mean values of the components of the vector s and all their powers and products 2, 3, …, 2s at a time, which do not reduce to lower powers (see §55, Problem 3).

Note that the rule for the complex conjugate of a spherical harmonic function, according to (28.9), coincides with the general rule (60.3).

See, for example, Fields, §17, and the end of §111 below.

When the sum σsa is integral (or halt-integral), all possible values of the total spin S of the system are also integral (or half-integral).

If the electric field possesses a high (cubic) symmetry, fourfold degeneracy may occur (see §99, including the Problem).
It is meaningless to call the spinor real in the literal sense, since complex conjugate spinors have different laws of transformation.