17
Symmetry and Conservation Laws
17-1 Symmetry
In classical physics there are a number of quantities which are conserved— such as momentum, energy, and angular momentum. Conservation theorems about corresponding quantities also exist in quantum mechanics. The most beautiful thing of quantum mechanics is that the conservation theorems can, in a sense, be derived from something else, whereas in classical mechanics they are practically the starting points of the laws. (There are ways in classical mechanics to do an analogous thing to what we will do in quantum mechanics, but it can be done only at a very advanced level.) In quantum mechanics, however, the conservation laws are very deeply related to the principle of superposition of amplitudes, and to the symmetry of physical systems under various changes. This is the subject of the present chapter. Although we will apply these ideas mostly to the conservation of angular momentum, the essential point is that the theorems about the conservation of all kinds of quantities are—in the quantum mechanics—related to the symmetries of the system.
We begin, therefore, by studying the question of symmetries of systems. A very simple example is the hydrogen molecular ion—we could equally well take the ammonia molecule—in which there are two states. For the hydrogen molecular ion we took as our base states one in which the electron was located near proton number 1, and another in which the electron was located near proton number 2. The two states—which we called |
1〉 and |
2 〉—are shown again in
Fig. 17-1(a). Now, so long as the two nuclei are both exactly the same, then there is a certain
symmetry in this physical system. That is to say, if we were to
reflect the system in the plane halfway between the two protons—by which we mean that everything on one side of the plane gets moved to the symmetric position on the other side—we would get the situations in
Fig. 17-1(b). Since the protons are identical, the
operation of reflection changes |
1〉 into |
2〉 and |
2〉 into |
1〉. We’ll call this reflection operation
and write
(17.1)
So our
is an operator in the sense that it “does
something” to a state to make a new state. The interesting thing is that
operating on
any state produces some
other state of the system.
Now
, like any of the other operators we have described, has matrix elements which can be defined by the usual obvious notation. Namely,
are the matrix elements we get if we multiply
and
on the left by 〈
1|. From Eq. (17.1) they are
(17.2)
In the same way we can get
P21 and
P22. The matrix of
—
with respect to the base systems |
1〉
and |
2〉—is
(17.3)
We see once again that the words operator and matrix in quantum mechanics are practically interchangeable. There are slight technical differences—like the difference between a “numeral” and a “number”—but the distinction is something pedantic that we don’t have to worry about. So whether defines an operation, or is actually used to define a matrix of numbers, we will call it interchangeably an operator or a matrix.
Review: Chapter 52, Vol. I, Symmetry in Physical Laws
Reference: A. R. Edmonds, Angular Momentum in Quantum Mechanics, Princeton University Press, 1957
Fig. 17-1. If the states |
1〉 and |
2〉 are reflected in the plane
P-P, they go into |
2〉 and |
1〉, respectively.
Fig. 17-2. In a symmetric system, if a pure |
1〉 state develops as shown in part (a), a pure |
2〉 state will develop as in part (b).
Now we would like to point out something. We will
suppose that the
physics of the whole hydrogen molecular ion system is
symmetrical. It doesn’t have to be—it depends, for instance, on what else is near it. But if the system is symmetrical, the following idea should certainly be true. Suppose we start at
t = 0 with the system in the state |
1〉 and find after an interval of time
t that the system turns out to be in a more complicated situation—in some linear combination of the two base states. Remember that in Chapter 8 we used to represent “going for a period of time” by multiplying by the operator
Û. That means that the system would after a while—say 15 seconds to be definite—be in some other state. For example, it might be
parts of the state |
1〉 and
parts of the state |
2〉, and we would write
(17.4)
Now we ask what happens if we start the system in the
symmetric state |
2〉 and wait for 15 seconds under the
same conditions? It is clear that if the world is symmetric—as we are supposing—we should get the state symmetric to (17.4):
(17.5)
The same ideas are sketched diagrammatically in
Fig. 17-2. So if the
physics of a system is symmetrical with respect to some plane, and we work out the behavior of a particular state, we also know the behavior of the state we would get by reflecting the original state in the symmetry plane.
We would like to say the same things a little bit more generally—which means a little more abstractly. Let
be any one of a number of operations that you could perform on a system
without changing the physics. For instance, for
we might be thinking of
, the operation of a
reflection in the plane between the two atoms in the hydrogen molecule. Or, in a system with two electrons, we might be thinking of the operation of
interchanging the two electrons. Another possibility would be, in a spherically symmetric system, the operation of a
rotation of the whole system through a finite angle around some axis—which wouldn’t change the physics. Of course, we would normally want to give each special case some special notation for
. Specifically, we will normally define the
to be the operation “rotate the system about the
y-axis by the angle
θ”. By Q we mean just any one of the operators we have described or any other one—which leaves the basic physical situation unchanged.
Let’s think of some more examples. If we have an atom with no external magnetic field or no external electric field, and if we were to turn the coordinates around any axis, it would be the same physical system. Again, the ammonia molecule is symmetrical with respect to a reflection in a plane parallel to that of the three hydrogens—so long as there is no electric field. When there is an electric field, when we make a reflection we would have to change the electric field also, and that changes the physical problem. But if we have no external field, the molecule is symmetrical.
Now we consider a general situation. Suppose we start with the state
and after some time or other under given physical conditions it has become the state |
ψ2〉. We can write
(17.6)
[You can be thinking of Eq. (17.4).] Now imagine we perform the operation
on the whole system. The state |
ψ1〉 will be transformed to a state
, which we can also write as
|
ψ1〉. Also the state |
ψ2〉 is changed into
|
ψ2〉 Now
if the physics is symmetrical under
(don’t forget the if; it is not a general property of systems), then, waiting for the same time under the same conditions, we should have
(17.7)
[Like Eq. (17.5).] But we can write
for
and
for
so (17.7) can also be written
(17.8)
If we now replace |
ψ2〉 by
|
ψ1〉—Eq. (17.6)—we get
(17.9)
It’s not hard to understand what this means. Thinking of the hydrogen ion it says that: “making a reflection and waiting a while”—the expression on the right of Eq. (17.9)—is the same as “waiting a while and then making a reflection”—the expression on the left of (17.9). These should be the same so long as U doesn’t change under the reflection.
Since (17.9) is true for
any starting state |
ψ1〉, it is really an equation about the operators:
(17.10)
This is what we wanted to get—
it is a mathematical statement of symmetry. When Eq. (17.10) is true, we say that the operators
Û and
commute. We can then
define “symmetry” in the following way: A physical system is
symmetric with respect to the operation
when
commutes with U, the operation of the passage of time. [In terms of matrices, the product of two operators is equivalent to the matrix product, so Eq. (17.10) also holds for the matrices
Q and
U for a system which is symmetric under the transformation
Q.]
Incidentally, since for infinitesimal times ∈ we have
Û = 1 –
iĤ∈/ℏ—where
Ĥ is the usual Hamiltonian (see Chapter 8)—you can see that if (17.10) is true, it is also true that
(17.11)
So (17.11) is the mathematical statement of the condition for the symmetry of a physical situation under the operator
. It
defines a symmetry.
17-2 Symmetry and conservation
Before applying the result we have just found, we would like to discuss the idea of symmetry a little more. Suppose that we have a very special situation: after we operate on a state with
, we get the same state. This is a very special case, but let’s suppose it happens to be true for a state |
ψ0〉 that
is physically the same state as |
ψ0〉. That means that |
ψʹ〉 is equal to |
ψ0〉 except for some phase factor.
59 How can that happen? For instance, suppose that we have an
ion in the state which we once called |
I〉.
60 For this state there is equal amplitude to be in the base states |
1〉 and |
2〉. The probabilities are shown as a bar graph in
Fig. 17-3(a). If we operate on |
I〉 with the reflection operator
, it flips the state over changing |
1〉 to |
2〉 and |
2〉 to |
1〉—we get the probabilities shown in
Fig. 17-3(b). But that’s just the state |
I〉 all over again. If we start with state |
II〉 the probabilities before and after reflection look just the same. However, there is a difference if we look at the amplitudes. For the state |
I〉 the amplitudes are the same after the reflection, but for the state |
II〉 the amplitudes have the opposite sign. In other words,
(17.12)
If we write
, we have that
eiδ = 1 for the state |
I〉 and
eiδ =–1 for the state |
II〉.
Let’s look at another example. Suppose we have a RHC polarized photon propagating in the z-direction. If we do the operation of a rotation around the z-axis, we know that this just multiplies the amplitude by eiφ when φ is the angle of the rotation. So for the rotation operation in this case, δ is just equal to the angle of rotation.
Fig. 17-3. The state |
l〉 and the state
obtained by deflecting |
l〉 in the central plane.
Now it is clear that if it happens to be true that an operator
just changes the phase of a state at some time, say
t = 0, it is true forever. In other words, if the state |
ψ1〉 goes over into the state |
ψ2〉 after a time
t, or
(17.13)
and if the symmetry of the situation makes it so that
(17.14)
then it is also true that
(17.15)
This is clear, since
and if
, then
[The sequence of equalities follows from (17.13) and (17.10) for a symmetrical system, from (17.14), and from the fact that a number like eiδ commutes with an operator.]
So with certain symmetries something which is true initially is true for all times. But isn’t that just a conservation law? Yes! It says that if you look at the original state and by making a little computation on the side discover that an operation which is a symmetry operation of the system produces only a multiplication by a certain phase, then you know that the same property will be true of the final state—the same operation multiplies the final state by the same phase factor. This is always true even though we may not know anything else about the inner mechanism of the universe which changes a system from the initial to the final state. Even if we do not care to look at the details of the machinery by which the system gets from one state to another, we can still say that if a thing is in a state with a certain symmetry character originally, and if the Hamiltonian for this thing is symmetrical under that symmetry operation, then the state will have the same symmetry character for all times. That’s the basis of all the conservation laws of quantum mechanics.
Let’s look at a special example. Let’s go back to the
operator. We would like first to modify a little our definition of
. We want to take for
not just a mirror reflection, because that requires defining the plane in which we put the mirror. There is a special kind of a reflection that doesn’t require the specification of a plane. Suppose we redefine the operation
this way: First you reflect in a mirror in the
z-plane so that z goes to–
z, x stays x, and
y stays
y; then you turn the system 180° about the
z-axis so that
x is made to go to–
x and y to–
y. The whole thing is called an inversion. Every point is projected
through the origin to the diametrically opposite position. All the coordinates of everything are reversed. We will still use the symbol
for this operation. It is shown in
Fig. 17-4. It is a little more convenient than a simple reflection because it doesn’t require that you specify which coordinate plane you used for the reflection—you need specify only the point which is at the center of symmetry.
Now let’s suppose that we have a state |
ψ0〉 which under the inversion operation goes into e
iδ|
ψ0〉—that is,
(17.16)
Then suppose that we invert again. After two inversions we are right back where we started from—nothing is changed at all. We must have that
So
if the inversion operator is a symmetry operation of a state, there are only two possibilities for
eiδ:
which means that
(17.17)
Fig. 17-4. The operation of inversion, P. Whatever is at the point A at (x, y, z) is moved to the point
A’ at (−
x, −
y, −z).
Classically, if a state is symmetric under an inversion, the operation gives back the same state. In quantum mechanics, however, there are the two possibilities: we get the same state or minus the same state. When we get the same state,
, we say that the state |
ψ0〉 has even parity. When the sign is reversed so that
, we say that the state has odd parity. (The inversion operator
is also known as the parity operator.) The state |
I〉 of the
ion has even parity; and the state |
II〉 has odd parity—see Eq. (17.12). There are, of course, states which are not symmetric under the operation
; these are states with no definite parity. For instance, in the
system the state |
I〉 has even parity, the state |
II〉 has odd parity, and the state |
1〉 has no definite parity.
When we speak of an operation like inversion being performed
“on a physical system” we can think about it in two ways. We can think of physically moving whatever is at r to the inverse point at–
r, or we can think of looking at the same system from a new frame of reference
xʹ,
yʹ,
z related to the old by
xʹ =–
x,
y’ =–
y, and
z’ = −
z. Similarly, when we think of rotations, we can think of rotating bodily a physical system, or of rotating the coordinate frame with respect to which we measure the system, keeping the “system” fixed in space. Generally, the two points of view are essentially equivalent. For rotation they are equivalent except that rotating a system by the angle
θ is like rotating the reference frame by the negative of
θ. In these lectures we have usually considered what happens when a projection is made into a new set of axes. What you get that way is the same as what you get if you leave the axes fixed and rotate the system backwards by the same amount. When you do that, the signs of the angles are reversed.
61
Many of the laws of physics—but not all—are unchanged by a reflection or an inversion of the coordinates. They are symmetric with respect to an inversion. The laws of electrodynamics, for instance, are unchanged if we change x to −
x,
y to–
y, and z to–
z in all the equations. The same is true for the laws of gravity, and for the strong interactions of nuclear physics. Only the weak interactions—responsible for
β-decay—do not have this symmetry. (We discussed this in some detail in Chapter 52, Vol. I.) We will for now leave out any consideration of the
β-decays. Then in any physical system where
β-decays are not expected to produce any appreciable effect—an example would be the emission of light by an atom—the Hamiltonian
Ĥ and the operator
will commute. Under these circumstances we have the following proposition. If a state originally has even parity, and if you look at the physical situation at some later time, it will again have even parity. For instance, suppose an atom about to emit a photon is in a state known to have even parity. You look at the whole thing—including the photon—after the emission; it will again have even parity (likewise if you start with odd parity). This principle is called the conservation of parity. You can see why the words “conservation of parity” and “reflection symmetry” are closely intertwined in the quantum mechanics. Although until a few years ago it was thought that nature always conserved parity, it is now known that this is not true. It has been discovered to be false because the
β-decay reaction does not have the inversion symmetry which is found in the other laws of physics.
Now we can prove an interesting theorem (which is true so long as we can disregard weak interactions): Any state of definite energy which is not degenerate must have a definite parity. It must have either even parity or odd parity. (Remember that we have sometimes seen systems in which several states have the same energy—we say that such states are degenerate. Our theorem will not apply to them.)
For a state |
ψ0〉 of definite energy, we know that
(17.18)
where
E is just a number—the energy of the state. If we have any operator
which is a symmetry operator of the system we can prove that
(17.19)
so long as |
ψ0〉 is a unique state of definite energy. Consider the new state |
ψʹ
0〉 that you get from operating with
. If the physics is symmetric, then |
ψ’0〉 must have the same energy as |
ψ0〉. But we have taken a situation in which there is only one state of that energy, namely |
ψ0〉, so |
ψ’
0〉 must be the same state—it can only differ by a phase. That’s the physical argument.
The same thing comes out of our mathematics. Our definition of symmetry is Eq. (17.10) or Eq. (17.11) (good for any state
ψ),
(17.20)
But we are considering only a state |
ψ0〉 which is a definite energy state, so that
Ĥ | ψ
0〉 = |ψ
0〉. Since E is just a number that floats through Q if we want, we have
So
(17.21)
So
= is also a definite energy state of
Ĥ—and with the same
E. But by our hypothesis, there is only one such state; it must be that |ψʹ
0〉 = e
iδ |
ψ0〉·
What we have just proved is true for any operator
that is a symmetry operator of the physical system. Therefore, in a situation in which we consider only electrical forces and strong interactions—and no
β-decay—so that inversion symmetry is an allowed approximation, we have that
. But we have also seen that
eiδ must be either +1 or–1. So any state of a definite energy (which is not degenerate) has got either an even parity or an odd parity.
17-3 The conservation laws
We turn now to another interesting example of an operation: a rotation. We consider the special case of an operator that rotates an atomic system by angle
φ around the
z-axis. We will call this operator
62 . We are going to suppose that we have a physical situation where we have no influences lined up along the
x− and
y-axes. Any electric field or magnetic field is taken to be parallel to the
z-axis
63 so that there will be no change in the external conditions if we rotate the whole physical system about the
z-axis. For example, if we have an atom in empty space and we turn the atom around the
z-axis by an angle
φ, we have the same physical system.
Now then, there are special states which have the property that such an operation produces a new state which is the original state multiplied by some phase factor. Let us make a quick side remark to show you that when this is true the phase change must always be proportional to the angle
φ. Suppose that you would rotate twice by the angle
φ. That’s the same thing as rotating by the angle 2
φ. If a rotation by
φ has the effect of multiplying the state |ψ
0〉 by a phase
eiδ so that
two such rotations in succession would multiply the state by the factor
(eiδ)2 = ei2δ, since
The phase change
δ must be proportional to
φ.
64 We are considering then those special states |
ψ0〉 for which
(17.22)
where
m is some real number.
We also know the remarkable fact that if the system is symmetrical for a rotation around z and if the original state happens to have the property that (17.22) is true, then it will also have the same property later on. So this number m is a very important one. If we know its value initially, we know its value at the end of the game. It is a number which is conserved—m is a constant of the motion. The reason that we pull out m is because it hasn’t anything to do with any special angle φ, and also because it corresponds to something in classical mechanics. In quantum mechanics we choose to call mℏ—for such states as |ψ0〉—the angular momentum about the z-axis. If we do that we find that in the limit of large systems the same quantity is equal to the z-component of the angular momentum of classical mechanics. So if we have a state for which a rotation about the z-axis just produces a phase factor eimφ, then we have a state of definite angular momentum about that axis—and the angular momentum is conserved. It is mℏ now and forever. Of course, you can rotate about any axis, and you get the conservation of angular momentum for the various axes. You see that the conservation of angular momentum is related to the fact that when you turn a system you get the same state with only a new phase factor.
We would like to show you how general this idea is. We will apply it to two other conservation laws which have exact correspondence in the physical ideas to the conservation of angular momentum. In classical physics we also have conservation of momentum and conservation of energy, and it is interesting to see that both of these are related in the same way to some physical symmetry.
Suppose that we have a physical system—an atom, some complicated nucleus, or a molecule, or something—and it doesn’t make any difference if we take the whole system and move it over to a different place. So we have a Hamiltonian which has the property that it depends only on the
internal coordinates in some sense, and does not depend on the
absolute position in space. Under those circumstances there is a special symmetry operation we can perform which is a translation in space. Let’s define
as the operation of a displacement by the distance a along the
x-axis. Then for any state we can make this operation and get a new state. But again there can be very special states which have the property that when you displace them by a along the
x-axis you get the same state except for a phase factor. It’s also possible to prove, just as we did above, that when this happens, the phase must be proportional to a. So we can write for these special states |
ψ0〉
(17.23)
The coefficient k, when multiplied by ℏ, is called the x-component of the momentum. And the reason it is called that is that this number is numerically equal to the classical momentum px when we have a large system. The general statement is this: If the Hamiltonian is unchanged when the system is displaced, and if the state starts with a definite momentum in the x-direction, then the momentum in the x-direction will remain the same as time goes on. The total momentum of a system before and after collisions—or after explosions or what not—will be the same.
There is another operation that is quite analogous to the displacement in space: a delay in time. Suppose that we have a physical situation where there is nothing external that depends on time, and we start something off at a certain moment in a given state and let it roll. Now if we were to start the same thing off again (in another experiment) two seconds later—or/say, delayed by a time
τ—and if nothing in the external conditions depends on the absolute time, the development would be the same and the final state would be the same as the other final state, except that it will get there later by the time
τ. Under those circumstances we can also find special states which have the property that the development in time has the special characteristic that the delayed state is just the old, multiplied by a phase factor. Once more it is clear that for these special states the phase change must be proportional to
τ. We can write
(17.24)
It is conventional to use the negative sign in defining ω; with this convention ωℏ is the energy of the system, and it is conserved. So a system of definite energy is one which when displaced τ in time reproduces itself multiplied by e−iωτ. (That’s what we have said before when we defined a quantum state of definite energy, so we’re consistent with ourselves.) It means that if a system is in a state of definite energy, and if the Hamiltonian doesn’t depend on t, then no matter what goes on, the system will have the same energy at all later times.
You see, therefore, the relation between the conservation laws and the symmetry of the world. Symmetry with respect to displacements in time implies the conservation of energy; symmetry with respect to position in x, y, or z implies the conservation of that component of momentum. Symmetry with respect to rotations around the x-, y-, and z-axes implies the conservation of the x-, y-, and z-components of angular momentum. Symmetry with respect to reflection implies the conservation of parity. Symmetry with respect to the interchange of two electrons implies the conservation of something we don’t have a name for, and so on. Some of these principles have classical analogs and others do not. There are more conservation laws in quantum mechanics than are useful in classical mechanics—or, at least, than are usually made use of.
In order that you will be able to read other books on quantum mechanics, we must make a small technical aside—to describe the notation that people use. The operation of a displacement with respect to time is, of course, just the operation
Û that we talked about before:
(17.25)
Most people like to discuss everything in terms of
infinitesimal displacements in time, or in terms of infinitesimal displacements in space, or in terms of rotations through infinitesimal angles. Since any finite displacement or angle can be accumulated by a succession of infinitesimal displacements or angles, it is often easier to analyze first the infinitesimal case. The operator of an infinitesimal displacement Δ
t in time is—as we have defined it in Chapter 8—
(17.26)
Then Ĥ is analogous to the classical quantity we call energy, because if Ĥ |ψ〉 happens to be a constant times |ψ〉 namely, Ĥ |Ψ〉 then that constant is the energy of the system.
The same thing is done for the other operations. If we make a small displacement in x, say by the amount Δ
x, a state |ψ〉 will,
in general, go over into some other state |ψʹ〉. We can write
(17.27)
since as Δ
x goes to zero, the |ψʹ〉 should become just |ψ〉 or
1, and for small Δ
x the change of
from 1 should be proportional to Δ
x. Defined this way, the operator
is called the momentum operator—for the
x-component, of course.
For identical reasons, people usually write for small rotations
(17.28)
and call
the operator of the
z-component of angular momentum. For those special states for which
, we can for any small angle—say Δ
φ—expand the right-hand side to first order in Δ
φ and get
Comparing this with the definition of
in Eq. (17.28), we get that
(17.29)
In other words, if you operate with
on a state
with a definite angular momentum about the
z-axis, you get
mℏ times the same state, where
mℏ is the amount of
z-component of angular momentum. It is quite analogous to operating on a definite energy state with
to get |
ψ〉.
We would now like to make some applications of the ideas of the conservation of angular momentum—to show you how they work. The point is that they are really very simple. You knew before that angular momentum is conserved. The only thing you really have to remember from this chapter is that if a state |ψ
0〉 has the property that upon a rotation through an angle
φ about the
z-axis, it becomes
|ψ0〉; it has a
z-component of angular momentum equal to
mℏ. That’s all we will need to do a number of interesting things.
17-4 Polarized light
First of all we would like to check on one idea. In Section 11-4 we showed that when RHC polarized light is viewed in a frame rotated by the angle
φ about the
z-axis
65 it gets multiplied by
eiφ . Does that mean then that the photons of light that are right circularly polarized carry an angular momentum of
one unit
66 along the
z-axis?
Indeed it does. It also means that if we have a beam of light containing a large number of photons all circularly polarized the same way—as we would have in a classical beam—it will carry angular momentum. If the total energy carried by the beam in a certain time is
W, then there are
N =
Wlħω photons. Each one carries the angular momentum
ħ, so there is a total angular momentum of
(17.30)
Can we prove classically that light which is right circularly polarized carries an energy and angular momentum in proportion to
W/
ω? That should be a classical proposition if everything is right. Here we have a case where we can go from the quantum thing to the classical thing. We should see if the classical physics checks. It will give us an idea whether we have a right to call m the angular momentum. Remember what right circularly polarized light is, classically. It’s described by an electric field with an oscillating
x-component and an oscillating
y-component 90° out of phase so that the resultant electric vector ε goes in a circle—as drawn in
Fig. 17-5(a). Now suppose that such light shines on a wall which is going to absorb it—or at least some of it—and consider an atom in the wall according to the classical physics. We have often described the motion of the electron in the atom as a harmonic oscillator which can be driven into oscillation by an external electric field. We’ll suppose that the atom is isotropic, so that it can oscillate equally well in the
x- or
y-directions. Then in the circularly polarized light, the
x-displacement and the
y-displacement are the same, but one is 90° behind the other. The net result is that the electron moves in a circle, as shown in
Fig. 17-5(b). The electron is displaced at some displacement
r from its equilibrium position at the origin and goes around with some phase lag with respect to the vector ε. The relation between ε and r might be as shown in
Fig. 17-5(b). As time goes on, the electric field rotates and the displacement rotates with the same frequency, so their relative orientation stays the same. Now let’s look at the work being done on this electron. The rate that energy is being put into this electron is
υ, its velocity, times the component of
qε- parallel to the velocity:
(17.31)
Fig. 17-5. (a) The electric field ε in a circularly polarized light wave. (b) The motion of an electron being driven by the circularly polarized light.
But look, there is angular momentum being poured into this electron, because there is always a torque about the origin. The torque is
qεtr, which must be equal to the rate of change of angular momentum
dJz/
dt:
(17.32)
Remembering that
υ =
ωr, we have that
Therefore, if we integrate the total angular momentum which is absorbed, it is proportional to the total energy—the constant of proportionality being 1/ω, which agrees with Eq. (17.30). Light does carry angular momentum—1 unit (times ħ) if it is right circularly polarized along the z-axis, and—1 unit along the z-axis if it is left circularly polarized.
Now let’s ask the following question: If light is linearly polarized in the x-direction, what is its angular momentum? Light polarized in the x-direction can be represented as the superposition of RHC and LHC polarized light. Therefore, there is a certain amplitude that the angular momentum is +ħ and another amplitude that the angular momentum is–ħ, so it doesn’t have a definite angular momentum. It has an amplitude to appear with +ħ and an equal amplitude to appear with–ħ. The interference of these two amplitudes produces the linear polarization, but it has equal probabilities to appear with plus or minus one unit of angular momentum. Macroscopic measurements made on a beam of linearly polarized light will show that it carries zero angular momentum, because in a large number of photons there are nearly equal numbers of RHC and LHC photons contributing opposite amounts of angular momentum—the average angular momentum is zero. And in the classical theory you don’t find the angular momentum unless there is some circular polarization.
We have said that any spin-one particle can have three values of
Jz, namely +1, 0,–1 (the three states we saw in the Stern-Gerlach experiment). But light is screwy; it has only two states. It does not have the zero case. This strange lack is related to the fact that light cannot stand still. For a particle of spin
j which is standing still, there must be the 2
j + 1 possible states with values of
jz going in steps of 1 from–
j to +
j. But it turns out that for something of spin
j with zero mass only the states with the components +
j and–
j along the direction of motion exist. For example, light does not have three states, but only two—although a photon is still an object of spin one. How is this consistent with our earlier proofs—based on what happens under rotations in space—that for spin-one particles three states are necessary? For a particle at rest, rotations can be made about any axis without changing the momentum state. Particles with zero rest mass (like photons and neutrinos) cannot be at rest; only rotations about the axis along the direction of motion do not change the momentum state. Arguments about rotations around one axis only are insufficient to prove that three states are required, given that one of them varies as
eiφ under rotations by the angle
φ.
67
One further side remark. For a zero rest mass particle, in general, only one of the two spin states with respect to the line of motion (+j,–j) is really necessary. For neutrinos—which are spin one-half particles—only the states with the component of angular momentum opposite to the direction of motion (–ħ/2) exist in nature [and only along the motion (+ħ/2) for antineutrinos]. When a system has inversion symmetry (so that parity is conserved, as it is for light) both components (+j, and–j) are required.
17-5 The disintegration of the ∧0
Now we want to give an example of how we use the theorem of conservation of angular momentum in a specifically quantum physical problem. We look at break-up of the lambda particle (∧
0), which disintegrates into a proton and a π
— meson by a “weak” interaction:
Assume we know that the pion has spin zero, that the proton has spin one-half, and that the ∧
0 has spin one-half. We would like to solve the following problem: Suppose that a ∧
0were to be produced in a way that caused it to be completely polarized—by which we mean that its spin is, say “up,” with respect to some suitably chosen
z-axis—see
Fig. 17-6(a). The question is, with what probability will it disintegrate so that the proton goes off at an angle
θ with respect to the
z-axis—as in
Fig. 17-6(b)? In other words, what is the angular distribution of the disintegrations? We will look at the disintegration in the coordinate system in which the ∧
0 is at rest—we will measure the angles in this rest frame; then they can always be transformed to another frame if we want.
Fig. 17-6. A ∧
0 with spin “up” decays into a proton and a pion (in the CM system). What is the probability that the proton will go off at the angle
θ?
We begin by looking at the special circumstance in which the proton is emitted into a small solid angle Δ
Ω along the
z-axis (
Fig. 17-7). Before the disintegration we have a ∧
0 with its spin “up,” as in part (a) of the figure. After a short time—for reasons unknown to this day, except that they are connected with the weak decays—the ∧
0explodes into a proton and a pion. Suppose the proton goes up along the +
z-axis. Then, from the conservation of momentum, the pion must go down. Since the proton is a spin one-half particle, its spin must be either “up” or “down”—there are, in principle, the two possibilities shown in parts (b) and (c) of the figure. The conservation of angular momentum, however, requires that the proton have spin “up.” This is most easily seen from the following argument. A particle moving along the
z-axis cannot contribute any angular momentum about this axis by virtue of its motion; therefore, only the spins can contribute to
Jz. The spin angular momentum about the
z-axis is +
ħ/2 before the disintegration, so it must also be +
ħ/2 afterward. We can say that since the pion has no spin, the proton spin must be “up.”
Fig. 17-7. Two possibilities for the decay of a spin “up” ∧
0 with the proton going along the +
z-axis. Only (b) conserves angular momentum.
If you are worried that arguments of this kind may not be valid in quantum mechanics, we can take a moment to show you that they are. The initial state (before the disintegration), which we can call ∣∧
0, spin +z) has the property that if it is rotated about the
z-axis by the angle
φ, the state vector gets multiplied by the phase factor
eiφ/2. (In the rotated system the state vector is e
iφ/2∣ ∧
0, spin +
z〉.) That’s what we mean by spin “up” for a spin one-half particle. Since nature’s behavior doesn’t depend on our choice of axes, the final state (the proton plus pion) must have the same property. We could write the final state as, say,
But we really do not need to specify the pion motion, since in the frame we have chosen the pion always moves opposite the proton; we can simplify our description of the final state to
Now what happens to this state vector if we rotate the coordinates about the z-axis by the angle φ?
Since the proton and pion are moving along the z-axis, their motion isn’t changed by the rotation. (That’s why we picked this special case; we couldn’t make the argument otherwise.) Also, nothing happens to the pion, because it is spin zero. The proton, however, has spin one-half. If its spin is “up” it will contribute a phase change of eiφ/2 in response to the rotation. (If its spin were “down” the phase change due to the proton would be e—iφ/2.) But the phase change with rotation before and after the excitement must be the same if angular momentum is to be conserved. (And it will be, since there are no outside influences in the Hamiltonian.) So the only possibility is that the proton spin will be “up.” If the proton goes up, its spin must also be “up.”
Fig. 17-8. The decay along the
z-axis for a ∧
0 with spin “down.”
We conclude, then, that the conservation of angular momentum permits the process shown in part (b) of
Fig. 17-7, but does not permit the process shown in part (c). Since we know that the disintegration occurs, there is some amplitude for process (b)—proton going up with spin “up.” We’ll let a stand for the amplitude that the disintegration occurs in this way in any infinitesimal interval of time.
68
Now let’s see what would happen if the ∧
0 spin were initially “down.” Again we ask about the decays in which the proton goes up along the
z-axis, as shown in
Fig. 17-8. You will appreciate that in this case the proton must have spin “down” if angular momentum is conserved. Let’s say that the amplitude for such a disintegration is b.
We can’t say anything more about the two amplitudes a and b. They depend on the inner machinery of ∧0, and the weak decays, and nobody yet knows how to calculate them. We’ll have to get them from experiment. But with just these two amplitudes we can find out all we want to know about the angular distribution of the disintegration. We only have to be careful always to define completely the states we are talking about.
We want to know the probability that the proton will go off at the angle
θ with respect to the
z-axis (into a small solid angle Δ
Ω as drawn in
Fig. 17-6. Let’s put a new
z-axis in this direction and call it the zʹ-axis. We know how to analyze what happens along this axis. With respect to this new axis, the ∧
0 no longer has its spin “up,” but has a certain amplitude to have its spin “up” and another amplitude to have its spin “down.” We have already worked these out in Chapter 6, and again in Chapter 10, Eq. (10.30). The amplitude to be spin “up” is cos
θ/2, and the amplitude to be spin “down” is
69—sin
θ/2. When the ∧
0 spin is “up” along the zʹ-axis it will emit a proton in the +
z′-direction with the amplitude a. So the amplitude to find an “up”-spinning proton coming out along the z′-direction is
(17.33)
Similarly, the amplitude to find a “down”-spinning proton coming along the positive
zʹ-axis is
(17.34)
The two processes that these amplitudes refer to are shown in
Fig. 17-9.
Fig. 17-9. Two possible decay states for the ∧
0.
Let’s now ask the following easy question. If the ∧
0has spin up along the
z-axis, what is the probability that the decay proton will go off at the angle
θ? The two spin states (“up” or “down” along z′) are distinguishable even though we are not going to look at them. So to get the probability we square the amplitudes and add. The probability
ƒ(
θ) of finding a proton in a small solid angle Δ
Ω at
θ is
(17.35)
Remembering that sin
2 θ/2 =
(1—cos
θ) and that cos
2 θ/2 =
(1 + cos
θ), we can write
ƒ(
θ) as
(17.36)
The angular distribution has the form
(17.37)
The probability has one part that is independent of
θ and one part that varies linearly with cos
θ. From measuring the angular distribution we can get a and
β, and therefore, ∣
a∣ and ∣
b∣.
Now there are many other questions we can answer. Are we interested only in protons with spin “up” along the old
z-axis? Each of the terms in (17.33) and (17.34) will give an amplitude to find a proton with spin “up” and with spin “down” with respect to the
zʹ-axis (+
zʹ and–
zʹ). Spin “up” with respect to the old axis ∣+
z〉 can be expressed in terms of the base states ∣ +
z′〉 and ∣-
z′〉. We can then combine the two amplitudes (17.33) and (17.34) with the proper coefficients (cos
θ/2 and—sin
θ/2) to get the total amplitude
Its square is the probability that the proton comes out at the angle θ with its spin the same as the ∧0 (“up” along the z-axis).
If parity were conserved, we could say one more thing. The disintegration of
Fig. 17-8 is just the reflection—in, say, the
yz-plane of the disintegration—of
Fig. 17-7.
70 If parity were conserved,
b would have to be equal to
a or to—
a. Then the coefficient α of (17.37) would be zero, and the disintegration would be equally likely to occur in all directions.
The experimental results show, however, that there is an asymmetry in the disintegration. The measured angular distribution does go as cos θ as we predict—and not as cos2 θ or any other power. In fact, since the angular distribution has this form, we can deduce from these measurements that the spin of the ∧0 is 1/2. Also, we see that parity is not conserved. In fact, the coefficient α is found experimentally to be–0.62± 0.05, so b is about twice as large as a. The lack of symmetry under a reflection is quite clear.
You see how much we can get from the conservation of angular momentum. We will give some more examples in the next chapter.
Parenthetical note. By the amplitude
a in this section we mean the amplitude that the state proton going +
z, spin +
z〉 is generated in an infinitesimal time
dt from the state ∣∧, spin +
z〉, or, in other words, that
(17.38)
where
H is the Hamiltonian of the world—or, at least, of whatever is responsible for the ∧-decay. The conservation of angular momentum means that the Hamiltonian must have the property that
(17.39)
By the amplitude
b we mean that
(17.40)
Conservation of angular momentum implies that
(17.41)
If the amplitudes written in (17.33) and (17.34) are not clear, we can express them more mathematically as follows. By (17.33) we intend the amplitude that the A with spin along +
z will disintegrate into a proton moving along the +
z′-direction with its spin also in the +
z′-direction, namely the amplitude
(17.42)
By the general theorems of quantum mechanics, this amplitude can be written as
(17.43)
where the sum is to be taken over the base states ∣∧,
i〉 of the ∧-particle at rest. Since the ∧-particle is spin one-half, there are two such base states which can be in any reference base we wish. If we use for base states spin “up” and spin “down”
with respect to z′ (+
zʹ,–
zʹ), the amplitude of (17.43) is equal to the sum
The first factor of the first term is
a, and the first factor of the second term is zero—from the definition of (17.38), and from (17.41), which in turn follows from angular momentum conservation. The remaining factor 〈∧, +
z′∣ ∧, +
z〉 of the first term is just the amplitude that a spin one-half particle which has spin “up” along one axis will also have spin “up” along an axis tilted at the angle
θ, which is cos
θ/2—see
Table 6-2. So (17.44) is just
a cos
θ/2, as we wrote in (17.33). The amplitude of (17.34) follows from the same kind of arguments for a spin “down” ∧-particle.
17-6 Summary of the rotation matrices
We would like now to bring together in one place the various things we have learned about the rotations for particles of spin one-half and spin one—so they will be convenient for future reference. On the next page you will find tables of the two rotation matrices
Rz (
φ) and
Ry(
θ) for spin one-half particles, for spin-one particles, and for photons (spin-one particles with zero rest mass). For each spin we will give the terms of the matrix 〈
j∣
R∣
i〉 for rotations about the
z-axis or the
y-axis. They are, of course, exactly equivalent to the amplitudes like 〈+
T∣0
S〉 we have used in earlier chapters. We mean by
Rz(
φ) that the state is projected into a new coordinate system which is rotated through the angle
φ about the
z-axis—using always the right-hand rule to define the positive sense of the rotation. By
Ry(
θ) we mean that the reference axes are rotated by the angle
θ about the
y-axis. Knowing these two rotations, you can, of course, work out any arbitrary rotation. As usual, we write the matrix elements so that the state on the left is a base state of the new (rotated) frame and the state on the right is a base state of the old (unrotated) frame. You can interpret the entries in the tables in many ways. For instance, the entry
e—iφ/2 in
Table 17-1 means that the matrix element 〈—|
R|—〉 =
e—iφ/2. It also means that
|—〉 =
e—iφ/2 |—〉, or that 〈—∣
= 〈—∣
e—iφ/2. It’s all the same thing.
Rotation matrices for spin one-half
Rotation matrices for spin one
Photons