Next: Lorentz Invariance of Dirac
Up: Relativistic Electron Theory
Previous: Preliminary Analysis
Consider the motion of an electron in the absence of electromagnetic fields. In classical relativity,
electron energy,
, is related to electron momentum,
, according to the well-known formula
|
(11.15) |
where
is the electron rest mass [93].
The quantum mechanical equivalent of this expression is the wave equation
|
(11.16) |
where the
's are interpreted as differential operators according to Equation (11.14). The previous equation
takes into account the correct relativistic relation between electron energy and momentum, but is nevertheless unsatisfactory from the
point of view of relativistic theory, because it is highly asymmetric between
and the other
's. This makes the equation
difficult to generalize, in a manifestly Lorentz-invariant manner, in the presence of electromagnetic fields. We must therefore look for a new equation.
If we multiply the wave equation (11.16) by the operator
|
(11.17) |
then we obtain
|
(11.18) |
This equation is manifestly Lorentz invariant, and, therefore, forms a more convenient starting point for relativistic quantum mechanics.
Note, however, that Equation (11.18) is not entirely equivalent to Equation (11.16), because, although each
solution of Equation (11.16) is also a solution of Equation (11.18), the converse is not true. In fact, only those solutions of Equation (11.18)
belonging to positive values of
are also solutions of Equation (11.16).
The wave equation (11.18) is quadratic in
, and is, thus, not of the form required by the laws of quantum theory. (Recall that we demonstrated, from
general principles, in Chapter 3, that the
time evolution equation for the wavefunction should be linear in the operator
, and, hence, in
.) We, therefore, seek a wave equation that is
equivalent to Equation (11.18), but is
linear in
. In order to ensure that this equation transforms in a simple way under a Lorentz transformation, we
shall require it to be rational and linear in
,
,
, as well as
. We are, thus,
lead to a wave equation of the form
|
(11.19) |
where the
's and
are dimensionless, and independent of the
's. Moreover, according to standard relativity, because we are considering the case of no electromagnetic fields, all points in space-time
must be equivalent. Hence, the
's and
must also be independent of
the
's. This implies that the
's and
commute with the
's and the
's. We, therefore, deduce that the
's and
describe an internal degree of freedom that is independent of space-time coordinates. In fact, we shall demonstrate later on in this chapter that these operators are related to electron spin. Note that the previous equation can be
written in the standard form
|
(11.20) |
where the effective Hamiltonian,
, is
|
(11.21) |
Multiplying Equation (11.19) by the operator
|
(11.22) |
we obtain
|
(11.23) |
where
.
This equation is equivalent to Equation (11.18) provided that
for
.
It is helpful to define the
, for
, where
for
.
Equations (11.24)-(11.26) can then be shown to reduce to
|
(11.29) |
(See Exercise 2.)
One way of satisfying the previous anti-commutation relations is to represent the operators
as matrices.
It follows that the operators
and
are also matrices.
[Incidentally, it is clear from Equation (11.21) that the
and
matrices must be Hermitian, otherwise
our effective Hamiltonian would not be an Hermitian operator.]
In fact, it is possible to prove that the
and
must be even-dimensional matrices. (See Exercise 3.)
Unfortunately, it is not possible to find a system of four appropriate
matrices. (The three Pauli
matrices have the correct properties, but there is no fourth matrix.)
It turns
out that the smallest dimension in which the
can be realized is four [9]. In fact, it is easily verified that the
matrices
for
, satisfy the appropriate anti-commutation relations. (See Exercise 4) Here, 0
and
denote
null and identity matrices, respectively, whereas the
represent the
Pauli matrices
introduced in Section 5.7.
It follows from Equations (11.27) and (11.28) that
Note that
,
, and the
, are all Hermitian matrices, whereas the
, for
, are anti-Hermitian.
However, the matrices
, for
, are Hermitian. Moreover, it is easily demonstrated that
|
(11.34) |
for
.
Equation (11.19) can be written in the form
|
(11.35) |
where
.
Alternatively, we can write
where
, and
is the vector of the
matrices. The previous expression is known as the Dirac equation [30].
Incidentally, it is clear that, corresponding to the four rows and columns of the
matrices, the wavefunction
must take the form of a
column matrix, each element of which is, in general, a function of the
.
We saw in Section 5.7 that the spin of the electron requires the wavefunction to have two components. The reason
our present theory requires the wavefunction to have four components is because the wave equation (11.18) has twice
as many solutions as it ought to have, half of them corresponding to negative energy states.
We can incorporate an electromagnetic field into the previous formalism by means of the standard prescription
, and
, where
is the magnitude of the electron charge,
the scalar potential, and
the
vector potential [67]. This prescription can be expressed in the Lorentz-invariant form
|
(11.37) |
where
is the potential 4-vector [93]. Thus,
Equation (11.35) becomes
|
(11.38) |
whereas
Equation (11.36) generalizes to
|
(11.39) |
If we write the wavefunction in the spinor form
|
(11.40) |
then the Hermitian conjugate of Equation (11.39) becomes
|
(11.41) |
where
|
(11.42) |
Here, use has been made of the fact that the
and the
are Hermitian matrices that commute with the
and the
.
It follows from
times Equation (11.38) that
|
(11.43) |
The Hermitian conjugate of this expression is
|
(11.44) |
where
now acts backward on
, and use has been made of the fact that the matrices
and
are Hermitian. Taking the difference between the previous two equations, we obtain
|
(11.45) |
where
|
(11.46) |
Writing
, where
Equation (11.45) becomes
|
(11.49) |
The previous expression has the same form as the non-relativistic probability conservation equation, (3.65). This
suggests that we can interpret the positive-definite real scalar field
as the relativistic probability density, and the vector field
as the relativistic probability current. Integration of the preceding expression over all space, assuming that
as
, yields
|
(11.50) |
This ensures that if the wavefunction is properly normalized at time
, such that
|
(11.51) |
then the wavefunction remains properly normalized at all subsequent times, as it evolves in accordance with the Dirac equation.
In fact, if this were not the case then it would be impossible to interpret
as a probability density. Now, relativistic
invariance demands that if the wavefunction is properly normalized in one particular inertial frame then it should
be properly normalized in all inertial frames [93]. This is the case provided that Equation (11.45) is Lorentz invariant (i.e., it has the property
that if it holds
in one inertial frame then it holds in all inertial frames),
which is true as long as the
transform as the contravariant components of a 4-vector under Lorentz transformation. (See the following section, and Exercise 6.)
Next: Lorentz Invariance of Dirac
Up: Relativistic Electron Theory
Previous: Preliminary Analysis
Richard Fitzpatrick
2016-01-22