Local
Gauge Symmetries and Gauge Bosons I – “Classical” QED:
In
previous lectures we have outlined the quark/parton model as a description of
hadrons. While this model supplies an
intuitively appealing and approximately correct formalism for understanding the
experimental data, it fails to provide a “fundamental” understanding of the
physics (especially the symmetries) and, perhaps more importantly, fails to
provide a systematic procedure for
defining corrections to the zeroth order picture. We would prefer to have a true theory (instead
of a model), which rigorously defines the structure of corrections. Prior to the mid-1970s many physicists
believed that field theory was not the correct language for such a theory of
particle physics. While quantum field
theory was clearly the correct language for QED, it apparently failed to offer
a description of either the weak or the strong interactions. However, once the
richness of Yang-Mills theories with non-Abelian symmetries became apparent,
this situation changed dramatically. So
the next step in our discussion of symmetries and particle physics will be to
introduce the very helpful language of field theory. In particular, with the language of field
theory and Lagrangians we will be able to discuss the relationship between
interactions (i.e., dynamics) and symmetries.
As
a first and familiar example we will consider the electromagnetic interactions,
QED, and the underlying local gauge symmetry.
To proceed we must establish (recall) some formalism. First let us reach back to classical
mechanics and define the Lagrangian describing an ensemble of particles labeled
by coordinates qi(t).
The Lagrangian is formally given by the difference between the kinetic
energy, a function of the velocities, and the potential energy, a function of
the positions,
![]()
The
action, S, is defined as the time integral of the Lagrangian

By
applying
![]()
where
ei is a small parameter
and h(t) is an arbitrary function of
time subject only to the constraint that the variation of the coordinates must
vanish at the boundaries, i.e.,
![]()
Now consider the dependence of the action on the parameter ei

For the action to be extremum we require that

Thus, returning to the previous equation and integrating the last terms by parts, we require that

where the last term vanishes due to the boundary conditions on h(t). Finally the Principle of Least Action tells us that this expression should be true for any value of i and for any h(t) that satisfies the boundary conditions. This latter constraint says that the content of the square brackets must vanish at any t (think about replacing h(t) with a delta function at any t). Thus for any i we have the Euler-Lagrange Equation

As a simple example, consider a single, nonrelativistic particle in a potential V,
![]()
Noting that

the Euler-Lagrange Equation becomes

For
our purposes we want to generalize from particles to fields (still in the
classical limit) that are defined at each point in space-time. Thus the Lagrangian becomes a spatial
integral of a density
![]()
with the action defined in 4-D fashion
![]()
where both S and L are Lorentz scalars. Since the “engineering” dimension of the action must be zero, i.e., it can be thought of as the argument of an exponential, the dimension of the Lagrangian density, L, must be length–4 or energy+4 ([L] = E4).
The Lagrangian density is a function of fields and their derivatives, i.e., we substitute above with

and find the new form of the
Euler-Lagrange equation to be

for
all values of x. [The constraint
on the allowed variations of the fields is that they must vanish at the
4-surface defining the boundary of the integral for the action, which may be at
infinity.] If the Lagrangian depends on
more than one field, there will one such equation for each field.
A
simple example is a free scalar field of mass m. The corresponding Lagrangian density is
![]()
where we interpret the first terms as the relativistic analog of the kinematic energy and the second term as the potential for a free, massive field. From the discussion of dimensions above we note that the engineering dimension of such a scalar field must be E1 or L-1 ([j] = E1). Applying the Euler-Lagrange expression we obtain the expected Klein-Gordon Equation as the equation of motion for a scalar field,

with
.
To move to the quantum version of this theory we make the usual quantum mechanical identification of operators (and take h = 1 instead of h = 0). We have

(note the sign) and
, while the Euler-Lagrange Equation becomes the “mass shell”
constraint
![]()
Note
that this Lagrangian has a discrete symmetry, i.e., the action and L are invariant under the transformation j ® -j. It is generally more interesting to consider
a continuous symmetry. Imagine that the
field j depends on some continuous
parameter a and that the Lagrangian is
invariant under changes in a. Then we can consider variations parameterized
by changes in a,

Since
the Lagrangian is invariant, we can use these expressions to write

This
result must be true for any da (with the correct boundary
conditions) so that the coefficient of da must vanish. Finally we can use the Euler-Lagrange
equation to rewrite the first term and find

Thus
the quantity in the final bracket can be interpreted as a conserved current,

If the Lagrangian depends on
more than one field, each of which depends on a, the current defined in this
way will be a sum of terms, one for each such field. The German mathematician Emmy Noether (1882-1935) first derived this
connection between a continuous symmetry of the Lagrangian and the existence of
a conserved current in 1918 and such currents are often called Noether
currents. She was the daughter of a
professor of mathematics and chose to pursue a career in mathematics at a time
in
Recall from your studies of currents in classical electromagnetism or in classical mechanics that a 4-D current can be written as

where the 0th component is the spatial
density of whatever is flowing in the current, while the 3-vector part is the
flux of whatever is flowing (
). The conservation
(or continuity) equation says that the time rate of change of the density,
, is the flux into the region of interest,
, or

It may be helpful to think about the charge Q inside of a closed surface S, defining a volume V,

i.e., the charge inside changes as minus the charge flowing out through the surface.
Consider
next the simplest nontrivial example of the above structure, which is provided
by a complex scalar field (so that the charge conjugation operator can yield a
nontrivial result, i.e., the field can have nontrivial quantum
numbers). The Lagrangian (density) is

where
there are now two degrees of freedom, j and j* (or Re j and Im j). Both fields satisfy the Klein-Gordon equation
as before, i.e., the particle and antiparticle have the same mass. This Lagrangian has a continuous U(1)
symmetry, i.e., invariance under a change in the phase of the fields
(with a a real, continuous
parameter)
![]()
The
corresponding Noether current is easily derived,

When this U(1), i.e., phase invariance, is identified with electromagnetism (see the next lecture), the current will contain a multiplicative factor of the electric charge of the field.
Here we continue with the discussion of classical E&M (see the textbook by J. D. Jackson). This will allow us to “naturally” introduce vector fields. Recall that (in Heavyside-Lorentz notation with no magnetic monopoles) Maxwell’s equations look like

The 6 degrees of freedom represented by the 3-vector
electric and magnetic fields are most simply represented in terms of the
4-vector potential
with engineering dimension L-1 or E1,
where

The form of the expression for
(a cross product) reminds us that, while the electric field
is an ordinary vector (odd under parity transformations), the magnetic field is
a pseudovector (even under parity transformations).
We next note that the last two of Maxwell’s equations (the homogeneous ones) are satisfied automatically by the (new) definitions of these 3-vectors, while the inhomogeneous equations can be easily expressed in terms of a 4-curl, the field-strength tensor, defined by
![]()
This antisymmetric 4-tensor has 6 degrees of freedom that are just the components of the electric and magnetic fields

The inhomogeneous Maxwell equations are now contained in a single 4-vector equation
![]()
After a little thought we recognize that this
equation is the Euler-Lagrange Equation, with respect to the 4-vector field An,
arising from the following Lagrangian
![]()
We can interpret the first term as the “kinetic energy” for a vector field and the second term as describing the coupling of that field to an external current Jn.
Related to Fmn is a second tensor, “F Dual”, defined by

In terms of this new tensor the homogeneous Maxwell equations take the form
![]()
You may recall from your studies of classical E&M that the above equations do not fully specify the vector field. The “physical fields”, i.e., the electric and magnetic fields or the field-strength tensor Fmn, are invariant under the following “gauge transformation” depending on the scalar function l(xm)

Thus, without changing the physics, we are free to apply another condition on Am, the “gauge condition”. As usual with such freedom, the specific “choice of gauge” will be made based on making the problem at hand as simple as possible. An example is the “Lorentz gauge” where we require
![]()
i.e., Am is polarized orthogonal to the 4-direction of its variation. In this case the Euler-Lagrange Equation for the vector field has the simple form
![]()
or, in a region with no sources,
![]()
This tells us that the vector field (the photon) satisfies a massless Klein-Gordon equation. The momentum, qm, of a freely propagating photon should satisfy qm qm = 0. In the free case we can easily write down the explicit form as a plane wave with a polarization vector
![]()
The equation of motion requires that qm is a “light-like” vector, qm qm = 0 (it is precisely light after all), while the Lorentz gauge condition requires
![]()
i.e., the wave is transversely polarized, as a 4-vector, and seems to have 3 degrees of freedom. But we are not done yet! Recall that we defined the gauge transformation in terms of the function l(x) in order to get to the Lorentz gauge. But this constraint does not fully specify l(x). In particular, we can add to l(x) any scalar function L(x), which satisfies the free, massless wave equation
![]()
and still satisfy the gauge condition,

If L(x) is a function of the scalar variable xmqm, ¶nL will be proportional to qn and ¶n¶nL will be proportional to q2 = 0. In particular, for a plane wave form we have

Thus we can always choose the coefficient c so as to cancel the 0th component of An, i.e.,

Explicitly we choose c = ie0/q0
and find
. Thus we are always
free to select a gauge in which the polarization of the vector field is
transverse (as a 3-vector) to its direction of motion in 3-space. Thus it must be the case that there are only
2 degrees of freedom for a freely propagating (on-shell) photon. If the photon (or classical E&M plane
wave) is moving in the z direction, these degrees of freedom correspond
to linear polarization in either the x or y directions. As we have discussed earlier, it is sometimes
more illuminating to use the circular polarization or helicity basis

which are also called right-handed (positive helicity) and left-handed (negative helicity).
If we instead consider a massive vector field, where there is no gauge symmetry, the Lagrangian has the form
![]()
Note that the mass term is not invariant under a gauge transformation as defined earlier. We can, however, still determine the equation of motion, i.e., the Euler-Lagrange Equation, as before to find
![]()
If we operate on this equation with ¶n and recall that Fmn is antisymmetric, we have
![]()
Thus, as long as m ¹ 0, the massive vector must satisfy
![]()
This looks like our Lorentz gauge condition above, but it arises in quite a different fashion. Above we had a gauge symmetry and no mass, while here we have a mass and no gauge symmetry. It is still true that this constraint equation means that the massive vector field has only 3 degrees of freedom. However, there is no gauge symmetry to reduce that number to 2. A massive vector field can be represented by 3 linear polarizations, x, y, z (2 transverse and 1 longitudinal polarization) or by +, - and 0 helicities. For motion in the z direction we have (where the symbol l now represents the helicity)

These forms have the required properties

So we have introduced notation for scalar and vector fields. The final type of fields we need are fermions. In the 4-component notation of Dirac,

we have the following equation of motion (the Dirac equation) for a free massive fermion (recall Lecture 9)
![]()
In the last expression the indices have been made explicit (m = 0,1,2,3 – the usual Lorentz index; a,b = 1,2,3,4 – the spinor index). The 4x4 Dirac matrices satisfy the anticommutation relations
![]()
Recall that this property is required so that a solution of the (linear) Dirac equation is also a solution of the (quadratic) Klein-Gordon equation. There is also a fifth matrix conventionally defined (with some variation in the literature),
![]()
that anticommutes with the others,
![]()
The above relations specify that
and we generally desire a unitary representation where
. Thus we can write
. In Lecture 9 we
displayed the following explicit representation


If we take the Hermitian conjugate of the Dirac Equation, we find
![]()
With the standard definition
we have
![]()
If we multiply this expression on the right by
and the original Dirac equation on the left by
, we obtain

where the first two expressions were subtracted to
obtain the last. Thus the vector
quantity
represents a conserved current if y is a free Dirac
field, independent of the size of its mass.
Analogously to our earlier discussion of the of the complex scalar field
j,
this is a Noether current corresponding to the continuous global symmetry
![]()
The conserved charge is just the fermion number carried by y. The Lagrangian corresponding to the Dirac Equation is
![]()
where we note that the dimension of the fermion
field is L-1.5, E1.5([y] = E1.5). It is clearly invariant under the (U(1)) symmetry
transformation just mentioned. Applying
the Euler-Lagrange equation for variations with respect to
yields the Dirac
equation, while varying
yields the equation of motion for
.
Recall that we define the field for the antiparticle as
![]()
The matrix representing the C operator satisfies
![]()
Thus, for a given operator G, we can relate the matrix element for antiparticles to that for particles via

where we included a minus sign in the last step from commuting the 2 fermions. The phases, h, for various operators, are given by the following table.
|
G = |
1 |
g5 |
gm |
gmg5 |
smn º i [gm,gn]/2 |
|
h = |
1 |
1 |
-1 |
1 |
-1 |
Note that the conserved current defined by
is odd under C
[
] and thus acts like an electric current. We will see that it is just that. An explicit representation for C is
and
.
Recall (from the appendix to Lecture 9) that we can define fermions with specific helicities and handedness. Particularly useful are the eigenstates of g5, which are often called either chiral fermions or Weyl fermions and labeled with the “handed” notation (even though these definitions match the helicity definition of handedness only in the massless case),
![]()
As defined earlier, these states are obtained with the projection operators

With these projection operators it is easy to verify that

i.e., in the chiral basis the scalar expression is “off-diagonal” while the vector is “diagonal”. Thus, in order to have a mass term for a fermion in the Lagrangian, both helicities must be present. A pure chiral state (i.e., with only yL or yR present) must be massless (in the Dirac language). Since only the mass term in the Lagrangian mixes the two chiral states, the massless case with both chiral states present exhibits chiral symmetry in the sense that we have two independent U(1) symmetries

A related but orthogonal concept is the Majorana fermion. This addresses the question of whether a fermion with zero additive quantum numbers can be its own antiparticle (analogous to the photon),
![]()
In contrast to the case of Dirac particles, which have 4-components corresponding to distinct particle and antiparticle states, each with two helicities, the Majorana particle has just 2 components corresponding to the two helicities of the identical particle and antiparticle. Note that this is not the same as a (2-component) Weyl or chiral particle. In the latter case, C relates a right-handed particle to a left-handed antiparticle [recall the table above]

Thus the particle and antiparticle are distinct and not Majorana-like. Further, it is possible in the Majorana case, unlike the Weyl case, to include a mass term in the Lagrangian (but we will not work out the details here). It remains unclear whether the idea of a Majorana particle plays a role in neutrino physics. An important experiment test is the search for neutrinoless contributions to nuclear double beta-decay, e.g., in 82Se ® 82Kr. In the usual Dirac (chiral weak interaction) case, the underlying process is
![]()
If the neutrino is a Majorana particle and has a small mass (so that L and R mix), the antineutrino emitted in the “first” decay can be absorbed as a (left-handed) neutrino in the “second” decay. Thus we would see
![]()
In the next lecture we will relate the above structure of classical E&M to the idea of a local U(1) gauge symmetry and see how a massless vector particle, the photon, is a necessary component.