Physics 557 – Lecture 13

 

Local Gauge Symmetries and Gauge Bosons I – “Classical” QED:

 

In previous lectures we have outlined the quark/parton model as a description of hadrons.  While this model supplies an intuitively appealing and approximately correct formalism for understanding the experimental data, it fails to provide a “fundamental” understanding of the physics (especially the symmetries) and, perhaps more importantly, fails to provide a systematic procedure for defining corrections to the zeroth order picture.  We would prefer to have a true theory (instead of a model), which rigorously defines the structure of corrections.  Prior to the mid-1970s many physicists believed that field theory was not the correct language for such a theory of particle physics.  While quantum field theory was clearly the correct language for QED, it apparently failed to offer a description of either the weak or the strong interactions. However, once the richness of Yang-Mills theories with non-Abelian symmetries became apparent, this situation changed dramatically.  So the next step in our discussion of symmetries and particle physics will be to introduce the very helpful language of field theory.  In particular, with the language of field theory and Lagrangians we will be able to discuss the relationship between interactions (i.e., dynamics) and symmetries.

 

As a first and familiar example we will consider the electromagnetic interactions, QED, and the underlying local gauge symmetry.  To proceed we must establish (recall) some formalism.  First let us reach back to classical mechanics and define the Lagrangian describing an ensemble of particles labeled by coordinates qi(t).   The Lagrangian is formally given by the difference between the kinetic energy, a function of the velocities, and the potential energy, a function of the positions,

 

                                   

 

The action, S, is defined as the time integral of the Lagrangian

 

                                                     

 

By applying Hamilton’s Principle of Least Action, i.e., the action describing the “true” behavior of the system is an extremum, we are led to the Euler-Lagrange Equations or the usual Equations of Motion.  We proceed by considering a variation in each of the coordinates of the form

 

                                        

 

where ei is a small parameter and h(t) is an arbitrary function of time subject only to the constraint that the variation of the coordinates must vanish at the boundaries, i.e.,

                                                

Now consider the dependence of the action on the parameter ei

 

           

 

For the action to be extremum we require that

 

                                                    

 

Thus, returning to the previous equation and integrating the last terms by parts, we require that

 

                        

 

where the last term vanishes due to the boundary conditions on h(t).  Finally the Principle of Least Action tells us that this expression should be true for any value of i and for any h(t) that satisfies the boundary conditions.  This latter constraint says that the content of the square brackets must vanish at any t (think about replacing h(t) with a delta function at any t).  Thus for any i we have the Euler-Lagrange Equation

 

                                                

 

As a simple example, consider a single, nonrelativistic particle in a potential V,

 

             

 

Noting that

 

                                              

 

the Euler-Lagrange Equation becomes Newton’s Equation

 

                                             

 

For our purposes we want to generalize from particles to fields (still in the classical limit) that are defined at each point in space-time.  Thus the Lagrangian becomes a spatial integral of a density

 

                                                    

 

with the action defined in 4-D fashion

 

                                                  

 

where both S and L are Lorentz scalars.  Since the “engineering” dimension of the action must be zero, i.e., it can be thought of as the argument of an exponential, the dimension of the Lagrangian density, L, must be length–4 or energy+4 ([L] = E4). 

 

The Lagrangian density is a function of fields and their derivatives, i.e., we substitute above with

 

                            

 

 and find the new form of the Euler-Lagrange equation to be

 

                                    

 

for all values of x.  [The constraint on the allowed variations of the fields is that they must vanish at the 4-surface defining the boundary of the integral for the action, which may be at infinity.]  If the Lagrangian depends on more than one field, there will one such equation for each field.

 

A simple example is a free scalar field of mass m.  The corresponding Lagrangian density is

 

                                    

 

where we interpret the first terms as the relativistic analog of the kinematic energy and the second term as the potential for a free, massive field.  From the discussion of dimensions above we note that the engineering dimension of such a scalar field must be E1 or L-1 ([j] = E1).  Applying the Euler-Lagrange expression we obtain the expected Klein-Gordon Equation as the equation of motion for a scalar field,

 

                               

 

with . 

 

To move to the quantum version of this theory we make the usual quantum mechanical identification of operators (and take h = 1 instead of h = 0).  We have

                                 

 

(note the sign) and , while the Euler-Lagrange Equation becomes the “mass shell” constraint

 

                                               

 

Note that this Lagrangian has a discrete symmetry, i.e., the action and L are invariant under the transformation j ® -j.  It is generally more interesting to consider a continuous symmetry.  Imagine that the field j depends on some continuous parameter a and that the Lagrangian is invariant under changes in a.  Then we can consider variations parameterized by changes in a,

 

             

 

Since the Lagrangian is invariant, we can use these expressions to write

 

                    

 

This result must be true for any da (with the correct boundary conditions) so that the coefficient of da must vanish.  Finally we can use the Euler-Lagrange equation to rewrite the first term and find

 

   

 

Thus the quantity in the final bracket can be interpreted as a conserved current,

 

                                  

 

If the Lagrangian depends on more than one field, each of which depends on a, the current defined in this way will be a sum of terms, one for each such field.  The German mathematician Emmy Noether (1882-1935) first derived this connection between a continuous symmetry of the Lagrangian and the existence of a conserved current in 1918 and such currents are often called Noether currents.  She was the daughter of a professor of mathematics and chose to pursue a career in mathematics at a time in Germany when women typically were not allowed to study at universities nor join their faculties.

 

Recall from your studies of currents in classical electromagnetism or in classical mechanics that a 4-D current can be written as

 

                                                 

 

where the 0th component is the spatial density of whatever is flowing in the current, while the 3-vector part is the flux of whatever is flowing ().  The conservation (or continuity) equation says that the time rate of change of the density, , is the flux into the region of interest, , or

 

                                          

 

It may be helpful to think about the charge Q inside of a closed surface S, defining a volume V,

 

                         

 

i.e., the charge inside changes as minus the charge flowing out through the surface.

 

Consider next the simplest nontrivial example of the above structure, which is provided by a complex scalar field (so that the charge conjugation operator can yield a nontrivial result, i.e., the field can have nontrivial quantum numbers).  The Lagrangian (density) is

 

                  

 

where there are now two degrees of freedom, j and j* (or Re j and Im j).  Both fields satisfy the Klein-Gordon equation as before, i.e., the particle and antiparticle have the same mass.  This Lagrangian has a continuous U(1) symmetry, i.e., invariance under a change in the phase of the fields (with a a real, continuous parameter)

 

                                       

 

The corresponding Noether current is easily derived,

 

                                  

 

When this U(1), i.e., phase invariance, is identified with electromagnetism (see the next lecture), the current will contain a multiplicative factor of the electric charge of the field.

 

Here we continue with the discussion of classical E&M (see the textbook by J. D. Jackson).  This will allow us to “naturally” introduce vector fields.  Recall that (in Heavyside-Lorentz notation with no magnetic monopoles) Maxwell’s equations look like

 

                                     

 

The 6 degrees of freedom represented by the 3-vector electric and magnetic fields are most simply represented in terms of the 4-vector potential with engineering dimension L-1 or E1, where

 

                                     

 

The form of the expression for (a cross product) reminds us that, while the electric field is an ordinary vector (odd under parity transformations), the magnetic field is a pseudovector (even under parity transformations). 

 

We next note that the last two of Maxwell’s equations (the homogeneous ones) are satisfied automatically by the (new) definitions of these 3-vectors, while the inhomogeneous equations can be easily expressed in terms of a 4-curl, the field-strength tensor, defined by

 

                                           

 

This antisymmetric 4-tensor has 6 degrees of freedom that are just the components of the electric and magnetic fields

 

                                 

 

 

The inhomogeneous Maxwell equations are now contained in a single 4-vector equation

 

                           

 

After a little thought we recognize that this equation is the Euler-Lagrange Equation, with respect to the 4-vector field An, arising from the following Lagrangian

                                        

 

We can interpret the first term as the “kinetic energy” for a vector field and the second term as describing the coupling of that field to an external current Jn.

 

Related to Fmn is a second tensor, “F Dual”, defined by

 

                    

 

In terms of this new tensor the homogeneous Maxwell equations take the form

 

                                                    

 

You may recall from your studies of classical E&M that the above equations do not fully specify the vector field.  The “physical fields”, i.e., the electric and magnetic fields or the field-strength tensor Fmn, are invariant under the following “gauge transformation” depending on the scalar function l(xm)

 

                               

 

Thus, without changing the physics, we are free to apply another condition on Am, the “gauge condition”.  As usual with such freedom, the specific “choice of gauge” will be made based on making the problem at hand as simple as possible.  An example is the “Lorentz gauge” where we require

 

                                                      

 

i.e., Am is polarized orthogonal to the 4-direction of its variation.  In this case the Euler-Lagrange Equation for the vector field has the simple form

 

                                       

 

or, in a region with no sources,

 

                                                 

 

This tells us that the vector field (the photon) satisfies a massless Klein-Gordon equation.  The momentum, qm, of a freely propagating photon should satisfy qm qm = 0.  In the free case we can easily write down the explicit form as a plane wave with a polarization vector

 

                                                

 

The equation of motion requires that qm is a “light-like” vector, qm qm = 0 (it is precisely light after all), while the Lorentz gauge condition requires

 

                                                 

 

i.e., the wave is transversely polarized, as a 4-vector, and seems to have 3 degrees of freedom.  But we are not done yet!  Recall that we defined the gauge transformation in terms of the function l(x) in order to get to the Lorentz gauge.  But this constraint does not fully specify l(x).  In particular, we can add to l(x) any scalar function L(x), which satisfies the free, massless wave equation

 

                                                 

 

and still satisfy the gauge condition,

 

                             

 

If L(x) is a function of the scalar variable xmqm, nL will be proportional to qn and nnL will be proportional to q2 = 0.  In particular, for a plane wave form we have

 

                                     

 

Thus we can always choose the coefficient c so as to cancel the 0th component of An, i.e.,

 

                                  

 

Explicitly we choose c = ie0/q0 and find .  Thus we are always free to select a gauge in which the polarization of the vector field is transverse (as a 3-vector) to its direction of motion in 3-space.  Thus it must be the case that there are only 2 degrees of freedom for a freely propagating (on-shell) photon.  If the photon (or classical E&M plane wave) is moving in the z direction, these degrees of freedom correspond to linear polarization in either the x or y directions.  As we have discussed earlier, it is sometimes more illuminating to use the circular polarization or helicity basis

 

                                                

 

which are also called right-handed (positive helicity) and left-handed (negative helicity).

 

If we instead consider a massive vector field, where there is no gauge symmetry, the Lagrangian has the form

 

                                    

 

Note that the mass term is not invariant under a gauge transformation as defined earlier.  We can, however, still determine the equation of motion, i.e., the Euler-Lagrange Equation, as before to find

 

                                             

 

If we operate on this equation with n and recall that Fmn is antisymmetric, we have

 

                               

 

Thus, as long as m ¹ 0, the massive vector must satisfy

 

                                                     

 

This looks like our Lorentz gauge condition above, but it arises in quite a different fashion.  Above we had a gauge symmetry and no mass, while here we have a mass and no gauge symmetry.  It is still true that this constraint equation means that the massive vector field has only 3 degrees of freedom.  However, there is no gauge symmetry to reduce that number to 2.  A massive vector field can be represented by 3 linear polarizations, x, y, z (2 transverse and 1 longitudinal polarization) or by +, - and 0 helicities.  For motion in the z direction we have (where the symbol l now represents the helicity)

 

                        

 

These forms have the required properties

 

                       

 

So we have introduced notation for scalar and vector fields.  The final type of fields we need are fermions.  In the 4-component notation of Dirac,

 

                                                     

 

we have the following equation of motion (the Dirac equation) for a free massive fermion (recall Lecture 9)

 

    

 

In the last expression the indices have been made explicit (m = 0,1,2,3 – the usual Lorentz index; a,b  =  1,2,3,4 – the spinor index).  The 4x4 Dirac matrices satisfy the anticommutation relations

 

                        

 

Recall that this property is required so that a solution of the (linear) Dirac equation is also a solution of the (quadratic) Klein-Gordon equation.  There is also a fifth matrix conventionally defined (with some variation in the literature),

 

                                             

 

that anticommutes with the others,

                                                  

 

The above relations specify that and we generally desire a unitary representation where .  Thus we can write .  In Lecture 9 we displayed the following explicit representation

 

                       

 

If we take the Hermitian conjugate of the Dirac Equation, we find

 

                                        

 

With the standard definition we have

 

                                            

 

If we multiply this expression on the right by and the original Dirac equation on the left by , we obtain

 

                                        

 

where the first two expressions were subtracted to obtain the last.  Thus the vector quantity represents a conserved current if y is a free Dirac field, independent of the size of its mass.  Analogously to our earlier discussion of the of the complex scalar field j, this is a Noether current corresponding to the continuous global symmetry

 

                                       

 

The conserved charge is just the fermion number carried by y.  The Lagrangian corresponding to the Dirac Equation is

 

                                        

 

where we note that the dimension of the fermion field is L-1.5, E1.5([y] = E1.5).   It is clearly invariant under the (U(1)) symmetry transformation just mentioned.  Applying the Euler-Lagrange equation for variations with respect to  yields the Dirac equation, while varying yields the equation of motion for .

 

Recall that we define the field for the antiparticle as

 

                                        

 

The matrix representing the C operator satisfies

 

                                             

 

Thus, for a given operator G, we can relate the matrix element for antiparticles to that for particles via

                     

 

where we included a minus sign in the last step from commuting the 2 fermions.  The phases, h, for various operators, are given by the following table.

 

G =

1

g5

gm

gmg5

smn º i [gm,gn]/2

h =

1

1

-1

1

-1

 

Note that the conserved current defined by  is odd under C [] and thus acts like an electric current.  We will see that it is just that.  An explicit representation for C is and .

 

Recall (from the appendix to Lecture 9) that we can define fermions with specific helicities and handedness.  Particularly useful are the eigenstates of g5, which are often called either chiral fermions or Weyl fermions and labeled with the “handed” notation (even though these definitions match the helicity definition of handedness only in the massless case),

 

                                               

 

As defined earlier, these states are obtained with the projection operators

 

                                             

 

With these projection operators it is easy to verify that

 

                                    

 

i.e., in the chiral basis the scalar expression is “off-diagonal” while the vector is “diagonal”.  Thus, in order to have a mass term for a fermion in the Lagrangian, both helicities must be present.  A pure chiral state (i.e., with only yL or yR present) must be massless (in the Dirac language).  Since only the mass term in the Lagrangian mixes the two chiral states, the massless case with both chiral states present exhibits chiral symmetry in the sense that we have two independent U(1) symmetries

 

                           

 

A related but orthogonal concept is the Majorana fermion.  This addresses the question of whether a fermion with zero additive quantum numbers can be its own antiparticle (analogous to the photon),

 

                                        

 

In contrast to the case of Dirac particles, which have 4-components corresponding to distinct particle and antiparticle states, each with two helicities, the Majorana particle has just 2 components corresponding to the two helicities of the identical particle and antiparticle.  Note that this is not the same as a (2-component) Weyl or chiral particle.  In the latter case, C relates a right-handed particle to a left-handed antiparticle [recall the table above]

 

         

 

Thus the particle and antiparticle are distinct and not Majorana-like.  Further, it is possible in the Majorana case, unlike the Weyl case, to include a mass term in the Lagrangian (but we will not work out the details here).  It remains unclear whether the idea of a Majorana particle plays a role in neutrino physics.  An important experiment test is the search for neutrinoless contributions to nuclear double beta-decay, e.g., in 82Se ® 82Kr.  In the usual Dirac (chiral weak interaction) case, the underlying process is

 

                                           

 

 If the neutrino is a Majorana particle and has a small mass (so that L and R mix), the antineutrino emitted in the “first” decay can be absorbed as a (left-handed) neutrino in the “second” decay.  Thus we would see

 

                                                

 

In the next lecture we will relate the above structure of classical E&M to the idea of a local U(1) gauge symmetry and see how a massless vector particle, the photon, is a necessary component.