Academia.eduAcademia.edu

Advanced Quantum Mechanics - Schwabl

The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.

Advanced Quantum Mechanics Springer-Verlag Berlin Heidelberg GmbH Physics and Astronomy ONLINE LIBRARY https://www.springer.de/phys/ Advanced Texts in Physics This program of advanced texts covers a broad spectrum of topics which are of current and emerging interest in physics. Each book provides a comprehensive and yet accessible introduction to a field at the forefront of modern research. As such, these texts are intended for senior undergraduate and graduate students at the MS and PhD level; however, research scientists seeking an introduction to particular areas of physics will also benefit from the titles in this collection. Franz Schwabl Advanced Quantum Mechanics Translated by Roginald Hilton and Angela Lahee Second Edition With 79 Figures, 4 Tables, and 103 Problems , Springer Professor Dr. Franz Schwabl Physik -Department Technische Universtă Munchen James-Franck-Strasse 85747 Garching, Germany E-mail: [email protected] Translators: Dr. Roginald Hilton Dr. Angela Lahee Title of the original German edition: Quantenmechanik fur Fortgeschrittene (QM II) (Springer-Lehrbuch) © Springer-Verlag Berlin Heidelberg 2000 ISSN 1439-2674 ISBN 978-3-662-05420-8 DOI 10.1007/978-3-662-05418-5 ISBN 978-3-662-05418-5 (eBook) Cataloging-in-Publication Data applied for. A catalog record for this book is available from the Library of Congress. Bibliographic informat ion published by Die Deutsche Bibliothek: Die Deutsche Bibliothek lists this publicat ion in the Deutsche Nationalbibliografie; detailed bibliographic data is available in the Internet at <https://dnb.ddb.de> This work is subject to copyright. AII rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of iIIustrations, recitation, broadcasting, reproduction on microfilm or in any other way, and storage in data banks. Duplication of this publicat ion or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are liable for prosecution under the German Copyright Law. https://www.springer.de © Springer-Verlag Berlin Heidelberg 1999,2004 Originally published by Springer-Verlag Berlin Heidelberg New York in 2004 Softcover reprint of the hardcover 2nd edition 2004 The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. Typesetting: A. Lahee and F. Herweg EDV Beratung using a Springer TI;X macro package Cover design: design & production GmbH, Heidelberg Printed on acid-free paper SPIN 10925171 56/3141/jl 54321o The true physics is that which will, one day, achieve the inclusion of man in his wholeness in a coherent picture of the world. Pierre Teilhard de Chardin To my daughter Birgitta Preface to the Second Edition In the new edition, supplements, additional explanations and cross references have been added at numerous places, including new formulations of some of the problems. Figures have been redrawn and the layout has been improved. In all these additions I have attempted not to change the compact character of the book. The present second English edition is identical to the current German third edition. The proofs were read by E. Bauer, E. MarquardSchmitt und T. Wollenweber. Special thanks go to them and to Dr. R. Hilton for comments on some of the formulations and to Mrs. Jorg-Miiller for general supervision. I would like to thank all colleagues and students who have made suggestions to improve the book, as well as the publisher, Dr. Thorsten Schneider and Mrs. J. Lenz for the excellent cooperation. Munich, May 2003 F. Schwabl Preface to the First Edition This textbook deals with advanced topics in the field of quantum mechanics, material which is usually encountered in a second university course on quantum mechanics. The book, which comprises a total of 15 chapters, is divided into three parts: I. Many-Body Systems, II. Relativistic Wave Equations, and III. Relativistic Fields. The text is written in such a way as to attach importance to a rigorous presentation while, at the same time, requiring no prior knowledge, except in the field of basic quantum mechanics. The inclusion of all mathematical steps and full presentation of intermediate calculations ensures ease of understanding. A number of problems are included at the end of each chapter. Sections or parts thereof that can be omitted in a first reading are marked with a star, and subsidiary calculations and remarks not essential for comprehension are given in small print. It is not necessary to have read Part I in order to understand Parts II and III. References to other works in the literature are given whenever it is felt they serve a useful purpose. These are by no means complete and are simply intended to encourage further reading. A list of other textbooks is included at the end of each of the three parts. In contrast to Quantum Mechanics I, the present book treats relativistic phenomena, and classical and relativistic quantum fields. Part I introduces the formalism of second quantization and applies this to the most important problems that can be described using simple methods. These include the weakly interacting electron gas and excitations in weakly interacting Bose gases. The basic properties of the correlation and response functions of many-particle systems are also treated here. The second part deals with the Klein-Gordon and Dirac equations. Important aspects, such as motion in a Coulomb potential are discussed, and particular attention is paid to symmetry properties. The third part presents Noether's theorem, the quantization of the KleinGordon, Dirac, and radiation fields, and the spin-statistics theorem. The final chapter treats interacting fields using the example of quantum electrodynamics: S-matrix theory, Wick's theorem, Feynman rules, a few simple processes such as Mott scattering and electron-electron scattering, and basic aspects of radiative corrections are discussed. X Preface to the First Edition The book is aimed at advanced students of physics and related disciplines, and it is hoped that some sections will also serve to augment the teaching material already available. This book stems from lectures given regularly by the author at the Technical University Munich. Many colleagues and coworkers assisted in the production and correction of the manuscript: Ms. I. Wefers, Ms. E. J6rg-Miiller, Ms. C. Schwierz, A. Vilfan, S. Clar, K. Schenk, M. Hummel, E. Wefers, B. Kaufmann, M. Bulenda, J. Wilhelm, K. Kroy, P. Maier, C. Feuchter, A. Wonhas. The problems were conceived with the help of E. Frey and W. Gasser. Dr. Gasser also read through the entire manuscript and made many valuable suggestions. I am indebted to Dr. A. Lahee for supplying the initial English version of this difficult text, and my special thanks go to Dr. Roginald Hilton for his perceptive revision that has ensured the fidelity of the final rendition. To all those mentioned here, and to the numerous other colleagues who gave their help so generously, as well as to Dr. Hans-Jiirgen K6lsch of Springer-Verlag, I wish to express my sincere gratitude. Munich, March 1999 F. Schwabl Table of Contents Part I. Nonrelativistic Many-Particle Systems 1. 2. Second Quantization. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1 Identical Particles, Many-Particle States, and Permutation Symmetry. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1.1 States and Observables of Identical Particles. . . . . . . . . 1.1.2 Examples....................................... 1.2 Completely Symmetric and Antisymmetric States .......... 1.3 Bosons................................................ 1.3.1 States, Fock Space, Creation and Annihilation Operators. . . . . . . . . . . . . . . . . . . . . . .. 1.3.2 The Particle-Number Operator. . . . . . . . . . . . . . . . . . . .. 1.3.3 General Single- and Many-Particle Operators. . . . . . .. 1.4 Fermions.............................................. 1.4.1 States, Fock Space, Creation and Annihilation Operators. . . . . . . . . . . . . . . . . . . . . . .. 1.4.2 Single- and Many-Particle Operators. . . . . . . . . . . . . . .. 1.5 Field Operators. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 1.5.1 Transformations Between Different Basis Systems .... 1.5.2 Field Operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 1.5.3 Field Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 1.6 Momentum Representation . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 1.6.1 Momentum Eigenfunctions and the Hamiltonian. . . . .. 1.6.2 Fourier Transformation of the Density .............. 1.6.3 The Inclusion of Spin . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Problems .................................................. Spin-l/2 Fermions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 2.1 Noninteracting Fermions ................................ 2.1.1 The Fermi Sphere, Excitations. . . . . . . . . . . . . . . . . . . .. 2.1.2 Single-Particle Correlation Function ................ 2.1.3 Pair Distribution Function. . . . . . . . . . . . . . . . . . . . . . . .. *2.1.4 Pair Distribution Function, Density Correlation Functions, and Structure Factor .. 3 3 3 6 8 10 10 13 14 16 16 19 20 20 21 23 25 25 27 27 29 33 33 33 35 36 39 XII Table of Contents Ground State Energy and Elementary Theory of the Electron Gas. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 2.2.1 Hamiltonian..................................... 2.2.2 Ground State Energy in the Hartree-Fock Approximation. . . . . . . . . . . . . . . .. 2.2.3 Modification of Electron Energy Levels due to the Coulomb Interaction . . . . . . . . . . . . . . . . . . .. 2.3 Hartree-Fock Equations for Atoms ....................... Problems .................................................. 2.2 3. Bosons................................................... 3.1 Free Bosons. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 3.1.1 Pair Distribution Function for Free Bosons .......... *3.1.2 Two-Particle States of Bosons. . . . . . . . . .. . . . . . . . . . .. 3.2 Weakly Interacting, Dilute Bose Gas. . . . . . . . . . . . . . . . . . . . .. 3.2.1 Quantum Fluids and Bose-Einstein Condensation. . .. 3.2.2 Bogoliubov Theory of the Weakly Interacting Bose Gas. . . . . . . . . . . . . . . .. *3.2.3 Superfluidity..................................... Problems .................................................. 4. Correlation Functions, Scattering, and Response. . . . . . . . .. 4.1 4.2 4.3 4.4 4.5 4.6 4.7 *4.8 Scattering and Response ................................ Density Matrix, Correlation Functions .................... Dynamical Susceptibility . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Dispersion Relations. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Spectral Representation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Fluctuation-Dissipation Theorem. . . . . . . . . . . . . . . . . . . . . . . .. Examples of Applications. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. Symmetry Properties ................................... 4.8.1 General Symmetry Relations. . . . . . . . . . . . . . . . . . . . . .. 4.8.2 Symmetry Properties of the Response Function for Hermitian Operators ........................... 4.9 Sum Rules ............................................. 4.9.1 General Structure of Sum Rules .................... 4.9.2 Application to the Excitations in He II .............. Problems .................................................. 41 41 42 46 49 52 55 55 55 57 60 60 62 69 72 75 75 82 85 88 90 91 92 98 98 101 106 106 108 109 Bibliography for Part I ....................................... 111 Table of Contents XIII Part II. Relativistic Wave Equations 5. 6. Relativistic Wave Equations and their Derivation . ..................................... 5.1 Introduction ........................................... 5.2 The Klein-Gordon Equation ............................. 5.2.1 Derivation by Means of the Correspondence Principle. 5.2.2 The Continuity Equation .......................... 5.2.3 Free Solutions of the Klein-Gordon Equation ........ 5.3 Dirac Equation ......................................... 5.3.1 Derivation of the Dirac Equation ................... 5.3.2 The Continuity Equation .......................... 5.3.3 Properties of the Dirac Matrices .................... 5.3.4 The Dirac Equation in Covariant Form .............. 5.3.5 Nonrelativistic Limit and Coupling to the Electromagnetic Field ....................... Problems .................................................. 115 115 116 116 119 120 120 120 122 123 123 125 130 Lorentz Transformations and Covariance of the Dirac Equation .................... 6.1 Lorentz Transformations ................................ 6.2 Lorentz Covariance of the Dirac Equation ................. 6.2.1 Lorentz Covariance and Transformation of Spinors .... 6.2.2 Determination of the Representation S(A) .......... 6.2.3 Further Properties of S ........................... 6.2.4 Transformation of Bilinear Forms ................... 6.2.5 Properties of the I Matrices ....................... 6.3 Solutions of the Dirac Equation for Free Particles ........... 6.3.1 Spinors with Finite Momentum .................... 6.3.2 Orthogonality Relations and Density ................ 6.3.3 Projection Operators ............................. Problems .................................................. 131 131 135 135 136 142 144 145 146 146 149 151 152 7. Orbital Angular Momentum and Spin .................... 7.1 Passive and Active Transformations ....................... 7.2 Rotations and Angular Momentum ....................... Problems .................................................. 155 155 156 159 8. The Coulomb Potential .. ................................. 8.1 Klein-Gordon Equation with Electromagnetic Field ......... 8.1.1 Coupling to the Electromagnetic Field .............. 8.1.2 Klein-Gordon Equation in a Coulomb Field ......... 8.2 Dirac Equation for the Coulomb Potential ................. Problems .................................................. 161 161 161 162 168 179 XIV 9. Table of Contents The Foldy-Wouthuysen Transformation and Relativistic Corrections .............................. 9.1 The Foldy-Wouthuysen Transformation ................... 9.1.1 Description of the Problem ........................ 9.1.2 Transformation for Free Particles ................... 9.1.3 Interaction with the Electromagnetic Field .......... 9.2 Relativistic Corrections and the Lamb Shift ................ 9.2.1 Relativistic Corrections ........................... 9.2.2 Estimate of the Lamb Shift ........................ Problems .................................................. 10. Physical Interpretation of the Solutions to the Dirac Equation .................... 10.1 Wave Packets and "Zitterbewegung" ...................... 10.1.1 Superposition of Positive Energy States ............. 10.1.2 The General Wave Packet ......................... *10.1.3 General Solution of the Free Dirac Equation in the Heisenberg Representation . . . . . . . . . . . . . . . . . . . *10.1.4 Potential Steps and the Klein Paradox .............. 10.2 The Hole Theory ....................................... Problems .................................................. 11. Symmetries and Further Properties of the Dirac Equation ..................................... *11.1 Active and Passive Transformations, Transformations of Vectors .............................. 11.2 Invariance and Conservation Laws ........................ 11.2.1 The General Transformation ....................... 11.2.2 Rotations ....................................... 11.2.3 Translations ..................................... 11.2.4 Spatial Reflection (Parity Transformation) ........... 11.3 Charge Conjugation .................................... 11.4 Time Reversal (Motion Reversal) ......................... 11.4.1 Reversal of Motion in Classical Physics .............. 11.4.2 Time Reversal in Quantum Mechanics .............. 11.4.3 Time-Reversal Invariance of the Dirac Equation ...... *11.4.4 Racah Time Reflection ............................ *11.5 Helicity ............................................... *11.6 Zero-Mass Fermions (Neutrinos) . . . . . . . . . . . . . . . . . . . . . . . . . . Problems .................................................. 181 181 181 182 183 187 187 189 193 195 195 196 197 200 202 204 207 209 209 212 212 212 213 213 214 217 218 221 229 235 236 239 244 Bibliography for Part II ...................................... 245 Table of Contents XV Part III. Relativistic Fields 12. Quantization of Relativistic Fields ........................ 12.1 Coupled Oscillators, the Linear Chain, Lattice Vibrations .... 12.1.1 Linear Chain of Coupled Oscillators ................ 12.1.2 Continuum Limit, Vibrating String ................. 12.1.3 Generalization to Three Dimensions, Relationship to the Klein-Gordon Field . . . . . . . . . . . . . 12.2 Classical Field Theory .................................. 12.2.1 Lagrangian and Euler-Lagrange Equations of Motion. 12.3 Canonical Quantization ................................. 12.4 Symmetries and Conservation Laws, Noether's Theorem ..... 12.4.1 The Energy-Momentum Tensor, Continuity Equations, and Conservation Laws. . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.4.2 Derivation from Noether's Theorem of the Conservation Laws for Four-Momentum, Angular Momentum, and Charge . . . . . . . . . . . . . . . . . . . Problems .................................................. 249 249 249 255 13. Free Fields ............................................... 13.1 The Real Klein-Gordon Field ............................ 13.1.1 The Lagrangian Density, Commutation Relations, and the Hamiltonian. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.1.2 Propagators ..................................... 13.2 The Complex Klein-Gordon Field ........................ 13.3 Quantization of the Dirac Field .......................... 13.3.1 Field Equations .................................. 13.3.2 Conserved Quantities ............................. 13.3.3 Quantization ..................................... 13.3.4 Charge .......................................... *13.3.5 The Infinite-Volume Limit ......................... 13.4 The Spin Statistics Theorem ............................. 13.4.1 Propagators and the Spin Statistics Theorem ........ 13.4.2 Further Properties of Anticommutators and Propagators of the Dirac Field ................. Problems .................................................. 277 277 301 303 14. Quantization of the Radiation Field ...................... 14.1 Classical Electrodynamics ............................... 14.1.1 Maxwell Equations ............................... 14.1.2 Gauge Transformations ........................... 14.2 The Coulomb Gauge .................................... 14.3 The Lagrangian Density for the Electromagnetic Field ...... 14.4 The Free Electromagnatic Field and its Quantization ....... 307 307 307 309 309 311 312 258 261 261 266 266 266 268 275 277 281 285 287 287 289 290 293 295 296 296 XVI Table of Contents 14.5 Calculation of the Photon Propagator ..................... 316 Problems .................................................. 320 15. Interacting Fields, Quantum Electrodynamics . ........... 15.1 Lagrangians, Interacting Fields ........................... 15.1.1 Nonlinear Lagrangians ............................ 15.1.2 Fermions in an External Field ...................... 15.1.3 Interaction of Electrons with the Radiation Field: Quantum Electrodynamics (QED) .................. 15.2 The Interaction Representation, Perturbation Theory ....... 15.2.1 The Interaction Representation (Dirac Representation) 15.2.2 Perturbation Theory .............................. 15.3 The S Matrix .......................................... 15.3.1 General Formulation .............................. 15.3.2 Simple Transitions ................................ * 15.4 Wick's Theorem ........................................ 15.5 Simple Scattering Processes, Feynman Diagrams ........... 15.5.1 The First-Order Term ............................. 15.5.2 Mott Scattering .................................. 15.5.3 Second-Order Processes ........................... 15.5.4 Feynman Rules of Quantum Electrodynamics ........ * 15.6 Radiative Corrections ................................... 15.6.1 The Self-Energy of the Electron .................... 15.6.2 Self-Energy of the Photon, Vacuum Polarization ...... 15.6.3 Vertex Corrections ............................... 15.6.4 The Ward Identity and Charge Renormalization ...... 15.6.5 Anomalous Magnetic Moment of the Electron ........ Problems .................................................. 321 321 321 322 322 323 324 327 328 328 332 335 339 339 341 346 356 358 359 365 366 368 371 373 Bibliography for Part III ..................................... 375 Appendix . .................................................... A Alternative Derivation of the Dirac Equation. . . . . . . . . . . . . . . B Dirac Matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . B.1 Standard Representation .......................... B.2 Chiral Representation ............................. B.3 Majorana Representations ......................... C Projection Operators for the Spin ........................ C.1 Definition ....................................... C.2 Rest Frame ...................................... C.3 General Significance of the Projection Operator P(n) . D The Path-Integral Representation of Quantum Mechanics .... E Covariant Quantization of the Electromagnetic Field, the Gupta-Bleuler Method .............................. E.1 Quantization and the Feynman Propagator . . . . . . . . . . 377 377 379 379 379 380 380 380 380 381 385 387 387 Table of Contents XVII E.2 F The Physical Significance of Longitudinal and Scalar Photons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . E.3 The Feynman Photon Propagator .................. E.4 Conserved Quantities ............................. Coupling of Charged Scalar Mesons to the Electromagnetic Field . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 389 392 393 394 Index ......................................................... 397 Part I Nonrelativistic Many-Particle Systems 1. Second Quantization In this first part, we shall consider nonrelativistic systems consisting of a large number of identical particles. In order to treat these, we will introduce a particularly efficient formalism, namely, the method of second quantization. Nature has given us two types of particle, bosons and fermions. These have states that are, respectively, completely symmetric and completely antisymmetric. Fermions possess half-integer spin values, whereas boson spins have integer values. This connection between spin and symmetry (statistics) is proved within relativistic quantum field theory (the spin-statistics theorem). An important consequence in many-particle physics is the existence of Fermi-Dirac statistics and Bose-Einstein statistics. We shall begin in Sect. 1.1 with some preliminary remarks which follow on from Chap. 13 of Quantum Mechanics l . For the later sections, only the first part, Sect. 1.1.1, is essential. 1.1 Identical Particles, Many-Particle States, and Permutation Symmetry 1.1.1 States and Observables of Identical Particles We consider N identical particles (e.g., electrons, 'IT mesons). The Hamiltonian H=H(1,2, ... ,N) (1.1.1) is symmetric in the variables 1, 2, ... ,N. Here 1 == xl, 0"1 denotes the position and spin degrees of freedom of particle 1 and correspondingly for the other particles. Similarly, we write a wave function in the form 'IjJ = 'IjJ(1, 2, ... ,N). (1.1.2) The permutation operator Pij , which interchanges i and j, has the following effect on an arbitrary N-particle wave function 1 F. Schwabl, Quantum Mechanics, 3 rd ed., Springer, Berlin Heidelberg, 2002; in subsequent citations this book will be referred to as QM I. 4 1. Second Quantization P ij ,¢( ... ,i, ... ,j, ... )=,¢( ... ,j, ... ,i, ... ). (1.1.3) We remind the reader of a few important properties of this operator. Since Pi~ = 1, the eigenvalues of P ij are ±1. Due to the symmetry of the Hamiltonian, one has for every element P of the permutation group PH = (1.1.4) HP. The permutation group 8N which consists of all permutations of N objects has N! elements. Every permutation P can be represented as a product of transpositions P ij . An element is said to be even (odd) when the number of Pij's is even (odd).2 A few properties: (i) If '¢(1, ... , N) is an eigenfunction of H with eigenvalue E, then the same also holds true for P'¢ (1, . .. , N). Proof. H'¢ = E'¢ =} HP,¢ = PH'¢ = EP,¢ . (ii) For every permutation one has (cpl'¢) = (PcpIP,¢) , (1.1.5) as follows by renaming the integration variables. (iii) The adjoint permutation operator pt is defined as usual by (cpIP,¢) = (ptcpl,¢) . It follows from this that and thus P is unitary ptp = ppt = 1. (1.1.6) (iv) For every symmetric operator 8(1, ... , N) we have (1.1. 7) [P,8] =0 and (P'¢il 8 IP'¢j) = ('¢il pt 8P I'¢j) = ('¢il pt P8 I'¢j) = ('¢il 8 I'¢j) . (1.1.8) This proves that the matrix elements of symmetric operators are the same in the states '¢i and in the permutated states P'¢i. 2 It is well known that every permutation can be represented as a product of cycles that have no element in common, e.g., (124)(35). Every cycle can be written as a product of transpositions, e.g. H24 (12) == (124) = (14)(12) odd even Each cycle is carried out from left to right (1 -+ 2,2 -+ 4,4 -+ 1), whereas the products of cycles are applied from right to left. 1.1 Identical Particles, Many-Particle States, and Permutation Symmetry 5 (v) The converse of (iv) is also true. The requirement that an exchange of identical particles should not have any observable consequences implies that all observables 0 must be symmetric, i.e., permutation invariant. Proof. ('l/JIO I'l/J) = (P'l/JI 0 IP'l/J) = ('l/JI ptOP I'l/J) holds for arbitrary 'l/J. Thus, ptOP = 0 and, hence, PO = OP. Since identical particles are all influenced identically by any physical process, all physical operators must be symmetric. Hence, the states 'l/J and P'l/J are experimentally indistinguishable. The question arises as to whether all these N! states are realized in nature. In fact, the totally symmetric and totally antisymmetric states 'l/Js and'l/Ja do play a special role. These states are defined by (1.1.9) PiJ°'l/Js( ... ,i, ... ,j, ... )=±'l/Js( ... ,i, ... ,j, ... ) a a for all Pij . It is an experimental fact that there are two types of particle, bosons and jermions, whose states are totally symmetric and totally antisymmetric, respectively. As mentioned at the outset, bosons have integral, and fermions half-integral spin. Remarks: (i) The symmetry character of a state does not change in the course of time: 'l/J(t) = Te -* t J dt' H(t') 0 'l/J(O) '* P'l/J(t) = Te -* t J dt' H(t') 0 P'l/J(O) , (1.1.10) where T is the time-ordering operator. 3 (ii) For arbitrary permutations P, the states introduced in (1.1.9) satisfy (1.1.11) P'l/J = (-l)P'l/J a with (-l)P = { a , 1 for even permuta~ions -1 for odd permutatIOns. Thus, the states 'l/Js and'l/Ja form the basis of two one-dimensional representations of the permutation group SN. For 'l/Js, every P is assigned the number 1, and for 'l/Ja every even (odd) element is assigned the number 1 ( -1). Since, in the case of three or more particles, the Pij do not all commute with one another, there are, in addition to 'l/Js and 'l/Ja, also states for which not all Pij are diagonal. Due to noncommutativity, a complete set of common eigenfunctions of all Pij cannot exist. These states are basis functions of higher-dimensional representations of the permutation group. These states are not realized in nature; they are referred to 3 QM I, Chap. 16. 6 1. Second Quantization as parasymmetric states. 4 . The fictitious particles that are described by these states are known as paraparticles and are said to obey parastatistics. 1.1.2 Examples (i) Two particles Let 1/!(1, 2) be an arbitrary wave function. The permutation P12 leads to H21/!(1, 2) = 1/!(2, 1). From these two wave functions one can form 1/!s = 1/!(1, 2) + 1/!(2, 1) symmetric 1/!a = 1/!(1, 2) -1/!(2, 1) antisymmetric (1.1.12) under the operation P12. For two particles, the symmetric and antisymmetric states exhaust all possibilities. (ii) Three particles We consider the example of a wave function that is a function only of the spatial coordinates Application of the permutation H23 yields P123 1/!(Xl,X2,X3) = 1/!(X2,X3,xd, i.e., particle 1 is replaced by particle 2, particle 2 by particle 3, and part i2( 2 2)2 2( 2 2)2 cle 3 by particle 1, e.g., 1/!(1,2,3) = e- x, X2- X 3 ,P12 1/!(1,2,3) = e-x2 X , -X3 , 2( 2 2)2 P 123 1/!(1, 2, 3) = e - x2 x 3 - x , • We consider H3P121/!(1, 2, 3) = H31/!(2, 1,3) = 1/!(2, 3,1) = P123 1/!(1, 2, 3) H 2P13 1/!(1, 2, 3) = H21/!(3, 2,1) = 1/!(3, 1,2) = P132 1/!(1, 2, 3) (P123 )21/!(1, 2, 3) = P 123 1/!(2, 3,1) = 1/!(3, 1, 2) = P132 1/!(1, 2, 3). Clearly, H3H2 =f. P 12 P 13 . 83, the permutation group for three objects, consists of the following 3! ments: = 6 ele- (1.1.13) We now consider the effect of a permutation P on a ket vector. Thus far we have only allowed P to act on spatial wave functions or inside scalar products which lead to integrals over products of spatial wave functions. Let us assume that we have the state direct product 11/!)= 4 L ..--.-... IXl)1Ix2)2I x3)31/!(Xl,X2,X3) (1.1.14) A.M.L. Messiah and O.W. Greenberg, Phys. Rev. B 136, 248 (1964), B 138, 1155 (1965). 1.1 Identical Particles, Many-Particle States, and Permutation Symmetry 7 with 1/i(XI,X2,X3) = (xIII (x212 (x31311/i)· In IXi)j the particle is labeled by the number j and the spatial coordinate is Xi. The effect of P123, for example, is defined as follows: P I 2311/i) = L IX I)2Ix2)3I x3)11/i(xI,x2,x3). In the second line the basis vectors of the three particles in the direct product are once more written in the usual order, 1,2,3. We can now rename the summation variables according to (Xl, X2, X3) -+ H23 (Xl, X2, X3) = (X2, X3, Xl). From this, it follows that If the state 11/i) has the wave function 1/i(XI,X2,X3), then P I'I/J) has the wave function P1/i(XI,X2,X3). The particles are exchanged under the permutation. Finally, we discuss the basis vectors for three particles: If we start from the state In) LB) b) and apply the elements of the group 8 3 , we get the six states In) 1{3) b) P 12 1a) 1{3) b) = 1{3) la) b) , P 23 la) 1m b) = In) Ii') 1m , P31 la) 1{3) b) = b) 1m la) , H231a)1 1{3)2 b)3 = la)2Im 3 1')')1 = I')') la) 1{3) , P 132 la) 1{3) b) = 1{3) I')') la) . (1.1.15) Except in the fourth line, the indices for the particle number are not written out, but are determined by the position within the product (particle 1 is the first factor, etc.). It is the particles that are permutated, not the arguments of the states. If we assume that n, {3, and')' are all different, then the same is true of the six states given in (1.1.15). One can group and combine these in the following way to yield invariant su bspaces 5: Invariant subspaces: Basis 1 (symmetric basis): 1 v'6 (Ia) 1{3) b) + 1{3) la) b) + la) I')') 1m + b) 1{3) la) + I')') la) 1m + 1m b) In)) (1.1.16a) Basis 2 (antisymmetric basis): 1 v'6 (Ia) 1{3) b) - 1m la) b) - la) b) 1{3) - b) 1{3) la) + I')') la) 1m + 1m b) In)) (1.1.16b) 5 An invariant subspace is a subspace of states which transforms into itself on application of the group elements. 8 1. Second Quantization Basis 3: vb (2Ia) L8) b) + 21,8) la) I'Y) - { - I'Y) la) 1,8) - 1,8) I'Y) la)) ~ (0 + 0 - la) b) 1,8) + I'Y) 1,8) la) la) I'Y) 1,8) - b) 1,8) la) (1.1.16c) + I'Y) la) 1,8) - 1,8) b) la)) Basis 4: { ~(O + 0 -Ia) I'Y) 1,8) + I'Y) 1,8) la) -I'Y) la) 1,8) + 1,8) I'Y) la)) 21,8) la) b) + la) b) 1,8) + b) 1,8) la) vb (2Ia) 1,8) I'Y) - - I'Y) la) 1,8) - 1,8) b) la)) (1.1.16d) . In the bases 3 and 4, the first of the two functions in each case is even under H2 and the second is odd under P12 (immediately below we shall call these two functions IWl) and IW2)). Other operations give rise to a linear combination of the two functions: (1.1.17a) (1.1.17b) with coefficients aij. In matrix form, (1.1.17b) can be written as (1.1.17c) The elements P 12 and P13 are thus represented by 2 x 2 matrices P12 = (1 0) 0 -1 P ,13 = (au a 12 ) a21 a22 . (1.1.18) This fact implies that the basis vectors IWl) and IW2) span a two-dimensional representation of the permutation group 83. The explicit calculation will be carried out in Problem 1.2. 1.2 Completely Symmetric and Antisymmetric States We begin with the single-particle states Ii): 11), 12), .... The single-particle states of the particles 1, 2, ... , a, ... , N are denoted by li)I' lib···, Ii)"" ... , Ii) N' These enable us to write the basis states of the N-particle system (1.2.1) where particle 1 is in state lil)1 and particle a in state Ii",)"" etc. (The subscript outside the ket is the number labeling the particle, and the index within the ket identifies the state of this particle.) Provided that the {Ii)} form a complete orthonormal set, the product states defined above likewise represent a complete orthonormal system in the 1.2 Completely Symmetric and Antisymmetric States 9 space of N -particle states. The symmetrized and antisymmetrized basis states are then defined by (1.2.2) In other words, we apply all N! elements of the permutation group SN of N objects and, for fermions, we multiply by ( -1) when P is an odd permutation. The states defined in (1.2.2) are of two types: completely symmetric and completely antisymmetric. Remarks regarding the properties of S± == k Lp(±1)PP: (i) Let SN be the permutation group (or symmetric group) of N quantities. Assertion: For every element PESN, one has P SN = SN. Proof. The set PSN contains exactly the same number of elements as SN and these, due to the group property, are all contained in SN. Furthermore, the elements of PSN are all different since, if one had PH = PP2 , then, after multiplication by p-l, it would follow that H = P2. Thus (1.2.3) (ii) It follows from this that (1.2.4a) PS+ = S+P= S+ and (1.2.4b) If P is even, then even elements remain even and odd ones remain odd. If P is odd, then multiplication by P changes even into odd elements and vice versa. = S+ IiI, ... ,iN) PS_lil' ... ,iN) = (-1)PS_li l , ... ,iN) Special case PijS-lil, ... ,iN) = -S_li l , ... PS+ IiI, ... ,iN) ,iN) (iii) If IiI, . . . , iN) contains single-particle states occurring more than once, then S+ IiI, ... , iN) is no longer normalized to unity. Let us assume that the first state occurs nl times, the second n2 times, etc. Since S+ IiI, ... , iN) contains a total of N! terms, of which ----f!J-r-are different, each of these nl·n2···· terms occurs with a multiplicity of nl !n2! .... . . t . . " 2 N! ,(nl.n2 .... ) " =nl!n2!··. N. nl.n2 .... __ 1 (Zl, ... ,zNIS+S+IZl, ... ,ZN) - 10 1. Second Quantization Thus, the normalized Bose basis functions are (iv) A further property of S± is (1.2.6a) k k since Si = Lp(±l)PPS± = Lp S± = ..;Nfs±. We now consider an arbitrary N-particle state, which we expand in the basis li 1 ) ... liN) Application of S± yields and further application of kS±, with the identity (1.2.6a), results in (1.2.6b) Equation (1.2.6b) implies that every symmetrized state can be expanded in terms of the symmetrized basis states (1.2.2). 1.3 Bosons 1.3.1 States, Fock Space, Creation and Annihilation Operators The state (1.2.5) is fully characterized by specifying the occupation numbers (1.3.1) Here, nl is the number of times that the state 1 occurs, n2 the number of times that state 2 occurs, . " . Alternatively: nl is the number of particles in state 1, n2 is the number of particles in state 2, .... The sum of all occupation numbers ni must be equal to the total number of particles: 00 (1.3.2) 1.3 Bosons 11 Apart from this constraint, the ni can take any of the values 0,1,2, .... The factor (nl!n2! ... )-1/2, together with the factor I/VN! contained in S+, has the effect of normalizing Inl' n2,"') (see point (iii)). These states form a complete set of completely symmetric N-particle states. By linear superposition, one can construct from these any desired symmetric N-particle state. We now combine the states for N = 0,1,2, ... and obtain a complete orthonormal system of states for arbitrary particle number, which satisfy the orthogonality relation 6 (1.3.3a) and the completeness relation (1.3.3b) nl,n2,··· This extended space is the direct sum of the space with no particles (vacuum state 10)), the space with one particle, the space with two particles, etc.; it is known as Fock space. The operators we have considered so far act only within a subspace of fixed particle number. On applying p, x etc. to an N-particle state, we obtain again an N-particle state. We now define creation and annihilation operators, which lead from the space of N-particle states to the spaces of N ± I-particle states: (1.3.4) Taking the adjoint of this equation and relabeling ni ---t n/, we have (... ,ni', .. ·1 ai = J ni' + 1 (... ,n/ + 1, .. ·1 . Multiplying this equation by I... (1.3.5) ,ni,"') yields (... ,n/,···I ai I··· ,ni,"') = .,;n;, on,'+I,ni . Expressed in words, the operator ai reduces the occupation number by 1. Assertion: .,;n;, I· .. ,ni - ai I· .. ,ni,"') ai I... ,ni = 0, ... ) = = 1, ... ) for ni ~ 1 (1.3.6) and 6 °. In the states Inl, n2, ... ), the nl, n2 etc. are arbitrary natural numbers whose sum is not constrained. The (vanishing) scalar product between states of differing particle number is defined by (1.3.3a). 12 1. Second Quantization Proof: 00 ai I·.. ,ni, ... ) = L I··· ,n/, ... )( ... , n/, .. ·1 ai I· .. ,ni,"') ni'=O = L I··· , n/, ... ) ,;n; 6n i'+1,ni n1,'=O = { ,;n; I.. · ,ni - o 1, ... ) for ni ~ 1 for ni - 0 The operator at increases the occupation number of the state Ii) by 1, and the operator ai reduces it by 1. The operators at and ai are thus called creation and annihilation operators. The above relations and the completeness of the states yield the Bose commutation relations (1.3.7a,b,c) Proof. It is clear that (1.3.7a) holds for i i =I j, = j, since ai commutes with itself. For it follows from (1.3.6) that aiaj I... ,ni, ... ,nj, ... ) = y!n:i.jnjl·.· ,ni -1, ... ,n) -1, ... ) = ajai I... ,ni,.·. ,njl"') which proves (1.3.7a) and, by taking the hermitian conjugate, also (1.3.7b). For j =I i we have ai a} I... ,ni, ... ,nj, ... ) = y!n:i.jnj+ll ... ,ni-l, ... ,nj+1, ... ) := a}ai I ... ,ni, ... ,nj, ... ) and (aia; -a;ai) I... ,ni, .. · ,nj, ... ) = (Vni + 1vni + 1 - y!n:iy!n:i) I... , ni, ... , nj, ... ) hence also proving (1.3.7c). Starting from the ground state == vacuum state 10) == 10,0, ... ) , which contains no particles at all, we can construct all states: single-particle states at 10) , ... , two-particle states (1.3.8) 1.3 Bosons 13 and the general many-particle state (1.3.9) Normalization: (1.3.10) 1 In) = -at In-1) ..;n 1.3.2 The Particle-Number Operator The particle-number operator (occupation-number operator for the state Ii) ) is defined by (1.3.11) The states introduced above are eigenfunctions of ni: (1.3.12) and the corresponding eigenvalue of ni is the number of particles in the state i. The operator for the total number of particles is given by (1.3.13) Applying this operator to the states I... ,ni, ... ) yields (1.3.14) Assuming that the particles do not interact with one another and, furthermore, that the states Ii) are the eigenstates of the single-particle Hamiltonian with eigenvalues fi' the full Hamiltonian can be written as (1.3.15a) (1.3.15b) The commutation relations and the properties of the particle-number operator are analogous to those of the harmonic oscillator. 14 1. Second Quantization 1.3.3 General Single- and Many-Particle Operators Let us consider an operator for the N-particle system which is a sum of single-particle operators (1.3.16) e.g., for the kinetic energy ta; = p:;,/2m, and for the potential V(xa;). For one particle, the single-particle operator is t. Its matrix elements in the basis Ii) are (1.3.17) such that t = 2: tij (1.3.18) Ii) (jl i,j and for the full N-particle system N T = 2: t 2: li)a; (jla;· (1.3.19) ij i,j n=l Our aim is to represent this operator in terms of creation and annihilation operators. We begin by taking a pair of states i,j from (1.3.19) and calculating their effect on an arbitrary state (1.3.1). We assume initially that j i=- i (1.3.20) It is possible, as was done in the third line, to bring the S+ to the front, since it commutes with every symmetric operator. If the state j is nj-fold occupied, it gives rise to nj terms in which Ij) is replaced by Ii). Hence, the effect of S+ is to yield nj states I... ,ni + 1, ... ,nj - 1, ... ), where the change in the normalization should be noted. Equation (1.3.20) thus leads on to = nj'Jni 1 + 1--1··· = yIn;,,;'ni yin; + 11 ... ,ni ,ni + 1, ... + 1, ... =a!aj I... ,ni,··· ,nj, ... ). ,nj -1, ... ) ,nj - 1, ... ) (1.3.20') 1.3 Bosons = i, For j 15 the i is replaced ni times by itself, thus yielding Thus, for any N, we have N L li)a Ula = a!aj. a=l From this it follows that, for any single-particle operator, T =L (1.3.21 ) tija!aj , i,j where tij = (il t IJ) . (1.3.22) The special case tij = EiOij leads to Ho = L Eia!ai , i i.e., to (1.3.15a). In a similar way one can show that two-particle operators (1.3.23) can be written in the form (1.3.24) where (i,jli2 ) Ik,m) = JJ dx dY'P;(x)'Pj(y)f(2)(x,Y)'Pk(X)'Pm(Y). (1.3.25) In (1.3.23), the condition a =I- (3 is required as, otherwise, we would have only a single-particle operator. The factor ~ in (1.3.23) is to ensure that each interaction is included only once since, for identical particles, symmetry implies that f(2)(x a ,x(3) = f(2)(X(3,X a ). Proof of (1.3.24). One first expresses F in the form F =~ L L a#(3 i,j,k,m (i,jl f(2) Ik, m) li)a Ij)(3 (kl a (ml(3· 16 1. Second Quantization We now investigate the action of one term of the sum constituting F: Lli)alj),e(kla(ml,el ... ,ni, ... ,nj, ... ,nk, ... ,nm, ... ) ai-,e = nknm I ... 1 .jii:k,.;n;;; ..jni + IJnj + 1 nk nm ,ni+ 1, ... ,nj+1, ... ,nk-1, ... ,nm -1, ... ) = a!ajakam I··· , ni,··· , nj, ... , nk,··· , nm, ... ) . Here, we have assumed that the states are different. If the states are identical, the derivation has to be supplemented in a similar way to that for the singleparticle operators. A somewhat shorter derivation, and one which also covers the case of fermions, proceeds as follows: The commutator and anticommutator for bosons and fermions, respectively, are combined in the form [ak, aj]'f = Okj. --aiakajam-ai t t t [ak,aj'fam t] --..-.' akaj'fajak = ±a!ajakam = a!ajamak , (1.3.26) £ bosons or fermions. This completes the proof of the form (1.3.24). 1.4 Fermions 1.4.1 States, Fock Space, Creation and Annihilation Operators For fermions, one needs to consider the states S_ lil, i2, ... , iN) defined in (1.2.2), which can also be represented in the form of a determinant: (1.4.1) 1.4 Fermions 17 The determinants of one-particle states are called Slater determinants. If any of the single-particle states in (1.4.1) are the same, the result is zero. This is a statement of the Pauli principle: two identical fermions must not occupy the same state. On the other hand, when all the ia are different, then this antisymmetrized state is normalized to 1. In addition, we have (1.4.2) This dependence on the order is a general property of determinants. Here, too, we shall characterize the states by specifying their occupation numbers, which can now take the values and 1. The state with nl particles in state 1 and n2 particles in state 2, etc., is ° The state in which there are no particles is the vacuum state, represented by 10) = 10,0, ... ) . This state must not be confused with the null vector! We combine these states (vacuum state, single-particle states, two-particle states, ... ) to give a state space. In other words, we form the direct sum of the state spaces for the various fixed particle numbers. For fermions, this space is once again known as Fock space. In this state space a scalar product is defined as follows: (1.4.3a) i.e., for states with equal particle number (from a single subspace), it is identical to the previous scalar product, and for states from different subspaces it always vanishes. Furthermore, we have the completeness relation 1 1 L L .. ·Inl' n2, ... ) (nll n2,···1 = 11. . (1.4.3b) nl=O n2=O at Here, we wish to introduce creation operators once again. These must be defined such that the result of applying them twice is zero. Furthermore, the order in which they are applied must play a role. We thus define the creation operators by at (1.4.4) Since these states are equal except in sign, the anticommutator is {ai,aj} =0, (1.4.5a) 18 1. Second Quantization which also implies the impossibility of double occupation (1.4.5b) The anticommutator encountered in (1.4.5a) and the commutator of two operators A and B are defined by {A,B} [A,B] == == == AB+BA == AB - BA. [A,B]+ [A,B]_ (1.4.6) Given these preliminaries, we can now address the precise formulation. If one wants to characterize the states by means of occupation numbers, one has to choose a particular ordering of the states. This is arbitrary but, once chosen, must be adhered to. The states are then represented as (1.4.7) The effect of the operator aJ must be aJ I··· ,ni, ... ) = (1- ni)( _1)E j <, n;I··· ,ni + 1, ... ). (1.4.8) The number of particles is increased by 1, but for a state that is already occupied, the factor (1- ni) yields zero. The phase factor corresponds to the number of anticommutations necessary to bring the aJ to the position i. The adjoint relation reads: (1.4.9) This yields the matrix element (... ,ni, ... 1 ai I... ,n/, ... ) = (1 - ni)( -l)L;<, n; bn ,+l,n,'. (1.4.10) We now calculate ail··· ,n/, ... ) = Llni) (nilail n /) n, (1.4.11) ni = (2 - n/)( -l)Lj<i nj I... ,n/ - 1, ... ) n/. Here, we have introduced the factor n/, since, for n/ = 0, the Kronecker delta bni +l,ni' = 0 always gives zero. The factor n/ also ensures that the right-hand side cannot become equal to the state I... ,n/ -1, ... ) = I... ,-1, ... ). To summarize, the effects of the creation and annihilation operators are I··· ,ni, ... ) = ai I... ,ni, ... ) = aJ (1 - ni)( -l)L;<, n; ni( -l)Lj<, n; I... I... ,ni + 1, ... ) ,ni -1, ... ). (1.4.12) 1.4 Fermions 19 It follows from this that aia!I ... ,ni,"') = = (1-ni)(-1)2LJ<,nJ(ni+1)1 .. · ,ni,"') (1 - ni) I.. · ,ni,"') (l.4.13a) a;ail .. · ,ni,"') =ni(-1)2LJ<,nJ(1-ni+1)1··· ,ni,"') == ni I· . . ,ni,' .. ) (l.4.13b) , nr since for ni E {O, I} we have = ni and (_1)2LJ<' nj = l. On account of the property (l.4.13b) one can regard a; ai as the occupation-number operator for the state Ii). By taking the sum of (l.4.13a,b), one obtains the anticommutator In the anticommutator [ai, a}l+ with i is different: =1= j, the phase factor of the two terms Likewise, [ai, aj 1+ for i =1= j, also has different phase factors in the two summands and, since aiai I... ,ni,"') ex ni(ni -1) = 0, one obtains the following anticommutation rules for fermions: (l.4.14) 1.4.2 Single- and Many-Particle Operators For fermions, too, the operators can be expressed in terms of creation and annihilation operators. The form is exactly the same as for bosons, (l.3.21) and (l.3.24). Now, however, one has to pay special attention to the order of the creation and annihilation operators. The important relation (1.4.15) from which, according to (1.3.26), one also obtains two-particle (and many-particle) operators, can be proved as follows: Given the state S_ Iii, i2, ... ,iN), we assume, without loss of generality, the arrangement to be i 1 < i2 < ... < iN. Application of the left-hand side of (1.4.15) gives L li)a (jla S-li l,i2, ... ,iN) = S_ L li)a (jln li ,i2, ... ,iN) 1 The symbol Ij-+i implies that the state Ij) is replaced by Ii). In order to bring the i into the right position, one has to carry out L:k<j nk + L:k<i nk permutations of rows for i :::; j and L:k<j nk + L:k<i nk - 1 permutations for i > j. 1. Second Quantization 20 This yields the same phase factor as does the right-hand side of (1.4.15): In summary, for bosons and fermions, the single- and two-particle operators can be written, respectively, as (1.4.16a) i,j (1.4.16b) where the operators ai obey the commutation relations (1.3.7) for bosons and, for fermions, the anticommutation relations (1.4.14). The Hamiltonian of a many-particle system with kinetic energy T, potential energy U and a two-particle interaction 1(2) has the form H = ~)tij + Uij)a!aj + ~ L (i,jl 1(2) Ik,m) a!a}amak , (1.4.16c) i,j,k,m i,j where the matrix elements are defined in (1.3.21, 1.3.22, 1.3.25) and, for fermions, particular attention must be paid to the order of the two annihilation operators in the two-particle operator. From this point on, the development of the theory can be presented simultaneously for bosons and fermions. 1.5 Field Operators 1.5.1 Transformations Between Different Basis Systems Consider two basis systems {Ii)} and {IA)}. What is the relationship between the operators ai and a>..? The state IA) can be expanded in the basis {Ii)}: IA) = L Ii) (iIA) . (1.5.1) The operator a! creates particles in the state Ii). Hence, the superposition Li(iIA) a! yields one particle in the state IA). This leads to the relation al = L (iIA) a! (1.5.2a) 1.5 Field Operators 21 with the adjoint (1.5.2b) The position eigenstates Ix) represent an important special case (xli) = 'Pi(X), (1.5.3) where 'Pi(X) is the single-particle wave function in the coordinate representation. The creation and annihilation operators corresponding to the eigenstates of position are called field operators. 1.5.2 Field Operators The field operators are defined by 'ljJ(x) = L 'Pi (x)ai (1.5.4a) 'ljJt(x) = L 'P;(x)a! . (1.5.4b) The operator 'ljJt(x) ('ljJ(x)) creates (annihilates) a particle in the position eigenstate Ix), i.e., at the position x. The field operators obey the following commutation relations: ['ljJ(x) , 'ljJ(x')l± 0, (1.5.5a) ['ljJt (x), 'ljJt (x')l± = 0 , (1.5.5b) = ['ljJ(x) , 'ljJt(x')l± = L 'Pi (x)'Pj (x')[ai, a}l± (1.5.5c) i,j = L'Pi(X)'Pj(X')Oij = O(3)(X ~ x') , i,j where the upper sign applies to fermions and the lower one to bosons. We shall now express a few important operators in terms of the field operators. Kinetic energy 7 L:a!Tijaj = 'l,) (~; V 2 ) 'Pj(x)aj 'l,) = 7 L:jd3xa!'P;(x) ~2m jd 3XV'ljJt(x)V'ljJ(x) (1.5.6a) The second line in (1.5.6a) holds when the wave function on which the operator acts decreases sufficiently fast at infinity that one can neglect the surface contribution to the partial integration. 22 1. Second Quantization Single-particle potential I:a!Uijaj = 1,J I: Jd xa!cp;(x)U(x)cpj(x)aj 3 'I-,J = J d3 x U(x)'I/} (x)'ljJ(x) (1.5.6b) Two-particle interaction or any two-particle operator ~ J L d3 xd 3 x' cP; (x)cp; (x')V(x, X')CPk (x)CPm (x')a!a}amak i,j,k,m = J ~ d3 xd3 x' V(x, x')'ljJt (x)'ljJt (x')'lj;(x')'ljJ(x) (1.5.6c) Hamiltonian H = J d3 x ( : : V'ljJt(x)V'lj;(x) ~ + U(X)'ljJt(x)'lj;(x)) + J d3 xd3 x' 'lj;t (x)'lj;t (x')V(x, x') 'ljJ (x') 'lj; (x) (1.5.6d) Particle density (particle-number density) The particle-density operator is given by (1.5.7) Hence its representation in terms of creation and annihilation operators is n(x) = I: a!aj Jd ycP; (y)8(3)(x - y)cpj(Y) 3 ',J = L al ajcp; (x)cpj (x). (1.5.8) i,j This representation is valid in any basis and can also be expressed in terms of the field operators n(x) = 'ljJt(x)'lj;(x). (1.5.9) Total-particle-number operator (1.5.10) Formally, at least, the particle-density operator (1.5.9) of the manyparticle system looks like the probability density of a particle in the state 'lj; (x). However, the analogy is no more than a formal one since the former is an operator and the latter a complex function. This formal correspondence has given rise to the term second quantization, since the operators, in the creation and annihilation operator formalism, can be obtained by replacing the 1.5 Field Operators 23 wave function 'ljJ(x) in the single-particle densities by the operator 'ljJ(x). This immediately enables one to write down, e.g., the current-density operator (see Problem 1.6) j(x) = -2~ 1m ['ljJt(x)V'ljJ(x) - (V'ljJt(x))'ljJ(x)] . (1.5.11) The kinetic energy (1.5.12) has a formal similarity to the expectation value of the kinetic energy of a single particle, where, however, the wavefunction is replaced by the field operator. Remark. The representations of the operators in terms of field operators that we found above could also have been obtained directly. For example, for the particlenumber density (1.5.12) e where is the position operator of a single particle and where we have made use of the fact that the matrix element within the integral is equal to 8(3) (x-e)8(3) (e -e'). In general, for a k-particle operator Vk: / d3 6 ... d3 ekd3 6' ... d3 ek' 'ljJt (el) ... 'ljJt (ek) <ele2'" eki Vk le~; ... e~) 'IjJ(e~)· 'IjJ(e~). (1.5.13) 1.5.3 Field Equations The equations of motion of the field operators 'ljJ(x, t) in the Heisenberg rep- resentation 'ljJ(x, t) = eiHt / 1i 'ljJ(x, 0) e- iHt / 1i (1.5.14) read, for the Hamiltonian (1.5.6d), in! 'ljJ(x, t) = (-:;: V2 + U(x)) 'ljJ(x, t) + + J d3 x' 'ljJt (x', t)V(x, x')'ljJ(x', t)'ljJ(x, t). (1.5.15) The structure is that of a nonlinear Schrodinger equation, another reason for using the expression "second quantization" . Proof: One starts from the Heisenberg equation of motion in :t 'ljJ(x, t) = -[H, 'ljJ(x, t)] = _eiHt / 1i [H, 'ljJ(x, 0)] e- iHt / 1i • (1.5.16) Using the relation [AB, C]- = A[B, G]± =f [A, G]±B Fermi Bose ' (1.5.17) 24 1. Second Quantization one obtains for the commutators with the kinetic energy: 3 / d x' ~[V'Ij;t(x),l 2m = /d 3 x' ~(_V'O3)x - x)· V''Ij;(x')) = 2m the potential energy: ~V27j;(X) 2m , / d3 x' U(x')['Ij; t (x')'Ij;(x'), 'Ij;(x)] = / d3 x' U(x') (_0(3) (x' - x) 'Ij; (x') ) = - U(x)7j;(x) , and the interaction: ~ [ / d 3 x' d 3 x" 'Ij; t (x')'Ij; t (x") V(x', XI )7j;(X")'Ij;(x'), 'Ij;(x)] = ~ / =~ / X d3 x' / d3 x" ['Ij;t(x')'Ij;t(x"),7j;(x)]V(x', X")7j; (x") 7j; (x') d3 x' / d3 x" {±0(3) (x" - x)'Ij;t(x') - 'lj;t(X")0(3) (x' - x)} V(x', XI)'Ij;(X")'Ij;(x') =- / d 3 x' 7j;t (x')V(x, x')'Ij;(x')'Ij;(x). In this last equation, (1.5.17) and (1.5.5c) are used to proceed from the second line. Also, after the third line, in addition to 7j;(x")7j;(x') = =f'lj;(x')7j;(x"), the symmetry V(x, x') = V(x', x) is exploited. Together, these expressions give the equation of motion (1.5.15) of the field operator, which is also known as the field equation. The equation of motion for the adjoint field operator reads: in?j;t(x, t) = - { - ; : V2 + U(x)} 'lj;t(x, t) - / d 3 x' 'lj;t (x, t)'Ij;t (x', t)V(x, x')7j;(x', t), (1.5.18) where it is assumed that V(x, x')* = V(x, x'). If (1.5.15) is multiplied from the left by 7j;t(x, t) and (1.5.18) from the right by 'Ij;(x, t), one obtains the equation of motion for the density operator n(x,t) = ('Ij;t~+l) = i~ ( - ; : ) {'Ij;tV2'1j;_ (V27j;t)'Ij;} , and thus n(x) = -Vj(x), (1.5.19) where j(x) is the particle current density defined in (1.5.11). Equation (1.5.19) is the continuity equation for the particle-number density. 1.6 Momentum Representation 25 1.6 Momentum Representation 1.6.1 Momentum Eigenfunctions and the Hamiltonian The momentum representation is particularly useful in translationally invariant systems. We base our considerations on a rectangular normalization volume of dimensions Lx, Ly and L z . The momentum eigenfunctions, which are used in place of ~i(X), are normalized to 1 and are given by (1.6.1) = LxLyLz. with the volume V eik(x+L x By assuming periodic boundary conditions = e ikx , etc. , ) (1.6.2a) the allowed values of the wave vector k are restricted to k nx ny nz) Lx' Ly' Lz ,nx = 27r ( = 0, ±1, ... ,ny = 0, ±1, ... ,nz = 0, ±1, ... (1.6.2b) The eigenfunctions (1.6.1) obey the following orthonormality relation: (1.6.3) In order to represent the Hamiltonian in second-quantized form, we need the matrix elements of the operators that it contains. The kinetic energy is proportional to (1.6.4a) and the matrix element of the single-particle potential is given by the Fourier transform of the latter: (1.6.4b) For two-particle potentials V(x - x') that depend only on the relative coordinates of the two particles, it is useful to introduce their Fourier transform (1.6.5a) and also its inverse V(x) = ~ L q Vqe iq . x . (1.6.5b) 26 1. Second Quantization For the matrix element of the two-particle potential, one then finds (p', k'i V(x - x') Ip, k) 1 V2 = - ~ - V3 J d3 X d3 x , e -ip'·x e -ik' ·x' V( x - x ') eik·x' eip·x ' " V; ~ q J J d3 d3 x , -ip'.x-ik'.x'+iq.(x-x')+ik.x'+ip.x xe q = 1 V3 I : Vq VIL p '+q+p,oV8_ k '_q+k,O. q (1.6.5c) Inserting (1.6.5a,b,c) into the general representation (1.4.16c) of the Hamiltonian yields: The creation operators of a particle with wave vector k (i.e., in the state 'Pk) are denoted by at and the annihilation operators by ak. Their commutation relations are (1.6.7) The interaction term allows a pictorial interpretation. It causes the annihilation of two particles with wave vectors k and p and creates in their place two particles with wave vectors k - q and p + q. This is represented in Fig. 1.1a. The full lines denote the particles and the dotted lines the interaction potential Vq . The amplitude for this transition is proportional to Vq . This diak- ql - q2 k- q) k ............ y~L. <p+q P k a) p b) Fig. 1.1. a) Diagrammatic representation of the interaction term in the Hamiltonian (1.6.6) b) The diagrammatic representation of the double scattering of two particles 1.6 Momentum Representation 27 grammatic form is a useful way of representing the perturbation-theoretical description of such processes. The double scattering of two particles can be represented as shown in Fig. 1.1 b, where one must sum over all intermediate states. 1.6.2 Fourier Transformation of the Density The other operators considered in the previous section can also be expressed in the momentum representation. As an important example, we shall look at the density operator. The Fourier transform of the density operatorS is defined by nq = f d3xn(x)e- iq .x f d3x~t()e-iq. = . (1.6.8) From (1.5.4a,b) we insert ~(x) = Jv LeiP'Xap , ~t(x) = Jv Le-iP'Xa~ P , (1.6.9) P which yields and thus, with (1.6.3), one finally obtains nq = L a~p+q (1.6.10) . P We have thus found the Fourier transform of the density operator in the momentum representation. 1.6.3 The Inclusion of Spin Up until now, we have not explicitly considered the spin. One can think of it as being included in the previous formulas as part of the spatial degree of freedom x. If the spin is to be given explicitly, then one has to make the replacements ~(x) --+ ~u(x) and a p --+ a pu and, in addition, introduce the sum over a, the z component of the spin. The particle-number density, for example, then takes the form nq = L a~up+qU (1.6.11) . p,U 8 The hat on the operator, as used here for nq and previously for the occupationnumber operator, will only be retained where it is needed to avoid confusion. 28 1. Second Quantization The Hamiltonian for the case of a spin-independent interaction reads: H = J 1i2 d3x(2m V1/J!V1/Ja L a +~ + U(X)1/J!(X)1/Ja(X)) Jd3Xx'1/J!()~V, L X')1/Ja (X')1/J(7(X) , l (1.6.12) a,u' the corresponding form applying in the momentum representation. For spin- ~ fermions, the two possible spin quantum numbers for the z component of S are ± ~. The spin density operator N S(x) = L 8(x - xa)Sa (1.6.13a) a=l is, in this case, S(x) = Ii "2 L (1.6.13b) 1/J! (x)lTaa 1/Ja (x), l l u,u' where IT aal are the matrix elements of the Pauli matrices. The commutation relations of the field operators and the operators in the momentum representation read: [1/J!(x), 1/J~(X')]± [1/J(7(X), 1/J(7I(X')]± = 0 , [1/Ja(x), 1/J~(X')]± = 0 = 8aa l 8(x - x') (1.6.14) and (1.6.15) The equations of motion are given by a iii at 1/Ja(x, t) = (- 1i2 V2 2m +L Jd3x'1/J~, + U(x) ) 1/Ja(x, t) (x', t)V(x, X')1/J(71 (x', t)1/Ja(x, t) (1.6.16) al and (1.6.17) Problems 29 Problems 1.1 Show that the fully symmetrized (antisymmetrized) basis functions 8±'Pil (Xd'Pi2 (X2) ... 'PiN (XN) are complete in the space of the symmetric (antisymmetric) wave functions 1/;./a(Xl,X2, ... ,XN). Hint: Assume that the product states 'Pil (Xl) ... 'PiN (XN), composed of the single-particle wave functions 'Pi(X), form a complete basis set, and express 1/;./a in this basis. Show that the expansion coefficients c::,a .. ,iN possess the symmetry property k8±c:~ .. ,iN = c:~ .. ,iN' The above assertion then follows directly by utilizing the identity k8±1/;./a = 1/;./a demonstrated in the main text. 1.2 Consider the three-particle state la) 1,6) IT), where the particle number is determined by its position in the product. a) Apply the elements of the permutation group 8 3 . One thereby finds six different states, which can be combined into four invariant subspaces. b) Consider the following basis, given in (1.1.16c), of one of these subspaces, comprising two states: 11/;1) vb (2 = la) 1,6) h') +2 1,6) la) IT) -la)IT)I,6) -IT)I,6)la) - 11') la) 1,8) -1,6) IT) la») , 11/;2) = ~ (0+0 -Ia) IT) 1;3) + IT) 1,6) la) + 11,) la) 1,6) -1,6) I,) la») and find the corresponding two-dimensional representation of 83. 1.3 For a simple harmonic oscillator, [a, at] = 1, (or for the equivalent Bose operator) prove the following relations: [a,e aat ] = ae aat e- aat ae aat = a + a, -aat {3a aat {3a (3a e e e =e e , eaataae-aata =e-aa, where a and ,8 are complex numbers. Hint: a) First demonstrate the validity of the following relations [a, j(a t )] = a~t j(a t ), [at,j(a)] = - ;aj(a). b) In some parts of the problem it is useful to consider the left-hand side of the identity as a function of a, to derive a differential equation for these functions, and then to solve the corresponding initial value problem. c) The Baker-Hausdorff identity eA Be -A = B + [A, B] + ~ [A, [A, Bll + ... can likewise be used to prove some of the above relations. 30 1. Second Quantization 1.4 For independent harmonic oscillators (or noninteracting bosons) described by the Hamiltonian determine the equation of motion for the creation and annihilation operators in the Heisenberg representation, Give the solution of the equation of motion by (i) solving the corresponding initial value problem and (ii) by explicitly carrying out the commutator operations in the expression ai(t) = eiHt/haie-iHt/h. 1.5 Consider a two-particle potential V(x', x") symmetric in x' and x". Calculate the commutator ~ [I 1 d3 X"'l/Jt(X')'l/Jt(X")V(X',X")'l/J(X")'l/J(X I ),'l/J(X)] , d 3 x' for fermionic and bosonic field operators 'l/J(x). 1.6 (a) Verify, for an N-particle system, the form of the current-density operator, j(x) = ~ N L ,,=1 {~ ,8(x - x,,) } in second quantization. Use a basis consisting of plane waves. Also give the form of the operator in the momentum representation, i.e., evaluate the integral, j(q) = J d3 xe- iq .x j(x). (b) For spin-~ particles, determine, in the momentum representation, the spindensity operator, N Sex) = L 8(x - x")S,, , 0:=1 in second quantization. 1. 7 Consider electrons on a lattice with the single-particle wave function localized at the lattice point Ri given by 'PiO"(X) = XO"'Pi(X) with 'Pi(X) = ¢(x - Ri). A Hamiltonian, H = T + V, consisting of a spin-independent single-particle operator T = E::=l t" and a two-particle operator V = ~ E"#i3 V(2)(X",Xi3) can be represented in the basis {'PiO"} by H =L i,j L tija!O"ajO" + ~ L L u i,j,k,l Vijkla!O"a;O"IaIO"IakO" , CT,U I where the matrix elements are given by tij = (i I t I j) and Vijkl = (ij I V(2) I kl). If one assumes that the overlap of the wave functions 'Pi(X) at different lattice points is negligible, one can make the following approximations: Problems 31 for i = j, for i and j adjacent sites , otherwise = Vi j 8il 8jk Vijkl with Vij = ! ! d3x d3y I 'Pi(X) 12 V(2)(x,y) I 'Pj(Y) 12 (a) Determine the matrix elements Vi] for a contact potential V = A "2 L 8(xa - X,B) ai',B between the electrons for the following cases: (i) "on-site" interaction i = j, and (ii) nearest-neighbor interaction, i.e., i and j adjacent lattice points. Assume a square lattice with lattice constant a and wave functions that are Gaussians, 'P(x) = ,13(i".3(4 exp{ _X2 /2Ll2}. (b) In the limit Ll « a, the "on-site" interaction U = Vii is the dominant contribution. Determine for this limiting case the form of the Hamiltonian in second quantization. The model thereby obtained is known as the Hubbard model. 1.8 Show, for bosons and fermions, that the particle-number operator commutes with the Hamiltonian H = La! (il T Ij) aj '] +~ IV = L:i a! ai La!a} (ijl V Iki) alak . ijkl 1.9 Determine, for bosons and fermions, the thermal expectation value of the occupation-number operator ni for the state Ii) in the grand canonical ensemble, whose density matrix is given by 1 pG = ZG e -,B(H-J.LN) with ZG = Tr (e-,B(H-J.LN)) 1.10 (a) Show, by verifying the relation n(x) I¢» = 8(x - x') I¢» , that the state (10) = vacuum state) describes a particle with the position x'. (b) The operator for the total particle number reads: ,= ! N d3 xn(x). Show that for spinless particles [1jJ(x), NJ = 1jJ(x) . 2. Spin-l/2 Fermions In this and the following chapters, we shall apply the second quantization formalism to a number of simple problems. To begin with, we consider a gas of noninteracting spin-~ fermions for which we will obtain correlation functions and, finally, some properties of the electron gas that take into account the Coulomb interaction. 2.1 Noninteracting Fermions 2.1.1 The Fermi Sphere, Excitations In the ground state of N free fermions, 14>0), all single-particle states lie within the Fermi sphere (Fig. 2.1), i.e., states with wave number up to kF' the Fermi wave number, are occupied: (2.1.1) Fig. 2.1. The Fermi sphere The expectation value of the particle-number operator in momentum space is np,a For = (4)01 Ipi > t apaa pa kF, we have 14>0) = a pa {1 14>0) 0 = Ipl:::; Ipi > kF kF . (2.1.2) II II a~',p 10) = O. According to p' U' Ip'l<kF (2.1.2), the total particle number is related to the Fermi momentum byl 1 = I:k (~k)3f is 11k = (2{)3, c.f. I:kf(k) = U;.)3 J d3kf(k). Eq. (1.6.2b). The volume of k-space per point 34 2. Spin-l/2 Fermions {kF N = L npa = 1 = 2V Jo 2 L p,a d3 p (27f)3 Vk} = 37f2 ' (2.1.3) Ipl:-S;kF whence it follows that _ 37f2 N _ 3 2 k 3p-y-7fn. (2.1.4) Here, k p is the Fermi wave vector, pp = nk p the Fermi momentum 2 , and n = the mean particle density. The Fermi energy is defined by Ep = (fikp) j(2m). For the x-dependence of the ground-state expectation value of the particle density, one obtains y (n(x)) = L (cPo 1'l/Jt (x)'l/Ja (x) IcPo) a 1 = V Lnpa = n. p,a As was to be expected, the density is homogeneous. The simplest excitation of a degenerate electron gas is obtained by promoting an electron from a state within the Fermi sphere to a state outside this sphere (see Fig. 2.2). One also describes this as the creation of an electronhole pair; its state is written as (2.1.5) Fig. 2.2. Excited state of a degenerate electron gas; electronhole pair The absence of an electron in the state Ikb al) has an effect similar to that of a positively charged particle (hole). If one defines bka == a~k -a and btu == a-k,-a, then the hole annihilation and creation operators b' and bt likewise satisfy anticommutation relations. 2 We denote wave vectors by p, q, k etc. Solely PF has the dimension of "momentum". 2.1 Noninteracting Fermions 35 2.1.2 Single-Particle Correlation Function The correlation function of the field operators in the ground state (2.1.6) signifies the probability amplitude that the annihilation of a particle at x' and the creation of a particle at x once more yields the initial state. The function G(7(x - x') can also be viewed as the probability amplitude for the transition of the state 'IjJ(7(x / ) I¢o) (in which one particle at x' has been removed) into 'IjJ(7(x) I¢o) (in which one particle at x has been removed). L G(7(x - x') = (¢ol ~e-iP.X+' ab(7a p '(7I¢o) p,p' - ~ ~ -ip·(x-x') we -v n p ,(7p = - -12 lkF (27r) dp p2 0 J d3p (27r)3 e -ip.(x-x')e(k F - p ) 11 . ,1'1, drye,plX-x -1 (2.1. 7) where we have used polar coordinates and introduced the abbreviation ry = with r = Ix - xii. Thus, we cos e. The integration over ry yields e ipr i;,~-pr have G(7(x - x') = 1 -2- 27r r lkF 0 dppsinpr = ~(sinkFr 1 27r r - kFr cos kFr) 3n sin kFr - kFr cos kFr (kFr)3 2 The single-particle correlation function oscillates with a characteristic period of l/kF under an envelope which falls off to zero (see Fig. 2.3). The values at r = 0 and for r -+ 00 are G(7(r = 0) = ~, lim r --+ oo G(7(r) = 0; the zeros are determined by tan x = x, i.e., for large x they are at n21r. G".(x - x') n "2 Fig. 2.3. Correlation function G".(x - x') as a function of kFT 36 2. Spin-l/2 Fermions Remark. In relation to the first interpretation of GO" (x) given above, it should be noted that the state 1/J0" (x') I¢o; is not normalized, (¢ol1/J~x')O" I¢o; = ~. (2.1.8) The probability amplitude is obtained from the single-particle correlation function by multiplying the latter by the factor (~) -1. Now (2.1.9) The probability amplitude for a transition from the (normalized) state 'l/>a~o) to the (normalized) state 'l/>a~o) n/2 n/2 is equal to the overlap of the two states. 2.1.3 Pair Distribution Function As a result of the Pauli principle, even noninteracting fermions are correlated with one another when they have the same spin. The Pauli principle forbids two fermions with the same spin from possessing the same spatial wave function. Hence, such fermions have a tendency to avoid one another and the probability of their being found close together is relatively small. The Coulomb repulsion enhances this tendency. In the following, however, we will consider only noninteracting fermions. A measure of the correlations just descibed is the pair distribution function, which can be introduced as follows: Suppose that at point x a particle is removed from the state 14>0) so as to yield the (N - I)-particle state (2.1.10) The density distribution for this state is (4)' (x, (T) I 'ljJ~, (x')'ljJu' (x') 14>' (x, (T)) = (4)01 'ljJ!(X)~/ux == 14>0) (~)2gcralx_' (2.1.11) This expression also defines the pair distribution function gaul (x - x'). Note: (~r 90"0"' (x - x') = = (¢ol1/J~ (x)1/JO" (X)1/J~, (X')1/J,,1 (x') I¢o; -OO"O"IO(X - x') (¢ol1/J~x),' I¢o; (¢ol n(x)n(x') I¢o; - O""IO(X - x') (¢ol n(x) I¢o; For the sake of convenience, the pair distribution function is calculated in Fourier space: 2.1 Noninteracting Fermions (~fgalX-') = ~2 37 LLe-i(k-k/).X-i(q_q/)ox l (2.1.12) k,k/q,q' x (4)01 aL~",q/C7lk 14>0) . We will distinguish two cases: (i) u=/-u': For u =/- u', we must have k orthogonal to one another: (~)2galX_' = k' and q = q', otherwise the states would be ~2 = L(4)olnk,,nq,,'I4>o) k,q ~N = N V 2 "C7 I =~ N .N 2 V2 2 = (~)2 2· Thus, for u =/- u', (2.1.13) independent of the separation. Particles with opposite spin are not affected by the Pauli principle. (ii) u = u': For u = u' there are two possibilities: either k (4)01 atC7~"q/kl4>o) = k', q = q' or k = q', q = k': = 8kk/8qql (4)01 at"~qC7kl4>o) + 8kq/8qk' (4)01 aL~"kC7q = 14>0) (4)01 aLk"~qC7l4>o) 8kq/8qk' )nkC7nqC7 . (8kk/8qql - 8kq/8qk/) = (8kk/8qql - (2.1.14) Since (ak,,)2 = 0, we must have k =/- q and thus, by anticommutating - see (1.6.15) - we obtain the expression (2.1.14), and from (2.1.12) one gains: (~)2 g"C7(X - x') = ~2 L (1- / e-i(k-q)(X-X )) nk"n q " k,q = n)2 -(G,,(x-x)). , 2 ( "2 (2.1.15) With the single-particle correlation function G,,(x - x') from (2.1.8) and the abbreviation x = kF Ix - x'l, we finally obtain g",,(x - x') = 1- 96 (sinx - xcosX)2 . X (2.1.16) 38 2. Spin-l/2 Fermions 9uu(X - x') 1.0 I-----"?"'-------------1.00 1----..---...,""""-7'""........=--1 0.5 0.99 0.0 L L . . -........- ....'--------,!2'-7r-.........-----:!3-7r-.......--4-!-7r-- 7r kFlx-X/1 Fig. 2.4. The pair distribution function 9uu(X - x'). The correlation hole and the weak oscillations with wave number kF should be noted Let us give a physical interpretation of the pair distribution function (2.1.16) plotted in Fig. 2.4. If a fermion is removed at x, the particle density in the vicinity of this point is strongly reduced. In other words, the probability of finding two fermions with the same spin at separations ;S k p1 is small. The reduction of guu(x - x') at such separations is referred to as an exchange, or correlation hole. It should be emphasized once again that this effective repulsion stems solely from the antisymmetric nature of the state and not from any genuine interaction. For the noninteracting electron gas at T = 0, one has 1" gaa = 2 "4 ~ l -1 (1 + gaa(x)) (2.1.17a) a,a' L (¢ol1jJtx)~/Oa n2 I¢o) = 4 a,a' L gaa l (x) a,a' = n2 2(1 + gaa(x)) -+ n 2 for x -+ 00 n2 -+ 2 for x -+ O. (2.1.17b) The next section provides a compilation of the definitions of the pair distribution function and other correlation functions. According to this, the spin-dependent pair distribution function 2.1 Noninteracting Fermions 39 is proportional to the probability of finding a particle with spin a at position x when it is known with certainty that a particle with spin a' is located at O. It is equal to the probability that two particles with spins a and a' are to be found at a separation x. *2.1.4 Pair Distribution Function, Density Correlation Functions, and Structure Factor The definitions and relationships given in this section hold for arbitrary manybody systems and for fermions as well as bosons 3 . The standard definition of the pair distribution function of N particles reads: g(x) = N(:-l)\ t 8(X-xo +x,6)). (2.1.18) 0#,6=1 Here, g(x) is the probability density that a pair of particles has the separation x; in other words, the probability density that a particle is located at x when with certainty there is a particle at the position O. As a probability density, g(x) is normalized to 1: (2.1.19) The density-density correlation function G(x) for translationally invariant systems is given by G(x) = (n(x)n(O)) = (n(x + x')n(x')) = L (8(x + x' - x o )8(x' - x,6)). (2.1.20) 0,,6 Due to translational invariance, this is independent of x' and we may integrate over x', whence (with -& d3 x' = 1) it follows that J 1 G(x) = V L (8(x - Xo + x,6)) . 0,,6 This leads to the relationship G(x) = ~ (~8X) = n8(x) + 3 + N(NV - 1) g(X)) (2.1.21) N(N -1) V2 g(x) . The brackets signify an arbitrary expectation value, e.g., a quantum-mechanical expectation value in a particular state or a thermal expectation value. 40 2. Spin-l/2 Fermions For interactions of finite range, the densities become independent of each other at large separations: lim G(x) x-+= = (n(x))(n(O)) = n2• From this it follows that The static structure factor S (q) is defined by ~(:2e-iqXa/3») S(q) = -N8qQ . (2.1.22) + 1- N8qQ (2.1.23) a,f3 One may also write S(q) = ~ :2:: (e- iq (Xa- X/3») a¥-f3 or where nq = j d xe3 iqx n(x) = :2:: e- iqxa . a Since N(N - 1) -+ N 2 for large N j d x e-iqxg(x) ;;2 j d xe- (:2:: 8(x - xa + X(3)) 3 3 = iqX a¥-(3 = ;;2 (:2:: e- iq (Xa- X/3»), a¥-f3 and it follows that S(q) = ~ j d xe3 iqx g(x) With N8qO = N V one obtains jd3xe- iqX ' +1- N8qQ . 2.2 Ground State Energy and Elementary Theory of the Electron Gas 41 (2.1.24a) and the inverse g(x) - 1 = -1 n J 3 d q e1qX(S(q) . __ - 1) . (2.1.24b) (21f)3 In the classical case, lim S(q) = nkTXT , (2.1.25) q--+O where XT is the isothermal compressibility. The above definitions yield the following second-quantized representations of the density-density correlation function and the pair-distribution function: G(x - x') = (lpt(X)'ljJ(x)'ljJt(x')'ljJ(x')) (2.1.26a) g(x) = ~: (2.1.26b) ('ljJ t (x)'ljJ t (O)'ljJ (O)'ljJ (x) ). The first formula, (2.1.26a), is self-evident; the second follows from the former and (2.1.21) and a permutation of the field operators. Proof of the last formula based on the definition (2.1.18) and on (1.5.6c): L 8(x - x'" + x{3) ",#{3 ~ J d 3 x' d 3 x"'I/} (x')'ljJt (x")8(x = I x' + x")'IjJ(x")'IjJ(x') d3 x''ljJt(x')'ljJt(x' -x)'IjJ(x' -x)'IjJ(x') / L 8(x - x", + X{3)) = V( 'ljJt (x')'ljJt (x' \0# x)'IjJ(x' - x)'IjJ(x')) . 2.2 Ground State Energy and Elementary Theory of the Electron Gas 2.2.1 Hamiltonian The Hamiltonian, including the Coulomb repulsion, reads: (2.2.1) The q = 0 contribution, which, because of the long-range nature of the Coulomb interaction, would diverge, is excluded here since it is canceled by the interaction of the electrons with the positive background of ions and by the interaction between the ions. This can be seen from the following. 42 2. Spin-l/2 Fermions The interaction energy of the background of positive ions is H = lOll ~e2 2 J d3 xd 3 x,n(x)n(x') e-JLlx-x'l Ix - x'i (2.2.2a) . Here, n(x) = ~ and we have introduced a cutoff at J.l- 1 . At the end of the calculation we will take J.l --+ 0 2 H·lOll = J (Xl 1 2 (N) -e V 4n 2 V drre- JL r = 2 1 2N 4n -e 2 V J.l2 • (2.2.2a') o The interaction of the electrons with the positive background reads: 2 Hioll,el = -e ~ N N V J 3 d x e -JLlx-x;1 2 N 2 4n Ix-xii = -e V J.l2 . (2.2.2b) Finally, we consider the q = 0 contribution to the electron-€lectron interaction , where --qr41l"e 2 --+ 41l"e 2 q2+JL2' (2.2.2c) The leading terms, proportional to N 2 , in the three evaluated energy contributions cancel one another. The term - ;~ ,. N yields an energy contribution per particle of ~ ex: ~ and vanishes in the thermodynamic limit. The limits are taken in the order N, V --+ 00 and then J.l --+ o. -k 2.2.2 Ground State Energy in the Hartree-Fock Approximation The ground state energy is calculated in perturbation theory by assuming a ground state 14>0), in which all single-particle states up to kF are occupied: (2.2.3) 2.2 Ground State Energy and Elementary Theory of the Electron Gas 43 The kinetic energy in this state is diagonal: E(O) = {<Pol Hkin l<Po} = !!.-. '" k 8(k 2m~ 2 F - k) k,O" (2.2.4) Here, according to (2.1.4), we have used n = 3: k3 3 = 47fr3 = 47fa3r~ 2 3 (2.2.5) and introduced ro, the radius of a sphere of volume equal to the volume per particle. The quantity ao = ~ me is the Bohr radius and r s = 2:Q.. ao The potential energy in first-order perturbation theory4 reads: e2 - 2V (1) _ E 47f t t q2 {<Pol ak+q,crak'_q,cr,ak'cr,akcr l<Po} . '" ~ (2.2.6) k,k' ,q,a,u' The prime on the summation sign indicates that the term q = 0 is excluded. The only contribution for which every annihilation operator is compensated by a creation operator is proportional to 8crcr ,8k , ,k+qat+qcratcr,ak+qcr' akcr, thus: , 2 E(l) =_~ L 2V e2 k,q,a '" = - 2V ~ ~ 0" =- 2 47fe V (27f)6 ",'47f qx8(k F -Iq + kJ)8(kF k,q J d3 k 8(k F - k) J d3 k' - k) Ik _1k'I2 8(k F - k'). (226') .. One then finds 1 8(k _k,)=_2e2 k F(~) _ 47fe 2 Jd 3 k' (27f)3 Ik _ k'I 2 F 7f F kF' where 1+ -1--4xlxo g 11-I-x +-x I F(x) = - 2 4 2 (2.2.6") This first-order perturbation theory can also be considered as the Hartree-Fock theory with the variational state (2.2.3); see also Problem 2.5. 44 2. Spin-l/2 Fermions Fig. 2.5. Integration region for E(l) consisting of the region of overlap of two Fermi spheres with relative displacement q; see Eq. (2.2.6') and J d3k [ (2'71-)3 1 + k} - k 2 1 kF + k I] 2kkF log kF - k k<kF = _N~e2kF = Taking E(O) _~(97f)1/3N 4 7f 2aors and E(l) = _~ 27f 4 0.916 N. 2ao rs (2.2.7) together yields: E e 2 [2.21 0.916 N = 2ao r; - --;::- + . .. ] (2.2.8) (rs«l)' The first term is the kinetic energy, and the second the exchange term. The pressure and the bulk modulus are given by P = _(BE) = _ dE drs = Ne 2 ~ [4.42 _ 0.916] BV N drs dV 2ao 3V r~ r; and B = .!. = -v (BP) K, = BV Ne 2 [11.05 _ 1.832] 9Vao r; rs (2.2.9) For r s = 4.83 the energy takes on its minimum value corresponding to ~ = -1.29 eV. This is ofthe same order of magnitude as in simple metals, e.g., Na (rs = 3.96, ~ = -1.13eV). However, these values of rs lie outside the range of validity of the present theory. Higher order corrections to the energy can be obtained in the random phase approximation (RPA): E { 2.21 NR y = -2+~·06lnrs.9AB ~ 0.916 G .. ~ } v correlation energy (2.2.10) 2 4 = ~Ii = 13.6 eV. where we have made use of the Rydberg, 1 Ry = 2~o The RPA yields an energy that contains, in addition to the Hartree-Fock 2.2 Ground State Energy and Elementary Theory of the Electron Gas 45 energy, the summation of an infinite series arising from perturbation theory. It is the latter that yields the logarithmic contributions. That perturbation theory should lead to a series in powers of r s, can be seen from the rescaling in (2.2.23). Remarks For r s ~ 00, one expects the electrons to form a Wigner crystal crystallize. For large r s one finds the expansion6 . E e 2 [ 1.79 hm N = - -rs-too 2ao rs ] + 2.64 3/2 + .. . , rs 5, i.e., to (2.2.11) which, for rs » 10, is quantitatively reliable (see Problem 2.7). The Wigner crystal has a lower energy than the fluid. Corrections arising from correlation effects are discussed in other advanced texts 7 . 0.10 >: II: -.. ~ 0.05 w'" >- 0.00 GI C GI GI .... -0.05 c -0.10 C) IU 1/1 I 'tI .. :::II 0 C) -0.15 0.0 5.0 10.0 15.0 20.0 Fig. 2.6. Energies of the electron gas in the Hartree--Fock approximation and of the Wigner crystal, in each case as a function of r s rs To date, Wigner crystallization5 in three dimensions has not been detected experimentally. It is possible that this is due to quantum fluctuations, which destroy (melt) the lattice6 . On the basis of a Lindemann criterions , one finds that the Wigner lattice is stable for r s > r~ = 0.41 8- 4 , where 8 (0.15 < 8 < 0.5) is the Lindemann parameter. Even for 8 = 0.5, the value of r~ = 6.49 is already larger than the minimum value of (2.2.11), rs = 4.88. In two dimensions, a triangular lattice structure has been theoretically pre- 5 6 7 8 E.P. Wigner, Phys. Rev. 46, 1002 (1934) , Trans. Faraday Soc. 34, 678 (1938) R.A. Coldwell-Horsfall and A.A. Maradudin, J. Math. Phys. 1, 395 (1960) G.D. Mahan, Many Particle Physics, Plenum Press, New York, 1990, 2nd edn, Sect. 5.2 See, e.g., J. M. Ziman, Principles of the Theory of Solids, 2nd edn, Cambridge University Press, Cambridge, 1972, p.65. 46 2. Spin-1/2 Fermions dicted 9 and experimentally observed for electrons on the surface of helium. 10 Its melting curve has also been determined. Figure 2.6 compares the HartreeFock energy (2.2.8) with the energy of the Wigner crystal (2.2.11). The minimum of the Hartree-Fock energy as a function of Ts lies at Ts = 4.83 and has the value E/N = -O.095e2/2ao. To summarize, the range of validity of the RPA, equation(2.2.1O), is restricted to Ts « 1, whereas (2.2.11) for the Wigner crystal is valid for Ts » 10; real metals lie between these two regimes: 1.8 :::; T s :::; 5.6. 2.2.3 Modification of Electron Energy Levels due to the Coulomb Interaction The Coulomb interaction modifies the electron energy levels fo(k) = (~2. We can calculate this effect approximately by considering the equation of motion of the operator akO"(t): ~ akO"(t) = = - [2: ~ 2: k' ,0"' akO"(t) = -~fo(k)aO"t fo(k')at,O",ak'O"', akO"] k',u' fO (k') [at, 0"" akO" L ak' 0"' "-v-" . (2.2.12) We now define the correlation function (2.2.13) Multiplying the equation of motion by ato"(O) yields an equation of motion for GkO"(t): d i dt GkO"(t) = -/ifo(k)GkO"(t). 9 10 (2.2.14) G. Meissner, H. Namaizawa, and M. Voss, Phys. Rev. B13, 1360 (1976); L. Bonsall, and A.A. Maradudin, Phys. Rev. B15, 1959 (1977) C.C. Grimes, and G. Adams, Phys. Rev. Lett. 42 795 (1979) 2.2 Ground State Energy and Elementary Theory of the Electron Gas 47 Its solution is (2.2.15) since (1)01 aka(O)aL(O) 11>0) = -nka + l. When the Coulomb repulsion is included, the equation of motion for the annihilation operator aka reads: (2.2.16) as can be immediately seen from the field equation. From this it follows that On the right-hand side there now appears not only Gka(t), but also a higherorder correlation function. In a systematic treatment we could derive an equation of motion for this, too. We introduce the following factorization approximation for the expectation value l l : (a~+q a' (t)ak+q a(t)apa,(t)ata(O)) = (a~+q = (2.2.18) a' (t)ak+q a(t))( apa,(t)aL(o)) <laa,<lPk( a~+q a' (t)a p+q a,(t))( aka (t)ata (0) ). The equation of motion thus reads: (2.2.19) From this, we can read off the energy levels E(k) as 11 The other possible factorization \ a~+q 0"' (t)a p 0"' (t) ) \ a~+q q = 0, which is excluded in the summation of Eq. (2.2.17). 0" (t)akO" (0) ) requires 48 2. Spin-l/2 Fermions (2.2.20) The second term leads to a change in E(k), LlE(k) = - J d 3 k' (21f)3 41fe 2 8(k - k') Ik _ k'I 2 kF F 1 2 = _ e Jdk'k,2 Jd 1f o TJ k2 -1 1 + k,2 - 2kk'TJ kF = - ~1fk J = _ 2e 2 k F dk' k' I o (~ 1f 2 Ik + k' I og k - k' + 1- x 2 log 11 + x I) 4x , . k x = kF I-x (2.2.21) ' I F(x) Here again the function F(x) of Eq. (2.2.6") appears. The Hartree-Fock energy levels are reduced in comparison to those of the free electron gas. However, the estimated reduction turns out to be greater than that actually observed. Figure 2.7 shows F(x) and E(k) in comparison to Eo(k) = 1i;:;'2 for rs = 4. Notes: (i) A shorter derivation of the Hartree-Fock energy is obtained by introducing the following approximation in the Hamiltonian 2 F(x) ,, , ,, ,, , 0.8 1.25 1.5 k/kF 0.6 -1 0.4 0.2 +-~=",:'x -2 0.2 0.4 0.6 0.8 1.2 1.4 Fig. 2.7. (a) The function F(x), Eq. (2.2.6"), and (b) the Hartree-Fock energy levels E(k) as a function of the wave number for Ts = 4, compared with the energy of the free-electron gas Eo(k) (dashed). 2.3 Hartree-Fock Equations for Atoms 2~ L k,k',q#O 4;:2 (\ a~+q" ak' ,,' ) a~, _q ,,' ak" + a~+q " ak' ,,' \ a~, _q ,,' 49 ak" ) ) 0',0" This yields: H = L E(k)a~" k," with (ii) The perturbation-theoretical expansion in terms of the Coulomb interaction leads to a power series (with logarithmic corrections) in rs. This structure can be seen from the following scaling of the Hamiltonian: H = L. 't 2 1 2 L~ 2.-1.' r'i l!i... + 2m To this end, we carry out a canonical transformation r' characteristic length ro is defined by 4;r8N = V, i.e., _(~)1/3 ro - 47rN (2.2.22) "'-r-J = r/ro p' = pro. The . In the new variables the Hamiltonian reads: (2.2.23) The Coulomb interaction becomes less and less important in comparison to the kinetic energy as ro (or rs) becomes smaller, i.e., as the density of the gas increases. 2.3 Hartree-Fock Equations for Atoms In this section, we consider atoms (possibly ionized) with N electrons and the nuclear charge number Z. The nucleus is assumed to be fixed and thus the Hamiltonian written in second quantized form is 50 2. Spin-l/2 Fermions i,j i,j (2.3.1) where p2 T=2m Ze 2 u= - , r (2.3.2a) (2.3.2b) r=lxl and e2 V = .,----,. Ix-x'i (2.3.2c) represent the kinetic energy of an electron, the potential felt by an electron due to the nucleus, and the Coulomb repulsion between two electrons, respectively. Although the Hartree and the Hartree-Fock approximations have already been discussed in Sect. 13.3 of QM 112, we will present here a derivation of the Hartree-Fock equations within the second quantization formalism. This method is easier to follow than that using Slater determinants. We write the state of the N electrons as (2.3.3) Here, 10) is the vacuum state containing no electrons and a! is the creation operator for the state Ii) == l'Pi, m s ;) , m s , = ±~. The states Ii) are mutually orthogonal and the 'Pi(X) are single-particle wave functions which are yet to be determined. We begin by calculating the expectation value for the general Hamiltonian (2.3.1) ('ljJ1 H 1'ljJ) without particular reference to the atom. For the single-particle contributions, one immediately finds N L (il T Ij) ('ljJ1 a!aj 1'ljJ) = L (il T Ii) i,j (2.3.4a) i=1 N L (il u Ij) ('ljJ1 aJaj 1'ljJ) = L i,j (il U Ii) , (2.3.4b) i=1 whilst the two-particle contributions are found as ('ljJ1 aJajamak 1'ljJ) = ('ljJ1 (8im8jk at"al + 8ik8jmalat,,)amak 1'ljJ) = (8ik 8jm - 8im 8jk) 8(m, k E 1, ... , N) . (2.3.4c) The first factor implies that the expectation value vanishes whenever the creation and annihilation operators fail to compensate one another. The second 12 QM lop. cit. 2.3 Hartree-Fock Equations for Atoms 51 implies that both the operators am and ak must be present in the set a1 ... aN occurring in the state (2.3.3), otherwise their application to the right on the vacuum state 10) would give zero. Therefore, the total expectation value of H reads: 11,2 (1/!1 HI1/!) = 2m L Jd3xlV'Pi + L Jd3xU(x) l'Pi(X)12 N N i=l i=l N +~ L J d3xd3x'V(x - x') {1'Pi(X)1 21'Pj(x')1 2 i,j=l (2.3.5) - Oms,m sJ 'P: (x)'P; (X')'Pi(X')'Pj (x) }. In the spirit of the Ritz variational principle, the single-particle wave functions 'Pi(X) are now determined so as to minimize the expectation value of H. As subsidiary conditions, one must take account of the normalizations l'Pil 2d3x = 1; this leads to the additional terms -Ei(J d3xl'Pi(X) 12 -1) with Lagrange parameters Ei. In all, one thus has to take the functional derivative J 1) (J of (1/!1 HI1/!) - I:~1 Ei d3xl'Pi(X)12 with respect to 'Pi(X) and 'PiCx) and set this equal to zero, where one uses (2.3.6) The following equations refer once again to atoms, i.e., they take into account (2.3.2a-c). Taking the variational derivative with respect to 'Pi yields: - tOms, mSJ J=l J d3x'ix ~2 x'i 'P; (X')'Pi(X') . 'Pj (x) = Ei'Pi(X) . (2.3.7) These are the Hartree-Fock equations. Compared to the Hartree equations, they contain the additional term J d3x'ix ~2 - "" Om ~ j =- ~Oms,J Jr' x'll'Pi(X')1 2'Pi(X) St m SJ J Jd3X'lx~2IP;()ij. 2 d3x' IX -e X, I 'PJ* (X')'Pi(X')'PJ (x) (2.3.8) 52 2. Spin-l/2 Fermions The second term of the interaction on the left-hand side is known as the exchange integral, since it derives from the antisymmetry of the fermion state. The interaction term can also be written in the form J d3 x'ix ~2 x'i L cpj(x') [cpj(X')CPi(X) - CPj(x)CPi(x')8ms,ms,] . J The exchange term is a nonlocal term which only occurs for m s , = m s ,. The term in square brackets is equal to the probability amplitude that i and j are at the positions x and x'. For further discussion of the Hartree-Fock equations and their physical implications we refer to Sect. 13.3.2 of QM I. Problems 2.1 Calculate the static structure function for noninteracting fermions SO(q) == ~(rPO 1 nqn_q 1 rPo), where nq = Lk C7 a~C7k+q is the particle density operator in the momentum representation a~d IrPo) is the ground state. Take the continuum limit Lk,C7 = 2V J d 3k/(27r)3 and calculate So(q) explicitly. Hint: Consider the cases q = 0 and q i- 0 separately. 2.2 Prove the validity of the following relations, which have been used in the evaluation of the energy shift LlE(k) of the electron gas, Eq. (2.2.21): 2! d 3 k' 1 , 2 e2 (27r)3 1 k _ k' 12 8(k F - k) = ---;-kFF(k/k F ), a) - 47re F(x) = ! + 1- x 2 with b) 4x 2 In 11I-x + x I. E(1) = _ e 2 kF V ! d 3k 7r = _~ 4 (27r)3 e 2 kF N = 7r _~ [1 + 2aoTs I k} - k 2 In kF + k 2kkF kF - k I] 8(kF - k) (97r)1/3 3N 27r ' 4 where Ts is a dimensionless number which characterizes the mean particle separation in units of the Bohr radius ao = fj,2 /me 2 . Furthermore, k~ = 37r2 n = 1/(a:aoTs)3 with a: = (4/97r)1/3. 2.3 Apply the atomic Hartree-Fock equations to the electron gas. a) Show that the Hartree-Fock equations are solved by plane waves. b) Replace the nuclei by a uniform positive background charge of the same total charge and show that the Hartree term is canceled by the Coulomb attraction of the positive background and the electrons. Problems 53 The electronic energy levels are then given by E(k) = (hk? 2m - 1 "" 47Te 2 L...J I k _ q 12 8(kF - q) . V q According to Problem 2.2, this can also be written as E(k) 2.4 Show that the Hartree-Fock states Ii) orthogonal and that the Ei are real. 1: = (~2 _ 2;2 kFF(kjkF)' == l<Pi, ms,) following from (2.3.7) are 2.5 Show that, for noninteracting fermions, SO(q,w) == ~ 00 2:~N dteiwt(¢>olnq(t)iLq(O)I¢>o) I x8( d3 k 8(kF - k) 8(lk + ql - kF) nw - :: (l + 2k . q) ) Also, prove the relationship for q = 0 , for q =I- 0 2.6 Derive the following relations for Fermi operators: a) 2 t (t t) e -aa t ae aa t =a-aa+aaa-aa 2 e-aaateaa = at - a a - a(aa t - ata) b) o:ata -o:ata -Q e ae =e a eaataate-aata = e-aat . 2.7 According to a prediction made by Wigner 13 , at low temperatures and sufficiently low densities, an electron gas should undergo a phase transition to a crystalline structure (bcc). For a qualitative analysis 14, consider the energy of a lattice of electrons embedded in a homogeneous, positively charged background. Assume that the potential in which each electron moves can be approximated by the potential of a uniformly charged sphere of radius ro = rsao surrounding each electron. 13 14 E.P. Wigner, Phys. Rev. 46, 1002 (1934) E.P. Wigner, Trans. Faraday Soc. 34, 678 (1938) 54 2. Spin-1/2 Fermions Here, ro is the mean particle separation in the Wigner crystal with electron density n, i.e., = lin. This leads to a model of independent electrons (Einstein approximation) in an oscillator potential 4; rg p2 e2 2 H=-+-r 2m 2rg 3e 2 2ro Determine the zero-point energy Eo of this three-dimensional harmonic oscillator and compare this with the result found in the literature l5 : Eo =~ {_1.792 2ao rs + 2.638} r~/2 . By minimizing the zero-point energy, determine the mean separation of the electrons. 15 R.A. Coldwell-Horsfall and A.A. Maradudin, J. Math. Phys. 1, 395 (1960) 3. Bosons 3.1 Free Bosons In this section, we study the characteristic properties of noninteracting bosons. We first calculate the pair distribution function in order to investigate correlation effects. 3.1.1 Pair Distribution Function for Free Bosons We shall assume that the bosons are noninteracting and that they carry zero spin. Hence, their only quantum number is their momentum. We consider a given state of an N -particle system (3.1.1) where the occupation numbers can take the values 0,1,2, ... etc. The expectation value of the particle density is (4)1 'l/Jt(x)'l/J(x) 14» = ~ L e-ikx+ik'x (4)1 atak' 14» k,k' = 1 V N (3.1.2) Lnk = V =n. k The density in the state (3.1.1) is independent of position. The pair distribution function is given by (3.1.3) The expectation value (4)1 at~q,k' 14» differs from zero only if k = k' and q = q', or k = q' and q = k'. The case k = q, which, in contrast to fermions, is possible for bosons, has to be treated separately. Hence, it follows that 56 3. Bosons (4)1 at~q,k' 14» = (1 - t5kq) (t5kk,t5qq, (4)1 at~qk 14» + t5kq,t5qk, (4)1 at~kq 14») + t5kqt5kk,t5qq' (4)1 atatakak 14» = (1 - t5kq)(t5kk,t5qq' + t5kq,t5qk, )nknq + t5kqt5kk,t5qqrnk(nk - 1) (3.1.4) and (3.1.5) = :2 - (1 {~)1 t5kq ) + e-i(k-q)(x-x'))nknq + L nk(nk - k,q = :2 {Lnknq k,q + Ln~ 1)} k Ln~ + ILe-ik(X-X')nkI2 k k Ln~ k - Lnk} k ~ n' + I~ ~ k e-ik(X-X')nkl' - : ' ~ nk(nk + 1) . In contrast to fermions, the second term here is positive due to the permutation symmetry of the wave function. For fermions, there is no multiple occupancy so the last term does not arise. We now consider two examples. When all the bosons occupy the same state Po, then (3.1.5) yields: 1 n2g(x-x')=n2+n2-V2N(N+1)= N(N - 1) V2 . (3.1.6) In this case, the pair distribution function is position independent; there are no correlations. The right-hand side signifies that the probability of detecting the first particle is N/V, and that of the second particle (N - 1)/V. If, on the other hand, the particles are distributed over many different momentum values and the distribution is described, e.g., by a Gaussian (3.1. 7) with the normalization 3.1 Free Bosons 57 it then follows that J d3k -1'k(x-x ') _ (21f)3 e nk - ne __ -~ L).2( x-x ')2 e 'k0 ( x-x ') -1 and If the density and the width .1 of the momentum distribution are held fixed, then, in the limit of large volume V, the third term in (3.1.5) disappears. The pair distribution function is then given by n 2 g(x _ x') = n 2 (1 + e- ~2 (X_XI)2) . (3.1.8) As can be seen from Fig. 3.1, for bosons the probability density of finding g(x - x') 2 1 -.~, 0o'-~ 2 3 4 Lllx - x'Wig. 3.1. Pair distribution function for bosons two particles at a small separation, i.e., r < .1-1, is increased. Due to the symmetry of the wave function, bosons have a tendency to "cluster together" . From Fig. 3.1, one sees that the probability density of finding two bosons at exactly the same place is twice that at large separations. *3.1.2 Two-Particle States of Bosons In order to investigate the consequences of Bose-Einstein statistics further, we now turn to boson interference and fluctuation processes. Such interference can already be found in two-particle states. The general two-particle state is (3.1.9) with the normalization 58 3. Bosons (3.1.10) We could have restricted ourselves from the outset to symmetric 'P(Xl, X2) since [1/!t(Xl).'l/,t(X2)] = 0 and thus the odd part of 'P(Xl,X2) makes no contribution. In the following, we shall consider functions 'P(Xl, X2) of the form (3.1.11) Had the particles been distinguishable, for such a wave function, they would have been completely independent. Furthermore, we assume (3.1.12) and then the normalization condition (3.1.10) yields: (3.1.13) with ('Pi, 'Pj) == J d 3x<pi (X)'Pj (x). For the two-particle state (3.1.9) with (3.1.13)1, the expectation value of the density is (21 n(x) 12) = J d 3xl d3x2X~ d3x~'Pr (Xl)'P~ (x~) r + 1('Pl, 'P2)1 2 l (01/!X2)ltx~ = [I'PI (x) 12 + 1'P2 (x) 12 + ('PI, 'P2)~ 2 -1 x [1 + 1('Pl,'P2)1] . x [1 (X2)'PI (X~)'P2 10) (X)'PI (x) + c.c.] (3.1.14) In (3.1.14), in addition to l'Pl(X)1 2 + 1'P2(X)1 2, an interference term occurs. When the two single-particle wave functions are orthogonal, i.e., ('PI, 'P2) = 0, the density (3.1.15) equals the sum of the single-particle densities, as would be the case for independent particles. For overlapping Gaussians, it is easy to calculate the clustering effect for bosons. Let (3.1.16) 1 The Schrodinger two-particle wave function corresponding to (3.1.9) with )'P2(X2)+'P2 (Xl )'Pl (X2) (3. 1. 13) reads 'PI (XlV2(HI('Pl,'P2)1 2)l/2' 3.1 Free Bosons with the properties ('Pi,'Pi) = 1 and ('P1,'P2) = ,fir I dxe- x2 - a2 these states the density expectation value (3.1.14) is (21 n(x) 12) = 1 + e- 2a2 ) y'7f(1 {e-(X-a)2 = 59 e- a2 ; for + e-(x+a)2 + 2e- 2a2 e- x2 } . (3.1.17) The integrated density is equal to the number of particles. Figure 3.2 shows (21 n(x) 12) for the separations a = 3 and a = 1. For the smaller separation the wave functions overlap and, for small x, the particle density is greater than it would be for independent particles. 0.8 1'P112 + 1'P212 (2In(x)12) 0.6 ,/~ /1 \~,./' I~' 0.4 1/ :/ '/ :/ /I~\ \ \_,,' I '/ ,I 0.2 0.0 -6 -4 -2 0 2 4 6 Fig. 3.2. Densities for two-boson states. The full line is the case a = 3. Since there is no overlap here, 1'P112 + 1'P212 and (n(x)) are indistinguishable from one another. The dashed lines are for a = 1: in this case (21 n(x) 12) is increased at small separations in comparison to 'Pi + 'P~ Photon Correlations Photons represent the ideal example of noninteracting particles. In photon correlation experiments it has actually been possible to observe the predicted tendency of bosons to cluster together. 2 These correlation effects can be understood theoretically with the help of pair correlations of the form (3.1.8).3 Since the classical electromagnetic waves of Maxwell's theory are coherent 2 3 R. Hanbury Brown and R.G. Twiss, Nature 177, 27 (1956); 178, 1447 (1956) E.M. Purcell, Nature 178, 1449 (1956) 60 3. Bosons 40 hcp ; 30 I ~ :E... 20 0... He II 10 superfluid Hel normal fluid O~-gaseou o 1 234 T[K] 5 6 Fig. 3.3. The phase diagram of He 4 . The solid phases are: hcp (hexagonal close packed) and bcc (body centered cubic). The fluid region is divided into a normal (He I) and a superfluid (He II) phase states of photons in quantum mechanics, it is not surprising that these correlation effects also follow from classical electrodynamics. 4 3.2 Weakly Interacting, Dilute Bose Gas 3.2.1 Quantum Fluids and Bose-Einstein Condensation The most important Bose fluid is He 4 , which has spin S = O. Another example is spin-polarized atomic hydrogen; this, however, is extremely difficult to produce for long enough periods at sufficient density. All other atomic bosons are heavier and more strongly interacting, causing them to crystallize at temperatures far above any possible superfluid transition. At normal pressures, He 4 remains fluid down to T = 0 and at the lambda point T>. = 2.18K it enters the superfluid state (Fig. 3.3). The normal and the superfluid phases are also known as He I and He II. In order for He 4 to crystallize, it must be subjected to a pressure of at least 25 bar. Although they are rare in comparison to Fermi fluids, which are realized in He 3 and by every metal, Bose fluids are a rewarding topic of study due to their fascinating properties. Corresponding to superfluidity there is the superconducting phase in fermion systems. He 3 , electrons in metals, and electrons in a number of oxidic high-Tc perovskites can form pairs of fermions that obey Bose statistics. Real helium 4 Discussions of the Hanbury-Brown and Twiss experiments can be found in C. Kittel, Elementary Statistical Physics, p. 123, J. Wiley, New York, 1958 and G. Baym, Lectures on Quantum Mechanics, p. 431, W.A. Benjamin, London, 1973 3.2 Weakly Interacting, Dilute Bose Gas 61 is only mimicked by an ideal Bose gas since, in additions to quantum effects, it is also plagued by the difficulties associated with a dense fluid. In an ideal (i.e., noninteracting) Bose gas at temperatures below Tc( v) = [271"n2 ~3 (for v·2.61 the mass and density of He4 this gives 3.14 K) Bose-Einstein condensation occurs 5 . The single-particle ground state becomes macroscopically occupied in conjunction with the disappearance of the chemical potential j.L -+ o. In reality, He 4 atoms have approximately a Lennard-Jones potential, (3.2.1) E (J = 1.411 x 1O-15 erg = 2.556A . It consists of a repulsive (hard-core) part and an attractive component. At small separations the potential (3.2.1) is equivalent to the potential of an almost ideal hard sphere of diameter 2 A. For fcc close packing of spheres, this would correspond to a molar volume of 12 cm3 , whereas the actually observed molar volume at P = 30 bar is 26 cm3 . The reason for this higher value lies in the large amplitude of the quantum-mechanical zero-point oscillations. In the fluid phase VM = 27 cm3 . The various phases of He 4 and He3 are also known as quantum fluids or quantum crystals. Note: Recently, Bose-Einstein condensation has been observed, 70 years after its original prediction, in a gas of about 2000 spin-polarized 87Rb atoms confined in a quadrapole trap. 6 7 The transition temperature is 170 x 1O- 9 K. One might expect that at low temperatures alkali atoms would form a solid; however, even at temperatures in the nano-Kelvin regime, it is possible to maintain a metastable gaseous state. A similar experiment has been carried out with a gas of 2 x 10 5 spin-polarized 7Li atoms. 8 In this case, the condensation temperature is Tc ~ 400 x 1O- 9 K. In 87Rb the s-wave scattering length is positive, whereas in 7Li it is negative. Despite this, the gaseous phase of 7Li does not collapse into the fluid or the solid phase, not, at least, in the spatially inhomogeneous case. 8 Bose-Einstein condensation has also been observed in sodium in a sample of 5 x 105 atoms at a density of 1014 cm- 3 and temperatures below 2p,K. 9 See for instance F. Schwabl, Statistical Mechanics, Springer, Berlin Heidelberg, 2002, Sect. 4.4; in subsequent citations this book will be referred to as SM. 6 M.H. Andersen, J.R. Enscher, M.R. Matthews, C.E. Wieman, and E. A. Cornell, Science 269, 198 (1995) 7 See also G.P. Collins, Physics Today, August 1995, 17 8 C.C. Bradley, C.A. Sackett, J.J. Tollett, and R.G. Hulet, Phys. Rev. Lett. 75, 1687 (1995) 9 K. B. Davis, M.-O. Mewes, M.R. Andrews, N.J. van Druten, D.S. Durfee, D.M. Kurn, and W. Ketterle, Phys. Rev. Lett. 75, 2969 (1995) 5 62 3. Bosons 3.2.2 Bogoliubov Theory of the Weakly Interacting Bose Gas In the momentum representation, the Hamiltonian reads: (3.2.2) where we have set Ii = 1. This Hamiltonian is still completely general, but in the following we will introduce approximations which restrict the validity of the theory to dilute, weakly interacting Bose gases. The creation and annihilation operators at and ak satisfy the Bose commutation relations and Vq is the Fourier transform of the two-particle interaction Vq = J d3 xe- iqx V(x). (3.2.3) At low temperatures, a Bose-Einstein condensation takes place in the k = 0 mode, i.e., in analogy to the ideal Bose gas it is expected that in the ground state lO 10) the single-particle state with k = 0 is macroscopically occupied, No = (01 a~o 10) ;S N , (3.2.4a) and thus the number of excited particles is N - No « No ;S N . (3.2.4b) Hence, we can neglect the interaction of the excited particles with one another and restrict ourselves to the interaction of the excited particles with the condensed particles: H = 1 k2 -atak + V Voa~ 2m 2 +VL 1 ~"'"" Vk(aka_kaOao t t 2V t t 3 + aOaoaka-k) + O(ak) L k + 1 k , (Vo + Vk)a~ot . (3.2.5) k The prime on the sum indicates that the value k = 0 is excluded. Due to momentum conservation, there is no term containing ak#O and three operators with k = O. The effect of ao and a~ on the state with No particles in the condensate is ao INo , ... ) a~ lNo, ... ) = ~INo -1, ... ) = J No + 11No + 1, ... ) (3.2.6) Since No is such a huge number, No >:::: 1023 , both of these correspond to multiplication by ~. Furthermore, it is physically obvious that the removal 10 Here, 10) is the ground state of the N bosons and not the vacuum state with respect to the ak, which would contain no bosons at all. It will emerge that 10) is the vacuum state for the operators Qk to be introduced below. 3.2 Weakly Interacting, Dilute Bose Gas 63 or addition of one particle from or to the condensate will make no difference to the physical properties of the system. In comparison to No, the effect of the commutator aoab - abao = 1 is negligible, i.e., the operators ao = ab = Fa (3.2.7) can be approximated by a c-number. The Hamiltonian then becomes ,,' k2 t -akak k 2m H =L 1 2 + -V No Vo 2 tIt t + Vk)akak + "2 Vk(aka_ k No,,' L [(Vo + 11 (3.2.8) + aka-k)] + ... k The value of No is unknown at the present stage. It is determined by the density (or the particle number for a given volume) and by the interaction. We express No in terms of the total particle number N and the number of particles in the excited state N = No + 2:' atak (3.2.9) . k (3.2.10) The Hamiltonian follows as N' N 2:'k-atak + V 2: Vkatak + -Vo 2m 2V N , + 2V L , 'Vk (t aka_t k + aka-k ) + H' . 2 H = 2 k k (3.2.11 ) k The operator H' contains terms with four creation or annihilation operators, and these are of order n,2, where n' = N -VNo is the density of the particles that are not part of the condensate. The Bogoliubov approximation, which amounts to neglecting these anharmonic terms, is a good approximation when n' « n. We shall see later, when we calculate n', that exactly this condition is fulfilled by the dilute, weakly interacting Bose gas. H H' is neglected, we have a quadratic form, which still has to be diagonalized. The transformation proceeds in analogy to the theory of antiferromagnetic magnons. We introduce the ansatz l l 11 The transformation is known as the Bogoliubov transformation. This diagonalization method was originally introduced by T. Holstein and H. Primakoff (Phys. Rev. 58, 109S (1940)) for complicated spin-wave Hamiltonians and was rediscovered by N.N. Bogoliubov (J. Phys. (U.S.S.R.) 11,23 (1947)). 64 3. Bosons ak = Ukak + vka~ (3.2.12) + Vka_k at = Uk at with real symmetric coefficients, and demand that the operators a also satisfy Bose commutation relations (3.2.13) This is the case when (3.2.14) Proof: + VkUk' (-bk,-k') = 0 UkUk,bkk' + VkVk,(-bkk') = (u~ - v~)bk' [ak, ak' 1= UkVk,bk,-k' [ak,at,] = . The inverse of the transformation (3.2.12) reads (see Problem 3.3): ak = Ukak - at = vka~ (3.2.15) ukat - Vka_k . With the additional calculational step atak = u~atk at~k = u~atk aka_k = u~ak_ + v~a_k + v~ak_ + v~atk + ukv(at~ + ak1Lk) + ukvk(atak + a_k~) + ukv(a~_ + akat) , one obtains for the Hamiltonian 1 2 H= 2vN Vo + + 2:' (;: +nVk) k +; 2:'Vk [( u~ + v~) ( at~k + aka-k) + 2UkVk( atak + akat)] k (3.2.16) In order for the nondiagonal terms to disappear, we require 2 k 2 (2m - + nVk)UkVk + -2n Vk(Uk2 + Vk) = 0. (3.2.17) Together with u~ - v~ = 1 from (3.2.14), one now has a system of equations that allow the calculation of u~ and v~. It is convenient to introduce the definition 3.2 Weakly Interacting, Dilute Bose Gas [(;~ Wk" +nVk)' - (nVk),r From (3.2.14) and (3.2.17), one finds u~ Wk 2 _ + (~ 2Wk = -'~ -Wk \ and v~ nk~vf' (3.2.18) to be (Problem 3.4) + n Vk ) Uk v~ ~ [(;~) 65 + U: + ' (3.2.19) nVk) 2Wk Inserting (3.2.19) into the Hamiltonian yields: N2 2V , l ' k2 2 k 2m 2: wkatak H= -Yo - - 2: ( - +nVk -Wk)+ (3.2.20) k '" energy Eo ground-state '---v----' sum of oscillators ~ quasiparticles The Hamiltonian consists of the ground-state energy and a sum of oscillators of energy Wk. The excitations that are created by the are called quasiparticles. The ground state of the system 10) is fixed by the condition that no quasiparticles are excited, at ak 10) = 0 for all k. (3.2.21 ) We can now calculate the number of particles (not quasiparticles) outside the condensate N' = (012:' atak 10) = (01 2:' v~akt k k For a contact potential V(x) that N' n' == V = ~(An)3/2 3/2 37r 2 10) = 2:' v~ . (3.2.22) k = .XO(x), it follows by using (3.2.18) and (3.2.19) . (3.2.23) The expansion parameter is An, i.e., the strength of the potential times the density. If this expansion parameter is small, consistent with the assumptions made, the density of particles outside the condensate is low. The dependence on An is nonanalytic and thus cannot be expanded about An = O. Hence, 66 3. Bosons these results for condensed Bose systems cannot be obtained using straightforward perturbation theory for the initial Hamiltonian (3.2.2). The number of particles in the condensate is No = N - n'V. Its temperature dependence No(T) is studied in Problem 3.5. The ground-state energy (3.2.20) is composed of a term that would be the interaction energy if all particles were in the condensate, and a further negative term. Through the occupation of k =1= 0 Bose states in the ground state (see (3.2.22)), the kinetic energy is increased, whereas the potential energy is reduced. Excited states of the system are obtained by applying to the ground state 10). Their energy is Wk. For small k one finds from (3.2.18) at ck Wk = with C -- JnmVo. (3.2.24) Thus, the long wavelength excitations are phonons with linear dispersion. This value for the sound velocity also follows from the compressibility v __ ,,- -.L ov. v oP' (3.2.25) Here, p = mn is the mass density and the pressure at zero temperature is given by p= - oEo oV . (3.2.26) For large k, one obtains from (3.2.18) k2 Wk = 2m +nVk (3.2.27) . This corresponds to the dispersion relation for free particles whose energy is shifted by a mean potential of nVk (see Fig. 3.4). A comparison with the experimental excitation spectrum of He 4 is not justified on account of the restriction to weak interaction and low density; in particular, one cannot attempt to explain the rot on minimum (see Sect. 3.2.3) in terms of the k dependence of the potential, since this would require potential strengths outside the domain of validity of this theory (see Problem 3.6). When is applied to a state, one speaks of the creation of a quasiparticle with the wave vector k. We shall show furthermore that, for small k, the excitation of a quasiparticle corresponds to a density wave. To this end, we consider the operator for the particle number density at nk = 2:= abap+k ~ JNa(~k + ak) (3.2.28) p under the assumption of a macroscopic occupation of the k = 0 state. 3.2 Weakly Interacting, Dilute Bose Gas 67 Fig. 3.4. Excitations of the weakly interacting Bose gas k From Eq. (3.2.12) it follows that ak + a~k = (Uk + Vk)(O:k + O:~k) and therefore nk = Ak (O:k + O:~k) (3.2.29) From Eq. (3.2.19) the amplitude Ak takes the form Ak == v'No(Uk + Vk) = kV2 No mWk . From Eq. (3.2.29) one obtains the density operator p(x) = p(x) + pt (x) (3.2.30a) in which p(x) = Lk AkeikxO:k , from which it follows that p(x) ( o:t 10)) = 2)ik/ Ak/o:k/o:t 10) = eikx Ak 10) . X (3.2.30b) k' For a coherent state ICk) built out of quasi-particle excitations with wave vector k (3.2.31a) and in which Ck = ICkl e- iOk one gains p(x) ICk) = Akckeikx ICk) . From this it follows that the expectation value of the density (3.2.31b) so that a coherent state of this type represents a density wave. 68 3. Bosons Notes: (i) Second-order phase transitions are associated with a broken symmetry. In the well known case of a Heisenberg ferromagnet, this symmetry is the invariance of the Hamiltonian with respect to the rotation of all spins. In the ferromagnetic phase, where a finite magnetization is present, oriented, e.g., in the z direction, the rotational invariance is broken. In the case of the Bose-Einstein condensate the gauge invariance is broken, i.e., the invariance of the Hamiltonian with respect to transformation of the field operator 'I/J(x) -+ 'I/J'(x) = 'I/J(x)e ia (3.2.32) with a phase a. In the ground state 10), one has (01 'I/J(x) 10) -=I- 0 and the phase is fixed arbitrarily at a = O. (ii) For finite-ranged potentials, e.g., the spherical well of Problem 3.6, the Fourier transform falls off with increasing wave vector k, leading to a finite ground-state energy in (3.2.20). For the 8-function potential the Fourier transform is a constant, which leads to a divergence at the upper integration limit. To ensure that the ground-state energy Eo also remains finite for an effective contact potential, the potential strength A must be replaced by the (finite) scattering length a. In second-order Born approximation, the scattering length is given in terms of A by a m A {A~'m = 47fn2 1- V ~ k2 + 0('\ 2 ) } or, inversely, (3.2.33) (see Problem 3.8). Inserting this into (3.2.16) shows that Vo and Vk must be replaced, here and in all subsequent formulas, by 2a 47fn2a 47fn Vr0 - + -- { 1+ -m V L' -1 } k k2 (3.2.34a) and 47fn2a Vik - + --· m (3.2.34b) For the interaction of the excited particles it is sufficient to retain only terms up to first order in a. The value of the ground-state energy is then 2 2 3 _27fn Eo - - -aN - - { 1 +128 - - (a - N)1/2} . m V 15y7r V (3.2.35) 3.2 Weakly Interacting, Dilute Bose Gas 69 *3.2.3 Superfluidity Superfluidity refers to a state in which the fluid can flow past objects without exerting a drag and where objects can move through the fluid without slowing down. This property holds only up to a certain critical velocity which we will now relate to the quasiparticle spectrum. The excitation spectrum of real helium, as derived from neutron scattering measurements, displays, according to Fig. 3.5, the following characteristics. For small p, the excitation energy varies linearly with the momentum Ep = cp. (3.2.36a) ,, 20 ~ ~4> ~ c. 'S \,u ~~ , I" I I ~ 10,i § , '.g ~ I l' I I » ,--: . • I ... 0 I ... , 0 00 °tJ , ., I S I I I I °0~-2 Wave vector pili [A -1] Fig. 3.5. The quasiparticle excitations in superfluid He 4 . Phonons and rotons according to Henshaw and WOOdS 12 The excitations in this region are called phonons; their velocity - the sound velocity - is c = 238 m/s. The second characteristic feature of the excitation spectrum is the minimum at Po = 1.91 A-IlL Here, the excitations are referred to as rotons and can be described by the dispersion relation Ep = Ll + (Ipl - PO)2 2p, (3.2.36b) with the effective mass p, = O.16mHe and the energy gap Ll/k = 8.6K. The condensation of helium and the resulting quasiparticle dispersion relation ((3.2.36a,b), Fig. 3.5) has essential consequences for the dynamical behavior of He 4 in the He-II phase. It leads to superfluidity and to the two-fluid model. To see this, we consider the flow of helium through a tube in two different inertial frames. In frame K, the tube is at rest and the fluid moves with a velocity -v. In frame Ko, the helium is at rest and the tube moves with a velocity y (see Fig. 3.6). 12 D.G. Henshaw and A.D. Woods, Phys. Rev. 121, 1266 (1961) 70 3. Bosons If" " " " " "Hell System Ko (He-Ruhsystem) " u "" " " II" II Fig. 3.6. Superfluid helium in the rest frame of the tube (laboratory frame, K) and in the rest frame of the fluid, Ko. The total energies (E, Eo) and the total momenta (P, Po) in the two frames (K,Ko) are related to one another through a Galilei transformation: P = Po -Mv (3.2.37a) E = Eo - Po . v Mv 2 + -2- (3.2.37b) , where we have introduced One can derive (3.2.37a,b) by using the Galilei transformation of the individual particles Xi Pi = = XiO - vt Pio - mv. Thus, P = L Pi = L(PiO - my) = Po - Mv . The energy transforms as follows: E = L 2~P +L = L; (~O -v i = L i = Eo V(Xi - Xj) r (i,j) i -P70 2m P O· v - Po . v +L V(Xio - Xjo) (i,j) + -M2v 2 + L V( XiO - XjO ) (i,j) + -M2 v 2 In a normal fluid, any flow that might initially be present will be degraded by frictional losses. When viewed in the frame Ko , this means that, in the fluid, excitations are created which move with the wall of the tube, such that more and more fluid is pulled along with the moving tube. Seen from the tube frame K, the same process can be interpreted as a deceleration of the fluid flow. In order that such excitations actually occur, the energy of the 3.2 Weakly Interacting, Dilute Bose Gas 71 fluid must simultaneously decrease. We now have to examine whether, for the particular excitation spectrum of He-II, Fig. 3.5, the moving fluid can reduce its energy through the creation of excitations. Is it energetically favorable for quasiparticles to be excited? We first consider helium at the temperature T = 0, i.e., in the ground state. In the ground state the energy and momentum in the frame Ko are given by Eg and Po O. = (3.2.38a) Thus, in K, these quantities are E9 = Eg Mv +2 P = - Mv . and 2 (3.2.38b) If a quasiparticle with momentum P = nk and energy E(p) = the energy and momentum in the frame Ko have the values Eo = Eg + E(p) and Po = P , nwk is created, (3.2.38c) whence, from (3.2.37a,b), the energy in K follows as Mv 2 E=Eg+E(p)-p,v+2- and P=p-Mv. (3.2.38d) The excitation energy in K (the tube frame) is thus 11E = E(p) - P . v . (3.2.39) Here, 11E is the energy change in the fluid due to the creation of an excitation in the tube frame K. Only when 11E < 0 does the flowing fluid lose energy. Since E - pv has its smallest value when p is parallel to v, the inequality E (3.2.40a) v>p must be satisfied in order for an excitation to occur. From (3.2.40a) and the experimental excitation spectrum, one obtains the critical velocity (Fig. 3.7) Vc=(~) P ..- . ~60m/s. (3.2.40b) mm ....- ..- ..- ..vc=(£/P)min = ~/p """",.. ii p Fig. 3.7. Quasiparticles and critical velocity 72 3. Bosons If the flow velocity is smaller than V e , no quasiparticles are excited and the fluid flows unimpeded and loss-free through the tube. This is the phenomenon of superfluidity. The existence of a finite critical velocity is closely related to the form of the excitation spectrum, which has a finite group velocity at p = 0 and is everywhere greater than zero (Fig. 3.7). The value (3.2.40b) of the critical velocity is observed experimentally for the motion of ions in He-II. The critical velocity for flows in capillaries is much smaller than V e , since vortices already occur at lower velocities. Such excitations have not been considered here. Problems 3.1 Consider the following two-particle boson state a) Confirm the normalization condition (3.1.10). b) Verify the result (3.1.14) for the expectation value (21 n(x) 12) on the assumption that CP(XI, X2) ex CPI (XI)CP2(X2). 3.2 The Heisenberg model of a ferromagnet is defined by the Hamiltonian H = -~ L J(II-I'I)SI . SI' , 1,1' where I and I' are nearest neighbor sites on a cubic lattice. By means of the HolsteinPrimakoff transformation st = V2Scp(ni)ai , = V2Sa! cp(ni) , = S - ni, = Sf ±sy, cp(ni) = V1- n;j2S, SiSf with S;ni = a;ai and [ai,a}] = dij, [ai,aj] = 0 one can express the Hamiltonian in terms of Bose operators ai. a) Show that the commutation relations for the spin operators are satisfied. b) Write down the Hamiltonian to second order (harmonic approximation) in terms of the Bose operators ai by regarding the square-roots in the above transformation as a shorthand for the series expansion. c) Diagonalize H (by means of a Fourier transformation) and determine the dispersion relation of the spin waves (magnons). 3.3 Confirm the inverse (3.2.15) of the Bogoliubov transformation. 3.4 By means of the Bogoliubov transformation, the Hamiltonian of the weakly interacting Bose gas can be brought into diagonal form. One thereby finds the Problems 73 condition (3.2.17): k2 +nVk ) Uk V k+2"Vk n (2 2) ( 2m Uk+vk =0. Confirm the results (3.2.18) and (3.2.19). 3.5 Determine the temperature dependence of the number of particles in the condensate, No(T), for a contact potential Vk = A. a) Proceed by first calculating the thermodynamic expectation value of the particle number operator N = I:k a~k by rewriting it in terms of the quasiparticle operators where 'Y D:k. = {3~, Unb) One finds (in the continuum limit: k2 = k5 = 4mnA, (3 = 1/k B T, ioroo dy e-r:n- l' -& I:k --+ f (g:~3) and with y = xVX2 + 1. b) Show that, for low temperatures, the depletion of the condensate increases quadratically with temperature No(T) V = No(O) _ V ~(k T)2 12c B , j!if-. Also, discuss the limiting case of high temperatures and compare where c = the result obtained with the results from the theory of the Bose-Einstein condensation of noninteracting bosons below the transition temperature. Lit.: R.A. Ferrell, N. Menyhard, H. Schmidt, F. Schwabl and P. Szepfalusy, Ann. Phys. (N.Y.) 47, 565 (1968); Phys. Rev. Lett. 18, 891 (1967); Phys. Letters 24A, 493 (1967); K. Kehr, Z. Phys. 221, 291 (1969). 3.6 a) Determine the excitation spectrum Wk of the weakly interacting Bose gas for the spherical well potential V(x) = A'8(R - Ixl). Analyze the limiting case R --+ 0 and compare the result with the excitation spectrum for the contact potential Vk = A. The comparison yields A = A'R3. b) Determine the range of the parameter koR, where k5 = 4mnA, in which the excitation spectrum displays a "roton minimum". Discuss the extent to which this parameter range lies within or outside the range of validity of the Bogoliubov theory of weakly interacting bosons. Hints: Rewrite the spectrum in terms of the dimensionless quantities x = k/k o and y = koR and consider the derivative of the spectrum with respect to x. The condition for the derivative to vanish should be investigated graphically. 4; 3.7 Show that (3.2.11) yields the Hamiltonian (3.2.16), which in turn leads to (3.2.20). 74 3. Bosons 3.8 Ground-state energy for bosons with contact interaction Consider a system of N identical bosons of mass m interacting with one another via a two-particle contact potential, Pi + oX. E O(Xi E -2m N H = 2 Xj) . i<j 0=1 In the limit of weak interaction, the Bogoliubov transformation can be used to express the Hamiltonian in the form The ground-state energy 2 Eo = N oX. _ 2V E' (~ k 2m + noX. - Wk) diverges at the upper integration limit (ultraviolet divergence). The reason for this is the unphysical form of the contact potential. The divergence is removed by introducing the physical scattering length, which describes the s-wave scattering by a short-range potential (L.D. Landau and E.M. Lifshitz, Course of Theoretical Physics, Vol. 9, E.M. Lifshitz and E.P. Pitaevskii, Statistical Physics 2, Pergamon Press, New York, 1981, § 6). Show that the scattering amplitude f of particles in the condensate is given, in first-order perturbation theory, by a := - f(k 1 = k2 = k3 = k4 = 0) = 2 m oX. { 1 - V oX. """" 47rn ~ km2 2 } + O(oX.) . Eliminate oX. from the expression for the ground-state energy by introducing a. For small values of afro, where ro = (N/V)-1/3 is the mean separation of particles, show that the ground-state energy is given by Eo = N27rn2an m {I + ~ 15J7f (~)3/2} . ro Calculate from this the chemical potential p, c = J~, p = mn, p = _ aEo av' = !!f# and the sound velocity c 4. Correlation Functions, Scattering, and Response In the following, we shall investigate the dynamical properties of manyparticle systems on a microscopic, quantum-mechanical basis. We begin by expressing experimentally relevant quantities such as the inelastic scattering cross-section and the dynamical susceptibility (which describes the response of the system to time-dependent fields) in terms of microscopic entities such as the dynamical correlation functions. General properties of these correlation functions and their interrelations are then derived using the symmetry properties of the system, causality, and the specific definitions in terms of equilibrium expectation values. Finally, we calculate correlation functions for a few physically relevant models. 4.1 Scattering and Response Before entering into the details, let us make some remarks about the physical motivation behind the subject of this chapter. If a time-dependent field Eei(kx-wt) is applied to a many-particle system (solid, liquid, or gas), this induces a "polarization" (Fig. 4.1) P(k,w)ei(kx-wt) + P(2k, 2w)e i2 (kx-wt) + .... (4.1.1) The first term has the same periodicity as the applied field; in addition, nonlinear effects give rise to higher harmonics. The linear susceptibility is Fig. 4.1. An external field E(x, t) induces a polarization P(x, t) 76 4. Correlation Functions, Scattering, and Response defined by X (k )- ,w - i~o r P(k,w) E ' (4.1.2) which is a property of the unperturbed sample and must be expressible solely in terms of quantities that characterize the many-particle system. In this chapter we will derive general microscopic expressions for this type of susceptibility. Another possibility for obtaining information about a many-particle system is to carry out scattering experiments with particles, e.g., neutrons, electrons, or photons. The wavelength of these particles must be comparable with the scale of the structures that one wants to resolve, and their energy must be of the same order of magnitude as the excitation energies of the quasiparticles that are to be measured. An important tool is neutron scattering, since thermal neutrons, as available from nuclear reactors, ideally satisfy these conditions for experiments on solids. 1 Since neutrons are neutral, their interaction with the nuclei is short-ranged; in contrast to electrons, they penetrate deep into the solid. Furthermore, due to their magnetic moment and the associated dipole interaction with magnetic moments of the solid, neutrons can also be used to investigate magnetic properties. We begin by considering a completely general scattering process and will specialize later to the case of neutron scattering. The calculation of the inelastic scattering cross-section proceeds as follows. We consider a many-particle system, such as a solid or a liquid, that is described by the Hamiltonian Ho. The constituents (atoms, ions) of this substance are described by coordinates Xa which, in addition to the spatial coordinates, may also represent other degrees of freedom. The incident particles, e.g., neutrons or electrons, which are scattered by this sample, have mass m, spatial coordinate x, and spin ms· The total Hamiltonian then reads: p2 H = Ho + 2m + W({xa},x). (4.1.3) This comprises the Hamiltonian of the target, H o, the kinetic energy of the incident particle, and the interaction between this projectile particle and the target, W({xa},x). In second quantization with respect to the incident particle, the Hamiltonian reads: 1 The neutron wavelength depends on the energy according to ..\(nm) and thus ..\ = O.18nm for E = 25meV ~ 290K. = ~ yE(eY) 4.1 Scattering and Response 77 p2 H=Ho+-+ 2m k'k"a'a" (4.1.4) where at, 0"' (ak" 0"") creates (annihilates) a projectile particle. We write the eigenstates of Ho as In), i.e., Ho In) = En In) . Fig. 4.2. Inelastic scattering with momentum transfer k /;2 2 2 transfer hw = 2m (k 1 - k 2 ) (4.1.5) = kl - k2 and energy In the scattering setup sketched in Fig. 4.2, a particle with wave vector kl and spin msl is incident on a substance initially in the state Inl). Thus, the initial state of the system as a whole is Ikl' msl, nl). The corresponding final state is Ik 2,ms2,n2). Due to its interaction with the target, the incident particle is deflected, i.e., the direction of its momentum is changed and, for inelastic scattering, so is the length of its wave vector (momentum). If the interaction is spin dependent, the spin quantum number may also be changed. The transition probability per unit time can be obtained from Fermi's golden rule 2 r(k1,msl,nl --+ k 2,m s 2,n2) 27r 2 = Ii l(k2, ms2, n21 W Ikl' msl, nl)1 8(Enl - En2 + hw) . (4.1.6) Here, (4.1.7a) 2 See, e.g., QM I, Eq. (16.40) 78 4. Correlation Functions, Scattering, and Response and (4.1.7b) are the energy and momentum transfer of the projectile to the target, and (4.1.7c) the final energy of the particle. The matrix element in the golden rule becomes (4.1.8) We take the distribution of initial states to be p(nl) with Ln1 p(nt} = 1 and the distribution of the spin states of the particle to be ps(mst} with Lm 81 ps(mst} = 1. If only k2 is measured, and the spin is not analyzed, the transition probability of interest is (4.1.9) The differential scattering cross-section (effective target area) per atom is defined by d2 (7 dQd dQdE E = probablility of transition into dQdE/S . number of scatterers x flux of incident particles (4.1.10) Here, dQ is an element of solid angle and the flux of incident particles is equal to the magnitude of their current density. The number of scatterers is Nand the normalization volume is L3. The states of the incident particles are ./, () 'l'k1 X 1 ik = L3/2 e 1 X (4.1.11) . The current density follows as: j(x) = -ifi('IjJ*V'IjJ _ (V'IjJ*)'IjJ) = fiki , 2m mL3 (4.1.12) and for the differential scattering cross-section one obtains (4.1.13) since the number of final states, i.e., the number of k2 values in the interval d3 k 2 is d3 k 2 . With f = ~!, it follows that dE = fi 2 k2 dk 2 /m and d3 k 2 = ~ k2 dE dQ. U;J3 4.1 Scattering and Response 79 We thus find (m)2 k2 L6 d 2a 27rn2 k1 N dfldE - x L (4.1.14) p(n1)ps(ms I) l(k 1,m s1,n11 W Ik 2 ,m s2,n2)1 2 6(Enl - En2 + flw) . nl,n2 ffi s l,m s 2 We now consider the particular case of neutron scattering and investigate the scattering of neutrons by nuclei. The range of the nuclear force is R ~ 1O-12 cm and thus k1R ~ 10- 4 « 1 and, therefore, for thermal neutrons one has only s-wave scattering. In this case, the interaction can be represented by an effective pseudopotential W( {xa }, x) 27rn2 =- m L N (4.1.15) aa6(xa - x) , a=l to be used within the Born approximation, where aa are the scattering lengths of the nuclei. This yields: Here, we have used (4.1.17) and the fact that the interaction is independent of spin. Written out explicitly, the expression (4.1.16) assumes the form L aaaj3\. .. e- ikx " ... )( ... e ikx (3 ••• )6(En1 - En2 + flw) (4.1.16') aj3 and still has to be averaged over the various isotopes with different scattering lengths. One assumes that their distribution is random, i.e., spatially uncorrelated: (4.1.18) 80 4. Correlation Functions, Scattering, and Response This gives rise to a decomposition of the scattering cross-section into a coherent and an incoherent part 3 : d2 a dadE = AcohScoh(k,w) + AincSinc(k,w) . (4.1.19a) Here, the various terms signify (4.1.19b) and Scoh(k,w) = ~ LL p(nd (nIl e- ikx", In2) (n21 eikxi3 lnI) nln2 x 8(Enl - En2 + 1U.v) , af3 ~ Sinc(k,w) = LL 0: (4.1.20) p(nI) l(nIle- ikx", In2)1 2 nln2 x 8(Enl - En2 + 1U.v) , the suffices standing for coherent and incoherent. In the coherent part, the amplitudes stemming from the different atoms are superposed. This gives rise to interference terms which contain information about the correlation between different atoms. In the incoherent scattering cross-section, it is the intensities rather than the amplitudes of the scattering from different atoms that are added. It contains no interference terms and the information which it yields relates to the autocorrelation, i.e., the correlation of each atom with itself. For later use we note here that, for systems in equilibrium, = p(nI) e- f3Enl Z ' (4.1.21a) which corresponds to the density matrix of the canonical ensemble p = e- f3Ho /Z , Z = Tre- f3Ho . (4.1.21b) We shall also make use of the following representation of the delta function Jg: 8(w) = e iwt . (4.1.22) The coherent scattering cross-section contains the factor !. n J dt 27f = 2~n = _1_ 27fn 3 ei(Enl -En2 +nw)t/n (nIl e- ikx", In 2) J J dte iwt (nIl eiHot/ne-ikx"'e-iHot/n In 2) (4.1.23) dte iwt (nIl e-ikx",(t) In 2) . (4.1.24) See, e.g., L. van Hove, Phys. Rev. 95, 249 (1954) 4.1 Scattering and Response 81 Hence, by making use of the completeness relation one finds (4.1.25) The correlation functions in (4.1.25) are evaluated using the density matrix of the many-particle system (4.1.21), the thermal average of an operator 0 being defined by LT -(3Ho (0) = (nl 0 In) = Tr (pO) . (4.1.26) n One refers to Scoh(inc) (k, w) as the coherent (incoherent) dynamical structure function. Both contain an elastic (w = 0) and an inelastic (w i= 0) component. Using the density operator N p(x, t) = L o(x - xa(t)) (4.1.27) a=l and its Fourier transform (4.1.28) it follows from (4.1.25) that (4.1.29) Thus, the coherent scattering cross-section can be represented by the Fourier transform of the density-density correlation function, where hk is the momentum transfer and nw the energy transfer from the neutron to the target system. An important application is the scattering from solids to determine the lattice dynamics. The one-phonon scattering yields, as a function of frequency w, resonances at the values ±Wt, (k), ±wt2(k), and ±wl(k), the frequencies of the two transverse, and the longitudinal phonons. The width of the resonances is determined by the lifetime of the phonons. The background intensity is due to multiphonon scattering (see Sect. 4.7(i) and Problem 4.5). The intensity of the single-phonon lines also depends on the scattering geometry via the scalar product of k with the polarization vector of the phonons and via the Debye-Waller factor. As a schematic example of the shape of 82 4. Correlation Functions, Scattering, and Response Scoh(k,w) Wt W Fig. 4.3. Coherent scattering cross-section as a function of w for fixed momentum transfer k. Resonances (peaks) are seen at the transverse (±wt{k)) and longitudinal (±wl(k)) phonon frequencies, as well as at W = 0 the scattering cross section, we show in Fig. 4.3 Scoh for fixed k as a function of the frequency w. The resonances at finite frequencies are due to a transverse and a longitudinal acoustic phonon, and, furthermore, one sees a quasi-elastic peak at w = O. Quasi-elastic peaks may result from disorder and from relaxation and diffusion processes (Sect. 4.7(ii)). The coherent scattering cross-section is a source of direct information about density excitations, such as phonons in solids and fluids. The incoherent component is a sum of the scattering intensities of the individual scatterers. It contains information about the autocorrelations. For other scattering experiments (e.g., with photons, electrons, or atoms) one can likewise represent the scattering cross-section in terms of correlation functions of the many-particle system. We shall pursue the detailed properties of the differential scattering cross-section here no further. These preliminary remarks are intended mainly as additional motivation for the sections that are to follow, where we will see that the correlation functions and the susceptibility are related to one another. Causality will allow us to derive dispersion relations. Time-reversal invariance and translational invariance will yield symmetry relations, and from the static limit and the commutation relations we will derive sum rules. 4.2 Density Matrix, Correlation Functions The Hamiltonian of the many-particle system will be denoted by Ho and is assumed to be time independent. The formal solution of the Schrodinger equation in :t I'if, t) = Ho l'if, t) (4.2.1) 4.2 Density Matrix, Correlation Functions 83 is then I'l/J, t) Uo(t, to) = I'l/J, to) (4.2.2) Due to the time independence of H o, the unitary operator Uo(t, to) (with Uo(to, to) = 1) is given by Uo(t, to) e-iHo(t-to)/fi . = (4.2.3) The Heisenberg state (4.2.4) is time independent and the Heisenberg operators A(t) = UJ (t, to)AUo(t, to) = eiHo(t-to)/fi Ae-iHo(t-to)/fi , (4.2.5) corresponding to the Schrodinger operators A, B, .. , satisfy the equation of motion (Heisenberg equation of motion) d dt A(t) = ni [Ho, A(t)] (4.2.6) The density matrix of the canonical ensemble is e-{3Ho p=-- Z with the canonical partition function Z = The-{3Ho , (4.2.7) and for the grand canonical ensemble e-{3(Ho-I-'N) p= (4.2.8) Za with the grand canonical partition function Za = The-{3(Ho-I-'N) = L L e-{3(E N m rn (N)-I-'N) [= L e-{3(En- nJ-L)] N n Since Ho is a constant of motion, these density matrices are time independent, as indeed must be the case for equilibrium density matrices. The mean values in these ensembles are defined by (0) = Th(pO) . In particular, we now wish to investigate the correlation functions (4.2.9) 84 4. Correlation Functions, Scattering, and Response C(t, t') = (A(t)B(t')) = Tr(peiHotlli Ae-iHotllieiHot' Iii Be-iHot' Iii) Tr(p eiHo(t-t')/1i Ae-iHo(t-t')/1i B) = = C(t - t',O) (4.2.10) Without loss of generality, we have set to = 0 and used the cyclic invariance of the trace and also [p, Hol = O. The correlation functions depend only on the time difference; equation (4.2.10) expresses temporal translational invariance. The following definitions will prove to be useful: G~B(t) = (A(t)B(O)) , (4.2.11a) GAB(t) = (B(O)A(t)) . (4.2.11b) Their Fourier transforms are defined by G~B(W) = J eiwtG~B() dt (4.2.12) . By inserting (4.2.5) into (4.2.12), taking energy eigenstates as a basis, and introducing intermediate states by means of the closure relation 11. > 2::m 1m) (ml, we obtain the following spectral representation for GAB(w): G~B(W) L e-{3(E ~ = n -I-'Nn ) (nl A 1m) (ml Bin) n,m >: Xu GAB(W) = (En -Em n L e-{3(E ~ n +W ) -I-'Nn ) (4.2.13a) (nl B 1m) (ml A In) n,m X 8( Ern - En n ) +W. (4.2.13b) From this, it is immediately obvious that (4.2.14a) (4.2.14b) To derive the first relation, one compares G~B(-w) with (4.2.13a). The second follows if one exchanges n with m in (4.2.13b) and uses the 15-function. The latter relation is always applicable in the canonical ensemble and is valid in the grand canonical ensemble when the operators A and B do not change the number of particles. If, however, B increases the particle number by L1nB, then N m - N n = L1nB and (4.2.14b) must be replaced by 4.3 Dynamical Susceptibility Q< (w) -- Q>AB (w)e- f3 (nW- Jl £1n B ) . AB 85 (4.2.14b') Inserting A = Pk and B = P_ k into (4.2 .14a, b) yields the following relationship for the density-density correlation function: (4.2.15) For systems possessing inversion symmetry S (k, w) Scoh (k, -w) e - {31iw Scoh (k, w) . = = S ( - k, w), and hence (4.2.16) This relation implies that, apart from a factor ~ in (4.1.19), the anti-Stokes lines (energy loss by the sample) are weaker by a factor e-{3nw than the Stokes lines (energy gain)4. For T --70 we have Scoh(k,w < 0) --7 0, since the system is then in the ground state and cannot transfer any energy to the scattered particle. The above relationship expresses what is known as detailed balance (Fig. 4.4): Wn-+'P~ = Wn'-+P~ or W n -+ n , = Wn'-+ne-f3(En,-En) (4.2.17) E t pen Fig. 4.4. Illustration concerning detailed balance Here, W n -+ n , and Wn'-+n are the transition probabilities from the level n to the level n' and vice versa, and P~ and P~, are the equilibrium occupation probabilities. Detailed balance implies that these quantities are related to one another in such a way that the occupation probabilities do not change as a result of the transition processes. 4.3 Dynamical Susceptibility We now wish to derive a microscopic expression for the dynamical susceptibility. To this end, we assume that the system is influenced by an external 4 From the measurement of the ratio of the Stokes and anti-Stokes lines in Raman scattering the temperature of a system may be determined. 86 4. Correlation Functions, Scattering, and Response force F(t) which couples to the operator B.5 The Hamiltonian then has the form H Ho +H'(t) = H'(t) = (4.3.1a) -F(t)B . (4.3.1b) For t ~ to, we assume that F(t) = 0, and that the system is in equilibrium. We are interested in the response to the perturbation (4.3.1b). The mean value of A at time t is given by (A(t)) = Tr (ps(t)A) = Tr (U(t, to) ps(to) ut(t, to) A) = Tr (ps(to) Ut(t, to) A U(t, to)) (4.3.2) -f3 H o = T r ( T ut(t, to) A U(t, to)) , where the notation (A(t)) is to be understood as the mean value of the Heisenberg operator (4.2.5). Here we have introduced the time-evolution operator U(t, to) for the entire Hamiltonian H and inserted the solution ps(t) = U(t, to)ps(to)ut(t, to) of the von Neumann equation PS = -*[H, ps]. Then, using the cyclic invariance of the trace and assuming a canonical equilibrium density matrix at time to, we end up with the mean value of the operator A in the Heisenberg representation. The time-evolution operator U(t, to) can be determined perturbation theoretically in the interaction representation. For this, we need the equation of motion for U(t, to). From in-9t 1'ljJ, t) = H 1'ljJ, t), it follows that in-9tU(t, to) l'ljJo) = HU(t, to) l'ljJo) and, thus, (in :t U(t, to) - HU(t, to)) l'ljJo) = 0 for every l'ljJo), which yields the equation of motion in ! U(t, to) = (4.3.3) HU(t, to) We now make the ansatz U(t , t 0 ) = e-iHo(t-to)/nU'(t , t 0 ) . (4.3.4) This gives 5 Physical forces are real and observables, e.g., the density p(x), are represented by hermitian operators. Nonetheless, we shall also consider the correlation functions = p-k), since we may also be interested for nonhermitian operators such as Pk (p~ in the properties of individual Fourier components. F(t) is a c-number. 4.3 Dynamical Susceptibility il~U' dt 87 = eiHo(t-to)/n(_R0 + H)U , and thus iii :t U'(t, to) = H~(t)U', (4.3.5) to) where the interaction representation of H' (4.3.6) has been introduced. The integration of (4.3.5) yields for the time evolution operator in the interaction representation U'(t, to) the integral equation U'(t,to) = 1+ .'"1 i t dt'H~()U,o (4.3.7) to 1/£ and its iteration 1 i t dt' H~(t') U'(t, to) = 1 + .'" 1/£ + ('~)21 =T to t exp ( dt'i dt" H~(t')" 1: to 1 -~ tf + ... to dt' H~(t') (4.3.8) . Here, T is the time-ordering operator. The second representation of (4.3.8) is not required at present, but will be discussed in more detail in Part III. For the linear response, we need only the first two terms in (4.3.8). Inserting these into (4.3.2), we obtain, to first order in F(t), (A(t)) = (A)o + .~ i t dt' \ [eiHo(t-to)/n Ae-iHo(t-to)/n, H~(t')] 1/£ = to 1 i t dt' ([A(t), B(t')])oF(t') (A)o - .'" 1/£ 0 (4.3.9) to The subscript 0 indicates that the expectation value is calculated with the density matrix e-(3Ho / Z of the unperturbed system. In the first term we have exploited the cyclic invariance of the trace We now assume that the initial time at which the system is in equilibrium, with density matrix e-(3Ho /Z, lies in the distant past. In other words, we take the limit to --t -00, which, however, does not prevent us from switching on the force F(t') at a later instant. For the change in the expectation value due to the perturbation, we obtain 88 4. Correlation Functions, Scattering, and Response Ll(A(t)) = (A(t)) - (A)o = I: dt'xAB(t - t')F(t') (4.3.10) Here we have introduced the dynamical susceptibility, or linear response func- tion XAB(t - t') = ~e(t - t')([A(t), B(t')])o , (4.3.11) which is given by the expectation value of the commutator of the two Heisenberg operators A(t) and B(t') (with respect to the Hamiltonian Ho). The step function arises from the upper integration boundary in Eq. (4.3.9) and expresses causality. Within the equilibrium expectation value we can make the replacements A(t) -+ eiHot/1i Ae-iHot/1i and B(t) -+ eiHot/1i Be-iHot/1i . Equation (4.3.10) determines, to first order, the effect on the observable A of a force that couples to B. We also define the Fourier transform of the dynamical susceptibility (4.3.12) where z may be complex (see Sect. 4.4). In order to find its physical significance, we consider a periodic perturbation which is switched on very slowly (f-+O,f>O): (4.3.13) I: For this perturbation, it follows from (4.3.10) and (4.3.12) that Ll(A(t)) = = dt' (XAB(t - t')Fwe- iwt' XAB(w)Fwe- iwt + XABt (_w)F~eit + XABt(t - t')F~eiw ed' (4.3.14) The factor eft that appears in the intermediate step can be put equal to 1 since f -+ O. The effect of the periodic perturbation (4.3.13) on Ll(A(t)) is thus proportional to the force (including its periodicity) and to the Fourier transform of the susceptibility. Resonances in the susceptibility express themselves as a strong reaction to forces of the corresponding frequency. 4.4 Dispersion Relations The causality principle demands that the response of a system can only be induced by a perturbation occurring at an earlier time. This is the source of the step function in (4.3.11), i.e., 4.4 Dispersion Relations XAB(t) =0 for t < 0 . 89 (4.4.1) This leads to the theorem: XAB (z) is analytical in the upper half plane. Proof. XAB is only nonzero for t > 0, where it is finite. Thus, the factor e- 1m zt guarantees the convergence of the Fourier integral (4.3.12). For z in the upper half plane, the analyticity of XAB(Z) allows us to use Cauchy's integral theorem to write __ 1 XAB (Z) - 2' 1 7l'le d ,XAB(Z') z, z-z (4.4.2) Here, C is a closed loop in the analytic region. We choose the path shown in Im(z) L..-_ _ _.L...-_ _- - . . L . . -_ _ Re(z) Fig. 4.5. Integration path C for deriving the dispersion relation c Fig. 4.5; along the real axis, and around a semicircle in the upper half plane, with both parts allowed to expand to infinity. We now assume that XAB(Z) becomes sufficiently small at infinity that the semicircular part of the integration path contributes nothing. We then have XAB () Z For real Z = 1 00 1 -2. 7r1 d ,XAB(X') x, X - -00 (4.4.3) Z it follows from (4.4.3) that · XAB ( .) l' XAB () x = 11m x+ IE = 1m ~O = J [p_,_l_ + dX 2 : 7r1 X - .~ J dx' XAB(X'). -., 27l'x-~ i7rD(X' - X x)] XAB(X') , i.e., - ~p . XAB (x ) - 7r1 J dX ,XAB(X') , X - (4.4.4) X We encounter here the Cauchy principal value P J dx' f(x') == lim x' - X .-+0 (l x -. -00 dx' + 1 00 x+. dX') f(x') . x' - x 90 4. Correlation Functions, Scattering, and Response We then arrive at the dispersion relations (also known as Kramers-Kronig relations) R e XAB () W = -7rIpJdW,1mWXAB(W') , -w (4.4.5a) and XAB(W') Im XAB (W) -- --IpJdW,Re ---'-"':":'::"-'---'7r W' - W (4.4.5b) These relationships between the real and imaginary parts of the susceptibility are a consequence of causality. 4.5 Spectral Representation We define 6 the dissipative response X~B(t) = ;n ([A(t), B(O)]) (4.5.1a) and (4.5.1b) Given the Fourier representation of the step function 1 00 8(t) = lim €--+o+ -00 dw _e- iwt _ 27r W . I_ . } , (4.5.2) + IE we find XAB(W) = J dt eiwt 8(t) 2i = _1 7r = ~pJdw'XB(W) 7r 1 00 -00 dw' X~B(t) x"AB (w') w' - W - iE w'-w +iX" (w) AB (4.5.3) where, in expressions such as the second line of (4.5.3), it should always be understood that the limit E -+ 0+ is taken. This yields the following decomposition of XAB(W): (4.5.4) 6 Here, and below we omit the index 0 from the expectation value. The notation ( ) represents the expectation value with respect to the Hamiltonian Ho of the entire system without external perturbation. 4.6 Fluctuation-Dissipation Theorem 91 with X~B(W) = .!pJdw'X~B(W) . (4.5.5) W' - W 7r When X~B(w) is real, then, according to (4.5.5), so is X~B(w) and (4.5.4) represents the separation into real and imaginary parts. The relation (4.5.5) is then identical to the dispersion relation (4.4.5a). The question as to the reality of X~B(w) will be dealt with in Sect. 4.8. 4.6 Fluctuation-Dissipation Theorem With the definitions (4.5.1b) and (4.2.11) we find X~B(W) = 21/i (G~BW) G~B(W) - (4.6.1a) which, together with (4.2.14b), yields X~B(W) = 2~GB(W) (1- e-i3 Tiw ) (4.6.1b) These relations between G> and X" are known as the fluctuation-dissipation theorem. Together with the relation (4.5.3), one obtains for the dynamical susceptibility __1_1 XAB(W) - 2" 7rn 00 -00 dw ,G~B(w')1 W , - - e- i3hw') . W - IE (4.6.2) Classical limit f3/iw « 1 : The classical limit refers to the frequency and temperature region for which f3/iw « 1. The fluctuation-dissipation theorem (4.6.1) then simplifies to X~B(W) = f3; G~B(W) (4.6.3) . =I 0 only for f3/ilw'l « In the classical limit (i. e. G~B(w') (4.6.2) XAB(O) = f3 J ~ G~B(W') = f3G~B(t = 0) 1) one obtains from (4.6.4) Hence the static susceptibility (w = 0) is given in the classical limit by the equal-time correlation function of A and B divided by kT. The name fluctuation-dissipation theorem for (4.6.1) is appropriate since GAB(W) is a measure of the correlation between fluctuations of A and B, whilst X~B describes the dissipation. That X" has to do with dissipation can be seen as follows: For a perturbation of the form 92 4. Correlation Functions, Scattering, and Response (4.6.5) where F is a complex number, the golden rule gives a transition rate per unit time from the state n into the state m of _ 271' { rn-tm -r; 8(Em - En - 1iw)1 +8(Em - (ml A t F In) I2 En + 1iw)1 (ml AF* In) n· (4.6.6) The power ofthe external force ( = the energy absorbed per unit time), with the help of (4.6.1a) and (4.2.13a), is found to be W = L e- f3En -----z-rn-tm(Em - En) n,m (4.6.7) where a canonical distribution has been assumed for the initial states. We t (w) determines the energy absorption and, therehave thus shown that X~A fore, the strength of the dissipation. For frequencies at which X~At(w) is large, i.e., in the vicinity of resonances, the absorption per unit time is large as well. Remark. If the expectation values of the operators A and B are finite, in some of the relations of Chap. 4, it can be expedient to use the operators A(t) =A(t) - (A) and B(t) = B(t) - (B), in order to avoid contributions proportional to D(W), e. g. e(x, t) or ek(t). Since the commutator remains unchanged XAB(t) = XAB(t), X:';'B(t) = XAB(t) etc. hold. 4.7 Examples of Applications To gain familiarity with some characteristic forms of response and correlation functions, we will give these for three typical examples: for a harmonic crystal, for diffusive dynamics, and for a damped harmonic oscillator. (i) Harmonic crystal. As a first, quantum-mechanical example, we calculate the susceptibility for the displacements in a harmonic crystal. For the sake of simplicity we consider a Bravais lattice, i.e., a lattice with one atom per unit cell. We first recall a few basic facts from solid state physics concerning lattice dynamics 7 . The atoms and lattice points are labeled by vectors n = (nl' n2, n3) of natural numbers ni = 1, ... ,Ni , where N = NIN2N3 is the number of lattice points. The cartesian coordinates are characterized by the indices i = 1,2,3. We denote the equilibrium positions of the atoms (i.e., the lattice points) by an, so that the actual position of the atom n is 7 See, e.g., C. Kittel, Quantum Theory of Solids, 2nd revised printing, J. Wiley, New York, 1987 4.7 Examples of Applications 93 = an + un, where Un is the displacement from the equilibrium position. The latter can be represented by normal coordinates Qk,).. Xn (4.7.1) where M the mass of an atom, k the wave vector, and Ei(k,).) the components of the three polarization vectors, ). = 1,2,3. The normal coordinates can be expressed in terms of creation and annihilation operators at).. and ak,>' for phonons with wave vector k and polarization ). (4.7.2) with the three acoustic phonon frequencies representation ak,)..(t) = e-iwk,>.tak,)..(O) Wk,)..' Here, we use the Heisenberg . (4.7.3) After this transformation, the Hamiltonian takes the form H = 'L:nwk,).. (at)..ak,)" k,).. +~) (4.7.4) (ak,>, == ak,).. (0)). From the commutation relations of the Xn and their adjoint momenta one obtains for the creation and annihilation operators the standard commutator form, [ ak,).., at, ,)..'] = <>)..)..' <>kk' (4.7.5) [ak,).., ak',)..'] = [at).., at,,)..'] =0. The dynamical susceptibility for the displacements is defined by (4.7.6) and can be expressed in terms of (4.7.7) as xij(n-n',t) =2i8(t)x"i j (n-n',t). (4.7.8) The phonon correlation function is defined by (4.7.9) 94 4. Correlation Functions, Scattering, and Response For all of these quantities it has been assumed that the system is translationally invariant, i.e., one considers either an infinitely large crystal or a finite crystal with periodic boundary conditions. For the physical quantities of interest, this idealization is of no consequence. The translational invariance means that (4.7.6) and (4.7.7) depend only on the difference n - n'. The calculation of x"ij(n-n', t) leads, with the utilization of (4.7.1), (4.7.2), (4.7.3), and (4.7.5), to x"ij(n-n',t) = L 2~N eikan+ik'an'Ei(k,A)Ej(k', A') k,A k',>.' In the following, we shall make use of the fact that the polarization vectors for Bravais lattices are real. 8 For (4.7.6), this yields: and for the temporal Fourier transform L e''k(an-an' ) Ei(k, A) Ej(k, A) Jdt e'w. Slnwk,>. t . 00 .. , 1XtJ(n - n , w) = NM t k>' , Wk,>. . 0 (4.7.12) Using the equations (A.22), (A.23), and (A.24) from QM I, 00 J dseisz = 27f8+(z) = [7f8(Z) + iP (~)] o Z I dse- isz = 27fL(z) = [7f8(Z) - iP o (~)] = i lim _1_. E-+O Z + If (4.7.13) = . l' 1 -11m - - . - , E-+O Z - IE one obtains, for real z, 8 In non-Bravais lattices the unit cell contains r ~ 2 atoms (ions). The number of phonon branches is 3r, i.e., >. = 1, ... ,3r. Furthermore, the polarization vectors f(k, >.) are in general complex and in the results (4.7.11) to (4.7.18) the second factor fj( ... ,>') must be replaced by fj*( ... ,>.). 4.7 Examples of Applications ) Xi j ( n - n ',W = l'1m -1- <-+02NM x {W L eik(a -a n nf k,.>. + W~,.> ) fi(k, A)fj(k, A) Wk.>. ----'----'------'----'-- ' + if - W- 95 W~,.> (4.7.14a) + if } and for the spatial Fourier transform ij ( X ) _ '""' ~ n q, W - e -iqa n ) _ 1 '""' fi(q, A)fj(q, A) ij ( X n, W - 2M ~ W .>. W~,.> x {W + + if - W- W~,.> q,'>' + if} (4.7.14b) . For the decompositions xij(n - n',w) = x'ij(n - n',w) + ix"ij(n - n',w) (4.7.15a) and xij(q,w) = X'ij(q,w) + iX"ij(q,w) (4.7.15b) this leads to 1 '""' ik(an -an f)fi(k,A)fj(k,A) -~e -2NM ~.> ) X'iJ" ( n-n' w - , k,'>' ' (4.7.16a) n (4.7.16b) XIIiJ" ( n - n ' ) w - -7r- , '""' ~ - 2NM k,'>' e ik(an -a )fi(k,A)fj(k,A) nf x [8(w - Wk,.>.) - 8(w X"ij(q,W) Wk,.>. + Wk,.>.)] (4.7.17a) = Le-iqanxij(n,w) n = ~ L 2M.>. X fi(q,A)fj(q, A) wq ,.>. [8(w - wq ,.>.) - 8(w + wq ,.>.)] (4.7.17b) The phonon correlation function (4.7.9) can be either calculated directly, or determined with the help of the fluctuation-dissipation theorem from x"ij(n - n',w): 96 4. Correlation Functions, Scattering, and Response ef3 liw .. D'} (n - n' w) = 2fi , ef3 liw - = 2fi[1 = .. X"'} (n - n' w) 1 ' + n(w)] x"ij(n - 7rfi "'eik(an_an/)Ei(k,A)Ej(k,A) NMw Wk>. k,>. X n',w) {(I + nk,>.)O(W - (4.7.18a) ' Wk,>.) - nk,>.O(w + Wk,>.)} analogously, it also follows that Dij (q, w) = 2fi [1 + n(w)] X" ij (q, w) = 7rfi L Ei(q, A)Ej(q, A) w q >. >. ' x {(I + nq,>.)o(w - wq,>.) - nq,>.o(w + wq,>.)} M (4.7.18b) Here, n q,>. = (a~,>. aq,>.) = -e""nw;:--Q-\---1 f3 (4.7.19) is the average thermal occupation number for phonons of wave vector q and polarization A. The phonon resonances in Dij (q, w) for a particular q are sharp o-function-like peaks at the positions ±wq ,>.. The expansion of the density-density correlation function, which determines the inelastic neutron scattering cross-section, has as one of its contributions the phonon correlation function (4.7.18b). The excitations of the many-particle system (in this case the phonons) express themselves as resonances in the scattering crosssection. In reality, the phonons interact with one another and also with other excitations of the system, e.g, with the electrons in a metal. This leads to damping of the phonons. The essential effect of this is captured by replacing the quantity E by a finite damping constant l'(q, A). The phonon resonances in (4.7.18) then acquire a finite width. (ii) Diffusion. The diffusion equation for M(x, t) reads: M(x, t) = D'\7 2 M(x, t) (4.7.20) where D is the diffusion constant and M(x, t) can represent, for example, the magnetization density of a paramagnet. From (4.7.20) one readily finds 9 ,10 9 10 M(x, t} is a macroscopic quantity; from the knowledge of its dynamics the dynamical susceptibility can be deduced (Problem 4.1). The same is true for the oscillator Q (see Problem 4.2). Here, we have also used X' = Re X, X" = 1m X, which, according to Sect. 4.8, holds for Qt = Q and M_q = Mci. 4.7 Examples of Applications 97 iDq2 X(q,w) = X(q) w +'D 2 1 q , (Dq2)2 X (q,W) = X(q) W2 + (Dq2)2 1/ ( X q,w ) > G (q,w) = () Xq W Dq 2w 2 + (Dq2)2 2~ ( 4.7.21 ) D~ = X(q)1_e-(3 nw w2 + (Dq2)2 Figure 4.6 shows X'(q,w),XI/(q,w), and G>(q,w). One sees that X'(q,w) is symmetric in w, whereas X" (q, w) is antisymmetric. The form of G> (q, w) also depends on the value of f3nDq2, which in Fig. 4.6c is taken to be f3nDq2 = 0.1. In order to emphasize the different weights of the Stokes and anti-Stokes components, Fig. 4.6d is drawn for the value f3nDq2 = 1. However, it should be stressed that, for diffusive dynamics, this is unrealistic since, in the hydrodynamic regime, the frequencies are always smaller than kT. 0.6 1.0 0.8 x'(q,w) x(q) 0.3 0.6 x"(q,w) x(q) 0.0 0.4 -0.3 0.2 0.0 ·4 ·2 W/D q2 2 ·0.6 4 ·4 ·2 o 2 4 2 2 4 w/Dl (b) (a) 10 1.0 8 c>(q,w) 2nx(q) 6 c>(q,w) 2nx(q) 0.6 4 2 0 0.2 ·4 ·2 0 2 4 ·4 (c) ·2 o w/Dq w/Dl (d) Fig. 4.6. Diffusive dynamics: (a) Real part and (b) imaginary part of the dynamical susceptibility (4.7.21). The curves in (c) and (d) show C> divided by the static susceptibility as a function of ~; (c) for f3hDl = 0.1 and (d) for f3hDq2 = 1 98 4. Correlation Functions, Scattering, and Response (iii) Damped oscillator. We now consider a damped harmonic oscillator (4.7.22) with mass m, frequency wo, and damping constant "(. If, on the right-hand side of the equation of motion (4.7.22), one adds an external force F, then, in the static limit, one obtains !j = 1/mw5. Since this relationship defines the static susceptibility, the eigenfrequency of the oscillator depends on its mass and the static susceptibility X, according to w5 = ~x. From the equation of motion (4.7.22) with a periodic frequency-dependent external force one finds for the dynamical susceptibility 9,10 X(w) and for G>(w) X(w) = l/m. -w 2 + w5 - lW"( 1 -w 2 +w 2 X'(w) = - ( 2 2)2 0 2 2 m -w +wo +w"( "() 1 (4.7.23) w"( X w = m (-w2 +W5)2 +W2"(2 G>(w) _ 2fiw "( - m(l - e- f3 1l!..J) (-w 2 + w5)2 + w2"(2 These quantities, each divided by X = 1/mw5, are shown in Fig. 4.7 as functions of w/wo. Here, the ratio of the damping constant to the oscillator frequency has been taken as ,,(/wo = 0.4. One sees that X' and X" are symmetric and antisymmetric, respectively. Figure 4.7c shows G>(w) at (3fiw o = 0.1, whereas Fig. 4.7d is for (3fiw o = 1. As in Fig. 4.6c,d, the asymmetry becomes apparent when the temperature is lowered. The differences between the intensities of the Stokes and anti-Stokes lines can be used, for example, to determine the temperature of a sample by Raman scattering. * 4.8 Symmetry Properties 4.8.1 General Symmetry Relations In the two previous figures we have seen that X'(w) is symmetric and X"(w) antisymmetric, and that in G> (w) the Stokes line is stronger than the antiStokes line. We will now undertake a general investigation of the conditions under which these symmetry properties hold. The symmetry properties that will be discussed here are either of a purely mathematical nature and a direct consequence of the definitions and of the usual properties of commutators together with the dispersion relations and the relationships (4.2.14a,b), or they are of a physical nature and follow from the symmetry properties of the Hamiltonian, such as translational invariance, rotational invariance, inversion ·4.8 Symmetry Properties 99 2.0 2 1.0 X"(w) X'(w) 0 x x 0.0 ·2 -1.0 -2 0 -2 2 0 W/Wo 2 w/wo (b) (a) 30 4 3 20 G>(w) G>(w) 2nx 2hX 2 10 0 -2 0 0 2 w/wo -2 2 0 w/wo (c) (d) Fig. 4.7. X'(w),X"(w) and G>(w) for the harmonic oscillator -20 = 0.4. The two plots of G>(w) are for different values of f3fiwo, namely in (c) f3fiwo = 0.1 and in (d) f3fiwo = 1.0 symmetry, or time-reversal invariance. It follows from (4_6.1b) and (4.2.14b) that X~B( -w) = 21n, G~B( -w) [1 - ef3 IiW ] = 21n,e-f3IiwG~A(W) [1 - ef3 liw ] (4.8.1a) and a further comparison with (4.6.1b) yields: (4.8.1b) This relation also follows from the antisymmetry of the commutator; see (4.8.12b). 100 4. Correlation Functions, Scattering, and Response > When B = At, the correlation functions G~A (w) are real. t Proof: J J 00 dt e- iwt (A(O)A t (t)) -00 Jdte-iw(A~)O 00 00 = -00 dteiwt(A(t)At(O)) -00 (4.8.2) t (w) and X~A t (w) are also real and thus yield the For B = At, then X~A decomposition of XAA t into real and imaginary parts: 1m XAAt = X~At Re XAAt = X~At (4.8.3) These properties are satisfied by the density-density correlation function. The definitions of density correlation and density-response functions read: J 00 S(k,w) = dteiwtS(k,t) = J dt d3 xe- i(kx-wt)S(x, t) , (4.8.4a) -00 where S(x, t) = (p(x, t)p(O, 0)) (4.8.4b) denotes the correlation of the density operator (4.1.27). It follows with (4.1.28) that (4.8.4c) The susceptibility or response function is defined correspondingly through X(k, w) or J 00 X" (k, w) = dt e iwt 2111, ([Pk (t), P-k (0)]) . (4.8.5) -00 The relationship between the density correlation function and reads: Scoh (k, w) (4.8.6) Further symmetry properties result in the presence of space inversion symmetry. Since we then have X" ( ~ k, w) = X" (k, w), it follows from (4.8.1 b) that • 4.8 Symmetry Properties x" (k, -w) = -X" (k, w) 101 (4.8.7a) Thus X" is an odd function of wand, due to (4.8.3), is also real. Correspondingly, X' (k, w) is even: X'(k, -w) = X'(k,w) . (4.8.7b) This can be seen by means of the dispersion relation, since , J J J 00 X (k, -w) =P 00 dw' X" (k, w') =-P W' 7r +w dw' X" (k, -w') - '-'-'---'---'7r -00 W' +w -00 00 =P dw' x"~k,w') 7r w - = X'(k,w) . (4.8.8) W -00 For systems with inversion symmetry the density susceptibility can, according to (4.6.1a) and (4.2.14a), be represented in the form X" (k, w) = 2111, (S(k, w) - S(k, -w)) . (4.8.9) Inserting this into the dispersion relation, one finds 2! PJdw'S(k,w') [_,_1_ _-w-w ,I ] 00 X'(k,w) = W - W n7r -00 = 2.. p n7r J (4.8.10) 00 dw,w'S(k,w') . w,2 - w2 -00 From this one obtains the asymptotic behavior J 00 · X'(k) 11m ,w -- 2.. f> p w-tO n7r ,S(k,w') dW , W (4.8.11a) -00 J 00 lim w2X'(k,w) = - f>1 w-too n7r dw'w'S(k,w') (4.8.11b) -00 4.8.2 Symmetry Properties of the Response Function for Hermitian Operators 4.8.2.1 Hermitian Operators Examples of hermitian operators are the density p(x, t) and the momentum density P(x). For arbitrary, and in particular also for hermitian, operators A and B, one has the following symmetry relations: 102 4. Correlation Functions, Scattering, and Response (4.8.12a) (4.8.12b) This follows from the antisymmetry of the commutator. The relation for the Fourier transform is identical to (4.8.1 b). Likewise, from the definition (4.5.1a), one can conclude directly (4.8.13a) i.e., X~B(t - t') is imaginary (the commutator of two hermitian operators is antihermitian) and " (W )* = -XAB "( ) XAB -w (4.8.13b) Taken together, (4.8.12) and (4.8.13) yield: X~B(t (4.8.14a) - t')* = +X'JJA(t' - t) and (4.8.14b) Remark: For both the correlation function and the susceptibility, the translational invariance implies ~ -- ~G AB (X G A(x)B(x') , - X , ... ) (4.8.15a) and the rotational invariance (4.8.15b) It thus follows from (4.8.14b) for systems with spatial translational and ro- tational invariance that (4.8.16) is real and antisymmetric in w. For different operators, it is the behavior under the time-reversal transformation that determines whether or not X" is real. 4.8.2.2 Time Reversal, Spatial and Temporal Translations Time-reversal invariance Under the time-reversal operation (Sect. 11.4.2.3), an operator A(x, t) transforms as follows: * 4.8 Symmetry Properties A(x, t) -+ A'(x, t) EA = T A(x, t)T- 1 = EAA(x, -t) . 103 (4.8.17) is known as the signature and can take the following values: EA = EA = -1 (e.g., for velocity, angular momentum, and magnetic field). 1 (e.g., for position and for electric field) For the expectation value of an operator B one finds (001 B 1(0) = (TBaITa) = (TBT-1TaITa) = (Tal (TBT- 1)t ITa) (4.8.18a) Making use of (4.8.17), one obtains (T[A(x, t), B(x', t')]T- 1)t = = EAEB[A(x, -t), B(x', -t')]t -EAEB[A(x, -t), B(x', -t')] . (4.8.18b) For time-reversal-invariant Hamiltonians, this yields: " (t - t ') XAB = " ( t-' ) -EAEBXAB t (4.8.19a) and (4.8.19b) When EA = EB, then X~B(w) is symmetric under exchange of A and B, odd in w, and real. When EA = -EB, then X~B(w) is antisymmetric under the exchange of A and B, even in w, and imaginary. If a magnetic field is present, then its direction is reversed under a time-reversal transformation X~B(w;) = fAEBX~(W; = -EABX~( -B) -w; -B). (4.8.20) Finally, we remark that, from (4.8.13b) and (4.5.3), (4.8.21 ) This relation guarantees that the response (4.3.14) is real. 104 4. Correlation Functions, Scattering, and Response Translational invariance of the correlation function f(x, t; x', t') == (A(x, t)B(x', t')) = (T;lTaA(x, t)T;lTaB(x', t')T;lTa ) = (T;l A(x + a, t)B(x' + a, t')Ta ) . If the density matrix p commutes with Ta, i.e., [Ta, p] = 0, then, due to the cyclic invariance of the trace, it follows that (A(x, t)B(x', t')) = (A(x + a, t)B(x' + a, t')) x', t; 0, t') = f(x - where in the last step we have set a translational invariance together yield: (4.8.22) , = -x'. Thus, spatial and temporal f(x, t;x', t') = f(x - x', t - t') . (4.8.23) Rotational invariance A system can be translationally invariant without being rotationally invariant. When rotational invariance holds, then (for any rotation matrix R) f(x - x', t - t') = f(R(x - x'), t - t') = f(lx - x'l, t - t') , (4.8.24) independent of the direction. Fourier transformation for translation ally invariant systems yields: j(k,t;k',t') J =J = d3xd3x'e-ikx-ik'x'f(x,t;x',t') d3xd3x'e-ikx-ik'x' f(x - x', t - t'). Substituting y = x - x' leads to /(k, t; k', t') = = J J d3x' d3ye- ik (y+x')-ik'x' f(y, t - t') (21r)38(3)(k + k')/(k, t - t') If rotational invariance holds, then /(k, t - t') = /(Ikl, t - t') . (4.8.25) 4.8.2.3 The Classical Limit We have already seen (Eqs. (4.6.3),(4.6.4)) that, in the classical limit (tu.v« kT): X~B(W) = XAB(O) = ,8G~B(t ,8; G~B(W) and = 0) (4.8.26a) (4.8.26b) ·4.8 Symmetry Properties 105 From the time-reversal relation for XAB(w), Eq. (4.8.19b), it follows that (4.8.27) When fA = fB, then C~B(w) is symmetric in w, real, and symmetric upon interchange of A and B. (The latter follows from the fluctuation-dissipation theorem and from the symmetry of XAB(w)). When fA = -fB, then C~B is odd in w, antisymmetric upon interchange of A and B, and imaginary. For fA = fB, equation (4.8.26a) is equivalent to (4.8.28) The half-range Fourier transform of C~B(t), e(t)C~B satisfies in the classical limit C:tB(W) == i.e. the Fourier transform of 1 dteiwC~B() 00 roo dteiwt foo dw' e-iw'tC> (w') 211" AB foo dw' e -iw'tc>AB ( f dt eiwt foo d211"" . "-iw" . + If 211" _~ foo dw' C~B(W') 211" if i 2 foo dw' X"AB (w') = ___ = Jo -00 ~ OO = -00 -00 Ie t W _ w ') -00 = W' - -00 i: 211" (3 == 1I"i{3 -00 W - w'(w' - w - if) dW'XAB(W') (~, - w' _ i (3w (XAB(O) - XAB(W)) . ~ _ if) -_-W-1-_-I-f (4.8.29) 4.8.2.4 Kubo Relaxation Function The Kubo relaxation function is particularly useful for the description of the relaxation of the deviation 8(A(t)) after the external force has been switched off (see SM, Appendix H). The K ubo relaxation function of two operators A and B is defined by ¢AB(t) = ~ 1 00 dt' ([A(t'), B(O)])e- Et ' (4.8.30) and its half-range Fourier transform is given by (4.8.31) 106 4. Correlation Functions, Scattering, and Response It is related to the dynamical susceptibility via (4.8.32a) and (4.8.32b) The first relation follows from a comparison of (4.3.12) with (4.8.31) and the second from a short calculation (Problem 4.6). Equation (4.8.29) thus implies that, in the classical limit, (4.8.33) 4.9 Sum Rules 4.9.1 General Structure of Sum Rules We start from the definitions (4.5.1a,b) (4.9.1) and differentiate this n times with respect to time: Repeated substitution of the Heisenberg equation yields, for t = 0, (4.9.2) The right-hand side contains an n-fold commutator of A with Ho. If these commutators lead to simple expressions, then (4.9.2) provides information about moments of the dissipative part of the susceptibility. Such relations are known as sum rules. The i-sum rule: An important example is the i-sum rule for the densitydensity susceptibility, which, with the help of (4.8.9), can be represented as a sum rule for the correlation function J~wx"(k,) = J 2w;/iwS(k,w) = 2i/i([,ok(t),p-k(t)]) . 4.9 Sum Rules 107 The commutator on the right-hand side can be calculated with Pk = ik· jk, which yields, for purely coordinate-dependent potentials, the standard form of the fsum rule I dww k2 --S(k w) = - n 27r Ii ' 2m' (4.9.3) where n = {j is the particle number density. There are also sum rules that result from the fact that, in many cases ,11 in the limit k -+ and w -+ the dynamical susceptibility must transform into a susceptibility known from equilibrium statistical mechanics. Compressibility sum rule: As an example, we will use (4.8.11a) to give the compressibility sum rule for the density response function: ° lim k-+O pI ° dw ~ S(k,w) = 7r Ii w n(an) =n ap 2 "'T T (4.9.4) Here we have made use of the relationship x'(O,O) =~ (~N) = _n 3 p, = T,v _~: (~) (an) al ) T,N = n ap ( ap T,N T,N ' which derives from (4.8.26b) and from thermodynamics 12 . The static form factor is defined by (4.9.5) This determines the elastic scattering and is related to S (k, w) via I dw -S(k,w) = S(k) . 27r (4.9.6) The static form factor S(k) can be deduced from x-ray scattering. Equations (4.9.3), (4.9.4), and (4.9.6) provide us with three sum rules for the density correlation function. The sum rules give precise relationships between S (k, w) and static quantities. When these static quantities are known from theory or experiment and one has some idea of the form of S (k, w), it is then possible to use the sum rules to determine the parameters involved in S(k,w). We shall elucidate this for the example of excitations in superfluid helium. 11 12 P.C. Kwok and T.D. Schultz, J. Phys. C2, 1196 (1969) See, e.g., L.D. Landau and E.M. Lifschitz,Course of Theoretical Physics, Vol. 5. Statistical Physics 3rd edn. Part I, E.M. Lifshitz, L.P. Pitajevski, Pergamon, Oxford, 1980; F. Schwabl, Statistical Mechanics, Springer, Berlin Heidelberg, 2002, Eq. (3.2.10) 108 4. Correlation Functions, Scattering, and Response 4.9.2 Application to the Excitations in He II We approximate S(q, w) by an infinitely sharp density resonance (phonon, roton) and assume T = 0 so that only the Stokes component is present: (4.9.7) Inserting this into the f-sum rule (4.9.3) and the form factor (4.9.6) yields: (4.9.8) The f-sum rule (4.9.3) and compressibility sum rule (4.9.4) give, in the limit q -+ 0, ~ m (OP) q, Zq = 1fnnq , S(q) an T mST = nnq , 2msT (4.9.9) J where we have introduced the isothermal sound velocity ST = (~)T 1m. The relationship (4.9.8) between the energy of the excitations and the static form factor was first derived by Feynman 13 . Figure 4.8 shows experimental results for these two quantities. For small q, it is seen that S(q) increases linearly with q, yielding the linear dispersion relation in the phonon regime. The maximum of S(q) at q ~ 2A -1 leads to the roton minimum. Eq [K] Seq) 15 1.5 10 1.0 5 0.5 0 1 2 qA-1 (a) 0 2 4 qA-1 (b) Fig. 4.8. (a) The excitations of He II at low temperatures: (i) under vapor pressure, (ii) at 25.3 atm. (b) The static form factor 14 13 14 R. Feynman, Phys. Rev. B 94, 262 (1954) D.G. Henshaw, Phys. Rev. 119, 9 (1960); D.G. Henshaw and A.D.B. Woods, Phys. Rev. 121, 1266 (1961) Problems 109 Problems 4.1 Confirm the validity of Eq. (4.7.21) by adding an external magnetic field H(x, t) to the diffusion equation (4.7.20). 4.2 For the classical damped harmonic oscillator d2 ( dt 2 +, dtd + wg ) Q(t) = F(t)/m determine the following functions: X(w), X'(w) X"(w), and G>(w). Hint: Solve the equation of motion in Fourier space and determine the dynamical susceptibility from X( w) = ~i: j. 4.3 Prove the I-sum rule, / dw 27l'wX "( k,w ) = / dw wS ( 27l'h k,w ) 2 k n = 2m for the density-density correlation function. Hint: Calculate f,; ([Pk, p-k]). 4.4 Show, for B = At, that G~B(W), G~B(W), X~B(W), and X~B(W) are real. 4.5 Show that the coherent neutron scattering cross-section for harmonic phonons, Eqs. (4.7.1) ff., can be written as " -i(an-a",)k JCXl ~ iwt (kun(t)ku",(O» S coh (k ,w) -_ e -2W ~ N " LJe 27l'h e e n,m (4.9.10) -00 with the Debye-Waller factor e-2W = e-(kun (O))2) . (4.9.11) Expand the last exponential function in Scoh (k, w) as a Taylor series. The zerothorder term corresponds to elastic scattering, the first-order term to one-phonon scattering, and the higher-order terms to multiphonon scattering. 4.6 Derive the relation (4.8.32b) by suitable partial integration, and using ¢AB(t 00) = o. = Bibliography for Part I A.A. Abrikosov, L.P. Gorkov, and I.E. Dzyaloshinski, Methods of Quantum Field Theory in Statistical Physics (Prentice Hall, Englewood Cliffs 1963) L.E. Ballentine, Quantum Mechanics (Prentice Hall, Englewood Cliffs 1990) G. Baym, Lectures on Quantum Mechanics (Benjamin, London 1973) A.L. Fetter and J.D. Walecka, Quantum Theory of Many-Particle Systems (McGrawHill, New York 1971) A. Griffin, Excitations in a Bose-condensed Liquid, Cambridge University Press, Cambridge, 1993 G.D. Mahan, Many-Particle Physics (Plenum, New York 1983) P.C. Martin, Measurements and Correlation Functions (Gordon and Breach, New York 1968) P. Nozieres and D. Pines, The Theory of Quantum Liquids, Volume I, Normal Fermi Liquids (Benjamin, New York 1966) P. Nozieres and D. Pines, The Theory of Quantum Liquids, Volume II, Superfiuid Bose Liquids (Addison-Wesley, New York 1990) J.J. Sakurai, Modern Quantum Mechanics (Addison-Wesley, Redwood City 1985) E.P. Wigner, Group Theory and its applications to the Quantum Mechanics of Atomic Spectra (Academic, New York 1959) J.M. Ziman, Elements of Advanced Quantum Theory (Cambridge University Press, Cambridge 1969) Part II Relativistic Wave Equations 5. Relativistic Wave Equations and their Derivation 5.1 Introduction Quantum theory is based on the following axioms l : 1. The state of a system is described by a state vector I'l/J) in a linear space. 2. The observables are represented by hermitian operators A ... , and functions of observables by the corresponding functions of the operators. 3. The mean (expectation) value of an observable in the state I'l/J) is given by (A) = ('l/JI A I'l/J)· 4. The time evolution is determined by the Schrodinger equation involving the Hamiltonian H in 81 'l/J) = H I'l/J) 8t (5.1.1) 5. If, in a measurement of the observable A, the value an is found, then the original state changes to the corresponding eigenstate In) of A. We consider the Schrodinger equation for a free particle in the coordinate representation (5.1.2) It is evident from the differing orders of the time and the space derivatives that this equation is not Lorentz covariant, i.e., that it changes its structure under a transition from one inertial system to another. Efforts to formulate a relativistic quantum mechanics began with attempts to use the correspondence principle in order to derive a relativistic wave equation intended to replace the Schrodinger equation. The first such equation was due to Schrodinger (1926)2, Gordon (1926)3, and Klein (1927)4. This scalar wave equation of second order, which is now known as the Klein-Gordon equation, was initially dismissed, since it led to negative 1 2 3 4 See QM I, Sect. 8.3. E. Schrodinger, Ann. Physik 81, 109 (1926) W. Gordon, Z. Physik 40, 117 (1926) O. Klein, Z. Physik 41, 407 (1927) 116 5. Relativistic Wave Equations and their Derivation probability densities. The year 1928 saw the publication of the Dirac equation 5 . This equation pertains to particles with spin 1/2 and is able to describe many of the single-particle properties of fermions. The Dirac equation, like the Klein-Gordon equation, possesses solutions with negative energy, which, in the framework of wave mechanics, leads to difficulties (see below). To prevent transitions of an electron into lower lying states of negative energy, in 19306 Dirac postulated that the states of negative energy should all be occupied. Missing particles in these otherwise occupied states represent particles with opposite charge (antiparticles). This necessarily leads to a many-particle theory, or to a quantum field theory. By reinterpreting the Klein-Gordon equation as the basis of a field theory, Pauli and Weisskopf1 showed that this could describe mesons with spin zero, e.g., 7r mesons. The field theories based upon the Dirac and Klein-Gordon equations correspond to the Maxwell equations for the electromagnetic field, and the d'Alembert equation for the four-potential. The Schrodinger equation, as well as the other axioms of quantum theory, remain unchanged. Only the Hamiltonian is changed and now represents a quantized field. The elementary particles are excitations of the fields (mesons, electrons, photons, etc.). It will be instructive to now follow the historical development rather than begin immediately with quantum field theory. For one thing, it is conceptually easier to investigate the properties of the Dirac equation in its interpretation as a single-particle wave equation. Furthermore, it is exactly these single-particle solutions that are needed as basis states for expanding the field operators. At low energies one can neglect decay processes and thus, here, the quantum field theory gives the same physical predictions as the elementary single-particle theory. 5.2 The Klein-Gordon Equation 5.2.1 Derivation by Means of the Correspondence Principle In order to derive relativistic wave equations, we first recall the correspondence principle8 . When classical quantities were replaced by the operators energy and 5 6 7 8 P.A.M. Dirac, Proc. Roy. Soc. (London) AU7, 610 (1928); ibid. AU8, 351 (1928) P.A.M. Dirac, Proc. Roy. Soc. (London) A126, 360 (1930) W. Pauli and V. Weisskopf, Relv. Phys. Acta 7, 709 (1934) See, e.g., QM I, Sect. 2.5.1 5.2 The Klein-Gordon Equation momentum p n -----+ -:- V , 117 (5.2.1) 1 we obtained from the nonrelativistic energy of a free particle p2 (5.2.2) E=2m' the free time-dependent Schrodinger equation (5.2.3) This equation is obviously not Lorentz covariant due to the different orders of the time and space derivatives. We now recall some relevant features of the special theory of relativity.9 We will use the following conventions: The components of the space-time four-vectors will be denoted by Greek indices, and the components of spatial three-vectors by Latin indices or the cartesian coordinates x, y, z. In addition, we will use Einstein's summation convention: Greek indices that appear twice, one contravariant and one covariant, are summed over, the same applying to corresponding Latin indices. Starting from xl-'(s) = (et, x), the contravariant four-vector representation of the world line as a function of the proper time s, one obtains the fourvelocity xl-' (s). The differential of the proper time is related to dxo via ds = }1 - (vle)2 dxo, where (5.2.4a) is the velocity. For the four-momentum this yields: . pI-' = mexl-'(s) = 1 }1 - (vle)2 (me) = four-momentum = (Ele) mv p (5.2.4b) In the last expression we have used the fact that, according to relativistic dynamics, po = mel }1 - (vle)2 represents the kinetic energy ofthe particle. Therefore, according to the special theory of relativity, the energy E and the momentum Px, Py, pz transform as the components of a contravariant fourvector (5.2.5a) 9 The most important properties of the Lorentz group will be summarized in Sect. 6.1. 118 5. Relativistic Wave Equations and their Derivation The metric tensor 1 0 00 0-1 91-'v 0) 0 = ( 0 0 -1 0 o 0 (5.2.6) 0 -1 yields the covariant components (5.2.5b) According to Eq. (5.2.4b), the invariant scalar product of the fourmomentum is given by (5.2.7) with the rest mass m and the velocity of light c. From the energy-momentum relation following from (5.2.7), (5.2.8) one would, according to the correspondence principle (5.2.1), initially arrive at the following wave equation: (5.2.9) An obvious difficulty with this equation lies in the square root of the spatial derivative; its Taylor expansion leads to infinitely high derivatives. Time and space do not occur symmetrically. Instead, we start from the squared relation: (5.2.10) and obtain (5.2.11) This equation can be written in the even more compact and clearly Lorentzcovariant form (5.2.11') Here xl-' is the space-time position vector xl-' = (xo = ct,x) 5.2 The Klein~Gord Equation 119 and the covariant vector a =~ aXIL IL is the four-dimensional generalization of the gradient vector. As is known from electrodynamics, the d' Alembert operator D =: OJ.tOIL is invariant under Lorentz transformations. Also appearing here is the Compton wavelength !tlmc of a particle with mass m. Equation (5.2.11') is known as the Klein~ Gordon equation. It was originally introduced and studied by Schrodinger, and by Gordon and Klein. We will now investigate the most important properties of the Klein~ Gordon equation. 5.2.2 The Continuity Equation To derive a continuity equation one takes 'lj!* times (5.2.11') and subtracts the complex conjugate of this equation This yields 'lj!* 0ILoIL'lj! - 'lj!0ILoIL'lj!* = 0 OIL ('lj!* oIL'lj! - 'lj!oIL'lj!*) = 0 . Multiplying by 2!i' so that the current density is equal to that in the nonrelativistic case, one obtains !!.at (~'lj!*o 2mc2 at _ 'lj!o'lj!*)) at + V. ~ 2ml ['lj;*V'lj! - 'lj!V'lj!*] = o. (5.2.12) This has the form of a continuity equation p + div j = 0, (5.2.12') with density (5.2.13a) and current density j =~ 2ml ('lj!*V'lj! - 'lj!V'lj!*) (5.2.13b) 120 5. Relativistic Wave Equations and their Derivation Here, p is not positive definite and thus cannot be directly interpreted as a probability density, although ep(x, t) can possibly be conceived as the corresponding charge density. The Klein-Gordon equation is a second-order differential equation in t and thus the initial values of 'ljJ and ~ can be chosen independently, so that p as a function of x can be both positive and negative. 5.2.3 Free Solutions of the Klein-Gordon Equation Equation (5.2.11) is known as the free Klein-Gordon equation in order to distinguish it from generalizations that additionally contain external potentials or electromagnetic fields (see Sect. 5.3.5). There are two free solutions in the form of plane waves: 'ljJ(x, t) = ei(Et-p.x)/1i (5.2.14) with E = ±Vp2 C2 + m 2 c4 . Both positive and negative energies occur here and the energy is not bounded from below. This scalar theory does not contain spin and could only describe particles with zero spin. Hence, the Klein-Gordon equation was rejected initially because the primary aim was a theory for the electron. Dirac 5 had instead introduced a firstorder differential equation with positive density, as already mentioned at the beginning of this chapter. It will later emerge that this, too, has solutions with negative energies. The unoccupied states of negative energy describe antiparticles. As a quantized field theory, the Klein-Gordon equation describes mesons 7 . The hermitian scalar Klein-Gordon field describes neutral mesons with spin O. The nonhermitian pseudoscalar Klein-Gordon field describes charged mesons with spin 0 and their antiparticles. We shall therefore proceed by constructing a wave equation for spin-1/2 fermions and only return to the Klein-Gordon equation in connection with motion in a Coulomb potential (1f- -mesons). 5.3 Dirac Equation 5.3.1 Derivation of the Dirac Equation We will now attempt to find a wave equation of the form (5.3.1) Spatial components will be denoted by Latin indices, where repeated indices are to be summed over. The second derivative 22 in the Klein-Gordon tt 5.3 Dirac Equation 121 equation leads to a density p = ('lj;* ft'lj; - c.c.). In order that the density be positive, we postulate a differential equation of first order. The requirement of relativistic covariance demands that the spatial derivatives may only be of first order, too. The Dirac Hamiltonian H is linear in the momentum operator and in the rest energy. The coefficients in (5.3.1) cannot simply be numbers: if they were, the equation would not even be form invariant (having the same coefficients) with respect to spatial rotations. a k and {3 must be hermitian matrices in order for H to be hermitian, which is in turn necessary for a positive, conserved probability density to exist. Thus a k and {3 are N x N matrices and ~ ~ (J~) an N-component column v~m . We shall impose the following requirements on equation (5.3.1): (i) The components of'lj; must satisfy the Klein-Gordon equation so that plane waves fulfil the relativistic energy-momentum relation E2 = p 2c2 + m 2 c4 . (ii) There exists a conserved four-current whose zeroth component is a positive density. (iii) The equation must be Lorentz covariant. This meanS that it has the same form in all reference frames that are connencted by a Poincare transformation. The resulting equation (5.3.1) is named, after its discoverer, the Dirac equation. We must now look at the consequences that arise from the conditions (i)-(iii). Let us first consider condition (i). The two-fold application of H yields fP -Ii2 -'lj; ot2 = -Ii2 c2 L . l- + ala' --) O·O-'lj; -1 ( a'a 2 ' 1 ij Ii + ~c 3 3 L (a i {3 + {3a i ) Oi'lj; + {32m 2c4'lj; . (5.3.2) i=l Here, we have made use of OiOj = OjOi to symmetrize the first term on the right-hand side. Comparison with the Klein-Gordon equation (5.2.11') leads to the three conditions (5.3.3a) (5.3.3b) (5.3.3c) 122 5. Relativistic Wave Equations and their Derivation 5.3.2 The Continuity Equation The row vectors adjoint to 'lj; are defined by 'lj; t = ('lj;r, ... ,'lj;'N) . Multiplying the Dirac equation from the left by 'lj; t, we obtain in'lj;t~ = n::'lj;t(ioi'lj;+mc 2 'lj;t(3'lj;. ut (5.3.4a) 1 The complex conjugate relation reads: _inoy:t'lj; = _ ut ~c (Oi'lj;t) ait'lj; + mc2'lj;t(3t'lj;. (5.3.4b) 1 The difference of these two equations yields: :t ('lj;t'lj;) = -C((oi'lj;t)ait'lj;+'lj;taioi'lj;) + irr;t ('lj;t(3t'lj;-'lj;t(3'lj;) (5.3.5) In order for this to take the form of a continuity equation, the matrices a and (3 must be hermitian, i.e., ait = ai, (3t = (3 . (5.3.6) Then the density N p='lj;tL~o: (5.3.7a) 0:=1 and the current density jk == c'lj;tak'lj; (5.3.7b) satisfy the continuity equation a ot P + d'· 0. IVJ = (5.3.8) With the zeroth component of jI-', jO == cp, (5.3.9) we may define a four-current-density jI-' == (j0,jk) (5.3.9') and write the continuity equation in the form £1 'J-L _ uJ-L) - 1 0. 0 ~ at) O'k - 0 . + oxk) (5.3.10) The density defined in (5.3.7a) is positive definite and, within the framework of the single particle theory, can be given the preliminary interpretation of a probability density. 5.3 Dirac Equation 123 5.3.3 Properties of the Dirac Matrices The matrices a k , {3 anticommute and their square is equal to 1; see Eq. (5.3.3a-c). From (a k )2 = {32 = 1, it follows that the matrices a k and {3 possess only the eigenvalues ±l. We may now write (5.3.3b) in the form ak = -{3a k {3 . Using the cyclic invariance of the trace, we obtain Tra k = -Tr {3a k {3 = -Tra k{32 = -Tra k . From this, and from an equivalent calculation for {3, one obtains (5.3.11) Hence, the number of positive and negative eigenvalues must be equal and, therefore, N is even. N = 2 is not sufficient since the 2 x 2 matrices 1, ax, ay, a z contain only 3 mutually anticommuting matrices. N = 4 is the smallest dimension in which it is possible to realize the algebraic structure (5.3.3a-c). A particular representation of the matrices is (5.3.12) where the 4 x 4 matrices are constructed from the Pauli matrices 1 a = (01) 10 ' a 2 = (0 -i)° 3 ,a i = (1° 0) -1 (5.3.13) and the two-dimensional unit matrix. It is easy to see that the matrices (5.3.12) satisfy the conditions (5.3.3a-c): e.g., . .= (0 ° + (0 ° = °. a' {3 + {3a' ai _a i ) -ai ai ) The Dirac equation (5.3.1), in combination with the matrices (5.3.12), is referred to as the "standard representation" of the Dirac equation. One calls '¢ a four-spinor or spinor for short (or sometimes a bispinor, in particular when '¢ is represented by two two-component spinors). '¢ t is called the hermitian adjoint spinor. It will be shown in Sect. 6.2.1 that under Lorentz transformations spinors possess specific transformation properties. 5.3.4 The Dirac Equation in Covariant Form In order to ensure that time and space derivatives are multiplied by matrices with similar algebraic properties, we multiply the Dirac equation (5.3.1) by {3/ c to obtain 124 5. Relativistic Wave Equations and their Derivation (5.3.14) We now define new Dirac matrices ')'0 == (3 (5.3.15) These possess the following properties: is hermitian and ')'0 (')'k)t = (')'0)2 = _,),k and (')'k)2 = 11.. However, ')'k is antihermitian. -11.. Proof: (!'k) t = ak(3 = -(3ak = _,),k , (!'k) 2 = (3a k (3a k = -11. . These relations, together with ,),O,),k + ')'k')'O = (3(3a k + (3a k(3 = 0 ')'k')'l + ')'l')'k = (3a k(3a 1 + (3a 1(3a k = 0 and for k =I- l lead to the fundamental algebraic structure of the Dirac matrices (5.3.16) The Dirac equation (5.3.14) now assumes the form (. -1')'J.Lf)J.L me) 'ljJ = +T 0. (5.3.17) It will be convenient to use the shorthand notation originally introduced by Feynman: (5.3.18) Here, vJ.L stands for any vector. The Feynman slash implies scalar multiplication by ')'W In the fourth term we have introduced the covariant components of the ')' matrices (5.3.19) In this notation the Dirac equation may be written in the compact form (-l~+ . Tme) 'ljJ = O. (5.3.20) Finally, we also give the')' matrices in the particular representation (5.3.12). From (5.3.12) and (5.3.15) it follows that 5.3 Dirac Equation ,= (11. -11.0) ' o 125 (5.3.21) 0 Remark. A representation of the, matrices that is equivalent to (5.3.21) and which also satisfies the algebraic relations (5.3.16) is obtained by replacing ,-+ M,M- 1 , where M is an arbitrary nonsingular matrix. Other frequently encountered representations are the Majorana representation and the chiral representation (see Sect. 11.3, Remark (ii) and Eq. (l1.6.12a-c)). 5.3.5 Nonrelativistic Limit and Coupling to the Electromagnetic Field 5.3.5.1 Particles at Rest The form (5.3.1) is a particularly suitable starting point when dealing with the nonrelativistic limit. We first consider a free particle at rest, i.e., with wave vector k = o. The spatial derivatives in the Dirac equation then vanish and the equation then simplifies to ."a't/J at = f3 mc2 'P 0 ,. In (5.3.17') . This equation possesses the following four solutions 0'.(+) _ 'P2 - bnc2 _ e " t (~) 0 o (5.3.22) 0"(_) _ 'PI - imc2t e" (~) 1 o 0"(-) _ ,'P2 - (~) e imc2t nO· 1 The 't/J~ +), 't/J~ +) and 't/J~ -), 't/J~ -) correspond to positive- and negative-energy solutions, respectively. The interpretation of the negative-energy solutions must be postponed until later. For the moment we will confine ourselves to the positive-energy solutions. 5.3.5.2 Coupling to the Electromagnetic Field We shall immediately proceed one step further and consider the coupling to an electromagnetic field, which will allow us to derive the Pauli equation. 126 5. Relativistic Wave Equations and their Derivation In analogy with the nonrelativistic theory, the canonical momentum p is replaced by the kinetic momentum (p - ~ A), and the rest energy in the Dirac Hamiltonian is augmented by the scalar electrical potential etP, in ~ = (co. (p - ~ A) + f3mc 2 + etP) ¢ (5.3.23) . Here, e is the charge of the particle, i.e., e = -eo for the electron. At the end of this section we will arrive at (5.3.23), starting from (5.3.17). 5.3.5.3 Nonrelativistic Limit. The Pauli Equation In order to discuss the nonrelativistic limit, we use the explicit representation (5.3.12) of the Dirac matrices and decompose the four-spinors into two twocomponent column vectors <p and X (5.3.24) with (0- . 11" X) + etP (<p) 2 ( cP ) X + mc -X . ata (<p) In X = co-. 11" cP ' (5.3.25) where e c 1I"=p--A (5.3.26) is the operator of the kinetic momentum. In the nonrelativistic limit, the rest energy mc2 is the largest energy involved. Thus, to find solutions with positive energy, we write (5.3.27) where (~) are considered to vary slowly with time and satisfy the equation (5.3.25') In the second equation, nX and etPx may be neglected in comparison to 2mc 2 x, and the latter then solved approximately as 0-.11" x= --cp. 2mc (5.3.28) From this one sees that, in the nonrelativistic limit, X is a factor of order rv v / c smaller than cpo One thus refers to cp as the large, and X as the small, component of the spinor. 5.3 Dirac Equation 127 Inserting (5.3.28) into the first of the two equations (5.3.25') yields (5.3.29) To proceed further we use the identity u . au· b = a . b + iu . (a x b) , which follows from 10 , 11 u . 7r U . 7r = 7r 2 aia j = . + 1U . 7r 6ij X 7r + iEijkak, which in turn yields: = 7r 2 - en - c U . B . Here, we have used 12 with Bi = Cijk 8j Ak. This rearrangement can also be very easily carried out by application of the expression v x Acp + A x Vcp = V x Acp - Vcp x A = (V x A) cp . We thus finally obtain [1 ( . 8<p 1n-= - m 2m e) 2 ---u·B+e<I> en ] <p. p--A c 2mc (5.3.29') This result is identical to the Pauli equation for the Pauli spinor <p, as is known from nonrelativistic quantum mechanics 13 . The two components of <p describe the spin of the electron. In addition, one automatically obtains the correct gyromagnetic ratio g = 2 for the electron. In order to see this, we simply need to repeat the steps familiar to us from nonrelativistic wave mechanics. We assume a homogeneous magnetic field B that can be represented by the vector potential A: 10 Here, E ijk is the totally antisymmetric tensor of third rank I for even permutations of (123) E ijk 11 12 13 = { -1 for odd permutations of (123) o otherwise QM I, Eq.(9.18a) Vectors such as E, B and vector products that are only defined as three-vectors are always written in component form with upper indices; likewise the f tensor. Here, too, we sum over repeated indices. See, e.g., QM I, Chap. 9. 128 5. Relativistic Wave Equations and their Derivation B = curl A , 1 A = 2"B X (5.3.30a) x. Introducing the orbital angular momentum L and the spin 8 as L=xxp, 8= 1 -ncr 2 ' (5.3.30b) then, for (5.3.30a), it followS 14 ,15 that . o'P = ( -p2 - - e (L In2m 2me ot 2 e- A 2 + ecli ) + 28) . B + 2me2 'P. (5.3.31 ) The eigenvalues of the projection of the spin operator 8e onto an arbitrary unit vector e are ±n/2. According to (5.3.31), the interaction with the electromagnetic field is of the form H t = -JL. B III 2 e + __ A2 + ecli 2 2me (5.3.32) ' in which the magnetic moment JL e = JLorbit + JLspin = -2-(L + 28) me (5.3.33) is a combination of orbital and spin contributions. The spin moment is of magnitude e JLspin = (5.3.34) 9 2me 8 , with the gyromagnetic ratio (or Lande factor) (5.3.35) 9 = 2. For the electron, 2:'c = -1f! can be expressed in terms of the Bohr magneton J.LB = ~ = 0.927 x 1O-2o erg/G. We are now in a position to justify the approximations made in this section. The solution 'P of (5.3.31) has a time behavior that is characterby the Rydberg energy ized by the Larmor frequency or, for ecli = -~e6, (Ryex me2 a 2 , with the fine structure constant a = e~/n). For the hydrogen and other nonrelativistic atoms (small atomic numbers Z), me2 is very much larger than either of these two energies, thus justifying for such atoms the approximation introduced previously in the equation of motion for x. 14 15 See, e.g., QM I, Chap. 9. One finds -p. A - A· p = -2A· P since (p . A) = ~ (V . A) = O. = -2~ (B x x)· p = - (x X p). B = -L· B, 5.3 Dirac Equation 129 5.3.5.4 Supplement Concerning Coupling to an Electromagnetic Field We wish now to use a different approach to derive the Dirac equation in an external field and, to facilitate this, we begin with a few remarks on relativistic notation. The momentum operator in covariant and contravariant form reads: (5.3.36) Here, a", implies p o ---1L ax" and a'" -- ---1L ax,,· a' .~ 1 = Po = I n act- For the time and space components, this p = -PI = . a na InaXl = -:-I -axi . (5.3.37) The coupling to the electromagnetic field is achieved by making the replacement (5.3.38) where A'" = (<li,A) is the four-potential. The structure which arises here is well known from electrodynamics and, since its generalization to other gauge theories, is termed minimal coupling. This implies a ax'" .~ In-- ---+ 1I~- .~ a ax'" - eA c I' (5.3.39) - which explicitly written in components reads: { 1I~- .~ ~ ~ i a .~ a at ---+ I nat- ax' ---+ ~ ~ i ax' n;. e'¥ (5.3.39') +~ c Ai =~ ~ i ax' - ~ c Ai . For the spatial components this is identical to the replacement ~ V ---+ ~ V ~A or p ---+ p- ~A. In the noncovariant representation of the Dirac equation, the substitution (5.3.39') immediately leads once again to (5.3.23). If one inserts (5.3.39) into the Dirac equation (5.3.17), one obtains (-"1'" (ina", - ~A",) +mc) 1/J = 0, (5.3.40) which is the Dirac equation in relativistic covariant form in the presence of an electromagnetic field. 130 5. Relativistic Wave Equations and their Derivation Remarks: (i) Equation (5.3.23) follows directly when one multiplies (5.3.40), i.e. "Yo (inao - ~Ao) 'lj; = -"Y i (inai - ~A;) 'lj; + mc'lj; by "Yo: inao'lj; in ! 'lj; = a i (-inai - = ca· ~Ai) 'lj; + ~Ao'lj; + mc(3'lj; (p - ~ A) 'lj; + eP'lj; + mc (3'lj; . 2 (ii) The minimal coupling, i.e., the replacement of derivatives by derivatives minus four-potentials, has as a consequence the invariance of the Dirac equation (5.3.40) with respect to gauge transformations (of the first kind): (iii) For electrons, m = me, and the characteristic length in the Dirac equation equals the Compton wavelength of the electron Ac =~ me C = 3.8 x lO-llcm . Problems 5.1 Show that the matrices (5.3.12) obey the algebraic relations (5.3.3a-c). 5.2 Show that the representation (5.3.21) follows from (5.3.12). 5.3 Particles in a homogeneous magnetic field. Determine the energy levels that result from the Dirac equation for a (relativistic) particle of mass m and charge e in a homogeneous magnetic field B. Use the gauge AO = At = A 3 = 0, A2 = Ex. 6. Lorentz Transformations and Covariance of the Dirac Equation In this chapter, we shall investigate how the Lorentz covariance of the Dirac equation determines the transformation properties of spinors under Lorentz transformations. We begin by summarizing a few properties of Lorentz transformations, with which the reader is assumed to be familiar. The reader who is principally interested in the solution of specific problems may wish to omit the next sections and proceed directly to Sect. 6.3 and the subsequent chapters. 6.1 Lorentz Transformations The contravariant and covariant components of the position vector read: XI-' Xo = ct, Xl = -X, X2 = -y, X3 = -z contravariant covariant. (6.1.1) The metric tensor is defined by _ g - (g I-'v) _ - (g I-'v _ 01 -10 00 ) - ( 0 0 -1 o 0 0) 0 0 (6.1.2a) 0-1 and relates covariant and contravariant components (6.1.3) Furthermore, we note that (6.1.2b) i.e., 1000) I-' _ I-' _ 0100 (g v) - (8 v) - ( 0 0 1 0 0001 132 6. Lorentz Transformations and Covariance of the Dirac Equation The d' Alembert operator is defined by (6.l.4) Inertial frames are frames of reference in which, in the absence of forces, particles move uniformly. Lorentz transformations tell us how the coordinates of two inertial frames transform into one another. The coordinates of two reference systems in uniform motion must be related to one another by a linear transformation. Thus, the inhomogeneous Lorentz transformations (also known as Poincare transformations) possess the form (6.l.5) where AIL v and aIL are real. Remarks: (i) On the linearity of the Lorentz transformation: Suppose that x' and x are the coordinates of an event in the inertial frames I' and I, respectively. For the transformation one could write x' = f(x) . In the absence of forces, particles in I and I' move uniformly, i.e., their world lines are straight lines (this is actually the definition of an inertial frame). Transformations under which straight lines are mapped onto straight lines are affinities, and thus of the form (6.l.5). The parametric representation of the equation of a straight line x" = e" s + d lL is mapped by such an affine transformation onto another equation for a straight line. (ii) Principle of relativity: The laws of nature are the same in all inertial frames. There is no such thing as an "absolute" frame of reference. The requirement that the d'Alembert operator be invariant (6.l.4) yields (6.l.6a) or, in matrix form, (6.l.6b) 6.1 Lorentz Transformations 133 The relations (6.1.6a,b) define the Lorentz transformations. Definition: Poincare group == {inhomogeneous Lorentz transformation, al-' =I- o} The group of homogeneous Lorentz transformations contains all elements with al-' = 0. A homogeneous Lorentz transformation can be denoted by the shorthand form (A, a), e.g., translation group (1, a) rotation group (D,O) From the defining equation (6.1.6a,b) follow two important characteristics of Lorentz transformations: (i) From the definition (6.1.6a), it follows that (detA)2 = 1, thus detA = ±1. (6.1.7) (ii) Consider now the matrix element A = 0, p (6.1.6a) AOI-'gl-'v AOv = 1 = (AOO)2 - ~)AOk2 = °of the defining equation = 1. k This leads to or (6.1.8) ° The sign of the determinant of A and the sign of A O can be used to classify the elements of the Lorentz group (Table 6.1). The Lorentz transformations can be combined as follows into the Lorentz group C, and its subgroups or subsets (e.g., means the set of all elements Lt): ct Table 6.1. Classification of the elements of the Lorentz group proper orthochronous improper orthochronous' time-reflection type" space-time inversion type'" • spatial reflection P = COO 0) 0-1 0 0 0 0 -1 0 o 0 0-1 sgn AO o detA 1 1 -1 -1 1 -1 -1 1 Lt L~ + L:' L4+ ** time reflection T= OOO 0100 0010 0001 C ) ••• space-time inversion PT = COO 0) 0-1 0 0 0 0 -1 0 o 0 0-1 (6.1.9) 134 6. Lorentz Transformations and Covariance of the Dirac Equation C Lorentz group (L.G.) ct restricted L.G. (is an invariant subgroup) ct u c!. ct U ct Co = ct u C~ C!. = p. ct = C+ = Ct C~ orthochronous L.G. proper L.G. orthochronous L.G. =T.Ct ct =P.T.Ct The last three subsets of C do not constitute subgroups. c = Ct U T Ct = ct u P ct U T ct U PTct (6.1.10) c t is an invariant subgroup of C; T C t is a coset to ct. ct is an invariant subgroup of C; P ct, T ct, PTct are cosets of C with respect to ct. Furthermore, C t , C+, and Co are invariant subgroups of C with the factor groups (E, P), (E, P, T, PT), and (E, T). Every Lorentz transformation is either proper and orthochronous or can be written as the product of an element of the proper-orthochronous Lorentz group with One of the discrete transformations P, T, or PT. ct, the restricted Lorentz group = the proper orthochronous L. G. consists of all elements with det A = ° 1 and AO ~ 1; this includes: (a) Rotations (b) Pure Lorentz transformations (= transformations under which space and time are transformed). The prototype is a Lorentz transformation in the Xl direction LOo L01 00) (COSh1]-Sinh1] 0 0) -sinh1] cosh 1] 0 0 ( L10 L11 00 _ o o 010 001 - 0 0 _1 _ _ f3 J1-f32 _ _(3_ ( J1-(32 o o J1-f32 010 001 00) -1-00 J1-(32 , (6.1.11) 010 001 with tanh 1] = {3. For this Lorentz transformation the inertial frame ]' moves with respect to ] with a velocity v = c{3 in the Xl direction. 6.2 Lorentz Covariance of the Dirac Equation 135 6.2 Lorentz Covariance of the Dirac Equation 6.2.1 Lorentz Covariance and Transformation of Spinors The principle of relativity states that the laws of nature are identical in every inertial reference frame. We consider two inertial frames I and l' with the space-time coordinates x and x'. Let the wave function of a particle in these two frames be 'lj; and 'lj;', respectively. We write the Poincare transformation between I and l' as x' = Ax+a. (6.2.1) It must be possible to construct the wave function 'lj;' from 'lj;. This means that there must be a local relationship between 'lj;' and 'lj;: 'ljJ'(x') = F('ljJ(x)) = F('ljJ(A- 1(x' - a)) . (6.2.2) The principle of relativity together with the functional relation (6.2.2) necessarily leads to the requirement of Lorentz covariance: The Dirac equation in I is transformed by (6.2.1) and (6.2.2) into a Dirac equation in I'. (The Dirac equation is form invariant with respect to Poincare transformations.) In order that both 'ljJ and 'ljJ' may satisfy the linear Dirac equation, their functional relationship must be linear, i.e., 'ljJ'(x') = 8(A)'ljJ(x) = 8(A)'ljJ(A-1(x' - a)) . (6.2.3) Here, 8(A) is a 4 x 4 matrix, with which the spinor 'ljJ is to be multiplied. We will determine 8(A) below. In components, the transformation reads: 4 'ljJ~(x) L 8 !3(A)'lj;!3(A- 1(x' - = a a)) . (6.2.3') 13=1 The Lorentz covariance of the Dirac equation requires that 'lj;' obey the equation + m)'ljJ'(x') = 0 , (-hJLa~ (c=l, n=l) (6.2.4) where , a aJL = ax'JL . The 'Y matrices are unchanged under the Lorentz transformation. In order to determine 8, we need to convert the Dirac equation in the primed and unprimed coordinate systems into one another. The Dirac equation in the unprimed coordinate system (6.2.5) 136 6. Lorentz Transformations and Covariance of the Dirac Equation can, by means of the relation 8 _ 8x'v 8 _ AV 8' 8xJ1- - 8xJ1- 8x'v J1- v and S-l'ljJ'(X') = 'ljJ(x) , be brought into the form (6.2.6) After multiplying from the left by S, one obtains l (6.2.6') From a comparison of (6.2.6') with (6.2.4), it follows that the Dirac equation is form invariant under Lorentz transformations, provided S(A) satisfies the following condition: (6.2.7) It is possible to show (see next section) that this equation has nonsingular solutions for S(A). A wave function that transforms under a Lorentz transformation according to 'ljJ' = S'ljJ is known as a four-component Lorentz spinor. 6.2.2 Determination of the Representation S(A) 6.2.2.1 Infinitesimal Lorentz Transformations We first consider infinitesimal (proper, orthochronous) Lorentz transforma- tions A VJ1- = gV J1- + Llwv J1- (6.2.8a) with infinitesimal and antisymmetric Llw VJ1(6.2.8b) This equation implies that Llw VJ1- can have only 6 independent nonvanishing elements. These transformations satisfy the defining relation for Lorentz transformations A"~ 1 J1- gJ1-V AP v = gAP , (6.1.6a) We recall here that the A" Jl. are matrix elements that, of course, commute with the 'Y matrices. 6.2 Lorentz Covariance of the Dirac Equation 137 as can be seen by inserting (6.2.8) into this equation: g\gl-'''gP" + .::1wAp + .::1wPA + 0 ((.::1W)2) = gAP. (6.2.9) Each of the 6 independent elements of .::1wl-''' generates an infinitesimal Lorentz transformation. We consider some typical special cases: .::1w01 = -.::1w0 1 = -.::1{3: Transformation onto a coordinate system moving with velocity c.::1{3 in the x direction .::1w 12 = _.::1w 12 = .::1cp: Transformation onto a coordinate system that is rotated by an angle .::1cp about the z axis. (See Fig. 6.1) (6.2.10) (6.2.11) The spatial components are transformed under this passive transformation as follows: X'2 Xl + .1ipX2 = -.1ipX1 + X2 X'3 = X3 X'1 = (6.2.12) x t-~"' X 11 Fig. 6.1. Infinitesimal rotation, passive transformation It must be possible to expand 8 as a power series in .::1w"l-'. We write 8=1 + T, 8- 1 = 1 - (6.2.13) T , where T is likewise infinitesimal. We insert (6.2.13) into the equation for 8, namely 8- 1 "(1-'8 = AI-'",,(", and get (6.2.14) from which the equation determining T follows as (6.2.14') To within an additive multiple of 1, this unambiguously determines T. If there were two solutions, then the difference between them would commute with all "(1-', and thus be proportional to 1 (see Sect. 6.2.5, Property 6). 138 6. Lorentz Transformations and Covariance of the Dirac Equation Due to the invariance of the norm, the determinant of S must be unity2, and thus, to first order in ..dw l .W , we have det S = det(:n. + T) = det:n. + TrT = 1 + TrT = 1 . (6.2.15) It thus follows that TrT = o. (6.2.16) Equations (6.2.14') and (6.2.16) have the solution _ 1 T- A I.<V( 84.lW 'YJ1-'Yv - ) _ 'Yv'YJ1- - i A IU.! -44.lW aJ1-V, (6.2.17) where we have introduced the definition i aJ1-V = 2 bJ1-''Yv] . (6.2.18) Equation (6.2.17) can be derived by calculating the commutator of T with 'YJ1-; the vanishing of the trace is guaranteed by the general properties of the "I matrices (Property 3, Sect. 6.2.5). 6.2.2.2 Rotation About the z Axis We first consider the rotation R3 about the z axis as given by (6.2.11). According to (6.2.11) and (6.2.17), and with it follows that (6.2.20) By a succession of infinitestimal rotations we can construct the transformation matrix S for a finite rotation through an angle 79. This is achieved by decomposing the finite rotation into a sequence of N steps 79/N 2 'Ij;~S(3,1 The requirement that the norm of 'Ij; remain invariant implies that = 'Ij;~cr. , i.e., 'lj;tstS'lj; = 'lj;t'lj;, whence sts = 1 and thus (det S)* det S = 1. Therefore, to within a phase factor, which is set to unity, detS = 1. 6.2 Lorentz Covariance of the Dirac Equation 'IjJ'(x') = S'IjJ(x) = lim (1 N-too 1190'12 = e2 = (cos 139 + 2 Ni '!90'12)N 'IjJ(X) 'IjJ %+ ia 12 sin %) 'IjJ(x) . (6.2.21) For the coordinates and other four-vectors, this succession of transformations implies that J~o ,. x = '!9 ( lL + N 0 000) 010 ( 0 -1 0 0 ) ... o 000 0 000) 0010 =exp {'!9 ( 0-100 o 000 }x= '!9 (00 000) 010 0 -1 0 0 ) x 0 000 (lL + N (10 cos'!9sin'!90 0 00) 0-sin'!9cos'!90 0 0 01 x, (6.2.22) and is thus identical to the usual rotation matrix for rotation through an angle '!9. The transformation S for rotations (6.2.21) is unitary (S-l = st). From (6.2.21), one sees that S(27f) = S(47f) = -lL lL . (6.2.23a) (6.2.23b) This means that spinors do not regain their initial value after a rotation through 27f, but only after a rotation through 47f, a fact that is also confirmed by neutron scattering experiments3 . We draw attention here to the analogy with the transformation of Pauli spinors with respect to rotations: cp'(x') = e~w·O'cp(x) . (6.2.24) 6.2.2.3 Lorentz Transformation Along the xl Direction According to (6.2.10), .:1w0 1 = .:1{3 (6.2.25) and (6.2.17) becomes 1 T(L 1) = 2.:1{3'YO'Y1 1 = 2.:1{3a1 . (6.2.26) We may now determine S for a finite Lorentz transformation along the axis. For the velocity ~, we have tanh'T} = ~. 3 Xl H. Rauch et al., Phys. Lett. 54A, 425 (1975); S.A. Werner et al., Phys. Rev. Lett. 35, 1053 (1975); also described in J.J. Sakurai, Modern Quantum Mechanics, p.162, Addison-Wesley, Red Wood City (1985). 140 6. Lorentz Transformations and Covariance of the Dirac Equation The decomposition of 17 into N steps of N leads to the following transformation of the coordinates and other four-vectors: , . ( g+-I 17)I.L ( g+-I 17 )V1 ... ( g+-I 17 )VN-1 Xv xI.L= hm N--+= N V1 N V2 N v gI.Lv = tSI.L v , IV I.L = (~1 o TH) , 1000) 12= ( 0100 0000 0000 0 00 ' (6.2.27) The N-fold application of the infinitesimal Lorentz transformation L1 (~) = 11 + ~I then leads, in the limit of large N, to the Lorentz transformation (6.1.11) cosh 17 - sinh 17 0 0) L ( ) = 'fJ 1 = ( - sinh 17 cosh 17 0 0 117 e 0 0 10 . 01 o 0 (6.2.27') N We note that the N infinitesimal steps of add up to 17. However, this does not imply a simple addition of velocities. We now calculate the corresponding spinor transformation S(Ld = J~= (1 + ~ = 11 cosh ~(1) N = e!Q1 (6.2.28) ~ + a1 sinh ~ For homogenous restricted Lorentz transformations, S is hermitian (S(L1)t = S(Ld)· For general infinitesimal transformations, characterized by infinitesimal antisymmetric LlwI.LV, equation (6.2.17) implies that SeA) = 11 - ~aI.LvlwV . This yields the finite transformation (6.2.29a) 6.2 Lorentz Covariance of the Dirac Equation 141 (6.2.29b) with wJ.LV = _WVJ.L and the Lorentz transformation reads A = eW , where the matrix elements of ware equal to wJ.L v' For example, one can represent a rotation through an angle {) about an arbitrary axis ft as S -e~1?ft.E , (6.2.29c) where (6.2.29d) 6.2.2.4 Spatial Reflection, Parity The Lorentz transformation corresponding to a spatial reflection is represented by _ AJ.L v (~1 ~ ~) (6.2.30) 0 0 -1 0 o 0 0 -1 The associated S is determined, according to (6.2.7), from 4 S-l"{J.LS = AJ.Lv"{V = LgJ.Lv"{v = gJ.LJ.L"{J.L, (6.2.31) v=l where no summation over J.L is implied. One immediately sees that the solution of (6.2.31), which we shall denote in this case by P, is given by (6.2.32) Here, e icp is an unobservable phase factor. This is conventionally taken to have one of the four values ±1, ±i; four reflections then yield the identity n. The spinors transform under a spatial reflection according to (6.2.33) The complete spatial reflection (parity) transformation for spinors is denoted by (6.2.33') where p(O) causes the spatial reflection x -+ -x. From the relationship "{O == (3 = (~ _ ~) one sees in the rest frame of the particle, spinors of positive and negative energy (Eq. (5.3.22)) that are eigenstates of P - with opposite eigenvalues, i.e., opposite parity. This means that the intrinsic parities of particles and antiparticles are opposite. 142 6. Lorentz Transformations and Covariance of the Dirac Equation 6.2.3 Further Properties of S For the calculation of the transformation of bilinear forms such as )Ii, (x), we need to establish a relationship between the adjoint transformations 8 t and 8- 1 . Assertion: (6.2.34a) where b= ±1 1: lOr AOO { :2: +1 ::::; -1 (6.2.34b) . Proof. We take as our starting point Eq. (6.2.7) AI-' v real, (6.2.35) and write down the adjoint relation (6.2.36) The hermitian adjoint matrix can be expressed most concisely as (6.2.37) By means of the anticommutation relations, one easily checks that (6.2.37) We insert this into the left- and is in accord with ,ot = ,0, ,kt = the right-hand sides of (6.2.36) and then multiply by from the left- and right-hand side to gain _,k. ,0 ,OAl-'v'O,V,o,o =,ost,O,I-',08t-1,o AI-'V'V = 8- 1,I-'S = ,08 t ,O,I-'(r°8 t ,O)-1 , ,0. Furthermore, on the left-hand side we have made the since (r0)-1 = substitution AI-' v,v = 8- 1,1-'8. We now multiply by 8 and 8- 1: Thus, 8,° 8 t ,0 commutes with all ,I-' and is therefore a multiple of the unit matrix (6.2.38) which also implies that (6.2.39) 6.2 Lorentz Covariance of the Dirac Equation 143 and yields the relation we are seeking4 (6.2.34a) Since (!,O)t = 1'0 and S1'° st are hermitian, by taking the adjoint of (6.2.39) one obtains S1'° st = b*1'°, from which it follows that (6.2.40) b* = b and thus b is real. Making use of the fact that the normalization of S is fixed by det S = 1, on calculating the determinant of (6.2.39), one obtains b4 = l. This, together with (6.2.40), yields: b= ±1. (6.2.41) The significance of the sign in (6.2.41) becomes apparent when one considers st S = st 1'01'0 S = hO S-11'0 S = b1'° AO .d,l 3 = bAOon + L bAO k 1'01'k . k=l ~ (6.2.42) ak st S has positive definite eigenvalues, as can be seen from the following. Firstly, det st S = 1 is equal to the product of all the eigenvalues, and these must therefore all be nonzero. Furthermore, st S is hermitian and its eigenfunctions satisfy st S'lj;a = a'lj;a, whence and thus a > O. Since the trace of st S is equal to the sum of all the eigenvalues, we have, in view of (6.2.42) and using Tr a k = 0, ° Thus, bAo > O. Hence, we have the following relationship between the signs of AOO and b: A00 ~ A 00 :::; 1 for -1 for b= 1 (6.2.34b) b = -1 . For Lorentz transformations that do not change the direction of time, we have b = 1; while those that do cause time reversal have b = -1. 4 Lt Note: For the Lorentz transformation (restricted L.T. and rotations) and for spatial reflections, one can derive this relation with b = 1 from the explicit representations. 144 6. Lorentz Transformations and Covariance of the Dirac Equation 6.2.4 Transformation of Bilinear Forms The adjoint spinor is defined by (6.2.43) We recall that 'ljJ t is referred to as a hermitian adjoint spinoL The additional introduction of ij; is useful because it allows quantities such as the current density to be written in a concise form. We obtain the following transformation behavior under a Lorentz transformation: thus, (6.2.44) Given the above definition, the current density (5.3.7) reads: (6.2.45) and thus transforms as (6.2.46) Hence, jl-' transforms in the same way as a vector for Lorentz transformations without time reflection. In the same way one immediately sees, using (6.2.3) and (6.2.44), that ij;(x)'ljJ(x) transforms as a scalar: ij;'(x')'ljJ'(x') = bij;(X')S-lS'ljJ(x') = bij;(x)'ljJ(x) . (6.2.47a) We now summarize the transformation behavior of the most important bilinear quantities under orthochronous Lorentz transformations, i.e., transformations that do not reverse the direction of time: ij;'(x')'ljJ'(x') = ij;(x)'ljJ(x) ij;'(x')'yI-''ljJ'(x') = AI-' vij; (x),·('ljJ (x) ij;'(x')al-'v'ljJ'(x') = AI-' pAl' uij;(x)aPu'ljJ(x) scalar (6.2.47a) vector (6.2.47b) antisymmetric tensor (6.2.47c) ij;' (X')'y5/1-''ljJ' (x') = (det A) AI-' vij;(X)'y5/ v'ljJ(X) ij;'(X')'y5'ljJ'(X') = (detA)ij;(xh5'ljJ(x) where 15 = sign is -1. it°/ 1 / 2/ 3 . We recall that detA pseudovector (6.2.47d) pseudoscalar, (6.2.47e) = ±1; for spatial reflections the 6.2 Lorentz Covariance of the Dirac Equation 6.2.5 Properties of the 145 Matrices "y We remind the reader of the definition of "(5 from the previous section: (6.2.48) and draw the reader's attention to the fact that somewhat different definitions may also be encountered in the literature. In the standard representation (5.3.21) of the Dirac matrices, "(5 has the form 5 "(= (Oll) llO' The matrix (6.2.48') satisfies the relations "(5 (6.2.49a) and (6.2.49b) By forming products of "(p., one can construct 16 linearly independent 4 x 4 matrices. These are rS r~ T rp'v = II (6.2.50a) = "(p. (6.2.50b) r;; = r i = O'p.v = '2 bp.,"(v] (6.2.50c) (6.2.50d) "(5"(p. (6.2.50e) P = "(5· The upper indices indicate scalar, vector, tensor, axial vector (= pseudovector), and pseudoscalar. These matrices have the following properiies 5 : (6.2.51a) 2. For every r a except r S == 3. For a i=- S we have Tr r a ll, there exists a r b, such that = 0. Proof. Tr r a (r b)2 = -Tr rb r a rb = -Tr r a (r b)2 Since (rb)2 = ±1, it follows that Tr r a = -Tr r (6.2.51c) a, thus proving the assertion. 5 Only some of these properties will be proved here; other proofs are included as problems. 146 6. Lorentz Transformations and Covariance of the Dirac Equation 4. For every pair r a, r b a -=I b there is a r e -=I 1, such that (3 = ±1, ±i. Proof follows by considering the r. 5. The matrices r a are linearly independent. Suppose that ran = (3re , L: Xara = 0 with complex coefficients Xa. From property 3 above a one then has Tr L Xa ra = Xs =0. a Multiplication by ra and use of the properties 1 and 4 shows that subsequent formation of the trace leads to Xa = o. 6. If a 4 x 4 matrix X commutes with every "1 M , then X <X 1. 7. Given two sets of "I matrices, "I and "I', both of which satisfy {"1 M, "IV} = 2gMV , there must exist a nonsingular M "I'M = M"IM M- 1 (6.2.51d) . This M is unique to within a constant factor (Pauli's fundamental theorem). 6.3 Solutions of the Dirac Equation for Free Particles 6.3.1 Spinors with Finite Momentum We now seek solutions of the free Dirac equation (5.3.1) or (5.3.17) (-it? + m)'l{i(x) (6.3.1) = 0 . Here, and below, we will set fi = c = 1. For particles at rest, these solutions [see (5.3.22)] read: 'l{i(+) (x) = ur(m, 0) e- imt 'l{i(-)(x) = vr(m, O)eimt r = 1,2 (6.3.2) , for the positive and negative energy solutions respectively, with ",(m,O) v,(m, O) m' ~ m' ~ m ' ~ m' ~ ",(m, O) v,(m, O) (6.3.3) 6.3 Solutions of the Dirac Equation for Free Particles 147 and are normalized to unity. These solutions of the Dirac equation are eigenfunctions of the Dirac Hamiltonian H with eigenvalues ±m, and also of the operator (the matrix already introduced in (6.2.19)) a 12 _ ~[1 - 2 ,,' 2] _ - (a 3 0 ) 0 a3 (6.3.4) with eigenvalues +1 (for r = 1) and -1 (for r = 2). Later we will show that a 12 is related to the spin. We now seek solutions of the Dirac equation for finite momentum in the form 6 7jJ(+)(x) = ur(k) e- ik . x 7jJ(-)(x) = vr(k) eik .x positive energy (6.3.5a) negative energy (6.3.5b) with kO > O. Since (6.3.5a,b) must also satisfy the Klein-Gordon equation, we know from (5.2.14) that (6.3.6) or (6.3.7) where kO is also written as E; i.e., k is the four-momentum of a particle with mass m. The spinors ur(k) and vr(k) can be found by Lorentz transformation of the spinors (6.3.3) for particles at rest: We transform into a coordinate system that is moving with velocity -v with respect to the rest frame and then, from the rest-state solutions, we obtain the free wave functions for electrons with velocity v. However, a more straightforward approach is to determine the solutions directly from the Dirac equation. Inserting (6.3.5a,b) into the Dirac equation (6.3.1) yields: (6.3.8) Furthermore, we have ~ = k,.dJ,k"," = k/.L"~{r,} = k/.Lk"g/.L" . (6.3.9) Thus, from (6.3.6), one obtains (6.3.10) Hence one simply needs to apply (~ + m) to the ur(m, 0) and (~ - m) to the vr(m, 0) in order to obtain the solutions ur(k) and vr(k) of (6.3.8). The 6 We write the four-momentum as k, the four-coordinates as x, and their scalar product as k . x. 148 6. Lorentz Transformations and Covariance of the Dirac Equation normalization remains as yet unspecified; it must be chosen such that it is compatible with the solution (6.3.3), and such that ijj'lj; transforms as a scalar (Eq. (6.2.47a)). As we will see below, this is achieved by means of the factor 1/ J2m(m + E): ur(k) = ( E+m)l/2 2m Xr ~+m y'2m(m+E) ~ - +m vr(k) = ur(m, O) = ( (6.3.11a) u·k (2m(m + E))1/2 Xr vr(m,O) = J2m(m+E) (2m:~kE)1/ xrl Here, the solutions are represented by ur(m, 0) @ and X2 = m· . (6.3.11b) (E+m)l/2 2m with Xl = 1 = (~) Xr and vr(m, 0) = (~J In this calculculation we have made use of and From (6.3.11a,b) one finds for the adjoint spinors defined in (6.2.43) ~+m _ ur(m, 0) . / y2m(m+E) _ _ -~+m vr(k) = vr(m, 0)--;:::==== y'2m(m+E) _ ur(k) = (6.3.12a) (6.3.12b) Proof. Furthermore, the adjoint spinors satisfy the equations ur(k) (~- m) = 0 (6.3.13a) 6.3 Solutions of the Dirac Equation for Free Particles 149 and (6.3.13b) as can be seen from (6.3.10) and (6.3.12a,b) or (6.3.8). 6.3.2 Orthogonality Relations and Density We shall need to know a number of formal properties of the solutions found above for later use. From (6.3.11) and (6.2.37) it follows that: (6.3.14a) With ur(m, O)(~ + m)2us(m, 0) = ur(m, 0)(~2 + 2m~ + m 2)us(m, 0) = ur(m, 0)(2m2 + 2m~)us(, 0) = ur(m, 0)(2m2 + 2mko,,,o)u s (m, 0) = 2m(m + E)ur(m, O)us(m, 0) = 2m(m + E)8rs , _ _ ~2 _ m2 ur(k)vs(k) = ur(m, 0) 2m(m + E) vs(m, 0) (6.3.14b) (6.3.14c) = ur(m, 0) 0 vs(m, 0) = 0 and a similar calculation for Vr (k), equations (6.3.14a, b) yield the orthogonality relations ur(k) us(k) vr(k) vs(k) ur(k) vs(k) vr(k) us(k) o O. (6.3.15) Remarks: (i) This normalization remains invariant under orthochronous Lorentz transformations: t 08-18 Us = Ur - Us = Us:rs . u-Ir UsI = u rt 8t 'Y 08 Us = ur'Y (6.3.16) (ii) For these spinors, ijj(x)'lj;(x) is a scalar, (6.3.17) is independent of k, and thus independent of the reference frame. In general, for a superposition of positive energy solutions, i.e., for 150 6. Lorentz Transformations and Covariance of the Dirac Equation 2 2 '!fi(+l(x) = LCrUr , with L r=l r=l Icr l2 = 1, (6.3.18a) one has the relation 2 ~(+lx)'!fi = L ur(k)us(k)c; Cs = L Icr l2 = 1 . r,s (6.3.18b) r=l Analogous relationships hold for '!fie - l . (iii) If one determines ur(k) through a Lorentz transformation corresponding to -v, this yields exactly the above spinors. Viewed as an active transformation, this amounts to transforming ur(m, 0) to the velocity v. Such a transformation is known as a "boost". The density for a plane wave (c = 1) is P = jO = ~,0'!fi. This is not a Lorentz-invariant quantity since it is the zero-component of a four-vector: ~+l (xhO '!fi~+l(x) ur(kho us(k) {~"O} E _ = ur(k) us(k) = - ors (6.3.19a) vr(khO vs(k) _ {~"O} E = -vr(k)-2- vs(k) = - ors· (6.3.19b) = 2m ~-l(xhO '!fii-l(x) m = m m In the intermediate steps here, we have used us(k) us(k)~/m (Eqs. (6.3.8) and (6.3.13)) etc. = us(k) = (~/m)usk, Note. The spinors are normalized such that the density in the rest frame is unity. Under a Lorentz transformation, the density times the volume must remain constant. The volume is reduced by a factor (32 and thus the density must increase by the reciprocal factor ~ = .§... VI - yl-i32 'Tn We now extend the sequence of equations (6.3.19). For and '!fi~+l(x) '!fii-l(x) = e-i(kOxO-k.xlur(k) = ei(kOxO+k.xlvs(k) with the four-momentum ~ - l (xho '!fii+ l (x) (6.3.20) k = (kO, -k), one obtains = e-2ikoxo vr(khO us(k) = ~e-2ikOx 2 vr(k) (_ ~ m ,0 + ,a.!) us(k) m (6.3.19c) =0 since the terms proportional to ko cancel and since {kni, ,O} = o. In this sense, positive and negative energy states are orthogonal for opposite energies and equal momenta. 6.3 Solutions of the Dirac Equation for Free Particles 151 6.3.3 Projection Operators The operators (6.3.21) project onto the spinors of positive and negative energy, respectively: A_vr(k) A_ur(k) A+ur(k) = ur(k) A+vr(k) = 0 = vr(k) =0 . (6.3.22) Thus, the projection operators A±(k) can also be represented in the form r=1,2 A_(k) (6.3.23) L =- vr(k) Q9 vr(k) . r=1,2 The tensor product Q9 is defined by (6.3.24) In matrix form, the tensor product of a spinor a and an adjoint spinor Ii reads: The projection operators have the following properties: = A±(k) =2 A+(k) + A_(k) = 1 . Ai(k) (6.3.25a) Tr A±(k) (6.3.25b) (6.3.25c) Proof: A±(k)2 = (±~+m? = 2m(±~ = ~2±m+ 4m 2 + m) 4m 2 = m2±~+ 4m 2 = A±(k) 4m 2 4m Tr A±(k) = 2m = 2 The validity of the assertion that A± projects onto positive and negative energy states can be seen in both of the representations, (6.3.21) and (6.3.22), by applying them to the states ur(k) and vr(k). A further important projection operator, Pen), which, in the rest frame projects onto the spin orientation n, will be discussed in Problem 6.15. 152 6. Lorentz Transformations and Covariance of the Dirac Equation Problems 6.1 Prove the group property of the Poincare group. 6.2 Show, by using the transformation properties of xp" that ap, a/axP,) transforms as a contravariant (covariant) vector. == a/axp, (ap, == 6.3 Show that the N-fold application of the infinitesimal rotation in Minkowski space (Eq. (6.2.22)) iJ 00 000) 010 A=l+ N ( 0-100 o 000 leads, in the limit N -+ last step in (6.2.22)). 00, to a rotation about the z axis through an angle iJ (the 6.4 Derive the quadratic form of the Dirac equation [(ina - ~A r -i~e (aE + iEB) - m 2 c2 ] ~ =0 for the case of external electromagnetic fields. Write the result using the electromagnetic field tensor Fp,v = Ap"v - Av,p" and also in a form explicitly dependent on E and B. Hint: Multiply the Dirac equation from the left by "Iv (inav - ~Av) + me and, by using the commutation relations for the "I matrices, bring the expression obtained into quadratic form in terms of the field tensor [(ina - ~A r -~>p,v Fp,v - m 2 e2 ] ~ = O. The assertion follows by evaluating the expression ap,v Fp,v using the explicit form of the field tensor as a function of the fields E and B. 6.5 Consider the quadratic form of the Dirac equation from Problem 6.4 with the fields E = Eo (1,0,0) and B = B (0,0,1), where it is assumed that Eo/Be :s; 1. Choose the gauge A = B (0, x, 0) and solve the equation with the ansatz ~(x) = e-iEt/nei(kyY+kzz)cp(x)if> , where if> is a four-spinor that is independent of time and space coordinates. Calculate the energy eigenvalues for an electron. Show that the solution agrees with that obtained from Problem 5.3 when one considers the nonrelativistic limit, i.e., Eo/Be« 1. Hint: Given the above ansatz for~, one obtains the following form for the quadratic Dirac equation: [K(x, ax):ll. + M] cp(x)if> = 0 , Problems ax) 153 ax where K(x, is an operator that contains constant contributions, and x. The matrix M is independent of and x; it has the property M2 ex: 11. This suggests that the bispinor <P has the form <P = (11 + AM)<Po . Determine A and the eigenvalues of ]tf. With these eigenvalues, the matrix differential equation reverts into an ordinary differential equation of the oscillator type. ax 6.6 Show that equation (6.2.14') bl', T] = Llw I' " "(" is satisfied by T = ~Llw'h( - ,,(",,(I') . 6.8 Show that the relation st "(0 = b,,(o S-l is satisfied with b = 1 by the explicit representations of the elements of the Poincare group found in the main text (rotation, pure Lorentz transformation, spatial reflection). 6.9 Show that 1{J(Xh51/J(X) is a pseudoscalar, 1{J(Xh5,,(1'1/J(X) a pseudovector, and 1{J(x)(J"I'''1/J(x) a tensor. ra. 6.10 Properties of the matrices Taking as your starting point the definitions (6.2.50a-e), derive the following properties of these matrices: (i) For every (except there exists a such that = (ii) For every pair (a i= b) there exists a c i= 11 such that = (3 c with (3 = ±1, ±i. ra r S) r a, r b, rb r r arb rb ra. r arb r 6.11 Show that if a 4 x 4 matrix X commutes with all "(I', then this matrix X is proportional to the unit matrix. Hint: Every 4 x 4 matrix can, according to Problem 6.1, be written as a linear combination of the 16 matrices r a (basis!). 6.12 Prove Pauli's fundamental theorem for Dirac matrices: For any two fourdimensional representations "(I' and "(~ of the Dirac algebra both of which satisfy the relation bl""("} = 291''' there exists a nonsingular transformation M such that "(~ = M"(I'M- 1 . M is uniquely determined to within a constant prefactor. 154 6. Lorentz Transformations and Covariance of the Dirac Equation 6.13 From the solution of the field-free Dirac equation in the rest frame, determine the four-spinors 'Ij;±(x) of a particle moving with the velocity v. Do this by applying a Lorentz transformation (into a coordinate system moving with the velocity -v) to the solutions in the rest frame. 6.14 Starting from A+(k) = E ur(k) @ A_(k) ur(k) , r=1,2 =- E vr(k) @vr(k) , r=1,2 prove the validity of the representations for A±(k) given in (6.3.22). 6.15 (i) Given the definition P(n) = ~ (1 +)'5~, show that, under the assumptions n 2 = -1 and nJLkJL = 0, the following relations are satisfied (a) (b) (c) (d) °, [A±(k),P(n)] = A+(k)P(n) + A_(k)P(n) Tr [A±(k)P(±n)] = 1 P(n)2 + A+(k)P( -n) + A_(k)P( -n) = 1 = P(n) (ii) Consider the special case n = (0, Ez) where P(n) =~ (1 ~ 1~ 0'3 0'3 ) . , 7. Orbital Angular Momentum and Spin We have seen that, in nonrelativistic quantum mechanics, the angular momentum operator is the generator of rotations and commutes with the Hamiltonians of rotationally invariant (i.e., spherically symmetric) systems l . It thus plays a special role for such systems. For this reason, as a preliminary to the next topic - the Coulomb potential - we present here a detailed investigation of angular momentum in relativistic quantum mechanics. 7.1 Passive and Active Transformations For positive energy states, in the non-relativistic limit we derived the Pauli equation with the Lande factor 9 = 2 (Sect. 5.3.5). From this, we concluded that the Dirac equation describes particles with spin S = 1/2. Following on from the transformation behavior of spinors, we shall now investigate angular momentum in general. In order to give the reader useful background information, we will start with some remarks concerning active and passive transformations. Consider a given state Z, which in the reference frame I is described by the spinor 'IjJ(x). When regarded from the reference frame I', which results from I through the Lorentz transformation x' =Ax, (7.1.1) the spinor takes the form, passive with A . (7.1.2a) A transformation of this type is known as a passive transformation. One and the same state is viewed from two different coordinate systems, which is indicated in Fig. 7.1 by 'IjJ(x) ~ 'IjJ1(X'). On the other hand, one can also transform the state and then view the resulting state Zl exactly as the starting state Z from one and the same reference frame I. In this case one speaks of an active transformation. For vectors and scalars, it is clear what is meant by their active transformation 1 See QM I, Sect. 5.1 156 7. Orbital Angular Momentum and Spin (rotation, Lorentz transformation). The active transformation of a vector by the transformation A corresponds to the passive transformation of the coordinate system by A-I. For spinors, the active transformation is defined in exactly this way (see Fig. 7.1). The state Z', which arises through the transformation A-I, appears in [ exactly as Z in [', i.e., active with A-I l' ~-LI (7.1.2b) Fig. 7.1. Schematic representation of the passive and active transformation of a spinor; the enclosed area is intended to indicate the region in which the spinor is finite The state Z", which results from Z through the active transformation A, by definition appears the same in [' as does Z in [, i.e., it takes the form 'Ij;(x'). Since [ is obtained from [' by the Lorentz transformation A-I, in [ the spinor Z" has the form 'Ij;"(x) = S-I'lj;(Ax) , active with A . (7.1.2c) 7.2 Rotations and Angular Momentum Under the infinitesimal Lorentz transformation (7.2.1) a spinor 'Ij;(x) transforms as passive with A (7.2.2a) active with A-I. (7.2.2b) or 7.2 Rotations and Angular Momentum 157 We now use the results gained in Sect. 6.2.2.1 (Eqs. (6.2.8) and (6.2.13)) to obtain (7.2.3) Taylor expansion of the spinor yields (1 - L1w/Lvxvo/L) 'IjJ(x) , so that (7.2.3') We now consider the special case of rotations through L1cp, which are represented by (7.2.4) (the direction of L1cp specifies the rotation axis and tion). If one also uses lL1cpl the angle of rota- (7.2.5) (see Eq. (6.2.19)) one obtains for (7.2.3') (1 + = (1 = (1 = (1 + 'IjJ'(x) = -~ L1w ij ( Eijk L1(/' L1(i ( iL1cpk -~ Eijk Ek ( -~ + XiOj ) ) 'IjJ(x) Eijk Ek - XiOj) ) 'IjJ(x) 20kk E k - EijkXiOj) ) 'IjJ(x) (~Ek + Ekijx~ (7.2.6) OJ) ) 'IjJ(x) == (1 + iL1cpk Jk) 'IjJ(x) . Here, we have defined the total angular momentum 1 = Ek')··· x' -i o· + -21 E k . ) Jk (7.2.7) With the inclusion of Ii, this operator reads: Ii i J = x x - Vn. Ii + -2 1J ' (7.2.7') and is thus the sum of the orbital angular momentum L = x x p and the . Spill Ii "2 ~. '("1 The total angular momentum (= orbital angular momentum + spin) is the generator of rotations: For a finite angle cpk one obtains, by combining a succession of infinitesimal rotations, 158 7. Orbital Angular Momentum and Spin (7.2.8) The operator Jk commutes with the Hamiltonian of the Dirac equation containing a spherically symmetric potential cP(x) = cP(lxl) (7.2.9) A straightforward way to verify (7.2.9) is by an explicit calculation of the commutator (see Problem 7.1). Here, we consider general consequences resulting from the behavior, under rotation, of the structure of commutators of the angular momentum with other operators; Eq. (7.2.9) results as a special case. We consider an operator A, and let the result of its action on 'l/Jl be the spinor 'l/J2: It follows that ei<pkJk Ae-i<pkJk (ei<pkJ k 'l/Jl(X)) = (ei<pkJk 'l/J2(X)) or, alternatively, Thus, in the rotated frame of reference the operator is (7.2.10) Expanding this for infinitesimal rotations (cpk -+ L1cpk) yields: (7.2.11) The following special cases are of particular interest: (i) A is a scalar (rotationally invariant) operator. Then, A' (7.2.11) it follows that = A and from (7.2.12) The Hamiltonian of a rotationally invariant system (including a spherically symmetric cP(x) = cP(lxl)) is a scalar; this leads to (7.2.9). Hence, in spherically symmetric problems the angular momentum is conserved. (ii) For the operator A we take the components of a three-vector v . As a vector, v transforms according to vii = vi + Eijk L1cpJ v k . Equating this, component by component, with (7.2.11), Vi + (ijk L1cp j v k = Vi + *L1cpJ [Jj, Vi] which shows that (7.2.13) Problems 159 The commutation relation (7.2.13) implies, among other things, [J i , Jj] = inEijk Jk [Ji,Lj] = inEijkL k It is clear from the explicit representation Ek (7.2.14a) (7.2.14b) (~k: ) that the eigen- values of the 4 x 4 matrices Ek are doubly degenerate and take the values ±l. The angular momentum J is the sum of the orbital angular momentum L and the intrinsic angular momentum or spin S, the components of which have the eigenvalues ±~. Thus, particles that obey the Dirac equation have spin S = 1/2. The operator (.1!17)2 = ~n21l has the eigenvalue 3~2. The eigenvalues of L2 and L3 are n~l( + 1) and nml, where 1 = 0,1,2, ... and ml takes the values -l, -l + 1, ... ,l - 1, l. The eigenvalues of J2 are thus 2 j(j + 1), where j = 1 ± ~ for 1 i- 0 and j = ~ for 1 = O. The eigenvalues of J3 are nmj, where mj ranges in integer steps between - j and j. The operators J2, L2, 172, and J3 can be simultaneously diagonalized. The orbital angular momentum operators Li and the spin operators Ei themselves fulfill the angular momentum commutation relations. n Note: One is tempted to ask how it is that the Dirac Hamiltonian, a 4 x 4 matrix, can be a scalar. In order to see this, one has to return to the transformation (6.2.6'). The transformed Hamiltonian including a central potential p(lxl) (_i-yV 8~ + m + ep(lx'l)) = S( -i'"'( 8 v + m + ep(lxl))S-l has, under rotations, the same form in both systems. The property "scalar" means invariance under rotations, but is not necessarily limited to one-component spherically symmetric functions. Problems 7.1 Show, by explicit calculation of the commutator, that the total angular momentum commutes with the Dirac Hamiltonian for a central potential H = c (~ ci p k + f3mc) + eP(lxl) , 8. The Coulomb Potential In this chapter, we shall determine the energy levels in a Coulomb potential. To begin with, we will study the relatively simple case of the Klein-Gordon equation. In the second section, the even more important Dirac equation will be solved exactly for the hydrogen atom. 8.1 Klein-Gordon Equation with Electromagnetic Field 8.1.1 Coupling to the Electromagnetic Field The coupling to the electromagnetic field in the Klein-Gordon equation _It2 {P1jJ = _It2c2\J2nt. + m 2c4nt. at 2 'I' 'I' , i.e., the substitution .t. In ata ~ .t. In ata - e'l' , J.. -;V~A, It It e I I C leads to the Klein-Gordon equation in an electromagnetic field (8.1.1) We note that the four-current-density now reads: (8.1.2) with the continuity equation (8.1.3) One thus finds, in jO for example, that the scalar potential AO = ~ appears. 162 8. The Coulomb Potential 8.1.2 Klein-Gordon Equation in a Coulomb Field We assume that A and cJ> are time independent and now seek stationary solutions with positive energy 'lj;(x, t) = e-iEt/fi'lj;(x) with E>O. (8.1.4) From (8.1.1), one then obtains the time-independent Klein-Gordon equation (E - ecJ»2'lj; = c2 (~v _~A ) 2 'lj; + m 2c4 'lj; . For a spherically symmetric potential cJ>(x) ---+ cJ>(r) (r follows that (8.1.5) = Ixl) and A = 0, it (8.1.6) The separation of variables in spherical polar coordinates 'lj;(r, fJ, '1') = R(r)Yem(fJ, '1') , (8.1.7) where Yem(fJ, '1') are the spherical harmonic functions, already known to us from nonrelativistic quantum mechanics,l leads, analogously to the nonrelativistic theory, to the differential equation (8.1.8) Let us first consider the nonrelativistic limit. If we set E = mc 2 + E' and assume that E' - ecJ> can be neglected in comparison to mc2, then (8.1.8) yields the nonrelativistic radial Schrodinger equation, since the right-hand side of (8.1.8) becomes n21c2 ((mc 2)2 + 2mc2(E' - ecJ>(r)) + (E' - (8.1.9) 2m ~ fi2(E' - ecJ>(r))R(r). For a 7r- meson in the Coulomb field of a nucleus with charge Z, Ze~ (8.1.lOa) ecJ>(r) = - - . r Inserting the fine-structure constant a [ _~.!i ecJ>(r))2 - m 2c4 ) R(r) = ~, 2 it follows from (8.1.8) that ..!ir+ £(£+ 1) - Z2 a 2 _ 2ZaE _ E2 -m 2c4 ] R = O. r dr dr r2 ncr n2c2 (8.1.10b) 1 QM I, Chap. 5 8.1 Klein-Gordon Equation with Electromagnetic Field 163 Remark: The mass of the 7r meson is m,,- = 273m e and its half-life T,,- = 2.55 X 10- 8 s. Since the classical orbital period 1 estimated by means of the uncertainty m~ a 21 m a principle is approximately T::::; a.l~:; "-Ii ,,m;:Ii ::::; 10- s one can think of well-defined stationary states, despite the finite half-life of the 7r-. Even the lifetime of an excited state (see QM I, Sect. 16.4.3) 6.T::::; Ta- 3 ::::; 10- 15 is still much shorter than T,,- . 2 2 By substituting 2 u = 4(m2e4 - E2) 1i2 e2 ' \ _ 2EI' lieu ' p = ur I' = Za, /\ - (8. 1. 11 a-d) into (8.1.10b), we obtain [ d2 d(p/2)2 2,\ + p/2 - 1- £(£+1)-1'2] (p/2)2 pR(p) = O. (8.1.12) This equation has exactly the form of the nonrelativistic Schrodinger equation for the function u = pR, provided we substitute in the latter Po ---+ 2,\ £(£ + 1) ---+ £(£ + 1) _1'2 == £'(£' + 1) . (8.1.13a) (8.1.13b) Here it should be noted that £' is generally not an integer. Remark: A similar modification of the centrifugal term is also found in classical relativistic mechanics, where it has as a consequence that the Kepler orbits are no longer closed. Instead of ellipses, one has rosette-like orbits. The radial Schrodinger equation (8.1.12) can now be solved in the same way as is familiar from the nonrelativistic case: From (8.1.12) one finds for R(p) in the limits p -+ 0 and p -+ 00 the behavior / ' and e- p / 2 respectively. This suggests the following ansatz for the solution: pR(p) = (~) £' +1 e- p / 2 w(p/2). (8.1.14) The resulting differential equation for w(p) (Eq. (6.19) in QM I) is solved in terms of a power series. The recursion relation resulting from the differential equation is such that it leads to a function rv eP • Taken together with (8.1.14), this means that the function R(p) would not be normalizable unless the power series terminated. The condition that the power series for w(p) terminates yields 2 : Po = 2(N + £' + 1) , i.e., 2 Cf. QM I, Eq. (6.23). 164 8. The Coulomb Potential ,X=N+£'+I, (8.1.15) where N is the radial quantum number, N = 0,1,2, .... In order to determine the energy eigenvalues from this, one first needs to use equations (8.1.lIa and d) to eliminate the auxiliary quantity a 4E2')'2 4(m 2c4 - E2) h 2c2,X2 h 2c2 which then yields the energy levels as (8.1.16) Here, one must take the positive root since the rescaling factor is a > 0 and, as'x > 0, it follows from (8.1.lIc) that E > o. Thus, for a vanishing attractive potential (')' -+ 0), the energy of these solutions approaches the rest energy E = mc 2 . For the discussion that follows, we need to calculate £', defined by the quadratic equation (8.1.13b) £' =- ~ (±) j(f + ~ r- ')'2 . (8.1.17) We may convince ourselves that only the positive sign is allowed, i.e., ,X = N ~ + + j (£ + ~ r- ')'2 and thus (8.1.18) To pursue the parallel with the nonrelativistic case, we introduce the principal quantum number n=N+f+l, whereby (8.1.18) becomes (8.1.18') The principal quantum number has the possible values n = 1,2, ... ; for a given value of n, the possible values of the orbital angular momentum quantum numbers are f = 0,1, ... n - 1. The degeneracy that is present in the 8.1 Klein-Gordon Equation with Electromagnetic Field 165 nonrelativistic theory with respect to the angular momentum is lifted here. The expansion of (8.1.18') in a power series in ""'(2 yields: (n 3) ]+Oh) (1+ 3) + ""'(2- -""'(4 E=mc2 [ 1 - ---2n 2 2n 4 £ + 1.2 4 = mc2 - Ry RY""'(2 - -- n2 n3 4 -- - - £ 1.2 6 4n O(RY""'( ) , (8.1.19) with The first term is the rest energy, the second the nonrelativistic Rydberg formula, and the third term is the relativistic correction. It is identical to the perturbation-theoretical correction due to the relativistic kinetic energy, giving rise to the perturbation Hamiltonian Hl = - J:~ (12.5))3. It is this correction that lifts the degeneracy in £: ER=o - ER=n-l = 4RY""'(2 ---3- n n- 1 n- (see QM I, Eq. (8.1.20) -2 1· The binding energy Eb is obtained from (8.1.18') or (8.1.19) by subtracting the rest energy Further aspects: (i) We now wish to justify the exclusion of solutions £', for which the negative root was taken in (8.1.17). Firstly, it is to be expected that the solutions should go over continuously into the nonrelativistic solutions and thus that to each £ should correspond only one eigenvalue. For the time being we will There are a number of arguments for denote the two roots in (8.1.17) by excluding the negative root. The solution can be excluded on account of the requirement that the kinetic energy be finite. (Here only the lower limit is relevant since the factor e- p / 2 guarantees the convergence at the upper limit): £±. T J J rv - rv dr r2 {)2 R . R {)2r drr 2 (rRI-l)2 J J rv rv £'- dr r2 ({) R) 2 {)r drr2R'. This implies that £' > - ~ and, hence, only £'+ is allowed. Instead of the kinetic energy, one can also consider the current density. If solutions with 3 See also Remark (ii) in Sect. 10.1.2 166 8. The Coulomb Potential both £'t- and £'- were possible, then one would also have linear superpositions of the type 'l/J = 'l/Je + i'l/J£, . The radial current density for this wave function +- . IS jr = 2!i = ('l/J* Ii . 2mi 21 ! 'l/J - (:r 'l/J* ) 'l/J ) (a a) 'l/J£,+ ar'l/J£,- - 'l/J£,- ar'l/J£,+ £' +£' rv r+ - -1 1 = r2 . The current density would diverge as ~ for r -+ o. The current through the surface of an arbitrarily small sphere around the origin would then be f dilr2jr = constant, independent of r. There would have to be a source or a sink for particle current at the origin. The solution £'t- must certainly be retained as it is the one that transforms into the nonrelativistic solution and, hence, it is the solution with which must be rejected. One can confirm this conclusion by solving the problem for a nucleus of finite size, for which the electrostatic potential at r = 0 is finite. The solution that is finite at r = 0 goes over into the solution of the ~ problem corresponding to the positive sign. £'- (ii) In order that £' and the energy eigenvalues be real, according to (8.1.17) we must have 1 £+"2 > Za (8.1.21a) (see Fig. 8.1). This condition is most restrictive for s states, i.e., £ = 0 : Z < ~ 2a = 137 2 = 68.5 . (8.1.21b) J For "( > ~, we would have a complex value £' = - ~ + is' with s' = "(2 This would result in complex energy eigenvalues and furthermore, we would have R(r) rv r-!e±is'logr, i.e., the solution would oscillate infinitely many times as r -+ 0 and the matrix element of the kinetic energy would be divergent. The modification of the centrifugal term into (£(£ + 1) - (Za)2~ arises from the relativistic mass increase. Qualitatively speaking, the velocity does not increase so rapidly on approaching the center as in the nonrelativistic case, and thus the centrifugal repulsion is reduced. For the attractive (- /2) potential, classical mechanics predicts that the particles spiral into the center. When Za > £ + ~ > ..j£(£ + 1), the quantum-mechanical system becomes unstable. The condition Za < ~ can also be written in the form 2 1. Z n/:'~_c < ~m7r-c2, i.e., the Coulomb energy at a distance of a Compton wavelength _n_ = 1.4 x 10- 13 cm from the origin should be smaller than m1l'-c 8.1 Klein-Gordon Equation with Electromagnetic Field o L -_ _ ~ ____ ~ ________ 100 ~ __ 167 Fig. 8.1. Plot of E l • and E lp for a point-like nucleus according to Eq. (8.1.18) as a function of Z. The curves end at the Z value given by (8.1.21a). For larger Z, the energies become complex ~ 200 Z The solutions for the (-~) potential become meaningless for Z > 68. Yet, since there exist nuclei with higher atomic number, it must be possible to describe the motion of a 7[- meson by means of the Klein-Gordon equation. However, one must be aware of the fact that real nuclei have a finite radius which means that also for large Z, bound states exist. ll The Bohr radius for 7[- is a 7r _ = Zm/'i. 2_ eg =...!&....!!. cm ' where m _ Z i'::j 2xlOZ a = 0.5 x 10- 8 cm, the Bohr radius of "the electr;n, and m 7r - = 270me have been used. Comparison with the nuclear radius RN = 1.5 X 10- 13 A1/3 cm reveals that the size of the nucleus is not negligible 4 . For a quantitative comparison of the theory with experiments on 7r-mesonic atoms, one also has to take the following corrections into account: (i) The mass m7l" must be replaced by the reduced mass /-L = :;",~ . (ii) As already emphasized, one must allow for the finite size of the nucleus. (iii) The vacuum polarization must be included. This refers to the fact that the photon exchanged between the nucleus and the 7r-meson transforms virtually into an electron-positron pair, which subsequently recombines into a photon (see Fig. 8.2). + N (a) 7r N (b) Fig. 8.2. The electromagnetic interaction arises from the exchange of a photon C'Y) between the nucleus (N) and the 7r-meson (7r-). (a) Direct exchange; (b) with vacuum polarization in which a virtual electron-positron pair (e- -e+) occurs 4 The experimental transition energies for 7r-mesonic atoms, which lie in the xray range, are presented in D.A. Jenkins and R. Kunselman, Phys. Rev. Lett. 17, 1148 (1966), where they are compared with the result obtained from the Klein-Gordon equation. 168 8. The Coulomb Potential (iv) Since the Bohr radius for the 7r-, as estimated above, is smaller by approximately a factor 1/300 than that of the electron, and thus the probability of finding the 7r- in the vicinity of the nucleus is appreciable, one must also include a correction for the strong interaction between the nucleus and the 7r-. 8.2 Dirac Equation for the Coulomb Potential In this section, we shall determine the exact solution of the Dirac equation for an electron in a Coulomb potential V(r) = _ Ze5 . (8.2.1) r From (8.2.2) one finds, for A = 0 and e«P == -~ == V(r), the Dirac Hamiltonian + f3mc 2 + V(r) H = co:· p (8.2.3) and, with 'ljJ(x, t) = e-iEt/n'ljJ(x), the time-independent Dirac equation (co: . p + f3mc 2 + V(r))'ljJ(x) (8.2.4) = E'ljJ(x). Here too, it will turn out to be useful to represent H in spherical polar coordinates. To achieve this end, we first exploit all symmetry properties of H. The total angular momentum J from (7.2.7') (8.2.5) commutes with H. This implies that H, J2, and Jz have common eigenstates. Remarks: (i) The operators L z , E z , and L2 do not commute with H. (ii) For E = (~), it follows that (~E)2 = 31211 = ~ (1 + ~) diagonal. (iii) L2, E2, and L . E, like H,are scalars and thus commute with J. h,2 11 is 8.2 Dirac Equation for the Coulomb Potential 169 As a necessary prerequisite for an exact solution of the Dirac equation, we first discuss the Pauli spinors. As we know from nonrelativistic quantum mechanics 5, the Pauli spinors are common eigenstates of J2, J z , and L2 with the corresponding quantum numbers j, m, and £, where J = L + ~O' is now the operator of the total angular momentum in the space of two-component spinors. From the product states 1£, mj + 1/2) 1+) 1£, mj - 1/2) It) (8.2.6) or (in Dirac ket space or in the coordinate representation), one forms linear combinations that are eigenstates of J2 , J z , and L2. For a particular value of £, one obtains . J 1 = £+-2 and j = £- ~2' (8.2.7) The coefficients that appear here are the Clebsch-Gordan coefficients. Compared to the convention used in QM I, the spinors 'P;-;'!J now contain an additional factor -1. The quantum number £ takes the values £ = 0,1,2, ... , whilst j and mj have half-integer values. For £ = 0, the only states are 'P;;;j == 'P~j The states 'P;-;,!j only exist for £ > 0, since l a negative j. The spherical harmonics satisfy = ° would imply Y£:m = (-1)mYc,_m. The eigenvalue equations for (8.2.8) 'P;;;J are (henceforth we set Ii = 1): . 1 £=J-=f- 2 mj = -j, ... ,j . 5 QM I, Chap. 10, Addition of Angular Momenta (8.2.9) 170 8. The Coulomb Potential Furthermore, we have L . 0" ~Jmj (±) = (J 2 - L2 -~) = (j(j + 1) - = { -f f_ 1 } (±) 4 ~Jmj f(f + 1) - ~) (8.2.10) ~)!3 + (j + 1/2)} -1-(j+1/2) ~jm _ { -1 - ~)!j (±) for j 1 =f±"2. The following definition will prove useful (8.2.11) K=(l+L·O") whereby, according to (8.2.10), the following eigenvalue equation holds: K{f)(±) = TJm 3 ± (j + ~)2 (f)(±) TJm 3 == k{f)(±) TJm . (8.2.12) 3 The parity of Yfm can be seen from (8.2.13) D 3 ) t h ere are two P au1·· (+) an d ~jm' (-) ror each va1ue 0 f·) (12' 2'··. 1 spmors, ~jm3 whose orbital angular momenta f differ by 1, and which therefore have opposite parities. We introduce the notation ~jm· € - { f =)-2 . 1 (+) ~jm3 (8.2.14) - 3 (_) ~jm3 In place of the index (±), one gives the value of f, which yields the quantum number j by the addition (subtraction) of According to (8.2.13), ~;mj has parity (-1)€, i.e., !. (8.2.15) Remark: One may also write (f)(+) TJmj 0". x = -r- ~jm3 (-) (8.2.16) This relation can be justified as follows: The operator that generates ~)-:;j from ~)-:3 must be a scalar operator of odd parity. Furthermore, due to the difference L1f = 1, the position dependence is of the form Y1,mca, ~), and 8.2 Dirac Equation for the Coulomb Potential 171 thus proportional to x . Therefore, x must be multiplied by a pseudovector. The only position-independent pseudovector is u . A formal proof of (8.2.16) is left as an exercise in Problem 8.2. The Dirac Hamiltonian for the Coulomb potential is also invariant under spatial reflections, i.e., with respect to the operation (Eq. (6.2.33')) P = (3 p(O) where p(O) effects the spatial reflection6 x --t -x. One may see this directly by calculating (3p(O) H and making use of {3o. = -o.{3: [~o. (3p(O) = (3 = V [~o.( + (3m - ~a] - V) + (3m - nO. .V + {3m ~a 1f;(x) ~a ] 1f;( -x) (8.2.17) ] (3p(O)1f;(x) . Therefore, (3p(O) commutes with H such that [(3p(O) ,H] =0. (8.2.17') Since ({3p(O))2 = 1, it is clear that (3p(O) possesses the eigenvalues ±l. Hence, one can construct even and odd eigenstates of (3p(O) and H (8.2.18) Let us remark in passing that the pseudovector J commutes with (3p(O) . In order to solve (8.2.4), we attempt to construct the four-spinors from Pauli spinors. When 'P]m J appears in the two upper components, then, on account of {3, one must also in the lower components take the other Cbelonging to j, and hence, according to (8.2.16), u ,x'P]mj' This gives as solution ansatz the four-spinors 7 .,.£ _ 'PjmJ - ( iGe,(r) £ ) r 'P jm J FeJr(r) (u· x)CP]mJ . (8.2.19) These spinors have the parity (-1)£, since (8.2.20) 6 7 This can also be concluded from the covariance of the Dirac equation and the invariance of ~ under spatial reflections (Sect. 6.2.2.4). S'mce [J , u . x 1 -- 0 , l't' Jz .1'£ m .1'£ i S C1ear t h at J2 'i'jm J -_ j(j+l) 'i'jm J ' 172 8. The Coulomb Potential The factors ~ and i included in (8.2.19) will turn out to be useful later. In matrix notation the Dirac Hamiltonian reads H m= ( u.p Za r u·p ) (8.2.21 ) Za. -m- r In order to calculate H'ljJ]m' we require the following quantities 8 ; u·p fer) <P]mj = u·X u·X u·p fer) <P]m J u·X -r- (x·p + iu·L) fer) <P]mJ = 1)) fer) }<Pjm {Bf(r) ( (. = - l.u·X - r - r-a:;:- + 1 =t= J + 2" i J for j = t' ± 1/2 (8.2.22a) and (u·p)(u·x) fer) <P]mj = - ~ [r :r (j + ~)] +1± fer) <P]mJ (8.2.22b) for j = t' ± 1/2 . By means of (8.2.22a,b), the angle-dependent part of the momentum operator is eliminated, in analogy to the kinetic energy in nonrelativistic quantum mechanics. If one now substitutes (8.2.19), (8.2.21), and (8.2.22) into the time-independent Dirac equation (8.2.4), the radial components reduce to za) Gl·r ( )_ - dFlj(r) (E -m+r dr J za) (E +m+rrr r D ( 1) -Fij(r) r =t= (.J+2 for j = t' ± 1/2 ) J _ 1) -Glj(r) r (8.2.23) dGlj(r) =t= (.J+dr 2 for j = t' ± 1/2 . This system of equations can be solved by making the substitutions = m+E p = ra al a2 k=±(j+~) =m-E a 'Y = vm 2 - E2 = Vala2 = Za (8.2.24) with the condition E < m for bound states. One obtains 8 u. au· b = a . b + iu . a x b, =? u . xu· X = 1 (~ - x· 5-) = -~ r P · ~ r = ~ 1 V· ~ r = ~ I T r 8.2 Dirac Equation for the Coulomb Potential + ~) F - (0: 2_1.) G = 0 ( 5£ dp pap _ ~) p G - (0: 1+ 1.) F = ( 5£ dp ap 173 (8.2.25) 0. Differentiating the first equation and inserting it into the second, one sees that, for large p, F and G are normalizable solutions that behave like e- p • Thus, in (8.2.25) we make the ansatz F(p) = l(p)e- p ,G(p) = g(p)e- P (8.2.26) , which leads to kl (0:2 1 -1+-- - -'Y) g=O pap I 9 I (8.2.27) -g-r;- (0:1--;;:-+p'Y) 1=0. kg In order to solve the system (8.2.27), one introduces the power series: = 1= 9 pS(ao + a1 P + ... ) , ao i- 0 (8.2.28) pS (b o + b1 p + ... ) , bo i- 0 . Here, the same power 8 is assumed for 9 and 1 since different values would imply vanishing ao and bo, as can be seen by substitution into (8.2.27) in the limit p -+ O. For the solution to be finite at p = 0, 8 would have to be greater than, or equal to, 1. Our experience with the Klein-Gordon equation, however, prepares us to admit 8 values that are somewhat smaller than l. Substituting the power series into (8.2.27) and comparing the coefficients of ps+v-1 yields for v > 0: For v (8 + V + k)b v - (8 + V - = bv - 1 k)a v - a v -1 + 'Yav - 0:2 - a v -1 =0 (8.2.29a) 0:1 =0. (8.2.29b) a 'Yb v - - bv - a 1 0 one finds (8+k)bo +'Yao=0 (8-k)ao-'Yb o =0. (8.2.30) This is a system of recursion relations. The coefficients ao and bo differ from zero, provided the determinant of their coefficients in (8.2.30) disappears, i.e. _ (±) (k2 _ 8 - 'Y 2)1/2 . (8.2.31 ) The behavior of the wave function at the origin leads us to take the positive sign. Now, 8 depends only on k 2 , i.e., only on j. Thus, the two states of opposite parity that belong to j turn out to have the same energy. A relationship 174 8. The Coulomb Potential between a v and bv is obtained by multiplying the first recursion relation by a, the second by Ct2, and then subtracting (8.2.32) where we have used Ctl Ct2 = a 2 . In the following we may convince ourselves that the power series obtained, which do not terminate, lead to divergent solutions. To this end, we investigate the asymptotic behavior of the solution. For large 1/ (and this is also decisive for the behavior at large r) it follows from (8.2.32) that al/bv = Ct21/av , thus Ct2 bv = - a v , a and from the first recursion relation (8.2.29a) whence we finally find 2 2 1/ 1/ b v = - b v - 1 , a v = - av-l and thus, for the series, The two series would approach the asymptotic form e2p . In order for the solution (8.2.26) to remain well-behaved for large p, the series must terminate. Due to the relation (8.2.32), when a v = 0 we also have bv = 0 and, according to the recursion relations (8.2.29), all subsequent coefficients are also zero, since the determinant of this system of equations does not vanish for 1/ > o. Let us assume that the first two vanishing coefficients are aN+! = bN +! = O. The two recursion relations (8.2.29a,b) then yield the termination condition (8.2.33) N is termed the "radial quantum number". We now set and apply the termination condition (8.2.33) 1/ = N in (8.2.32) bN [a(8+N+k)+Ct2/+a(8+N-k)- :: 'Y] =0, i.e., with (8.2.24) 2a(8 + N) = 'Y(Ctl - Ct2) = 2E'Y . (8.2.34) 8.2 Dirac Equation for the Coulomb Potential 175 We obtain E from this and also see that E > O. According to (8.2.24), the quantity a also contains the energy E. In the following we reintroduce c and, from (8.2.34), obtain 2(m 2c4 - E 2)1/2 (s + N) = 2E"(. Solving this equation for E yields the energy levels: ,,(2-! 2 E = mc [1 + (s + N)2 ] (8.2.35) . It still remains to determine which values of k (according to (8.2.12) they are integers) are allowed for a particular value of N. For N = 0, the recursion relation (8.2.30) implies: "( bo ao s +k and from the termination condition (8.2.33), we have bo a2 = - - <0. ao a - Since, as implied by(8.2.31), s < that bo ao {<> Ikl, it follows from the first of these relations 0 for k > 0 0 for k < 0 whilst from the second relation it always follows that 1& < 0, i.e., for k < 0 ao we arrive at a contradiction. Thus, for N = 0, the quantum number k can only take positive integer values. For N > 0, all positive and negative integer values are allowed for k. With the definition of the principal quantum number n= N + Ikl = and the value s = levels E ·=mc2 n,) = mc 2 N . 1 + J + "2 Jk 2 - "(2 (8.2.36) from (8.2.31), equation (8.2.35) yields the energy [ (n - Ikl + Jk [+ ( 2]-! Za 1+ 1 n- (j + ~) 2 - (Za)2 ) )2]-! . Za J + (j + ~)2 - (Za)2 (8.2.37 ) 176 8. The Coulomb Potential Before we discuss the general result, let us look briefly at the nonrelativistic limit together with the leading corrections. This follows from (8.2.37) by expanding as a power series in Z a: (Za)4 --- ( _ 1 . _ ~) J +~ 4n + O((Za)6)} . (8.2.38) This expression agrees with the result obtained from the perturbationtheoretic calculation of the relativistic corrections (QM I, Eq. (12.5)). We now discuss the energy levels given by (8.2.37) and their degeneracies. For the classification of the levels, we note that the quantum number k = = 'P~=r-!. ± (j + ~) introduced in (8.2.12) belongs to the Pauli spinors Instead of k, one traditionally uses the quantum number £. Positive k is thus associated with the smaller of the two values of £ belonging to the particular j considered. The quantum number k takes the values k = ±1, ±2, ... , and the principal quantum number n the values n = 1,2, .... We recall that for N = 0 the quantum number k must be positive and thus from (8.2.36), we have k = n and, consequently, £ = n - 1 and j = n - ~. Table 8.1 summarizes the values of the quantum numbers k, j, j + ~ and £ for a given value of the principal quantum number n. Table 8.2 gives the quantum numbers for n = 1,2, and 3 and the spectroscopic notation for the energy levels n L j . It should be emphasized that the orbital angular momentum L is not conserved and that the quantum number £ is really only a substitute for k, introduced to characterize parity. 'P);;3 ±(n - 1) n k ±1 ±2 j 1/2 3/2 n-1/2 j + 1/2 1 2 n £ 0 1 1 2 n-2 n-1 n N Ikl k j £ 1 0 1 1 1/2 0 181 / 2 2 1 1 1 1 +1 -1 1/2 1/2 0 1 281 / 2 0 2 2 3/2 1 2P3 / 2 2 2 1 1 1 -1 1/2 1/2 0 1 381 / 2 1 1 2 2 2 -2 3/2 3/2 1 2 3P3 / 2 3D 3 / 2 0 3 3 5/2 2 3D5/ 2 3 2P1/2 3P1/ 2 Table 8.1. Values of the quantum numbers k,j,j + ~, and £ for a given principal quantum number n n-1 Table 8.2. The values of the quantum numbers; principal quantum number n, radial quantum number N, k, angular momentum j and £ 8.2 Dirac Equation for the Coulomb Potential n=31_ k=1 177 k=3 D5/2 k =-2 k=2 P3/2 D3/2 k =-1 51/ 2 P 1/ 2 k=2 P 3/ 2 n = 2k=1 {_ k =-1 5 1/ 2 P 1/ 2 k=1 n = 1 { - - - - 51 / 2 Fig. 8.3. The energy levels of the hydrogen atom according to the Dirac equation for values of the principal quantum number n = 1,2, and 3 Figure 8.3 shows the relativistic energy levels of the hydrogen atom according to (8.2.37) for the values n = 1,2, and 3 of the principal quantum number. The levels 2S1/ 2 and 2P1/2, the levels 3S1/ 2 and 3P1/ 2 , the levels 3P3 / 2 and 3D 3 / 2, etc. are degenerate. These pairs of degenerate levels correspond to opposite eigenvalues of the operator K = 1 + L·u, e.g., 2P3 / 2 has the value k = 2, whereas 2D 3 / 2 possesses k = -2. The only nondegenerate levels are lS1 / 2 , 2P3 / 2 , 3D 5 / 2 , etc. These are just the lowest levels for a fixed j, or the levels with radial quantum number N = 0, for which it was shown in the paragraph following (8.2.35) that the associated k can only be positive. The lowest energy levels are given in Table 8.3. The energy eigenvalues for N = 0 are, according to (8.2.37) and (8.2.36), E=mc 2 1 [1+ k2~/,]- =mc2 [1+ n2~/,]- 1 =mc2 J1-/,2jn 2 . (8.2.39) Table 8.3. The lowest energy levels E n .j /mc 2 n £ j 151 / 2 1 0 2" 251 / 2 2 0 2" 2 P 1/ 2 2 1 2" 2 P3 / 2 2 1 2" 1 1 1 3 )1- (Za)2 V1+v'~(Za)2 V1+v'~(Za)2 ~)4 - (Za)2 8. The Coulomb Potential 178 n=2, t-O,' ", fine structure + Lamb shift + hyperfine • I ~ \, • 2 I ""2 f.---...- - -..•."..:... ! 23.6 Fig. 8.4. Splitting of the energy levels of the hydrogen atom (MHz) due to the relativistic terms (fine structure, (Fig. 8.3)), the Lamb shift and the hyperfine structure Figure 8.4 shows how the level n = 2, l = 1 (a single level according to the Schrodinger equation) splits according to Dirac theory (8.2.37) to yield the fine structure. Further weaker splitting, due to the Lamb shift and the hyperfine structure9 , is also shown. It should be noted that all levels are still (2j + I)-fold degenerate since they do not depend on the quantum number mj. This degeneracy is a general consequence of the spherical symmetry of the Hamiltonian (see the analogous discussion in QM I, Sect. 6.3). The fine-structure splitting between the 2P3/2 and the 2P1 / 2 and 2S1 / 2 levels is 10950 MHz ~ 0.45 x 1O- 4 eV. As has already been mentioned, it is usual to make use of the nonrelativistic notation to classify the levels. One specifies n, j, and where is the index of the Pauli spinor, which really only serves to characterize the parity. The 2 Sl/2 and 2 P 1 / 2 states are degenerate, as in first-order perturbation theory. This is not surprising since they are the two eigenstates of opposite parity for the same N and j. The 2 P3 / 2 state has a higher energy than the 2 P 1 / 2 state. The energy difference arises from the fine-structure splitting caused by the spin-orbit interaction. In general, for a given n, the states with larger j have a higher energy. The ground-state energy e, E1 = mc2 VI - (Za)2 = mc2 ( (Za)2 (Za)4 ) 1 - - 2 - - - 8 - ... e (8.2.40) is doubly degenerate, with the two normalized spinors 'lfIn=l,j=!,Tn J =! (r, {J, '1') (2mZa)3/2 1 +1 ( )'y-1 2r(I + 2,,) 2mZar J47f x e -TnZar (8.2.4Ia) ( i(~:r) 01 ) i(1-1) cos {J sin {J ei<p and 9 Section 9.2.2 and QM I, Chap. 12 Problems (2mZo:)3/2 v'41f 1+1' ( )1- 1 2r(1 + 2')') 2mZo:r ----za 179 (8.2.41b) ~ .G _iCP) ( i(1-1) Slllve -i(1-1) cos rJ Za with l' = VI - Z 2 0:2 and the gamma function r(x). The normalization is given by J d3x'IjJtn-1 '-1 -±1 (rJ, cp)'ljJn=l 1'=1 m =±1 (rJ, cp) = 1. The two - ')-2,m]2 ' 2' J 2 spinors possess the quantum numbers mj = +1/2 and mj = -1/2. They are constructed from eigenfunctions of the orbital angular momentum: Yoo in the components 1 and 2 and Y1 ,m=O,±1 in the components 3 and 4. In the nonrelativistic limit 0: --+ 0, l' --+ 1, ~ ---+ 0 these solutions reduce to the Schr6dinger wave functions multiplied by Pauli spinors in the upper two components. The solution (8.2.41) displays a weak singularity r1-1 = rv'1-Z 2a L l ~ r-Z 2 a 2 /2. However, this only has a noticeable effect in a very tiny region: r < 1 -e~ 2 2mZo: 1O-16300/z 2 = ----2mZo: Furthermore, for real nuclei with a finite radius, this singularity no longer occurs. For Zo: > 1, l' becomes imaginary and the solutions are therefore oscillatory. However, all real nuclei have Zo: < 1 and, furthermore, this limit is shifted for finite-sized nuclei. Problems 8.1 Demonstrate the validity of the relation 8.2 Prove the relation (8.2.16) (+) U·X (_) 'PJm] = -r- 'Pjm] that was given in connection with the solution of the Dirac equation for the hydrogen atom, ISO S. The Coulomb Potential Hint: Make use of the fact that c.p;-;J,j the commutator [0" L, ""/] (result: ~ anticommutator. is an eigenfunction of (r2 0" 0' . L and calculate V - (0" x)(x· V) - 0" x)) or the 8.3 Derive the recursion relations (S.2.29a,b) for the coefficients a y and by. 8.4 Calculate the ground-state spinors of the hydrogen atom from the Dirac equation. 8.5 A charged particle is moving in a homogeneous electromagnetic field B = (0,0, E) and E = (Eo, 0, 0). Choose the gauge A = (0, Ex, 0) and, taking as your starting point the Klein-Gordon equation, determine the energy levels. 9. The Foldy-Wouthuysen 'Transformation and Relativistic Corrections 9.1 The Foldy-Wouthuysen Transformation 9.1.1 Description of the Problem Beyond the Coulomb potential, there are other potentials for which it is also important to be able to calculate the relativistic corrections. Relativistic corrections become increasingly important for nuclei of high atomic number, and it is exactly these, for which the nuclear diameter is no longer negligible, in which the potential deviates from the 1/1' form. The canonical transformation of Foldy und Wouthuysen 1 transforms the Dirac equation into two decoupled two-component equations. The equation for the components 1 and 2 becomes identical to the Pauli equation in the nonrelativistic limit; it also contains additional terms that give rise to relativistic corrections. The energies for these components are positive. The equation for the components 3 and 4 describes negative energy states. From the explicit solutions given in previous sections, it is evident that for positive energies the spinor components 1 and 2 are large, and the components 3 and 4 small. We seek a transformation that decouples the small and large components of the spinor from one another. In our treatment of the nonrelativistic limit (Sect. 5.3.5), we achieved this decoupling by eliminating the small components. We now wish to investigate this limit systematically and thereby derive the relativistic corrections. According to a classification that is now established in the literature, the Dirac Hamiltonian contains terms of two types: "odd" operators which couple large and small components (ci, 'Ii, '15) and "even" operators which do not couple the large and small components (n, (J, E). The canonical (unitary) transformation that achieves the required decoupling may be written in the form (9.1.1) where, in general, S can be time dependent. From the Dirac equation, it then follows that 1 L.L. Foldy and S.A. Wouthuysen, Phys. Rev. 78, 29 (1950) 182 9. The Foldy-Wouthuysen Transformation and Relativistic Corrections 1'801, t'f/ -'8 - 1 te -iSol.' 'f/ - 'Ie - is 8 01.' t'f/ + 1. (8te -is) 01.' 'f/ -- Hoi, 'f/ -- H e -iSol.' 'f/ (9.1.2a) and, thus, we have the equation of motion for 'ljJ': (9.1.2b) with the Foldy-Wouthuysen-transformed Hamiltonian (9.1.2c) The time derivative on the right-hand side of this equation only acts on e- iS . One endeavors to construct S such that H' contains no odd operators. For free particles, one can find an exact transformation, but otherwise one has to rely on a series expansion in powers of ~ and, by successive transformations, satisfy this condition to each order of~. In fact, each power of ~ corresponds to a factor ~ in the atomic domain this is approximately equal to Sommerfeld's fine-structure constant a, since, from Heisenberg's uncertainty relation, we have ~ ::::::: crn~x ::::::: c!a = a. '::c '" ; 9.1.2 Transformation for Free Particles For free particles, the Dirac Hamiltonian simplifies to (9.1.3) H = a ·p+(3m with the momentum operator p = -iV. Since {a, (3} = 0, the problem is analogous to that of finding a unitary operator that diagonalizes the Pauli Hamiltonian (9.1.4a) so that, after transformation, H contains only II and {Yz. This is achieved by a rotation about the y axis through an angle {}o determined by (Bx, By, 0): (9.1.4b) This equation suggests the ansatz e ±·S 1 = e ±(3~19) Ipi p = cos {} ± (3 a . p -I-p-Isin {} . (9.l.5) Here, S is time independent. The last relation results from the Taylor expansion of the exponential function and from (a. p)2 = aia j pipi = ~{ai, a j } pipi = Jijpipl = p2 ((3 a . p)2 = (3a. p (3a. p = _(32 (a . p)2 = _p2 . (9.1.6a) (9.1.6b) 9.1 The Foldy-Wouthuysen Transformation 183 Inserting (9.1.5) into (9.1.2c), one obtains H' as H' ei3T.;f!?(o . P = + (3m) = e,BT.;-fD (cos'/?+ ,B~i Ipi p sin 19 ) (0· p+ ,Bm) = e2 ,BT.;-fD(0. p + ,Bm) = o· p (cos 2'19 - (cos '19 - (30· P sin '19) = (cos 219 + ,B~i p sin 219 ) (0· p + ,Bm) 1:1 sin 2'19 ) + (3m (cos 2'19 + 1:1 sin 2'19 ) . (9.1.7) The requirement that the odd terms disappear yields the condition tan 2'19 I~ , from whence it follows that .0 sin 21f = tan 2'19 ---=---,-,- (1 = p + tan 2 2'19)1/2 Substituting this into (9.1.7) finally yields: H' (3m (m E = + p. P) = (3Jp2 + m2 mE . (9.1.8) Thus, H' has now been diagonalized. The diagonal components are nonloca1 2 Hamiltonians ±Jp 2 + m 2 . In our first attempt (Sect. 5.2.1) to construct a nonrelativistic theory with a first order time derivative, we encountered the operator J p 2 + m 2 • The replacement of J p 2 + m 2 by linear operators necessarily leads to a four-component theory with negative as well as positive energies. Even now, H' still contains the character of the four-component theory due to its dependence on the matrix (3, which is different for the upper and lower components. Such an exact transformation is only feasible for free particles. 9.1.3 Interaction with the Electromagnetic Field Of primary interest, of course, is the case of non-vanishing electromagnetic fields. We assume that the potentials A and If> are given, such that the Dirac Hamiltonian reads: H =0 = . (p - eA) + (3m + elf> (3m+£+O. (9.1.9a) (9.1.9b) Here, we have introduced a decomposition into a term proportional to (3, an even term £, and an odd term 0: 2 They are nonlocal because they contain derivatives of all orders. In a discrete theory, the nth derivative signifies an interation between lattice sites that are n units apart. 184 9. The Foldy-Wouthuysen Transformation and Relativistic Corrections £ = eP and 0 = a (p - eA) . (9.1.10) These have different commutation properties with respect to (3: (3£ = £(3, (30 = -0(3 . (9.1.11) The solution in the field-free case (9.1.5) implies that, for small fJ, i.e., in the nonrelativistic limit, . 18 a·p = (3 - - fJ Ipi p rv (3a . 2m We can thus expect that successive transformations of this type will lead to an expansion in ~. In the evaluation of H', we make use of the Baker-Hausdorff identity 3 H' = H i2 i3 + i[8,H] + "2[8, [8,H]] + "6[8, [8, [8,Hlll + 1'4 1" 2 . 1 . + 24 [8, [8, [8, [8, H]lll- 8 . (9.1.12) - "2[8,8]- "6[8, [8,8]] , given here only to the order required. The odd terms are eliminated up to order m -2, whereas the even ones are calculated up to order m -3. In analogy to the procedure for free particles, and according to the remark following Eq. (9.1.11), we write for 8: 8 = -i(30/2m . (9.1.13) For the second term in (9.1.12), we find i[8, H] = -0 + 2(3 [0, £] m + ~(302 m , (9.1.14) obtained using the straightforward intermediate steps [(30, (3] = (30(3 - (3(30 = -20 [(30, £] = (3[0, £] [(30, 0] = (30 2 - 0(30 = 2(30 2 . (9.1.15) Before calculating the higher commutators, let us immediately draw attention to the fact that the first term in (9.1.14) cancels out the term 0 in H. Hence, the aim of eliminating the odd operator 0 by transformation has been attained; although new odd terms have been generated, e.g., the second term in (9.1.14), these have an additional factor m-l. We now address the other terms in (9.1.12). The additional commutator with i8 can be written immediately by using (9.1.14), (9.1.15), and (9.1.11): 3 eABe- A = B + [A,B] + ... + ~ [A, [A, ... ,[A,B] ... ]] + ... 9.1 The Foldy-Wouthuysen Transformation 185 and likewise, i3 3![S, [S, [S,Hlll = 03 1 4 {J 6m 2 - 6m 3 {J0 - 48m 3 [0,[0,[0,£]]]. For the odd operators, it is sufficient to include terms up to order m- 2 and hence the third term on the right-hand side may be neglected. The next contributions to (9.1.12), written only up to the necessary order in 11m, are: i4 4[[S, [S, [S, [S, Hllll i{JO 2m -8 All in all, one obtains for H': H , = {Jm + {J ( -0 4 2 2m {J [//'\ C] +v,c- 2m - -0 ) 8m 3 +£ - 3 . · - 1 [0, [0,£]] - - i[0,0] 8m 2 8m 2 0 + -i{JO == {Jm+£ " +0 . (9.1.16) 3m 2 2m Here, £ and all even powers of 0 have been combined into a new even term £', and the odd powers into a new odd term 0'. The odd terms now occur only to orders of at least ~. To reduce them further, we apply another Foldy-Wouthuysen transformation S' = -i{J 0' = 2m -i{J 2m (~[O - 2m' £] _ 0 3 3m 2 + i{JO) (9.1.17) 2m This transformation yields: H" = eiS ' (H' - Wt)e -is' = {Jm + £' + ~ 2m [0' , £'] + i{JO' 2m (9.1.18) == {Jm + £' + 0" . Since 0' is of order 11m, in 0" there are now only terms of order 11m2 . This transformation also generates further even terms, which, however, are all of higher order. For example, {JO,2/2m = {Je 2E2/8m 3 '" {Je 4 /m 3 r 4 '" Ry oA . By means of the transformation S"=~ .{J//'\" 2m ' (9.1.19) 186 9. The Foldy-Wouthuysen Transformation and Relativistic Corrections the odd term Gil '" G (~2) = eiS" (H" Hili is also eliminated. The result is the operator = {3m + £' - iot)e- iS " (9.1.20) which now only consists of even terms. In order to bring the Hamiltonian Hili into its final form, we have to substitute (9.1.10) and rewrite the individual terms as follows: 2nd term of Hili: G2 2m = 1 2m (a . (p - eA)) 2 = 1 2 e 2m (p - eA) - 2m E . B , (9.1.21a) since aia j = a i {32a j = -,,/,,(j = -~ = 8ij (hi, "(j} + b i ,,,(jD = _gi j + iE:jk~7 + iE: ijk Ek and the mixed term with E: ijk yields: -e (piAj + AipJ) iE: ijk Ek = -ie ((piAj) + Ajpi + AipJ) E: ijk Ek =-e(oiAj)E:ijkEk = -eB·E. 5th term of Hili: Evaluation of the second argument of the commutator gives ([G,£]+iO) = [ai(pi_eAi),ep] -ieaiAi = -iea i (Oi P + Ai) = iea i Ei . It then remains to compute [G, a . E] = aia j (pi - eAi)Ej - a j Ej ai(pi - eAi) = (pi _ eAi)Ei _ Ei(pi _ eAi ) +iE: ijk Ek(pi _ eAi)Ej _ iE: jik Ek Ej(pi _ eAi) = (piEi) + E. V x E - 2iE· Ex (p - eA) . Hence, the 5th term in Hili reads: ie - 8m 2 [G, a . E] =- e. ie 8m 2 dlV E - 8m 2 E . V x E e - 4m 2 E· Ex (p - eA) . (9.1.21b) Inserting (9.1.10) and (9.1.21a,b) into (9.1.20), one obtains the final expression for Hili: 9.2 Relativistic Corrections and the Lamb Shift Hili = {3 (m + (p - eA? - _l_[(p - eA? - eE. B]2) 2m 8m 3 ie e - -{3E·B- -E·curIE 2m 8m 2 e e - 4m2E . Ex (p-eA)- 8m 2 divE. 187 + ecjj (9.1.22) The Hamiltonian Hili no longer contains any odd operators. Hence, the components 1 and 2 are no longer coupled to the components 3 and 4. The eigenfunctions of H"' can be represented by two-component spinors in the upper and lower components of 'ljJ', which correspond to positive and negative energies. For 'ljJ' = (b), the Dirac equation in the Foldy-Wouthuysen representation acquires the following form: . a<p 1- at 1 2 e p4 = { m + ecjj + -(p - eA) - - 0 " . B - 3 2m 2m 8m (9.1.23) - _e_O". Ex (p - eA) _ _ e_ diVE} <p. 4m 2 8m 2 Here, <p is a two-component spinor and the equation is identical to the Pauli equation plus relativistic corrections. The first four terms on the right-hand side of (9.1.23) are: rest energy, potential, kinetic energy, and coupling of the magnetic moment J.L = 2~ 0" = 2 2~ S to the magnetic field B. As was discussed in detail in Sect. 5.3.5.2, the gyromagnetic ratio (Lande factor) is obtained from the Dirac equation as 9 = 2. The three subsequent terms are the relativistic corrections, which will be discussed in the next section. Remark. Equation (9.1.23) gives only the leading term that follows from 0 4 , which is still contained in full in (9.1.22), namely p4. The full expression is _L04 = _L((p _ eA)2 _ eEB? = _L[(p _ eA)4 + e 2B2 8m 3 8m 3 8m 3 +eE . f).B - 2eE . B(p - eA)2 - 2ieO"j V Bj (p - eA)] . + It should also be noted that, in going from (9.1.22) to (9.1.23), it has been assumed that curl E = o. 9.2 Relativistic Corrections and the Lamb Shift 9.2.1 Relativistic Corrections We now discuss the relativistic corrections that emerge from (9.1.22) and (9.1.23). We take E = -Vcjj(r) = -~x and A = O. Hence, curlE = 0 and 1 acjj 1 acjj E·E x p= - - - E · x x p= ---E·L. rar rar (9.2.1 ) 188 9. The Foldy-Wouthuysen Transformation and Relativistic Corrections Equation (9.1.23) contains three correction terms: (p2)2 = --3 relativistic mass corrrection 8m e 18p H2 = - - - - u . L spin-orbit coupling 4m 2 r 8r H3 = _ _ e_ divE = _e_ V 2 p (x) Darwin term. 8m 2 8m 2 HI (9.2.2a) (9.2.2b) (9.2.2c) Taken together, these terms lead to the perturbation Hamiltonian _ (p2)2 1 1 8V 1i2 2 . H 1 +H2 +H3 - 3 2 + -22--8 4 u . L8+22 - VV(x), 8 mc mc r r mc (9.2.2d) (V = eP). The order of magnitude of each of these corrections can be obtained from the Heisenberg uncertainty relation Ry x (~J2 = Ry (~r = Ryo? = mc2 a 4 , where a = e6 (= e6/ lie) is the fine-structure constant. The Hamiltonian (9.2.2d) gives rise to the fine structure in the atomic energy levels. The perturbation calculation of the energy shift for hydrogen-like atoms of nuclear charge Z was presented in Chap. 12 of QM I; the result in first-order perturbation theory is i'1E ._ 1 n,J-i±"2'£ = Ry z2 (Za)2 n2 n2 {~ 4 n_} . _ _ j +~ (9.2.3) The energy eigenvalues depend, apart from on n, only on j. Accordingly, the (n = 2) levels 281/ 2 and 2Pl/2 are degenerate. This degeneracy is also present in the exact solution of the Dirac equation (see (8.2.37) and Fig. 8.3). The determination of the relativistic perturbation terms HI, H 2 , and H3 from the Dirac theory thus also provides a unified basis for the calculation of the fine-structure corrections 0 (a 2 ) • Remarks: (i) An heuristic interpretation of the relativistic corrections was given in QM I, Chap. 12. The term HI follows from the Taylor expansion of the relativistic kinetic energy y' p 2 + m 2 . The term H2 can be explained by transforming into the rest frame of the electron. Its spin experiences the magnetic field that is generated by the nucleus, which, in this frame, orbits around the electron. The term H3 can be interpreted in terms of the "Zitterbewegung", literally "trembling motion", a fluctuation in the position of the electron with an amplitude 6x = fic/m. (ii) The occurrence of additional interaction terms in the Foldy-Wouthuysen representation can be understood as follows: An analysis of the transformation from the Dirac representation 'ljJ to 'ljJ' shows that the relationship is nonlocal4 4 L.L. Foldy and S.A. Wouthuysen, Phys. Rev. 78, 29 (1950) 9.2 Relativistic Corrections and the Lamb Shift 'l/J'(x) = ! 189 d3 x' K(x,x')'l/J(x') , where the kernel of the integral K(x, x') is of a form such that, at the position x, 'l/J' (x) consists of 'l/J contributions stemming from a region of size rv Ac around the point x; here, Ac is the Compton wavelength of the particle. Thus, the original sharply localized Dirac spinor transforms in the Foldy-Wouthuysen representation into a spinor which seems to correspond to a particle that extends over a finite region. The reverse is also true: The effective potential that acts on a spinor in the Foldy-Wouthuysen representation at point x consists of contributions from the original potential A(x), cp(x) averaged over a region around x. The full potential thus has the form of a multipole expansion of the original potential. This viewpoint enables one to understand the interaction of the magnetic moment, the spin-orbit coupling, and the Darwin term. (iii) Since the Foldy-Wouthuysen transformation is, in general, time dependent, the expectation value of Hili is generally different to the expectation value of H. In the event that A(x) and cp(x) are time independent, i.e., time-independent electromagnetic fields, then S is likewise time independent. This means that the matrix elements of the Dirac Hamiltonian, and in particular its expectation value, are the same in both representations. (iv) An alternative method 5 of deriving the relativistic corrections takes as its starting point the resolvent R = H-';cLz of the Dirac Hamiltonian H. This is analytic in ~ at c = 00 and can be expanded in ~. In zeroth order one obtains the Pauli Hamiltonian, and in O( ~) the relativistic corrections. 9.2.2 Estimate of the Lamb Shift There are two further effects that also lead to shifts and splitting of the electronic energy levels in atoms. The first is the hyperfine interaction that stems from the magnetic field of the nucleus (see QM I, Chap. 12), and the second is the Lamb shift, for which we shall now present a simplified theory.6 The zero-point fluctuations of the quantized radiation field couple to the electron in the atom, causing its position to fluctuate such that it experiences a smeared-out Coulomb potential from the nucleus. This effect is qualitatively similar to the Darwin term, except that the mean square fluctuation in the electron position is now smaller: We consider the change in the potential due to a small displacement Ox: (9.2.4) Assuming that the mean value of the fluctuation is (ox) an additional potential 5 6 = 0, we obtain F. Gesztesy, B. Thaller, and H. Grosse, Phys. Rev. Lett. 50, 625 (1983) Our simple estimate of the Lamb shift follows that of T.A. Welton, Phys. Rev. 74, 1157 (1948). 190 9. The Foldy-Wouthuysen Transformation and Relativistic Corrections 1 L1HLam b = (v(x + ox) - V(x)) = 6((OX)2)V 2V(X) 1 = 6(( OX)2) 411" Zane 0(3) (x) . (9.2.5) The brackets () denote the quantum-mechanical expectation value in the vacuum state of the radiation field. In first order perturbation theory, the additional potential (9.2.5) only influences s waves. These experience an energy shift of L1ELamb = 211"Zanc I 2) 2 3 \ (ox) 11/In,£=0(0) I = (2meZa)3 Zac I(OX)2)0 12n2 n3 \ where we have used 1/In,£=0(0) = £,0 , J.rr (m:~Z)3/2. (9.2.6) The energy shift for the p, d, . .. electrons is much smaller than that of the s waves due to the fact that they have 1/1(0) = 0, even when one allows for the finite extent of the nucleus. A more exact theory of the Lamb shift would include, not only the finite size of the nucleus, but also the fact that not all contributing effects can be expressed in the form L1 V, as is assumed in this simplified theory. We now need to estimate ((ox)2), i.e., find a connection between ox and the fluctuations of the radiation field. To this end, we begin with the nonrelativistic Heisenberg equation for the electron: mox = eE. (9.2.7) The Fourier transformation J 00 ox(t) = dw . t 211" e-'w oXw (9.2.8) -00 yields (9.2.9) -ex) -ex) Due to the invariance with respect to translation in time, this mean square fluctuation is time independent, and can thus be calculated at t = o. From (9.2.7) it follows that e Ew oXw = - - - . mw 2 (9.2.10) For the radiation field we use the Coulomb gauge, also transverse gauge, div A = O. Then, due to the absence of sources, we have 9.2 Relativistic Corrections and the Lamb Shift E(t) 1 . = --A(O, t) . 191 (9.2.11) C The vector potential of the radiation field can be represented in terms of the creation and annihilation operators at,.>.. (ak,.>..) for photons with wave vector k, polarization A, and polarization vector ck,.>..(A = 1,2)1: A(x , t) = ""' Jh27rC (a g ei(kx-ckt) ~ Vk k,.>.. k,.>.. + atk,.>.. g*k,.>.. e-i(kX-Ckt») k,.>.. (9.2.12) The polarization vectors are orthogonal to one another and to k. From (9.2.12), one obtains the time derivative -~A(Ot)="'Jh27rCika C ' C ~ Vk g k,.>.. k,.>.. e-ickt_at g* k,.>.. k,.>.. eickt ) k,.>.. and the Fourier-transformed electric field J 00 Ew = dteiwtE(t) -00 (9.2.13) Now, by making use of (9.2.9), (9.2.10), and (9.2.13), we can calculate the mean square fluctuation of the position of the electron The expectation value is finite only when the photon that is annihilated is the same as that created. We also assume that the radiation field is in its ground state, i.e., the vacuum state 10). Then, with ak,.>..at,.>.. = 1 + at,.>..ak,.>.. and ak,.>.. 10) = 0, it follows that 7 QM I, Sect. 16.4.2 192 9. The Foldy-Wouthuysen Transformation and Relativistic Corrections = ~ e2 1f 1tc (~) 2 mc (9.2.14) J dk , k fr where we have also made the replacement Lk ---+ I (g:~3. The integral 00 dk ~ is ultraviolet (k ---+ (0) and infrared (k ---+ 0) divergent. In fact, there are good physical reasons for imposing both an upper and a lower cutoff on this integral. The upper limit genuinely remains finite when one takes relativistic effects into account. The divergence at the lower limit is automatically avoided when the electron is treated, not with the free equation of motion (9.2.7), but quantum mechanically, allowing for the discrete atomic structure. In the following, we give a qualitative estimate of both limits, beginning with the upper one. As a result of the "Zitterbewegung" (the fluctuation in the position of the electron), the electron is spread out over a region the size of the Compton wavelength. Light, because its wavelength is smaller than the Compton wavelength, causes, on average, no displacement of the electron, since the light wave has as many peaks as troughs within one Compton wavelength. Thus, the upper cutoff is given by the Compton wavelength ~, or by the corresponding energy m. For the lower limit, an obvious choice is the Bohr radius (Zo:m)-l, or the corresponding wave number Zo:m. The bound electron is not influenced by wavelengths greater than a = (Zo:m)-l. The lowest frequency of induced oscillations is then Zo:m. Another plausible choice is the Rydberg energy Z 2 0:2m with the associated length (Z 2 0:2 m )-1, corresponding to the typical wavelength of the light emitted in an optical transition. Light oscillations at longer wavelengths will not influence the bound electron. In a complete quantum-electro dynamical theory, of course, there are no such heuristic arguments. If we take the first of the above estimates for the lower limit, it follows that 10 wJmaxdw .!. = Jm dw.!. = log ~ w w , Zo: Zorn Wrnin and thus, from (9.2.6) and (9.2.14), AE L.I 2 _ (2mcZo:)3 Zo:c ~ e (~)2l Lamb - 12~ n n3 ~ 1fnC _1_ s: mc 8Z 4 0: 3 1 1 -o:2mc2J/! 0 . = - - - log 3 Zo: 2 ' 31fn This corresponds to a frequency shift 8 8 T.A. Welton, Phys. Rev. 74, 1157 (1948) og Z 0: u/!,O (9.2.15) Problems LlVLamb 193 = 667 MHz for n = 2, Z = 1, £ = O. The experimentally observed shift 9 is 1057.862±0.020 MHz. The complete quantum-electro dynamical theory of radiative corrections yields 1057.864 ± 0.014 MHz.lO In comparison with the Darwin term, the radiative corrections are smaller by a factor a log The full radiative corrections also contain a(Za)4 terms, which are numerically somewhat smaller. Levels with £ -=1= 0 also display shifts, albeit weaker ones than the s levels. Quantum electrodynamics allows radiative corrections to be calculated with remarkable precision 10 ,1l. This theory, too, initially encounters divergences: The coupling to the quantized radiation field causes a shift in the energy of the electron that is proportional (in the nonrelativistic case) to p2, i.e., the radiation field increases the mass of the electron. What one measures, however, is not the bare mass, but the physical (renormalized) mass which contains this coupling effect. Such mass shifts are relevant to both free and bound electrons and are, in both cases, divergent. One now has to reformulate the theory in such a way that it contains only the renormalized mass. For the bound electron, one then finds only a finite energy shift, namely, the Lamb shiftll. In the calculation by Bethe, which is nonrelativistic and only contains the self-energy effect described above, one finds a lower cutoff of 16.6 Ry and a Lamb shift of 1040 MHz. Simply out of curiosity, we recall the two estimates preceding Eq. (9.2.15) for the lower cutoff wave vector: If one takes the geometrical mean of these two values, for Z = lone obtains a logarithmic factor in (9.2.15) of log 16.l5 a 2 , which in turn yields LlE = 1040 MHz. In conclusion, it is fair to say that the precise theoretical explanation of the Lamb shift represents one of the triumphs of quantum field theory. ±. Problems 9.1 Verify the expressions given in the text for i i3 2[S, [S,H]] , 6[S,[S,[S,Hlll, with H 9 10 11 = a(p - eA) + (3m + eP 1 24[S,[S,[S,[S,H]lll and S = -2!n(30, where 0 (9.2.16) == a(p - eA). The first experimental observation was made by W.E. Lamb, Jr. and R.C. Retherford, Phys. Rev. 72, 241 (1947), and was refined by S. Triebwasser, E.S. Dayhoff, and W.E. Lamb, Phys. Rev. 89, 98 (1953) N.M. Kroll and W.E. Lamb, Phys. Rev. 75, 388 (1949); J.B. French and V.F. Weisskopf, Phys. Rev. 75, 1240 (1949); G.W. Erickson, Phys. Rev. Lett. 27, 780 (1972); P.J. Mohr, Phys. Rev. Lett. 34, 1050 (1975); see also Itzykson and Zuber, op. cit p. 358 The first theoretical (nonrelativistic) calculation of the Lamb shift is due to H.A. Bethe, Phys. Rev. 72, 339 (1947). See also S.S. Schweber, An Introduction to Relativistic Quantum Field Theory, Harper & Row, New York 1961, p. 524.; V.F. Weisskopf, Rev. Mod. Phys. 21, 305 (1949) 194 9. The Foldy-vVouthuysen Transformation and Relativistic Corrections 9.2 Here we introduce, for the Klein-Gordon equation, a transformation analogous to Foldy-Wouthuysen's, which leads to the relativistic corrections. (a) Show that the substitutions x = ~ (<p _ ~ a<p) 2 mat and allow the Klein-Gordon equation aat<p _ (V2 2 2 - - m 2) <p to be written as a matrix equation .ap _ 1Ft - where P = H p a (~) and Ho = - (-i -i) i~ + 0-n m. (b) Show that in the two-component formulation, the Klein-Gordon equation for particles in an electromagnetic field, using the minimal coupling (p -+ 7l" = P - eA), reads: 2 . ap = 1Ft { - ( -11 -11) 2m 7r + ( 01 -10) m + eV(x) } p(x). (c) Discuss the nonrelativistic limit of this equation and compare it with the corresponding result for the Dirac equation. Hint: The Hamiltonian of the Klein-Gordon equation in (b) can be brought intotheformH=O+c+1]mwith1]= (1 0) ,O=P2rn= (0 1) 71"2 0-1 -10 71"2 2rn,and 2 C = eV + 1] :{rn . Show, in analogy to the procedure for the Dirac equation, that, in the case of static external fields, the Foldy-Wouthuysen transformation p' = eiSp yields the approximate Schrodinger equation i 8!:t' = H'p', with H' =1] ( 7r2 m+2m 7r4) -3 + . . . 8m + eV + 1 --4 32m [7r 2 ,[7r 2 ,eVll + ... The third and the fifth term represent the leading relativistic corrections. In respect to their magnitudes see Eq. (8.1.19) and the remark (ii) in Sect. 10.1.2. 10. Physical Interpretation of the Solutions to the Dirac Equation In interpreting the Dirac equation as a wave equation, as has been our practice up to now, we have ignored a number of fundamental difficulties. The equation possesses negative energy solutions and, for particles at rest, solutions with negative rest mass. The kinetic energy in these states is negative; the particle moves in the opposite direction to one occupying the usual state of positive energy. Thus, a particle carrying the charge of an electron is repelled by the field of a proton. (The matrix /3 with the negative matrix elements /333 and /344 multiplies m and the kinetic energy, but not the potential term eP in Eq. (9.1.9).) States such as these are not realized in nature. The main problem, of course, is their negative energy, which lies below the smallest energy for states with positive rest energy. Thus, one would expect radiative transitions, accompanied by the emission of light quanta, from positive energy into negative energy states. Positive energy states would be unstable due to the infinite number of negative-energy states into which they could fall by emitting light - unless, that is, all of these latter states were occupied. It is not possible to exclude these states simply by arguing that they are not realized in nature. The positive energy states alone do not represent a complete set of solutions. The physical consequence of this is the following: When an external perturbation, e.g., due to a measurement, causes an electron to enter a certain state, this will in general be a combination of positive and negative energy states. In particular, when the electron is confined to a region that is smaller than its Compton wavelength, the negative energy states will contribute significantly. 10.1 Wave Packets and "Zitterbewegung" In the previous sections, we for the most part investigated eigenstates of the Dirac Hamiltonian, i.e., stationary states. We now wish to study general solutions of the time-dependent Dirac equation. We proceed analogously to the nonrelativistic theory and consider superpositions of stationary states for free particles. It will emerge that these wave packets have some unusual properties as compared to the nonrelativistic theory (see Sect. 10.1.2). 196 10. Physical Interpretation of the Solutions to the Dirac Equation 10.1.1 Superposition of Positive Energy States We shall first superpose only positive energy states d3p m 'lj1C+)(x) = J (27l')3 E . L b(p, r)ur(p)e- 1PX (10.1.1) r=1,2 and investigate the properties of the resulting wave packets. Here, ur(p) are the free spinors of positive energy and b(p, r) are complex amplitudes. The factor C27r)3 E is included so as to satisfy a simple normalization condition. We note in passing that E * is a Lorentz-invariant measure where, as always, = J p 2 + m 2 . We show this by the following rearrangement: ! ! 00 = d3 p dpo 8(p~ - E2) = ! (10.1.2) d 4 p8(p2 _ m 2) . -00 Both d 4 p and the 8-function are Lorentz covariant. The d 4 p = det A d 4 p' = ± d4 p' transforms as a pseudoscalar, where the Jacobi determinant det A would equal 1 for proper Lorentz transformations. The density corresponding to (10.1.1) is given by jC+)O(t,x) = 'lj1C+)t(t,x)'lj1C+)(t,x) . (1O.1.3a) Integrated over all of space Jd3Pd3pl m 2 ""'b*( )b( 1 ') Jd 3XJ'C+)o( t,x )-Jd3 X (27l')6 EE' f;: p,r p ,r x ut(p)url (p')eiCE-E')t-iCP-pl)x = L (1O.1.3b) d3p m 2 J (27l')3 E Ib(p,r)1 = 1 , r the density in the sense of a probablity density is normalized to unity. Here, we have used f d3 x eiCp-pl)x = (27l')3 8(3 ) (p - pi) and the orthogonality relation (6.3.19a)1. The time dependence disappears and the total density is time independent. This equation determines the normalization of the amplitudes b(p, r). We next calculate the total current, which is defined by JC+) = 1 J ut(p) Uri (p) d3 xjC+)(t, x) = J = Ur(p)"/ Uri (p) = ~ d3 x 'lj1C+)t(t,x)o: 'lj1C+)(t, x) . 8rr l (10.1.4) 10.1 Wave Packets and "Zitterbewegung" 197 In analogy to the zero component, one obtains J Jd(27r)6 Jf d pd PEE' Lb (p,r)b(p ,r) (+) _ 3' ~ 3 __X_ 3 - * 2 , , r,r' x ut(p)o Uri (p')ei(E-E')t-i(p-p')x = J(~:)3 L r,r' ~: (10.1.4') b*(p,r)b(p,r')ut(p)our,(p), For further evaluation, we need the Gordon identity (see Problem 10.1) (10.1.5) Taken in conjunction with the orthonormality relations for the (6.3.15), ur(k)us(k) = 8rs , equation (10.1.4') yields: J(+) Ur 3 =" / (27r)3 d p m Ib( )1 2 P = /P\. ~ E p,r E \E/ given in (10.1.6) r This implies that the total current equals the mean value of the group velocity Va = oE oJp2 +m2 op = op p (10.1.7) = E . So far, seen from the perspective of nonrelativistic quantum mechanics, nothing appears unusual. 10.1.2 The General Wave Packet However, on starting with a general wave packet and expanding this using the complete set of solutions of the free Dirac equation, the result contains negative energy states. Let us take as the initial spinor the Gaussian ¢(O, x) = 1 (27rd 2)3/4 eixpo-x2/4d2 w (10.1.8) ' i.e., at time zero there are only components where, for example, w = (~), with positive energy, and where d characterizes the linear dimension of the wave packet. The most general spinor can be represented by the following superposition: ¢(t, x) = J(~3 ~ L (b(p, r)ur(p)e- ipx + d*(p, r)vr(p)e iPX ) r (10.1.9) 198 10. Physical Interpretation of the Solutions to the Dirac Equation We also need the Fourier transform of the Gaussian appearing in the initial spinor (10.1.8) (10.1.10) In order to determine the expansion coefficients b(p, r) and d(p, r), we take the Fourier transform at time t = 0 of 'IjJ(0, x) and insert (10.1.8) and (10.1.10) on the left-hand side of (10.1.9) (87rd2 )3/4 e -(P-PO)2 d 2 w = ~ 2)b(p, r)ur(p) + d*(p, r)vr(p)) , (10.1.11) r where p = (pO, -p). After multiplying (10.1.11) by u~(p) thogonality relations (6.3.19a-c) and v~(p), the or- yield the Fourier amplitudes b(p, r) = (87rd 2)3/4 e-(p-po)2d 2 ut(p)w d*(p, r) = (87rd 2)3/4 e-(p-po)2d 2 vt(p)w , (10.1.12) both of which are finite. We have thus demonstrated the claim made at the outset that a general wave packet contains both positive and negative energy components. We now wish to study the consequences of this type of wave packet. For the sake of simplicity, we begin with a nonpropagating wave packet, i.e., Po = o. Some of the modifications arising when Po =I=- 0 will be discussed after Eq. (1O.1.14b). Since we have assumed w = (~), the representation (6.3.11a,b) implies, for the spinors U r and Vr of free particles, the relation d* (p, r) / b(p, r) rv J~IE . If the wave packet is large, d » ~ , then Ipi ;S d- 1 « m and thus d*(p) « b(p). In this case, the negative energy components are negligible. However, when we wish to confine the particle to a region of dimensions less than a Compton wavelength, d « ~, then the negative energy solutions play an important role: i.e., d*/b rv 1. 10.1 Wave Packets and "Zitterbewegung" 199 The normalization is time independent as a result of the continuity equation. The total current for the spinor (10.1.9) reads: +i L [b* (p, r )d* (p, r')e2iEtur (p)(7iO Vr ' (p) (10.1.13) r,r' The first term is a time-independent contribution to the current. The second term contains oscillations at frequencies greater than 2rr;.,c 2 = 2 X 10 21 s- 1 . This oscillatory motion is known as "Zitterbewegung". In this derivation, in addition to (10.1.5), we have used (10.1.14a) from which it follows that u~(p)aiVr'P = ur(P)·. ./vr,(p) = 2~[(pi-)UrPV+7l/t'] (1O.1.14b) For the initial spinor (10.1.S) with w = (i;) and Po = 0, the first term of (1O.1.14b) contributes nothing to Ji(t) in (10.1.13). If the spinor w also contains components 3 and 4, or if po =P 0, there are also contributions from Zitterbewegung to the first term of (1O.1.14b). One obtains an additional term (see Problem 10.2) to (10.1.13) AJi (t) _ / L.l - d3 p m (S d2 )3/2 -2(p_po)2 d 2 2iEt i t _1_ ( 2 (2'if)3 E 'if e e p w 2m 2 p _ mpl' ) rOW. (10.1.13') The amplitude of the Zitterbewegung is obtained as the mean value of x: (x) = = J J d3x1j;t(t, x) x 1j;(t, x) (10.1.15a) d3x1j;t(0,x)eiHtxe-iHt1j;(0,x). In order to calculate (x), we first determine the temporal variation of (x), since this can be related to the current, which we have already calculated: 200 10. Physical Interpretation of the Solutions to the Dirac Equation ! !J J =J (x) = d3x¢t(O,x)eiHtxe-iHt¢(O,x) d3x¢t(t,x) i[H,x] ¢(t,x) = (1O.1.15b) d3x ¢t (t, x) Q. ¢(t, x) == J(t) . tV In evaluating the commutator we have used H = Q.. +f3m. The integration of this relation over the time from to t yields, without (10.1.13'), i i (x) = (x )t=O + +~ J(~:3 J ° d3p mpi "" 2 2 (21l')3 E2 ~ [lb(p,r)1 + Id(p,r)1 ] t r 2;2 [b*(p,r)d*(p,r')e2iEtur(p)O'iOVrl(p) + b(p, r )d(p, r')e-2iEtvrl (p )O'iOUr(p)] (10.1.16) . rv rv rv '::c The mean value of Xi contains oscillations with amplitude 1; ~ = 3.9 x 10- 11 cm. The Zitterbewegung stems from the interference between components with positive and negative energy. Remarks: (i) If a spinor consists not only of positive-energy, but also of negative energy states, Zitterbewegung follows as a consequence. If one expands bound states in terms of free solutions, these also contain components with negative energy. An example is the ground state of the hydrogen atom (8.2.41). (ii) A Zitterbewegung also arises from the Klein-Gordon equation. Here too, wave packets with linear dimension less than the Compton wavelength Ac 71'- = ~, m,,_ contain contributions from negative energy solutions, which fluctuate over a region of size Ac 71'-. The energy shift in a Coulomb potential (Darwin term), however, is a factor Q: smaller than for spin-~ particles. (See Problem 9.2)2. *10.1.3 General Solution of the Free Dirac Equation in the Heisenberg Representation The existence of Zitterbewegung can also be seen by solving the Dirac equation in the Heisenberg representation. The Heisenberg operators are defined by 2 An instructive discussion of these phenomena can be found in H. Feshbach and F. Villars, Rev. Mod. Phys. 30, 24 (1985) 10.1 Wave Packets and "Zitterbewegung" 201 (10.1.17) which yields the equation of motion dO(t) dt = ~ iii [O( ) H] t, (10.1.18) . We assume that the particle is free, i.e., that A = 0 and tP = O. In this case, the momentum commutes with H = co:· p + f3mc 2 (10.1.19) , that is, dp(t) = 0 dt ' (10.1.20) which implies that p(t) dx(t) = p 1 v(t) = ---;It do: 1 iii [o:(t) , H] = iii [x(t), = const. H] = In addition, we see that co:(t) (1O.1.21a) and dt = 2 = iii (cp - H o:(t)) . (10.1.21b) Since H = const (time independent), the above equation has the solution v(t) = co:(t) = cH-1p + e 2;{(t (0:(0) - cH-1p) (10.1.22) Integration of (10.1.22) yields: x(t) = x(O) + -c 2 p H t + -lie 2iH (2;Ht e-"- - 1) (0:(0) - -C P ) H (10.1.23) For free particles, we have o:H + H 0: = 2cp , and hence (0: - C;) H + H (0: - C;) = 0 . (10.1.24) In addition to the initial value x(O), the solution (10.1.23) also contains a term linear in t which corresponds to the group velocity motion, and an oscillating term that represents the Zitterbewegung. To calculate the mean value I 'ljIt (0, x)x(t)'ljI(O, x)d3 x, one needs the matrix elements of the operator 0:(0) - fl. This operator has nonvanishing matrix elements only between states of identical momentum. The vanishing of the anticommutator (10.1.24) implies, furthermore, that the energies must be of opposite sign. Hence, we find that the Zitterbewegung is the result of interference between positive and negative energy states. 202 10. Physical Interpretation of the Solutions to the Dirac Equation *10.1.4 Potential Steps and the Klein Paradox One of the simplest exactly solvable problems in nonrelativistic quantum mechanics is that of motion in the region of a potential step (Fig. 10.1). If the energy E of the plane wave incident from the left is smaller than the height Vo of the potential step, i.e., E < Vo, then the wave is reflected and penetrates into the classically forbidden region only as a decaying exponential e- Kx3 with K, = J2m(Vo - E). Hence, the larger the energy difference Vo - E, the smaller the penetration. The solution of the Dirac equation is also relatively easy to find, but is not without some surprises. We assume that a plane wave with positive energy is incident from the left. After separating out the common time dependence e- iEt , the solution in region I (cf. Fig.1O.1) comprises the incident wave (10.1.25) and the reflected wave (10.1.26) i.e., 7f!I(X 3 ) = 7f!in(X3) + 7f!refl(X 3 ). The second term in (10.1.26) represents a reflected wave with opposite spin and will turn out to be zero. In region II, we make a similar ansatz for the transmitted wave • 1. ( 'YII X 3) =_ (3) -_ ceiqx 3 .1. 'Ytrans X ( ~ E-Vo+m o ) + d eiqX3 ( ~ ) • -q E-Vo+m (10.1.27) Va f - - - - - - - E ---------'--------- x3 II I Fig. 10.1. A potential step of height Va 10.1 Wave Packets and "Zitterbewegung" 203 The wave vector (momentum) in this region is given by (10.1.28) and the coefficients a, b, c, d are determined from the requirement that 'lj; be continuous at the step. If the solution were not continuous, then, upon inserting it into the Dirac equation, one would obtain a contribution proportional to 8(x 3 ). From this continuity condition, 'lj;r(O) = 'lj;u(O), it follows that (1O.1.29a) l+a=c, 1- a = rc, with q E+m - kE-Va+m' (1O.1.29b) r=------ and (1O.1.29c) b=d=O. The latter relation, which stems from components 2 and 4, implies that the spin is not reversed. As long as IE - Val < m, i.e. -m + Va < E < m + Va, the wave vector q to the right of the step is imaginary and the solution in that case decays exponentially. In particular, when E, Va « m, then the solution 'lj;trans rv e-iQix3 rv e- mx3 is localized to within a few Compton wavelengths. However, when the height of the step Va becomes larger, so that finally Va ~ E + m, then, according to (10.1.28), q becomes real and one obtains an oscillating transmitted plane wave. This is an example of the Klein paradox. The source of this initially surprising result can be explained as follows: In region I, the positive energy solutions lie in the range E > m, and those with negative energy in the range E < -m. In region II, the positive energy solutions lie in the range E > m + Va, and those with negative energy in the range E < -m + Va. This means that for Va > m the solutions hitherto referred to as "negative energy solutions" actually also possess positive energy. When Va becomes so large that Va > 2m (see Fig. 10.2), the energy of these "negative energy solutions"in region II eventually becomes larger than m, and thus lies in the same energy range as the solutions of positive energy in region I. The condition for the occurrence of oscillatory solutions given after Eq. (10.1.29c) was Va ~ E+m, where the energy in region I satisfies E > m. This coincides with the considerations above. Instead of complete reflection with exponential penetration into the classically forbidden region, one has a transition into negative energy states for E > 2m. For the transmitted and reflected current density one finds, 4r (1 + r)2 ' j~efl = Jin (1 +- r) 1_jt~ans . 2 = 1 r Jin (10.1.30) 204 10. Physical Interpretation of the Solutions to the Dirac Equation ° for positive q and thus the However, according to Eq. (1O.1.29b), r < reflected current is greater than the incident current. If one takes the positive square root for q in (10.1.28), according to (1O.1.29b), r < 0, and consequently the flux going out to the left exceeds the (from the left) incoming flux. This comes about because, for Va> E+m, the group velocity 1 Va = E - Va q has the opposite sense to the direction of q. That is, wave-packet solutions of this type also contain incident wave packets coming form the right of the step. If one chooses for q in (10.1.28) the negative square root, r > 0, one obtains the regular reflection behavior3 . .,jL---L---L-.L.-L-L-_ Va +m r----- VaVa- m Fig. 10.2. Potential step and energy ranges for Va > 2m. Potential step (thick line) and energy ranges with positive and negative energy (right- and left-inclined hatching). To the left of the step, the energies E and E' lie in the range of positive energies. To the right of the step, E' lies in the forbidden region, and hence the solution is exponentially decaying. E lies in the region of solutions with negative energy. The energy E" lies in the positive energy region, both on the right and on the left. 10.2 The Hole Theory In this section, we will give a preliminary interpretation of the states with negative energy. The properties of positive energy states show remarkable 3 H.G. Dosch, J.H.D. Jensen and V.L. Muller, Physica Norvegica 5, 151 (1971); B. Thaller, The Dirac Equation, Springer, Berlin, Heidelberg, 1992, pp. 120,307; W. Greiner, Theoretical Physics, Vol. 3, Relativistic Quantum Mechanics, Wave Equations, 2nd edn., Springer, Berlin, Heidelberg, 1997 10.2 The Hole Theory 205 agreement with experiment. Can we simply ignore the negative energy states? The answer is: No. This is because an arbitrary wave packet will also contain components of negative energy V r . Even if we have spinors of positive energy, U r , to start with, the interaction with the radiation field can cause transitions into negative energy states (see Fig. 10.3). Atoms, and thus all matter surrounding us, would be unstable. E Fig. 10.3. Energy eigenvalues of the Dirac equation and conceivable transitions A way out of this dilemma was suggested by Dirac in 1930. He postulated that all negative energy states be considered as occupied. Thus, particles with positive energy cannot make transitions into these states because the Pauli principle forbids multiple occupation. In this picture, the vacuum state consists of an infinite sea of particles, all of which are in negative energy states (Fig. 10.4). Fig. 10.4. Filled negative energy states (thick line): (a) vacuum state, (b) excited state b) a) An excited state of this vacuum arises as follows: An electron of negative energy is promoted to a state of positive energy, leaving behind a hole with charge -(-eo) = eo! (Fig. 10.4 b). This immediately has an interesting consequence. Suppose that we remove a particle of negative energy from the vacuum state. This leaves behind a hole. In comparison to the vacuum state, this state has positive charge and positive energy. The absence of a negative energy state represents an antiparticle. For the electron, this is the positron. Let us consider, for example, the spinor with negative energy Vr=I (p')e ip' x = VI (p')ei(Ep/t-p/x) . 206 10. Physical Interpretation of the Solutions to the Dirac Equation This is an eigenstate with energy eigenvalue - Epl, momentum -p' and spin in the rest frame ~E3 1/2. When this state is unoccupied, a positron is present with energy Epl, momentum p', and spin ~E3 -1/2. (An analogous situation occurs for the excitation of a degenerate ideal electron gas, as discussed at the end of Sect. 2.1.1.) The situation described here can be further elucidated by considering the excitation of an electron state by a photon: The 'Y quantum of the photon with its energy nw and momentum nk excites an electron of negative energy into a positive energy state (Fig. 10.5). In reality, due to the requirements of energy and momentum conservation, this process of pair creation can only take place in the presence of a potential. Let us look at the energy and momentum balance of the process. Fig. 10.5. The photon I excites an electron from a negative energy state into a positive energy state, i.e., e+ + e- ,-+ The energy balance for pair creation reads: nw = Eel. pas. energy - Eel. neg. energy = Ep - ( - Epl) = Eel. + Epos. (10.2.1 ) The energy of the electron is Eel. = y'p 2C2 + m 2 c4, and the energy of the positron Epos. = y'p'2C 2 + m 2 c4 . The momentum balance reads: nk-p' = p or nk= p+p', (10.2.2) i.e., (photon momentum) = (electron momentum) + (positron momentum). It turns out, however, that this preliminary interpretation of the Dirac theory still conceals a number of problems: The ground state (vacuum state) has an infinitely large (negative) energy. One must also inquire as to the role played by the interaction of the particles in the occupied negative energy states. Furthermore, in the above, treatment, there is an asymmetry between electron and positron. If one were to begin with the Dirac equation of the positron, one would have to occupy its negative energy states and the electrons would be holes in the positron sea. This interpretation necessarily describes a many-body system. A genuinely adequate description only becomes possible through the quantization of the Dirac field. The original tendency was to view the Dirac equation as a generalization of the Schrodinger equation and to interpret the spinor 'ljJ as a sort of wave Problems 207 function. However, this leads to insurmountable difficulties. For example, even the concept of a probability distribution for the localization of a particle at a particular point in space becomes problematic in the relativistic theory. Also connected to this is the fact that the problematic features of the Dirac single-particle theory manifest themselves, in particular, when a particle is highly localized in space (in a region comparable to the Compton wavelength). The appearance of these problems can be made plausible with the help of the uncertainty relation. When a particle is confined to a region of size Llx, it has, according to Heisenberg's uncertainty relation, a momentum spread Llp > liLlx- 1 . If Llx < then the particle's momentum, and thus energy, uncertainty becomes '::c' Thus, in this situation, the energy of a single particle is sufficient to create several other particles. This, too, is an indication that the single-particle theory must be replaced by a many-particle theory, i.e. a quantum field theory. Before finally turning to a representation by means of a quantized field, in the next chapter, we shall first investigate further symmetry properties of the Dirac equation in connection with the relationship of solutions of positive and negative energy to particles and antiparticles. Problems 10.1 Prove the Gordon identity (10.1.5), which states that, for two positive energy solutions of the free Dirac equation, U r (p) and Uri (p), 10.2 Derive Eq. (10.1.13) and the additional term (10.1.13'). 10.3 Verify the solution for the potential step considered in conjunction with the Klein paradox. Discuss the type of solutions obtained for the energy values E' and E" indicated in Fig. 10.2. Draw a diagram similar to Fig. 10.2 for a potential step of height 0 < Vo < m. 11. Symmetries and Further Properties of the Dirac Equation *11.1 Active and Passive Transformations, Transformations of Vectors In this and the following sections we shall investigate the symmetry properties of the Dirac equation in the presence of an electromagnetic potential. We begin by recalling the transformation behavior of spinors under passive and active transformations, as was described in Sect. 7.1. We will then address the transformation of the four-potential, and also investigate the transformation of the Dirac Hamiltonian. Consider the Lorentz transformation x' = Ax+a (11.1.1) from the coordinate system [ into the coordinate system I'. According to Eq. (7.l.2a), a spinor 1jJ(x) transforms under a passive transformation as (11.1.2a) where we have written down only the homogeneous transformation. An active transformation with A-I gives rise to the spinor (Eq. (7.1.2b)) (11.1.2b) The state Z", which is obtained from Z through the active transformation A, appears by definition in [' as the state Z in [, i.e., 1jJ(x'). Since [ is obtained from [' by the Lorentz transformation A-I, we have (Eq. (7.1.2c)) 1jJ"(x) = S-I1jJ(Ax) . (11.1.2c) For a passive transformation A, the spinor transforms according to (11.1.2a). For an active transformation A, the state is transformed according to (11.1.2c)1. We now consider the transformation of vector fields such as the fourpotential of the electromagnetic field: 1 For inhomogeneous transformations (A, a), one has (A, a) -1 = (A -1, - A -1 a) and in the arguments of Eq. (11.1.2a-c) one must make the replacements Ax ---+ Ax+a and A -1X ---+ A -1(X - a). 210 11. Symmetries and Further Properties of the Dirac Equation The passive transformation of the components of a vector AI-'(x) under a Lorentz transformation X'I-' = AI-'vxv takes the form (11.1.3a) The inverse of the Lorentz transformation may be established as follows: AAI-' gl-'V APv = gAP ===} AAV A pv = 8AP ===} A AvAPv = 8A' P Since the right inverse of a matrix is equal to its left inverse, together with Eq. (11.1.1), this implies and so, finally, the inverse of the Lorentz transformation (11.1.4) For an active transformation, the entire space, along with its vector fields, is transformed and then viewed from the original coordinate system I. For a transformation with A, the resulting vector field, when viewed from I', is of the form AI-'(X') (see Fig. 11.1). The field transformed actively with A, which we denote by A"I-'(x), therefore takes the form (11.1.3c) I Fig. 11.1. Active transformation of a vector with the Lorentz transformation A For the sake of completeness, we also give the active transformation with respect to the Lorentz transformation A-I, which leads to the form (11.1.3b) We now investigate the transformation of the Dirac equation in the presence of an electromagnetic field AI-' with respect to a passive Lorentz transformation: Starting from the Dirac equation in the system I 11.1 *Active and Passive Transformations, Transformations of Vectors (-y1L(i81L - eAIL(x)) - m) 'ljJ(x) = 0 , 211 (l1.1.5a) one obtains the transformed equation in the system I': (-y1LW~ - eA~(x') - m) 'ljJ'(x') (l1.1.5b) = 0 . Equation (1l.1.5b) is derived by inserting into (l1.1.5a) the transformations 81L == 8~1L AVIL8~; = AIL(x) = AVILA' V(x') and 'ljJ(x) = S-l'ljJ'(X') , which yields (r,ILAVW~ - eA~(x') - m) S-l'ljJ'(x') = o. Multiplying by S, (S'Y IL AVILS-1(W~ - eA~(x') - m)'ljJ'(x') = 0 , and making use of 'YIL AVIL = S-l'Y v S finally yields the desired result (-yVW~ - eA~(x') - m)'ljJ'(x') = 0 . Transformation of the Dirac equation with respect to an active Lorentz transformation, viz: 'ljJ" (x) = S-l'ljJ(Ax) (l1.1.2c) with A"IL(X) = At AV(Ax) . (l1.1.3c) Starting from (-y1L(WIL - eAIL(x)) - m)'ljJ(x) = we take this equation at the point x' a _ ax a - A 1.18 ax'" - ax'" ax IL v, (l1.1.5a) 0, = Ax, and taking note of the fact that V V - (-y1L(iA:8v - eAIL(Ax)) - m)'ljJ(Ax) = 0. Multiplying by S-l(A), (S-1'YILS(iA:8v - and using S-l'YILSA: (-yV(Wv - eA~(x) eAIL(Ax)) - m)S-l'ljJ(Ax) = 0 , = AIL,,.'Y''' A: = 'YaJ; together with Eq. (11.1.4) yields: - m)'ljJ"(x) = 0. (11.1.6) If'ljJ(x) satisfies the Dirac equation for the potential AIL(x), then the transformed spinor 'ljJ"(x) satisfies the Dirac equation with the transformed potential A~ (x ). In general, the transformed equation is different to the original one. The two equations are the same only when A~(x) = AIL(x). Then, 'ljJ(x) and'ljJ"(x) obey the same equation of motion. The equation of motion remains invariant under any Lorentz transformation L that leaves the external potential unchanged. For example, a radially symmetric potential is invariant under rotations. 212 11. Symmetries and Further Properties of the Dirac Equation 11.2 Invariance and Conservation Laws 11.2.1 The General Transformation We write the transformation 'ljJ"(x) = S-l'ljJ(Ax) in the form 'ljJ" = T'ljJ, (11.2.1) where the operator T contains both the effect of the matrix S and the transformation of the coordinates. The statement that the Dirac equation transforms under an active Lorentz transformation as above (Eq. (11.1.6)) implies for the operator (11.2.2) that TD(A)T- 1 = D(A") , (11.2.3) since (D(A) - m)'ljJ = 0 =} T(D(A) - m)'ljJ = T(D(A) - m)T-1T'ljJ = (D(A") - m)T'ljJ =0. As the transformed spinor T'ljJ obeys the Dirac equation (D(A") -m)T'ljJ = 0, and this holds for every spinor, equation (11.2.3) follows. If A remains unchanged under the Lorentz transformation in question (A" = A), it follows from (11.2.3) that T commutes with D(A): [T, D(A)] = (11.2.4) 0. One can construct the operator T for each of the individual transformations, to which we shall now turn our attention. 11.2.2 Rotations We have already found in Chap. 7 that 2 for rotations (11.2.5) with It 2 It i J=-.E+xx-V. 2 The difference in sign compared to Chap. 7 arises because there the active transformation A-I was considered. 11.2 Invariance and Conservation Laws 213 The total angular momentum J is the generator of rotations. If one takes an infinitesimal cpk, then, from (11.2.2) and (11.2.4) and after expansion of the exponential function, it follows that for a rotationally invariant potential A, [D(A), J] = 0 . (11.2.6) Since [hOOt, ,i,k] = 0 and [hOOt, x x V] = 0, equation (11.2.6) also implies that [J,H] = 0, (11.2.7) where H is the Dirac Hamiltonian. 11.2.3 Translations For translations we have S 'lj;//(x) = 'lj;(x + a) = = 1l and ea~l'j;(x) , (11.2.8) and thus the translation operator is (11.2.9) where PIL = iOlL is the momentum operator. The momentum is the generator of translations. The translational invariance of a problem means that [D(A),PIL] = 0 and since [hOOt,piLl (11.2.10) = 0, this also implies that [pIL,H] =0. (11.2.11 ) 11.2.4 Spatial Reflection (Parity Transformation) We now turn to the parity transformation. The parity operation P, represented by the parity operator P, is associated with a spatial reflection. We use prO) to denote the orbital parity operator, which causes a spatial reflection p(O)'lj;(t, x) = 'lj;(t, -x) . (11.2.12) For the total parity operator in Sect. 6.2.2.4, we found, to within an arbitrary phase factor, P = ,Op(O) . (11.2.13) We also have pt = P and p2 = 1. If AIL(X) is invariant under inversion, then the Dirac Hamiltonian H satisfies [P,H] =0. (11.2.14) There remain two more discrete symmetries of the Dirac equation, charge conjugation and time-reversal invariance. 214 11. Symmetries and Further Properties of the Dirac Equation 11.3 Charge Conjugation The Hole theory suggests that the electron possesses an antiparticle, the positron. This particle was actually discovered experimentally in 1933 by C.D. Anderson. The positron is also a fermion with spin 1/2 and should itself satisfy the Dirac equation with e --+ -e. There must thus be a connection between negative energy solutions for negative charge and positive energy solutions carrying positive charge. This additional symmetry transformation of the Dirac equation is referred to as "charge conjugation", C. The Dirac equation of the electron reads: (i~ - e4- - m)'lj; = 0, e = -eo, eo = 4.8 x 10- lO esu (11.3.1) and the Dirac equation for an oppositely charged particle is (i~ + e4- - (11.3.2) m )'lj;c = 0 . We seek a transformation that converts 'lj; into 'lj;c. We begin by establishing the effects of complex conjugation on the first two terms of (11.3.1): (i8J.L)* = -i8J.L (11.3.3a) (AJ.L)* = AJ.L , (11.3.3b) as the electromagnetic field is real. In the next section, in particular, it will turn out to be useful to define an operator K o that has the effect of complex conjugating the operators and spinors upon which it acts. Using this notation, (11.3.3a,b) reads: (11.3.3') Thus, when one takes the complex conjugate of the Dirac equation, one obtains (11.3.4) In comparison with Eq. (11.3.1), not only is the sign of the charge opposite, but also that of the mass term. We seek a nonsingular matrix C,!o with the property (11.3.5) With the help of this matrix, we obtain from (11.3.4) C,!o (-(i8J.L = (i~ + eAJ.LhJ.L* + e4- - - m) (C,!O)-lC,!°'lj;* m)(C,!°'lj;*) =0. Comparison with (11.3.2) shows that (11.3.6) 11.3 Charge Conjugation 215 (11.3.7) since ifiT = (1/Jt"(0f = "(OT 1/Jt T = "(01/J* . (11.3.8) Equation (11.3.5) can also be written in the form C- 1 ,,(J],C = _,,(J], T (11.3.5') In the standard representation, we have "(OT = ,,(0, "(2T = "(2, "(IT = _,,(1, = _,,(3 , and, hence, C commutes with "(I and "(3 and anticommutes with "(0 and "(2. From this, it follows that "(3 T C = i"(2"(0 = _C- 1 = -ct = (11.3.9) _CT , so that 1/Jc = i"(21/J* (11.3.7') . The full charge conjugation operation C = C"(oKo = (11.3.7") i"(2 K o consists in complex conjugation K o and multiplication by C"(o. If 1/J(x) describes the motion of a Dirac particle with charge e in the potential AJ],(x), then 1/Jc describes the motion of a particle with charge -e in the same potential AJ], (x). Example: For a free particle, for which AJ], = 0, (11.3.10) and therefore, ("l-t ~ C,o ("l-)), ~ h' ("l-)), ~ (2~* (~}_im' ~ "l+) (11.3.10') The charge conjugated state has opposite spin. We now consider a more general state with momentum k and polarization along n. With respect to the projection operators, this has the property 3 3 rI = "(I-'nl-' , nl-' pen) = ~(1 space-like unit vector n2 = nl-'nl-' = -1 and nl-'kl-' = o. + "(5r1) projects onto the positive energy spinor u(k,n), which is polarized along n in the rest frame, and onto the negative energy spinor v(k, n), which is polarized along -no k = Ak, n = An, k = operators A±(k) == (±~ (m, 0, 0, 0), n = (0, n) (see Appendix C). The projection + m)/2m were introduced in Eq. (6.3.21). 216 11. Symmetries and Further Properties of the Dirac Equation (11.3.11) with c = ±1 indicating the sign of the energy. Applying charge conjugation to this relation, one obtains m) * (1 \'Y5'¢) 'l/J* w C'Yo (c + m) (C'Yo) C'Yo (1 + 'Y5'¢*) (C'Yo) (-c~; m) (1 +2'Y5'¢) 'l/Jc , 'l/Jc = olj;T = = = c'l (c~2: 2m (11.3.11') * -1 2 -1 C'Yo'l/J * where we have used 1'5 = 1'5 and {C'Yo, 'Y5} = O. The state 1/Jc is characterized by the same four-vectors, k and n, as 'l/J, but the energy has reversed its sign. Since the projection operator !(1 + 'Y5,¢) projects onto spin ±! along n, depending on the sign of the energy, the spin is reversed under charge conjugation. With regard to the momentum, we should like to point out that, for free spinors, complex conjugation yields e- ikx -+ eikx , i.e., the momentum k is transformed into - k. Thus far, we have discussed the transformation of the spinors. In the qualitative description provided by the Hole theory, which finds its ultimate mathematical representation in quantum field theory, the non-occupation of a spinor of negative energy corresponds to an antiparticle with positive energy and exactly the opposite quantum numbers to those of the spinor (Sect. 10.2). Therefore, under charge conjugation, the particles and antiparticles are transformed into one another, having the same energy and spin, but opposite charge. Remarks: (i) The Dirac equation is obviously invariant under simultaneous transformation of 7j; and A, 7j; --+ 7j;c = 'f/cCiV Ait --+ A~ = -Alt· With respect to charge conjugation, the four-current density jl' transforms according to . lit ·c -. t -T = 7j;,lt7j; --+ lit = 7j;c,lt7j;c = 7j; C ,O,ltC'Ij; = 7j;T ,o( -Cho'ltciV = 'lj;T C,ltCiV = 7j;T,~ iV = 7j;"'(,It)j3""Sp7j;; = 7j;,~3(It)" = if;'lt7j; . c-number Dirac field one thus obtains j~ = jlt. In the quantized form, For the and if; become anticommuting fields, which leads to an extra minus sign: j~ = -jlt . 7j; (11.3.12) Then, under charge conjugation, the combination ej . A remains invariant. As we shall see explicitly in the case of the Majorana representation, the form of the charge conjugation transformation depends on the representation. 11.4 Time Reversal (Motion Reversal) 217 (ii) A Majorana representation is a representation of the "/ matrices with the property that ,,/0 is imaginary and antisymmetric, whilst the ,,/k are imaginary and symmetric. In a Majorana representation, the Dirac equation (i"{"'8", - m)'IjJ =0 is a real equation. If 'Ij; is a solution of this equation, then so is (11.3.13) In the Majorana representation, the solution related to 'Ij; by charge conjugation is, to within an arbitrary phase factor, given by (11.3.13), since the Dirac equation for the field 'IjJ, (11.3.14) also leads to (11.3.14') The spinor 'IjJ is the solution of the Dirac equation with a field corresponding to charge e and the spinor 'ljJc is the solution for charge -e. A spinor that is real, i.e., 'IjJ* = 'IjJ, is known as a Majorana spinor. A Dirac spinor consists of two Majorana spinors. An example of a Majorana representation is the set of matrices 0 (2 ( a2 0 "/0 = "/2 = .I (11.0 ) ,,,/1 = . ( I 0 (1) 0 ' a1 (11.3.15) -11.0 ) ,,,/3 = I. ( 3 a03 (0 ) . Another example is given in Problem 11.2. 11.4 Time Reversal (Motion Reversal) Although the more appropriate name for this discrete symmetry transformation would be "motion reversal" , the term "time reversal transformation" is so well established that we shall adopt this practice. It should be emphasized from the outset that the time-reversal transformation does not cause a system to evolve backwards in time, despite the fact that it includes a change in the time argument of a state t --+ -to One does not need clocks that run backwards in order to study time reversal and the invariance of a theory under this transformation. What one is really dealing with is a reversal of the motion. In quantum mechanics the situation is further complicated by a formal difficulty: In order to describe time reversal, one needs antiunitary operators. In this section we first study the time-reversal transformation in classical mechanics and nonrelativistic quantum mechanics, and then turn our attention to the Dirac equation. 218 11. Symmetries and Further Properties of the Dirac Equation 11.4.1 Reversal of Motion in Classical Physics Let us consider a classical system invariant under time translation, which is described by the generalized coordinates q and momenta p. The timeindependent Hamiltonian function is H(q,p). Hamilton's equations of motion are then . aH(q,p) ap . aH(q,p) p=aq q= (11.4.1) At t = 0, we assume the initial values (qO,Po) for the generalized coordinates and momenta. Hence, the solution q(t),p(t) of Hamilton's equations of motion must satisfy the initial conditions q(O) = qo p(O) = Po. (11.4.2) Let the solution at a later time t = tl > 0 assume the values (l1.4.3a) The motion-reversed state at time tl is defined by (l1.4.3b) If, after this motion reversal, the system retraces its path, and after a further time t, returns to its time reversed initial state, the system is said to be timereversal or motion-reversal invariant (see Fig. 11.2). To test time-reversal invariance there is no need for running backwards in time. In the definition which one encounters above, only motion for the positive time direction arises. As a result, it is possible to test experimentally whether a system is timereversal invariant. q2 Fig. 11.2. Motion reversal: Shown (displaced for clarity) are the trajectories in real space: (0, iI) prior to reversal of the motion, and (iI , 2iI) after reversal 11.4 Time Reversal (Motion Reversal) 219 Let us now investigate the conditions for time-reversal invariance and find the solution for the motion-reversed initial state. We define the functions q'(t) = q(2tl - t) p'(t) = -P(2tl - t) . (11.4.4) These functions obviously satisfy the initial conditions q'(tl) = q(tl) = ql p'(tl ) = -p(h) = -Pl· (11.4.5) (11.4.6) At time 2tI they have the values q'(2h) = q(O) = qo p'(2h) = -p(O) = -Po, (11.4.7) Le., the motion-reversed initial values. Finally, they satisfy the equation of motion4 aH(q(2h - t),p(2h - t)) ap(2h - t) aH(q'(t), -p'(t)) ap'(t) "( ) _ '(2 _) __ aH(q(2h - t),p(2tl - t)) pt-ph t>l() uq 2tI - t q'(t) = -q(2tl - t) = aH(q'(t), -p'(t)) aq' (t) (1l.4.8a) (11.4.8b) The equations of motion of the functions q'(t),p'(t) are described, according to (11.4.8a,b), by a Hamiltonian function H, which is related to the original Hamiltonian by making the replacement p -+ -p: H = H(q,-p). (11.4.9) Most Hamiltonians are quadratic in p (e.g., that of particles in an external potential interacting via potentials), and are thus invariant under motion reversal. For these, H = H(q,p), and q'(t),p'(t) satisfy the original equation of motion evolving from the motion-reversed starting value (ql, -pd to the motion-reversed initial value (qo, -Po) of the original solution (q(t),p(t)). This implies that such classical systems are time-reversal invariant. Motion-reversal invariance in this straightforward fashion does not apply to the motion of particles in a magnetic field, or to any other force that varies linearly with velocity. This is readily seen if one considers Fig. 11.3: In a homogeneous magnetic field, charged particles move along circles, the 4 The dot implies differentiation with respect to the whole argument, e.g., q(2tl - t) == aWtl~N. 220 11. Symmetries and Further Properties of the Dirac Equation ® B Fig. 11.3. Motion reversal in the presence of a magnetic field B perpendicular to the plane of the page. The motion is reversed at an instant when the particle is moving in exactly the x-direction o .. x sense of motion depending on the sign of the charge. Thus, when the motion is reversed, the particle does not return along the same circle, but instead moves along the upper arc shown in Fig. 11.3. In the presence of a magnetic field, one can only achieve motion-reversal invariance if the direction of the magnetic field is also reversed: B -+ -B, (11.4.10) as can be seen from the sketch, or from the following calculation. Let the Hamiltonian in cartesian coordinates with no field be written H = H(x,p), which will be assumed to be invariant with respect to time reversal. The Hamiltonian in the presence of an electromagnetic field is then H = e c H(x, p - -A(x)) + ecP(x) , (11.4.11) where A is the vector potential and cP the scalar potential. This Hamiltonian is no longer invariant under the transformation (11.4.4). However, it is invariant under the general transformation x'(t) p'(t) A'(x,t) cP'(x, t) = = = = X(2tl - t) -P(2tl - t) -A(x,2tl -t) cP(x, 2tl - t) . (11.4.12a) (11.4.12b) (11.4.12c) (11.4.12d) Equations (11.4.12c) and (11.4.12d) imply a change in the sign of the magnetic field, but not of the electric field, as can be seen from B = curl A -+ curl A' =-B 1 a ( ) -+-VCP, +--A 1 a '( xt ) E=-VcP+--Axt cat' cat' 1 a C 2tl - t = - VcP + - a( ) A(x, 2tl - t) = E . (11.4.13a) 11.4 Time Reversal (Motion Reversal) 221 We note in passing that when the Lorentz condition 1 a --4>+ VA = 0 cat (11.4.13b) holds, it also holds for the motion-reversed potentials. Remark: In the above description we considered motion in the time interval [0, h], and then allowed the motion-reversed process to occur in the adjoining time interval [h,2tll. We could equally well have considered the original motion in the time interval [-tl, tIl and, as counterpart, the motion-reversed process also lying in the time interval between -tl and tl: q"(t) = q(-t) p"(t) = (11.4.14) -p(-t) with the initial conditions q"( -h) = q(h) , p"( -h) = -p(td (11.4.15) and final values, q"(td p"(td = q( -td , = -pC-h) (11.4.16) . (q"(t),p"(t)) differs from (q'(t),p'(t) in Eq. (11.4.4) only by a time translation of 2h; in both cases, time runs in the positive sense -tl to tl. 11.4.2 Time Reversal in Quantum Mechanics 11.4.2.1 Time Reversal in the Coordinate Representation Following these classical mechanical preparatory remarks, we now turn to nonrelativistic quantum mechanics (in the coordinate representation). The system is described by the wave function 'l/J(x, t), which obeys the Schrodinger equation .a'l/J(X, t) = Hol,( ) at <p x, t . (11.4.17) I Let us take the wave function at time t 'l/J(x,O) = 'l/Jo(x) . = 0 to be given by 'l/Jo(x), i.e., (11.4.18) This initial condition determines 'l/J(x, t) at all later times t. Although the Schrodinger equation enables one to calculate 'l/J(x, t) at earlier times, this is usually not of interest. The statement that the wave function at t = 0 is 222 11. Symmetries and Further Properties of the Dirac Equation '¢o(x) implies that a measurement that has been made will, in general, have changed the state of the system discontinuously. At the time h > 0 we let the wave function be (11.4.19) What is the motion reversed system for which an initial state '¢l (x) evolves into the state '¢o(x) after a time tl? Due to the presence of the first order time derivative, the function '¢(X,2tl - t) does not satisfy the Schrodinger equation. However, if, in addition, we take the complex conjugate of the wave function ,¢' (x, t) = '¢* (x, 2tl - t) == Ko'¢(x, 2tl - t) , (11.4.20) this satisfies the differential equation (11.4.21) and the boundary conditions '¢' (x, tt) = '¢~ (x) '¢' (x, 2td = '¢o (x) . (l1.4.22a) (11.4.22b) Proof. Omitting the argument x, we have 5 .8'1j;'(t) _ .8'1j;*(2h -t) _ _ T/ .8'1j;(2h -t) _ at - 1 8t .nOl at - 1 = KoH'Ij;(2h - t) = H*'Ij;*(2h - t) T/ .8'1j;(2tl-t) 8( -t) .nOl = H*'Ij;'(t) . Here, H* is the complex conjugate of the Hamiltonian, which is not necessarily identical to Ht. For the momentum operator, for example, we have (11.4.23) When the Hamiltonian is quadratic in p, then H* = H and thus the system is time-reversal invariant. We now calculate the expectation values of momentum, position, and angular momentum (the upper index gives the time and the lower index the wave function): (p)~ (x)~ 5 j x '¢* ~v'¢ = ('¢,x'¢) = jd3X,¢*(x,t)x,¢(x,t) = ('¢, p'¢) = d3 The operator Ko has the effect of complex conjugation. (l1.4.24a) (l1.4.24b) 11.4 Time Reversal (Motion Reversal) 223 (l1.4.24c) (l1.4.24d) (11.4.24e) These results are in exact correspondence to the classical results. The mean value of the position of the motion-reversed state follows the same trajectory backwards, the mean value of the momentum having the opposite sign. Here, too, we can take 'ljJ(x, t) in the interval [-tl' tIl and likewise, 'ljJ'(x, t) = Ko'ljJ(x, -t) (11.4.25) in the interval [-h, tl], corresponding to the classical case (11.4.14). In the following, we will represent the time-reversal transformation in this more compact form. The direction of time is always positive. Since K5 = 1, we have Kol = Ko. Due to the property (11.4.23), and since the spatial coordinates are real, we find the following transformation behavior for x, p, and L: KoxKOI = x (l1.4.25'c) KopKol =-p (l1.4.25'd) KoLKol = -L. (l1.4.25'e) 11.4.2.2 Antilinear and Antiunitary Operators The transformation 'ljJ -+ 'ljJ'(t) = Ko'ljJ( -t) is not unitary. Definition: An operator A is antilinear if (11.4.26) Definition: An operator A is antiunitary if it is antilinear and obeys (A'ljJ,Aip) = (ip,'ljJ) . (11.4.27) Ko is evidently antilinear, and, furthermore, (11.4.28) 224 11. Symmetries and Further Properties of the Dirac Equation Hence, Ko is antiunitary. If U is unitary, uut = UtU as follows: = 1, then U Ko is antiunitary, which can be seen UKO(al7/Jl + a27/J2) = U(ai K o7/Jl + a;Ko7/J2) = ai UKo7/Jl + a; UKo7/J2 (UKo7/J, UKo<.p) = (Ko7/J, UtUKo<.p) = (Ko7/J, Ko<.p) = (<.p,7/J). The converse is also true: Every anti unitary operator can be represented in the form A = U Ko. Proof: We have = 1. Let A be a given antiunitary operator; we define U = AKo. The operator U satisfies K5 U(al7/Jl + a27/J2) = = AKo(al7/Jl (al AKo7/Jl + a27/J2) = A(ai K o7/Jl + a;Ko7/J2) + a2 AKo7/J2) = (a 1 U7/Jl + a2 U7/J2) , and, hence, U is linear. Furthermore, (U<.p,U7/J) = (AKo<.p, A Ko7/J) = (A<.p*,A7/J*) = (7/J*,<.p*) = J d3 x7/J<.p* = (<.p,7/J) , and thus U is also unitary. From U proving the assertion. = AKo it follows that A = U K o, thus Notes: (i) For antilinear operators such as K o , it is advantageous to work in the coordinate representation. If the Dirac bra and ket notation is used, one must bear in mind that its effect is dependent on the basis employed. If la) = J d 3 Ie) (ela), then in the coordinate representation, and insisting that Ko Ie) = Ie), e (11.4.29) For the momentum eigenstates this implies that since (elp) = e iPe and (elp)* = e- iPe . If one chooses a different basis, e.g., In) and postulates Ko In) = In), then Ko la) will not be the same as in the basis of position eigenfunctions. When we have cause to use the Dirac notation in the context of time reversal, a basis of position eigenfunctions will be employed. (ii) In addition, the effect of antiunitary operators is only defined for ket vectors. The relation (al (L Ib)) = ((al L) Ib) = (al Lib) , valid for linear operators, does not hold in the antiunitary case. This stems from the fact that a bra vector is defined as a linear functional on the ket vectors. 6 6 See, e.g., QM I, Sect. 8.2, footnote 2. 11.4 Time Reversal (Motion Reversal) 11.4.2.3 The Time-Reversal Operator r 225 in Linear State Space A. General properties, spin 0 Here, and in the next section, we describe the time-reversal transformation in the linear space of the ket and bra vectors, since this is frequently employed in quantum statistics. We give a general analysis of the condition of time reversal and also consider particles with spin. It will emerge anew that time reversal (motion reversal) cannot be represented by a unitary transformation. We denote the time-reversal operator by 7. The requirement of time-reversal invariance implies e- iHt 711/1(t)) = 711/1(0)) , (11.4.30) i.e., Hence, if one carries out a motion reversal after time t and allows the system to evolve for a further period t, the resulting state is identical to the motionreversed state at time t = O. Since Eq. (11.4.30) is valid for arbitrary 1"p(0)), it follows that whence (11.4.31 ) Differentiating (11.4.31) with respect to time and setting t = 0, one obtains /iH = -iH7. (11.4.32) One might ask whether there could also be a unitary operator 7 that satisfies (11.4.32). If 7 were unitary and thus also linear, one could then move the i occurring on the left-hand side in front of the 7 and cancel it to obtain 7H+H7=0. Then, for every eigenfunction 1/1E with we would also have H71/1E = -E71/1E . For every positive energy E there would be a corresponding solution 71/1E with eigenvalue (-E). There would be no lower limit to the energy since 226 11. Symmetries and Further Properties of the Dirac Equation there are certainly states with arbitrarily large positive energy. Therefore, we can rule out the possibility that there exists a unitary operator T satisfying (11.4.31). According to a theorem due to Wigner 7 , symmetry transformations are either unitary or anti unitary, and hence T can only be antiunitary. Therefore, TiH = -iTH and (11.4.33) TH -HT=O. Let us now consider a matrix element of a linear operator B: (aIBI,6) = (B t al,6) = = (T,6IBt~la) (T,6ITB t a) = (T,6I TBt~l ITa) or = (aIB,6) = (TB,6ITa) = (TB~l,6I a) = (T,61 TB~lIa) (11.4.34) If we assume that B is hermitian and (11.4.35) which is suggested by the results of wave mechanics (Eq. then follows that (al B 1,6) = EB (1l.42a~e), it (T,61 B ITa) The quantity E B is known as the "signature" of the operator B. Let us take the diagonal element (al B la) = EB (Tal B ITa) Comparing this with (1l.4.24c---€) and of the operators Tx~l Tp~l TL~l = yields the transformation X (l1.4.36a) =-p (l1.4.36b) = i.e., Ex = 1, Ep of the first two. 7 (l1.425'c~e) -L, = -1, and (1l.4.36c) EL = -1. The last relation is also a consequence E.P. Wigner, Group Theory and its Applications to Quantum Mechanics, Academic Press, p. 233; V. Bargmann, J. Math. Phys. 5,862 (1964) 11.4 Time Reversal (Motion Reversal) 227 Remark: If the relations (11.4.36) are considered as the primary defining conditions on the operator 7, then, by transforming the commutator [x,p) = i, one obtains Ti7- 1 = 7[x,p) 7- 1 = [x, -p) = -i . This yields 7i7- 1 = -i, which means that 7 is antilinear. We now investigate the effect of I on coordinate eigenstates where eis real. Applying I Ie), defined by to this equation and using (11.4.36a), one obtains Hence, with unchanged normalization, I factor. The latter is set to 1: Ie) equals Ie) to within a phase (11.4.37) Then, for an arbitrary state 1'ljJ), the antiunitarity implies 11'ljJ) = I = f d3~'ljJ(e) Ie) = f d3~'ljJ*(e) f d3~'ljJ*(e)/ (11.4.38) Ie) . Hence, the operator I is equivalent to Ko (cf. Eq. (11.4.29)): (11.4.39) I=Ko · For the momentum eigenstates, it follows from (11.4.38) that Ip) = lip) = f d3~ f d3~e-ip eiPe B. Nonrelativistic spin-~ Ie) (11.4.40) Ie) = I-p) particles Up to now we have considered only particles without spin. Here, we will, in analogy to the orbital angular momentum, extend the theory to spin-~ particles. We demand for the spin operator that 18/- 1 = -8. (11.4.41) 228 11. Symmetries and Further Properties of the Dirac Equation The total angular momentum (11.4.42) then also transforms as TJ'"r- 1 = -J . (11.4.43) For spin-! we assert that the operator T is given by (11.4.44) The validity of this assertion is demonstrated by the fact that the proposed form satisfies Eq. (11.4.41) in the form T8 = -8T: for the x and z components and for the y component For the square of T, from (11.4.44) one gets 'f2 = -iO"yKo( -iO"yKo) = -iO"yi( -O"y)K5 = +i20"~ = -1. (11.4.45) For particles with no spin, 'f2 = K5 = 1. For N particles, the time-reversal transformation is given by the direct product Iii -i7rS(N) IIiK T =e -i7rS(l) Y ••• e Y 0, (11.4.46) is the y component of the spin operator of the nth particle. The where S~n) square of T is now given by (11.4.45') In this context, it is worth mentioning Kramers theorem. 8 This states that the energy levels of a system with an odd number of electrons must be at least doubly degenerate whenever time-reversal invariance holds, i.e., when no magnetic field is present. Proof: From (T'ljJ, T r.p) = (r.p, 'ljJ) it follows that 8 H.A. Kramers, Koninkl. Ned. Wetenschap. Proc. 33, 959 (1930) 11.4 Time Reversal (Motion Reversal) 229 (Tt/J,'¢) = (T¢, 72'¢) = -(7'¢, '¢) . Thus, (7,¢, '¢) = 0, i.e., 7'1j; and '¢ are orthogonal to one another. In addition, from H'¢ = E'¢ and (11.4.33), it follows that H(7'¢) = E(7'¢) . The states '¢ and 7'¢ have the same energy. However, the two states are also distinct: If it were the case that 7'¢ = a,¢, this would imply J2'¢ = a*7'¢ = laI 2 ,¢, which would contradict the fact that J2 = -1. However complicated the electric fields acting on the electrons may be, for an odd number of electrons this degeneracy, at least, always remains. It is referred to as "Kramers degeneracy". For an even number of electrons, J2 = 1, and in this case no degeneracy need exist unless there is some spatial symmetry. 11.4.3 Time-Reversal Invariance of the Dirac Equation We now turn our attention to the main topic of interest, the time-reversal invariance of the Dirac equation. The time-reversal transformation 7 = T7(0) , where 7(0) stands for the operation t -+ -t and T is a transformation still to be determined, associates to the spinor '¢(x, t) another spinor ,¢'(x, t) = T7(0),¢(x, t) = T'¢(x, -t) , (11.4.47) which also satisfies the Dirac equation. If, at a time -h, the spinor is of the form '¢(x, -h) and evolves, according to the Dirac equation, into the spinor '¢(x, td at time h, then the spinor ,¢'(x, -h) = T'¢(x, tl) at time -tl evolves into ,¢'(x, h) = T'¢(x, -td at time h (see Fig. 11.4). tl 'Ij;'(h) = T'Ij;(-td 'Ij;( -iI) Fig. 11.4. Illustration of time reversal for the spinors 'Ij; and 'Ij;' (space coordinates are suppressed) 230 11. Symmetries and Further Properties of the Dirac Equation Applying i a~:, to the Dirac equation ,(0) (0. (-iV - t) = eA(x, t)) +,Bm + eAo(x, t)~(x, t) (11.4.48) i.e., making the replacement t --+ -t, yields: i a~;t) eA(x, -t)) +,Bm + eAo(x, (0. (-iV - = -t)~(x, -t) . (11.4.49) Since, in wave mechanics, the time-reversal transformation is achieved by complex conjugation, we set T= ToKo where To is to be determined. We now apply T to Eq. (11.4.3). The effect of Ko is to replace i by -i, and one obtains i a~'J: t) = T(o . (-iV - eA(x, -t)) +,Bm + eAo(x, _t)T-l~'(x, t) . (11.4.49') The motion-reversed vector potential appearing in this equation is generated by current densities, the direction of which is now reversed with respect to the original unprimed current densities. This implies that the vector potential changes its sign, whereas the zero component remains unchanged with respect to motion reversal A'(x, t) = -A(x, -t), A'o(x, t) Hence, the Dirac equation for i a~'i: t) = (0. ~' = AO(x, -t) . (11.4.50) (x, t) (-iV - eA'(x, t)) +,Bm + eA~(x, t)~'(x, t) , (11.4.51) is obtained when ToT- 1 = T satisfies the condition -0 and T,BT- 1 = ,B , (11.4.52) where the effect of Ko on i in the momentum operator has been taken into account. With T = ToKo, the last equation implies (11.4.52') where we have chosen the standard representation for the Dirac matrices in which,B is real. Since £Yl and £Y3 are real, and £Y2 imaginary, we have 11.4 Time Reversal (Motion Reversal) , '-1 , '-1 , '-1 TOa1TO TOa2TO TOa 3TO , = -a1 = a2 = -a3 231 (11.4.52") '-1 Tof3To = f3 , which can also be written in the form {To, ad = {To, a3} = 0 (11.4.52 111 ) [To, a2] = [To, f3] = 0 . From (11.4.52") one finds the representation (11.4.53) and hence, (11.4.53') The factor i in (11.4.53) and (11.4.53') is arbitrary. Proof: To satisfies (11.4.52"'), since, e.g., {To, ctd = ctlct3ctl + ctlalct3 = o. The total time-reversal transformation, T = TO KoT(O) , can be written in the form 'lj;' (x, t) = i'l,),3 Ko'lj;(x, -t) = i,.l,..,l'lj;* (x, -t) = hI ')'3 ')'0 iV (x, -t) = i')'2 ')'5 iV (x, -t) (11.4.47') and, as required, 'lj;'(x, t) satisfies the Dirac equation io'lj;,~:t) = (0:' (-iVeA'x,t)+f3m~lj;. (11.4.51') The transformation of the current density under time reversal follows from (11.4.47') as j'1-' = i[;'(x, t hI-' 'lj;' (x, t) = i[;(x, -thl-''I/J(x, -t) . (11.4.54) The spatial components of the current density change their signs. Equations (11.4.54) and (11.4.50) show that the d'Alembert equation for the electromagnetic potential Ov ovAl-' = j I-' is invariant under time reversal. In order to investigate the physical properties of a time-reversed spinor, we consider a free spinor 232 11. Symmetries and Further Properties of the Dirac Equation (11.4.55) with momentum p and spin orientation n (in the rest frame). The timereversal operation yields: T1jJ (fP2: m) (11 +2'51ft) 1jJ = T = To ( f P:: = (f~2:m) m) (11 +;51ft*) 1jJ* (x, -t) (11 +2'5~) (11.4.56) T1jJ, where p = (pO, -p) and fi = (no, -n). Here, we have used (11.4.52'). The spinor T1jJ has opposite momentum -p and opposite spin -no We have thus far discussed all discrete symmetry transformations of the Dirac equation. We will next investigate the combined action of the parity transformation P, charge conjugation C, and time reversal T. The successive application of these operations to a spinor 1jJ(x) yields: 1jJPCT(X') PC,oKoToK o1jJ(x', -t') = ,0i{2,O,O Koi{l,3 K o1jJ( -x') = = i,51jJ( -x') (11.4.57) . If one recalls the structure of,5 (Eq. (6.2.48')), it is apparent that the consequence of the C part of the transformation is to transform a negative-energy electron spinor into a positive-energy positron spinor. This becomes obvious when one begins with a spinor of negative energy and a particular spin orientation (-n), which hence satisfies the projection relation 1jJ(x) = (-~:m) Since {,5"Jl} (11+2'51ft)1jJ(X). = 0, it follows from (11.4.57) and (11.4.58) that .(p+m) .1. . 5 .1.( 'f'PCT (x' ) =1, 'f' -x ') =1 = (11.4.58) ~ (ll-151ft) .1. ( ') 2 ,5'f'PCT-X (p :mm) (ll -2'51ft) 1jJPCT(X') . (11.4.59) If 1jJ(x) is an electron spinor with negative energy, then 1jJPCT(X) is a positron spinor of positive energy. The spin orientation remains unchanged. 9 With 9 To determine the transformation behavior of the quanta from this, one must think of positrons in the context of Hole theory as unoccupied electron states of negative energy. Therefore, under peT, electrons are transformed into positrons with unchanged momentum and opposite spin. 11.4 Time Reversal (Motion Reversal) 233 regard to the first line of (11.4.59), one can interpret a positron spinor with positive energy as an electron spinor with negative energy that is multiplied by i!5 and moves backwards in space and time. This has an equivalent in the Feynman diagrams of perturbation theory (see Fig. 11.5). Electron ~ t Fig. 11.5. Feynman propagators for electrons Positron (arrow pointing upwards, i.e., in positive time direction) and positrons (arrow in negative time direction) ~e~ /+ e~ ~+ /ee~ b) a) T ~e~ e/ e) /+ Fig. 11.6. The effect of (a) the parity transformation P, (b) charge conjugation C, and (e) the time-reversal transformation T on an electron and a positron state. The long arrows represent the momentum, and the short arrows the spin orientation. These diagrams represent the transformations not of the spinors, but of the particles and antiparticles, in the sense of Hole theory or in quantum field theory Figure 11.6,a-c illustrates the effect of the transformations P, C, and T on an electron and a positron. According to the Dirac theory, electrons and positrons possess opposite parity. The effect of a parity transformation on a state containing free electrons and positrons is to reverse all momenta while leaving the spins unchanged, and additionally multiplying by a factor (-1) for every positron (Fig. 11.6a). Up until 1956 it was believed that a 234 11. Symmetries and Further Properties of the Dirac Equation spatial reflection on the fundamental microscopic level, i.e., a transformation of right-handed coordinate systems into left-handed systems, would lead to an identical physical world with identical physical laws. In 1956 Lee and Yang lO found convincing arguments indicating the violation of parity conservation in nuclear decay processes involving the weak interaction. The experiments they proposed 11 showed unambiguously that parity is neither conserved in the (3 decay of nuclei nor in the decay of 7r mesons. Therefore the Hamiltonian of the weak interaction must, in addition to the usual scalar terms, contain further pseudoscalar terms which change sign under the inversion of all coordinates. This is illustrated in Fig. 11. 7 for the experiment of Wu et al. on the (3 decay of radioactive 6000 nuclei into 6CNi. In this process a neutron within the nucleus decays into a proton, an electron, and a neutrino. Only the electron ((3 particle) can be readily observed. The nuclei possess a finite spin and a magnetic moment which can be oriented by means of a magnetic field. It is found that the electrons are emitted preferentially in the direction opposite to that of the spin of the nucleus. The essential experimental fact is that the direction of the velocity of the (3 particle v (3 (a polar vector) is determined by the direction of the magnetic field B (an axial vector), which orients the nuclear spins. Since the inversion P leaves the magnetic field B unchanged, while reversing v(3, the above observation is incompatible with a universal inversion symmetry. Parity is not conserved by the weak interaction. However, in all processes involving only the strong and the electromagnetic interactions, parity is conserved. 12 Under charge conjugation, electrons and positrons are interchanged, whilst the momenta and spins remain unchanged (Fig. 11.6b). This is because charge conjugation, according to Eqs. (11.3.7') and (11.3.11'), transforms the spinor into a spinor with opposite momentum and spin. Since the antiparticle (hole) corresponds to the nonoccupation of such a state, it again has opposite values and hence, in total, the same values as the original particle. Even the charge-conjugation invariance present in the free Dirac theory is not strictly valid in nature: it is violated by the weak interaction. 12 The time-reversal transformation reverses momenta and spins (Fig. 11.6c). The free Dirac theory is invariant under this transformation. In nature, time reversal invariance holds for almost all processes, whereby one should note that time reversal interchanges the initial and final states. It was in the decay processes of neutral K mesons that effects violating T invariance were first observed experimentally. 10 11 12 T.D. Lee and C.N. Yang Phys. Rev. 104, 254 (1956) C.S. Wu, E. Ambler, R.W. Hayward, D.D. Hoppes, and R.P. Hudson, Phys. Rev. 105, 1413 (1957); R.L. Garwin, L.M. Ledermann, and M. Weinrich, Phys. Rev. 105, 1415 (1957) A more detailed discussion of experiments testing the invariance of the electromagnetic and strong interactions under C, P, and CP, and their violation by the weak interaction, can be found in D.H. Perkins, Introduction to High Energy Physics, 2nd ed., Addison-Wesley, London, 1982. 11.4 Time Reversal (Motion Reversal) B B 235 Vf3 p e Co vf3 (a) (b) Fig. 11.7. Schematic representation of the parity violation observed in the ,B-decay experiment of Wu et al. The figure shows the current circulating in a toroidal coil generating the magnetic field B, which in turn orients the magnetic moment J.L of the cobalt nucleus and the associated angular momentum I, together with the velocity Vf3 of the ,B particle (electron). The (3 particles are emitted preferentially in the direction opposite to that of f..t. Thus configuration (a) corresponds to the experimental result, whereas configuration (b) is not observed The invariances C, P, and T are all violated individually in nature. 12 In relativistic field theory with an arbitrary local interaction, however, the product 8 = PCT must be an invariance transformation. This theorem, which is known as the PCT theorem 13 ,14, can be derived from the general axioms of quantum field theory 15. The PCT theorem implies that particles and antiparticles have the same mass and, if unstable, the same lifetime, although the decay rates for particular channels are not necessarily the same for particles and antiparticles. * 11.4.4 Racah Time Reflection Here, we determine the spinor transformation corresponding to a pure time reflection. According to Eq. (6.1.9), this is described by the Lorentz transformation J.L _ Av- ( -1000) 0100 0010 . 0001 (11.4.60) One readily sees that the condition for the spinor transformation (6.2.7) 13 14 15 G. Liiders, Dan. Mat. Fys. Medd. 28, 5 (1954); Ann. Phys. (N.Y.) 2, 1 (1957); W. Pauli, in Niels Bohr and the development of physics, ed. by W. Pauli, L. Rosenfeld, and V. Weisskopf, McGraw Hill, New York, 1955 The Lagrangian of a quantum field theory with the properties given in Sect. 12.2 transforms under e as £(x) -+ £( -x), so that the action S is invariant. R.F. Streater and A.S. Wightman peT, Spin Statistics and all that, W.A. Benjamin, New York, 1964; see also Itzykson, Zuber, op. cit., p. 158. 236 11. Symmetries and Further Properties of the Dirac Equation "(I1-SR = Al1-v SR,,(v is satisfied by16 (11.4.61) Hence, the transformation for the spinor and its adjoint has the form ,¢'=SR'¢ i{;' = '¢ t s1"(0 = -'¢ t "(0 Si/ = -i{;Si/ , (11.4.62) in agreement with the general result, Eq. (6.2.34b), b = -1 for time reversal, where Si/ = -"(3"(2"(1. The current density thus transforms according to (11.4.63) Hence, jl1- transforms as a pseudovector under Racah time reflection. The vector potential AI1- (x), on the other hand, transforms as (11.4.64) Thus, the field equation for the radiation field (11.4.65) is not invariant under this time reflection One can combine the Racah transformation with charge conjugation: ,¢'(x, t) = SR'¢c(X, -t) = S(T)i{;T(x, -t) . Here, the transformation matrix S(T) is related to SR and C (11.4.66) == i"(2"(0 This is the motion-reversal transformation (= time-reversal transformation), Eq. (11.4.47'). The Dirac equation is invariant under this transformation. *11.5 Helicity The helicity operator is defined by h(k) = 17· k, where 16 k = k/lkl (11.5.1) is the unit vector in the direction of the spinor's momentum. SR is known as the Racah time reflection operator, see J.M. Jauch and F. Rohrlich, The Theory of Photons and Electrons, p. 88, Springer, New York, 1980 *11.5 Helicity 237 Since E . k commutes with the Dirac Hamiltonian, there exist common eigenstates I7 of E . k and H. The helicity operator h(k) has the property h2 (k) = 1, and thus possesses the eigenvalues ±l. The helicity eigenstates with eigenvalue +1 (spin parallel to k) are called right-handed, and those with eigenvalue -1 (spin antiparallel to k) are termed left-handed. One can visualize the states of positive and negative helicity as analogous to rightand left-handed screws. From Eq. (6.3.11a), the effect of the helicity operator on the free spinor ur(k) is: E.kur(k) = E·k ((¥ft ~. ) 1 [2m(m+E)p = (_E~t "k~: ------,-1 u 'Pr (11.5.2) ) . k 'Pr [2m(m+E)p with 'PI = (~) and 'P2 = (~), and an analogous expression for the spinors vr(k). The Pauli spinors 'Pr are eigenstates of O'z and thus the ur(k) and vr(k) in the rest frame are eigenstates of Ez (see Eq. (6.3.4)). As an example of a simple special case, we now consider free spinors with wave vector along the z axis. Thus k = (0,0, k), and the helicity operator is (11.5.3) Furthermore, from Eq. (11.5.2) one sees that the spinors ur(k) and vr(k) are eigenstates of the helicity operator. According to Eqs. (6.3.11a) and (6.3.11b), the spinors for k = (O,O,k), i.e., for k' = (~k2 +m2,0,0,k) (to distinguish it from the z component, the four-vector is denoted by k'), are 17 In contrast to the nonrelativistic Pauli equation, however, the Dirac equation has no free solutions that are eigenfunctions of E . n with an arbitrarily oriented unit vector n. This is because, except for n = ±k, the product E . n does not commute with the free Dirac Hamiltonian. 238 11. Symmetries and Further Properties of the Dirac Equation with N = (E2~m) 1/2, and satisfy {~ ~ Ezur(k') = ±ur(k') for r = Ezvr(k') = ±vr(k') for r = {~ ~ (11.5.5) The letter R indicates right-handed polarization (positive helicity) and L left-handed polarization (negative helicity). For k in an arbitrary direction, the eigenstates u(R), u(L) with eigenvalues +1, -1 are obtained by rotating the spinors (11.5.4). The rotation is through an angle 1) = arccos about the axis defined by the vector (-ky, kx, 0). It causes the z axis to rotate into the k direction. According to (6.2.21) and (6.2.29c), the corresponding spinor transformation reads: M (11.5.6) Therefore, the helicity eigenstates of positive energy for a wave vector k are and Corresponding expressions are obtained for spinors with negative energy (Problem 11.4). *11.6 Zero-Mass Fermions (Neutrinos) 239 *11.6 Zero-Mass Fermions (Neutrinos) Neutrinos are spin-~ particles and were originally thought to be massless. There is now increasing experimental evidence that they possess a finite albeit very small mass. On neglecting this mass, which is valid for sufficiently high momenta, we may present the standard description by the Dirac equation having a zero mass term p1jJ = 0, (11.6.1) where PJ1- = iaJ1- is the momentum operator. In principle, one could obtain the solutions from the plane waves (6.3.lla,b) or the helicity eigenstates by taking the limit m ---+ 0 in the Dirac equation containing a mass term. One merely needs to split off the factor 1/..jiii and introduce a normalization different to (6.3.19a) and (6.3.19b), for example ur(k)'lus(k) = 2Eors (11.6.2) vr(k)'lvs(k) = 2Eors . However, it is also interesting to solve the massless Dirac equation directly and study its special properties. We note at the outset that in the representation based on the matrices a and (3 (5.3.1), for the case of zero mass, (3 does not appear. However, one could also realize three anticommuting matrices using the two-dimensional representation of the Pauli matrices, a fact that is also reflected in the structure of (11.6.1). In order to solve (11.6.1), we multiply the Dirac equation by "l,o = -i,1'l,3 . With the supplementary calculation '/'l'/ = -i'll'l'/ = -i'/'/'l'l = +i'l'l = (123 = 17 1 , ",,/'l'l = -i'l'l'l'l = i'l'l = (112 = 173 ,'/'-'/,'/ = "{5 , ( _pi 17i + PO"(5)'Ij; = 0 one obtains (11.6.3) Inserting into (11.6.3) plane waves with positive (negative) energy 1jJ(x) = e~ikx1jJ() = e~i(kOx-.)1jJ , (11.6.4) this yields ~. k1jJ(k) = k O,51jJ(k) . (11.6.5) From (11.6.1) it follows that p21jJ(x) = 0 and hence, k 2 = 0 or k O = E = Ikl for solutions of positive (negative) energy. With the unit vector k = k/lkl, Eq. (11.6.5) takes the form 240 11. Symmetries and Further Properties of the Dirac Equation E· k:t/J(k) = ±,'/'t/J(k) . (11.6.6) The matrix ,),5, which anticommutes with all ')'1-' , commutes with E and thus has joint eigenfunctions with the helicity operator E . k. The matrix ')'5 is also termed the chirality operator. Since (,),5)2 = 1, the eigenvalues of ')'5 are ±1, and since 'It ')'5 = 0, they are doubly degenerate. The solutions of Eq. (11.6.6) can thus be written in the form 't/J(x) = { e -ikx u± (k) e ikx v±(k) . 2 WIth 0 k = 0, k = Ikl > 0 , (11.6.7) where the u± (v±) are eigenstates of the chirality operator (11.6.8) The spinors u+ and v+ are said to have positive chirality (right-handed), and the spinors u_ and v_ to have negative chirality (left handed). Using the standard representation ')'5 = (~ ~ ), Eq. (11.6.8) yields (11.6.9) Inserting(11.6.9) into the Dirac equation (11.6.6), one obtains equations determining a± (k) : (11.6.10) Their solutions are (cf. Problem 11.7) (Si:O;~ ( ) -sin*e~) (11.6.11a) , cos ~ 2 (11.6.11b) where {) and <p are the polar angles of k. These solutions are consistent with the m --+ 0 limit of the helicity eigenstates found in (11.5.7). The negative energy solutions v±(k) can be obtained from the u±(k) by charge conjugation (Eqs. (11.3.7) and (11.3.8)): = i')' 2u:'(k) = -u+(k) v_(k) = Cur(k) = i')2u~(k = -u_(k) v+(k) = Cu~(k) (11.6.11c) (11.6.11d) i.e., in (11.6.9), b±(k) = -a±(k). It is interesting in this context to go from the standard representation of the Dirac matrices to the chiral representation, which is obtained by the transformation *11.6 Zero-Mass Fermions (Neutrinos) 'ljJch = Ut'ljJ ,ILch 241 (l1.6.12a) = UL,ILU 1 (1l.6.12b) 5 (l1.6.12c) U=J2(l+ I ). The result is (Problem 1l.8): ,och == (3ch = kch =I = I k _,5 = ( _ ~ -~ ) (l1.6.13a) (0 cr k ) _crk 0 (1l.6.13b) (l1.6.13c) (l1.6.13d) crch = ~ 2 [",Ch ",ch] 10,,, (1l.6.13e) crch = ~ 2 [",ch ",ch] I, , IJ (l1.6.13f) 0, 'J In the chiral representation, (l1.6.13e,f) are diagonal in the space ofbispinors, i.e., the upper components (1,2) and the lower components (3,4) of the spinor transform independently of one another under pure Lorentz transformations and under rotations (see (6.2.29b)). This means that the four-dimensional representation of the restricted Lorentz group is reducible to two twodimensional representations. More precisely, the representation 18 of the group SL(2,C) can be reduced to the two nonequivalent representations D(~'O) and D(O, ~). When the parity transformation P, which is given by P = ei'P,ochpo (see (6.2.32)), is present as a symmetry element, then the four-dimensional representation is no longer reducible, i.e., it is irreducible. In the chiml representation, the Dirac equation takes the form .ct (-iOo + icrkfA)'ljJ~h - m1/J~h (-iOo - icrka)'ljJ~h - m1/J~h =0 =0, (11.6.14) where we have set 'ljJch = (~:). Equations (1l.6.14) are identical to the equations (A. 7), but have been obtained in a different way. For m = 0 the two equations decouple and one obtains 18 The group SL(2,C) is homomorphic to the group £~ corresponding to the twovalued nature of the spinor representations. For useful group theoretical background we recommend V. Heine, Group Theory in Quantum Mechanics, Pergamon Press, Oxford (1960), and R.F. Streater and A.S. Wightman, peT, Spin Statistics and all that, Benjamin, Reading (1964). 242 11. Symmetries and Further Properties of the Dirac Equation (1l.6.15a) and (1l.6.15b) These are the two Weyl equations. A comparison of these with (5.3.1) shows that (11.6.15) contains a two-dimensional representation of the a matrices. As mentioned at the beginning of this section, when (3 is absent, the algebra of the Dirac a matrices {ai, aj} = 26ij can be realized by the three Pauli a i matrices. The two equations (11.6.15a,b) are not individually parity invariant and in the historical development were initially heeded no further. In fact, it has been known since the experiments of Wu et al. 19 that the weak interaction does not conserve parity. Since the chirality operator in the chiral representation is of the form X5 h = (~ _ ~ ) , 'lj;ch spinors of the form 'ljJ = ( 6 ) have positive chirality, whilst those of the form 'ljJ = ('Ij;~h) have negative chirality. Experimentally, it is found that only neutrinos of negative chirality exist. This means that the first of the two equations (11.6.15) is the one relevant to nature. The solutions of this equation are of the form 'ljJ~h(+) (x) = e-ik,xu(k) and 'ljJ~h(-)x = eik,xv(k) with ko > 0, where u and v are now two-component spinors The first state has positive energy and, as directly evident from (11.6.15a), negative helicity since the spin is antiparallel to k. We call this state the neutrino state and represent it pictorially by means of a left-handed screw (Fig. 11.8a). Of the solutions shown in (1l.6.9), this is u_(k). The momentum is represented by the straight arrow. E = ko = Ikl E= -ko = -Ikl E = ko = Ikl Fig. 11.8. (a) Neutrino state with negative helicity, (b) neutrino state with negative energy and positive helicity, (c) antineutrino with positive helicity The solution with negative energy 'ljJ~h(-) has momentum -k, and hence positive helicity; it is represented by a right-handed screw (Fig. 1l.8b). This 19 See references on p. 234. * 11.6 Zero-Mass Fermions (Neutrinos) 243 solution corresponds to v_(k) in Eq. (1l.6.9). In a hole-theoretical interpretation, the antineutrino is represented by an unoccupied state v_(k). It thus has opposite momentum (+k) and opposite spin, hence the he Ii city remains positive (Fig. 1l.Sc). Neutrinos have negative helicity, and their antiparticles, the antineutrinos, have positive helicity. For electrons and other massive particles, it would not be possible for only one particular helicity to occur. Even if only one helicity were initially present, one can reverse the spin in the rest frame of the electron, or, for unchanged spin, accelerate the electron in the opposite direction, in either case generating the opposite helicity. Since massless particles move with the velocity of light, they have no rest frame; for them the momentum k distinguishes a particular direction. Fig. 11.9. The effect of a parity transformation on a neutrino state Figure 1l.9 illustrates the effect of a parity transformation on a neutrino state. Since this transformation reverses the momentum whilst leaving the spin unchanged, it generates a state of positive energy with positive helicity. As has already been stated, these do not exist in nature. Although neutrinos have no charge, one can still subject them to charge conjugation. The charge conjugation operation C connects states of positive and negative chirality and changes the sign of the energy. Since only lefthanded neutrinos exist in nature, there is no invariance with respect to C. However, since the parity transformation P also connects the two types of solution (in the chiral representation ')'0 is nondiagonal), the Weyl equation is invariant under CPo In the chiral representation, C reads 0) = ( o ia2 C = (-ia2 Hence, the effect of C P is for chirality ')'5 = ± 1. -100 0) 010 0 00 1 0 000 -1 244 11. Symmetries and Further Properties of the Dirac Equation Problems 11.1 Show that equation (11.3.5) implies (11.3.5'). 11.2 In a Majorana representation of the Dirac equation, the "I matrices - indicated here by the subscript M for Majorana - are purely imaginary, "I~ * = -"I~ , J-t = 0, 1,2,3. A special Majorana representation is given by the unitary transformation "I~ = U "II' U t = U t = U- 1 = with U (a) Show that (n + "(2). ~"I0 1 = "I 3 = "I2 "I3 = "1M 2 "1M = -"I 2 = (00' 2 _0'2) "1M 0 2 "I 1 = (i0'30 i0'0) 3 (-iO'l 0 0. ) -10' 1 (b) In Eq. (11.3.14') it was shown that, in a Majorana representation, the charge conjugation transformation (apart from an arbitrary phase factor) has the form 'lj;ft = 'Ij;'M. Show that application of the transformation U to Eq. (11.3.7') 'lj;c =i"(2'1j; leads to 'lj;ft = -i'lj;M . 11.3 Show that, under a time-reversal operation the Dirac theory satisfies j'1'(x, t) T, the four-current-density jl' in = jl'(x, -t) . 11.4 Determine the eigenstates of helicity with negative energy: (a) as in (11.5.7) by applying a Lorentz transformation to (11.5.4); (b) by solving the eigenvalue equation for the helicity operator :E . k and taking the appropriate linear combination of the energy eigenstates (6.3.11b). 11.5 Show that :E. k commutes with ("(I'kl' ± m). 11.6 Prove the validity of Eq. (11.5.7). 11.7 Show that (11.6.11) satisfies the equation (11.6.10). 11.8 Prove the validity of (11.6.13). Bibliography for Part II H.A. Bethe and R. Jackiw, Intermediate Quantum Mechanics, Benjamin, London, 1968 J.D. Bjorken and S.D. Drell, Relativistic Quantum Mechanics, McGraw-Hill, New York, 1964 C. Itzykson and J.-B.Zuber, Quantum Field Theory, McGraw-Hill, New York, 1980 A. Messiah, Quantum Mechanics, Vol. II, North Holland, Amsterdam, 1964 J.J. Sakurai, Advanced Quantum Mechanics, Addison-Wesley, London, 1967 S.S. Schweber, An Introduction to Relativistic Quantum Field Theory, Harper & Row, New York, 1961 Part III Relativistic Fields 12. Quantization of Relativistic Fields This chapter is dedicated to relativistic quantum fields. We shall begin by investigating a system of coupled oscillators for which the quantization properties are known. The continuum limit of this oscillator system yields the equation of motion for a vibrating string in a harmonic potential. This is identical in form to the Klein-Gordon equation. The quantized equation of motion of the string and its generalization to three dimensions provides us with an example of a quantized field theory. The quantization rules that emerge here can also be applied to non-material fields. The fields and their conjugate momentum fields are subject to canonical commutation relations. One thus speaks of "canonical quantization". In order to generalize to arbitrary fields, we shall then study the properties of general classical relativistic fields. In particular, we will derive the conservation laws that follow from the symmetry properties (Noether's theorem). 12.1 Coupled Oscillators, the Linear Chain, Lattice Vibrations 12.1.1 Linear Chain of Coupled Oscillators 12.1.1.1 Diagonalization of the Hamiltonian We consider N particles of mass m with equilibrium positions that lie on a periodic linear chain separated by the lattice constant a. The displacements along the direction of the chain from the equilibrium positions an are denoted by ql, ... ,qN (Fig.12.1a), and the momenta by PI, ... ,PN. It is assumed that each particle is in a harmonic potential and, additionally, is harmonically coupled to its nearest neighbors (Fig.12.1b). The Hamiltonian then reads: ~ H = ~ 1 2 2m Pn mil2 + -2- (qn -qn-I) 2 mil5 2 + -2-qn· (12.1.1) n=1 Here, il 2 characterizes the strength of the harmonic coupling between nearest neighbors, and il5 the harmonic potential of the individual particles (see Fig.12.1b). Since we will eventually be interested in the limiting case of an 250 12. Quantization of Relativistic Fields •• • •• • • n-1 (a) n n+1 (b) Fig. 12.1. Linear chain: (a) displacement of the point masses (large dots) from their equilibrium positions (small dots); (b) potentials and interactions (represented schematically by springs) infinitely large system in which the boundary conditions play no part, we will choose periodic boundary conditions, i.e., qo = qN. The x coordinates Xn are represented as Xn = an + qn = na + qn and, from the commutation relations [Xn, Pm] = ii5nm , etc. (h = 1), we have for the canonical commutation relations of the qn and Pn (12.1.2) The Heisenberg representation, qn(t) = eiHt qn e- iHt , Pn(t) = eiHt Pn e- iHt , (12.1.3a) (12.1.3b) yields the two equations of motion . 1 qn(t) = - Pn(t) m (12.1.4a) Pn(t) (12.1.4b) and mqn(t) = mfP(qn+1 (t) = + qn-1 (t) - 2qn(t)) - mD5 qn(t) . On account of the periodic boundary conditions, we are dealing with a translationally invariant problem (invariant with respect to translations by a). The Hamiltonian can therefore be diagonalized by means of the transformation (Fourier sum) qn 1 = (mN)1/2 Pn = (~) """' ika L;: e n Q k 1/22: e- ikan P k . (12.1.5a) (12.1.5b) k The variables Qk and Pk are termed the normal coordinates and normal momenta, respectively. We now have to determine the possible values of k. To this end, we exploit the periodic boundary conditions which demand that qo = qN, i.e., 1 = eikaN ; hence, we have kaN = 21[£ and thus 12.1 Coupled Oscillators, the Linear Chain, Lattice Vibrations k = 27fC 251 (12.1.6) Na' 2: ± are equivalent to where C is an integer. The values k = 27r<fv!N) = 7;~ k = 7;~ since, for these k values, the phase factors e ikan are equal and, thus, so are the qn and Pn. The possible k values are therefore reduced to those given by: N N N-2 for even N - - <C <- for odd N --<-C- < - ' 2 2 2 - 2 ' N-1 N C=O,±l, ... '±-2-'2 N-1 N-1 C = O,±l, ... '±-2- In solid state physics, this reduced interval of k values is also known as the first Brillouin zone. The Fourier coefficients in (12.1.5) satisfy the following orthogonality and completeness relations: Orthogonality relation " ~ N L eikane-ik'an = L1(k - k') (12.1. 7a) n=l _ 1 for k - k - { ° , 27f =- a h, h integer otherwise. In this form, the orthogonality relation is valid for any value of k = 7;~. When k is restricted to values in the first Brillouin zone, the generalized Kronecker delta L1(k - k') becomes bkk' . Completeness relation,' (12.1.7b) Here, the summation variable k is restricted to the first Brillouin zone. (For a proof, see Problem 12.1). The inverse of (12.1.5) reads: Qk = .(Iff L e-ikanqn (12.1.8a) n (12.1.8b) 252 12. Quantization of Relativistic Fields Since the operators qn and Pn are hermitian, it follows that (12.1.9) Remark. When N is even, f appears. For k = = ,~ e i1m = (_l)n. i:a . 1f = 1f and ~ are equivalent and hence only we have Qk = Qk and P k = ~P , since ei;;'an 1f = The commutation relations for the normal coordinates and momenta are obtained from (12.1.2) with the result (12.1.10) Transforming (12.1.1) into normal coordinates according to (12.1.5a,b) yields the Hamiltonian in the form (12.1.11) r where the square of the vibration frequency as a function of k reads: wk = Q2 (2 sin k2a + Q5 (12.1.12) (Problem 12.3). The quantity (Qa)2 is known as the stiffness constant. Thus, in Fourier space, one obtains uncoupled oscillators with the frequency Wk = [It should be noted, however, that the terms in (12.1.11) are of the form QkQ-k etc., so that the oscillators with wave numbers k and -k are still interdependent.] The frequency is depicted as a function of k (dispersion relation) in Fig.12.2. In the language of lattice vibrations, Q o = 0 leads to acoustic, and finite Q o to optical, phonons. In order to diagonalize H in Eq.(12.1.11), one introduces creation and annihilation operators: JWI. no =0 7r a Fig. 12.2. The phonon frequencies for no =1= 0 and no = 0 12.1 Coupled Oscillators, the Linear Chain, Lattice Vibrations 253 = y'L k (Wk Qk + iP~) (12.1.13a) 1 ( t. ) at_ k - y' 2W k w k Q k - IPk (12.1.13b) ak The inverse of this transformation is given by Qk = + a~k y' 2W k ak (12.1.14a) and (12.1.14b) The commutation relations for the normal coordinates (12.1.10) lead to (Problem 12.5) [ak ,at,] = Ok,k" [ak,ak'] = [at, at,] = o. (12.1.15) By inserting (12 .1.14a, b) into (12.1.11), one obtains (12.1.16) a Hamiltonian for N uncoupled oscillators. The summation extends over all N wave numbers in the first Brillouin zone, since H L: Wk t t k - ak)(a_ k = "21 '"""' 2(a- ak) w~ (ak + a_k)(a t t + 2Wk k + a-k) t t 4: L...J Wk ( a-ka_t k + akak + ak a tk + a_ka-k -_1,""", k -a-kak - al~k = t + aka-k + a~kl) "21,""", L...Jwk(akak +ak a tk ) = k '"""' L...JWk (t akak +"21) (12.1.17) k The energy eigenstates and eigenvalues for the individual oscillators are known. The ground-state energy of the oscillator with the wave vector k is ~Wk. The nth excited state of the oscillator with wave vector k is obtained by the n-fold application of the operator at, having energy (nk + ~) Wk. The fact that the eigenvalues of the Hamiltonian are, up to the zero-point energy, integer multiples of the eigenfrequencies leads quite naturally to a particle interpretation, although we are dealing here not with material particles but rather with excited states (quasiparticles). In the case of the elastic chain considered here, these quanta are known as phonons. The occupation numbers are 0, 1, 2, ... , hence the quanta are bosons. The operator creates a phonon with wave vector k and frequency (energy) Wk, whilst ak annihilates a phonon with wave vector k and frequency (energy) Wk. at 254 12. Quantization of Relativistic Fields Hence, the eigenstates of the Hamiltonian (12.1.17) are of the following form: In the ground state 10) , which is determined by the equation ak 10) 0, for allk, = (12.1.18a) no phonons are present. Its energy represents the zero-point energy (12.1.18b) (12.1.19a) with energy E = L nkWk + Eo . (12.1.19b) k The occupation numbers take the values nk = 0, 1,2, ... and k runs through the N values of the first Brillouin zone; the nk are not bounded from above. The operator nk = atak is the occupation number operator for phonons with the wave vector k. From (12.1.19c) it follows that ak i I· .. at I··· ,nkp' .. ) = Jnk; I· .. ,nkp"') = Jnki ,nk i - 1, ... ) , + 11··· ,nki + 1, ... ) (12.1.19d) Remark. Let us emphasize that the commutation relations (12.1.2) and (12.1.15) are valid even when nonlinear terms are present in the Hamiltonian, since they are a consequence of the general canonical commutation relations of position and momentum operators. 12.1.1.2 Dynamics Equation (12.1.16) expresses the Hamiltonian of the linear chain in diagonal form. In fact, H is time independent, so that its various representations (12.1.1), (12.1.11), and (12.1.16) are valid at all times. The essential features of the dynamics are most readily described in the Heisenberg picture. Starting from 12.1 Coupled Oscillators, the Linear Chain, Lattice Vibrations qn = L eikanQk _1_ = = "" 7 1 .J2WkmN (eikana L + a~k) _l_eikan (a k .JmN k .J2Wk _1_ .JmN k k + e-ikanat) k 255 (12.1.20) , we define the Heisenberg operator qn(t) = eiHt qn(O) e- iHt = eiHt qn e- iHt . (12.1.21) By solving the equation of motion, or by using eiHt ak e -iHt 1 [·Ht [·Ht , ak 11 + ... = ak + [·Ht 1 ,ak1+ I" 1 ,1 2. = ak + [iwkt alak' akl + ~2. [iHt, [iHt, ak]] + ... 1 = ak - iWktak + 2! [iwktalak, -iwktakl + ... = ak (1 - iWkt + ~! (-iwkt)2 (12.1.22) + ... ) = ake-iwkt , one obtains for the time dependence of the displacements qn(t) = "" 7 1 (ei(kan-wkt)a .J2WkmN k + e-i(kan-wkt)at) k (12.1.23) Concerning its structure, this solution is identical to the classical solution, although the amplitudes are now the annihilation and creation operators. We will discuss the significance of this solution only in the context of the continuum limit, which we shall now proceed to introduce. 12.1.2 Continuum Limit, Vibrating String Here, we shall treat the continuum limit for the vibrating string. In this limit the lattice constant becomes a -+ 0 and the number of oscillators N -+ 00 , whilst the length of the string L = aN remains finite (Fig. 12.3). m -- • • • a • Fig. 12.3. Concerning the continuum limit of the linear chain (see text) The density p = ~ and stiffness constant v 2 = (na)2 must also remain constant. The positions of the lattice points x = na are then continuously distributed. We also introduce the definitions 256 12. Quantization of Relativistic Fields m)I/2 q(X) = qn ( -;; (12.1.24a) p(X) = Pn(ma)-1/2 . (12.1.24b) The equation of motion (12.1.4b) ijn = D2(qn+l + qn-l - 2qn) - D5 qn becomes "( ) _ n2 q x, t - J& a (q(x 2 + a, t) - q(x, t)) - (q(x, t) - q(x - a, t)) a 2 (12.1.25) - D5q(x, t) and, in the limit a -+ 0, one has ::2 ij(x, t) - v 2 q(x, t) + D5 q(x, t) = (12.1.26) 0. The form of this equation is identical to that of the one-dimensional KleinGordon equation. For Do = 0, i.e., in the absence of a harmonic potential, Eq. (12.1.26) is the equation of motion for a vibrating string, as is known from classical mechanics. In the continuum limit, the Hamiltonian (12.1.1) takes the form .m L = a--+O,hN --+00 H (1 Pn2+ -2-(qn mD2 2 mD5 2) - qn-l) + -2-qn n -2 m (1 · L a - - P2 + -mD2 mD5 11m = a--+O,N--+oo - a 2 (qn - qn_l)2 + - q2) 2ma n 2a a 2a n n ~ ! L <Ix ~ [P(X)' + v' (::)' + {Ii q(x)'1' (12.1,27) L where En a ... -+ J dx .... The commutators of the displacements and the o momenta are obtained from (12.1.2) and (12.1.24a,b): [q(x),p(x')] = lim a--+O,N --+00 . 11m a--+O,N --+00 ( -m)I/2 (ma)-1/2 [qn,Pn'] a .6nn ,=.lu'( X 1-- a (12.1.28a) - X ') and [q(x), q(X')] = [P(X),p(X')] = 0 . (12.1.28b) Next, we will derive the representation in terms of normal coordinates. From (12.1.6), it follows that 12.1 Coupled Oscillators, the Linear Chain, Lattice Vibrations 2~£, k = where £ is an integer with - 00 :S £ :S 257 (12.1.29) 00 . For a string of finite length, the Fourier space remains discrete in the continuum limit, although the number of wave vectors and thus normal coordinates is now infinite. From (12.1.5a,b) we have 1 ""' q ( x ) = £1/2 ~e ikxQ (12.1.30a) k k P(X ) = 1 £1/2 ""' ~e -ikxp k (12.1.30b) k (12.1.31 ) whereby, in the limit a ---+ 0, equation (12.1.12) reduces to w~ = v 2 k 2 + n5 . (12.1.32) The commutation relations for the normal coordinates (12.1.10) remain unchanged: (12.1.33) The transformation to creation and annihilation operators (12.1.14a,b), and also the expression for the Hamiltonian in terms of these quantities (12.1.16) remain correspondingly unchanged. The representation of the displacement field in terms of creation and annihilation operators now takes the form q(x) = _1_ ""' eikx a k £1/ 2 7 t + a_ k .J2Wk = _1_ ""' (eikXa £1/2 ~ k k + e-ikxa t ) k _1_ (12.1.34) I2W-=- V.Gwk and, from (12.1.23), its time dependence is given by q(x t) , = _1_ ""' (ei(kX-wkt)a k £1/2 ~ k + e-i(kX-wkt)at) _1_ k I2W-=V.Gwk . (12.1.35) We finally obtain for the Hamiltonian H = ~ Wk (ata k + ~) , (12.1.36) which is positive definite. The functions ei(kx-wkt) and e-i(kx-wk t ) appearing in (12.1.35) are solutions of the free field equation (12.1.26), which, in connection with the Klein-Gordon equation, we had interpreted as solutions with 258 12. Quantization of Relativistic Fields positive and negative energy. In the quantized theory, these solutions appear as amplitude functions, prefactors of the annihilation and creation operators in the expansion of the field operators. The sign of the frequency dependence is of no significance for the value of the energy. This is determined by the Hamiltonian (12.1.36), which is positive definite: there are no states of negative energy. The direct analogy to the vibrating string relates to the real Klein-Gordon field. The complex field will be treated in Eq. (12.1.47a,b) and Sect. 13.2. 12.1.3 Generalization to Three Dimensions, Relationship to the Klein-Gordon Field 12.1.3.1 Generalization to three dimensions It is now straightforward to generalize the above results to three dimensions. We consider a discrete three-dimensional cubic lattice. Rather than taking an elastic lattice, which would have three-dimensional displacement vectors, we shall assume instead that the displacements are only along one dimension (scalar). In the continuum limit, the one-dimensional coordinate x must be replaced by the three-dimensional vector x x -+x, and the field equation for the displacement q(x, t) reads: ij(x, t) - v 2 Llq(x, t) + ng q(x, t) = (12.1.37) 0. Introducing the substitutions v -+ c, (x, t) == x, and q(x, t) -+ ¢(x) , (12.1.38) we obtain (12.1.39) which is precisely the Klein-Gordon equation (5.2.11'). The representation of the solution of the Klein-Gordon equation in terms of annihilation and creation operators (12.1.35), the commutation relations (12.1.15), (12.1.28), and the Hamiltonian (12.1.36) can all be directly translated into three dimensions: (12.1.40) (12.1.41a) 12.1 Coupled Oscillators, the Linear Chain, Lattice Vibrations [¢(x, t), ¢(x', t)] = i8(3)(x - x'), [¢(x, t), ¢(x', t)] = [¢(x, t), ¢(x', t)] = 0, 259 (12.1.41b) and H = ~Wk . (ata +~) k (12.1.42) Inspired by these mechanical analogies, we arrive at a completely new interpretation of the Klein-Gordon equation. Previously, in Sect. 5.2, an attempt was made to use the Klein-Gordon equation as a relativistic replacement for the Schrodinger equation and to interpret its solutions as probability amplitudes in the same way as for the Schrodinger wave functions in coordinate space. However, ¢(x, t) is not a wave function but an operator in Fock space. This field operator is represented as a superposition of single-particle solutions of the Klein-Gordon equation with amplitudes that are themselves operators. The effect of these operators is to create and annihilate the quanta (elementary particles) that are described by the field. The term Fock space describes the state space spanned by the multi-boson states (12.1.43a) where 10) is the ground state (= vacuum state) of the field. The energy of this state is E = L mvk ( nk + ~) . (12.1.43b) k In equation (12.1.40) the field operator was split into positive and negative frequency parts, ¢+(x) and ¢-(x). This notation originates from the positive and negative energy solutions. Due to the hermiticity of the field operator ¢(x), we have ¢+t = ¢-, and in the expansion (12.1.40) we encounter the sum of ak and This hermitian (real) Klein-Gordon field describes uncharged mesons, as our subsequent investigations will reveal. at. 12.1.3.2 The infinite-volume limit Until now, we have based our studies on a finite volume with linear extension L. In order to formulate relativistically invariant theories, it is necessary to include all of space. We thus take the limit L --+ 00. In this limit, the previously discrete values of k move arbitrarily close together, such that k too becomes a continuous variable. The sums over k are replaced by integrals according to L k (27f)3 ... --+ L J d 3 \ (27f) ... 260 12. Quantization of Relativistic Fields Using the definition (12.1.44) one obtains the field operator from (12.1.39) as -00 (12.1.45) where the k integration extends in all three spatial dimensions from -00 to +00. The commutation relations for the creation and annihilation operators now read: [a(k), at(k')] r = Okk' (2~ = o(k - k'), (12.1.46) [a(k), a(k')] = 0 ,[at(k), at(k')] = 0 . Proof: The complex Klein-Gordon field is not hermitian and therefore the expansion coefficients (operators) of the solutions with positive and negative frequency are independent of one another 1 " ¢ ( x, t ) -_ L3/2 ~ k ~ 1 y2Wk (-ik.X e ak t)· + eik.Xbk (12.1.47a) Here, k· x = Wkt - k· x is the scalar product of four-vectors. The operators and bk have the following significance: ak a k (aD bk (bt) annihilates (creates) a particle with momentum k and annihilates (creates) an antiparticle with momentum k and opposite charge, as will be discussed more fully in subsequent sections. From (12.1.47a), one obtains the hermitian conjugate of the field operator as ¢t(x t) = _1_" _1_ (e-ik.Xb k , L3/2 ~ k y~wk '2w:- t) + eik.Xak · (12.1.47b) 12.2 Classical Field Theory 261 12.2 Classical Field Theory 12.2.1 Lagrangian and Euler-Lagrange Equations of Motion 12.2.1.1 Definitions In this section we shall study the basic properties of classical (and, in the main, relativistic) field theories. We consider a system described by fields cPr(x), where the index r is a number which labels the fields. It can refer to the components of a single field, e.g., the radiation field A""(x) or the fourspinor 'l/J(x), but it can also serve to enumerate the different fields. To begin with, we define a number of terms and concepts. We assume the existence of a Lagrangian density that depends on the fields cPr and their derivatives cPr,,.,, == O,."cPr == ()~,. cPr. The Lagrangian density is denoted by (12.2.1) The Lagrangian is then defined as (12.2.2) The significance of the Lagrangian in field theory is completely analogous to that in point mechanics. The form of the Lagrangian for various fields will be elucidated in the following sections. We also define the action S(J!) = J d4 x £(cPr, cPr,,.,,) = J dxO L(xO) , (12.2.3) n where d4 x = dxo d3 x == dxo dx l dx 2 dx 3 . The integration extends over a region J! in the four-dimensional space-time, which will usually be infinite. We shall use the same notation as in Part II on relativistic wave equations, where we set the speed of light c = 1 and, thus, XO = t. 12.2.1.2 Hamilton's principle in point mechanics As has already been mentioned, the definitions and the procedure needed here are analogous to those of point mechanics with n degrees of freedom. We briefly remind the reader of the latter l , 2. The Lagrangian of a system of particles with n degrees of freedom with generalized coordinates qi, i = 1, ... ,n has the form: 1 2 H. Goldstein, Classical Mechanics, 2nd ed., Addison-Wesley, Reading, Mass., 1980 L.D. Landau and E.M. Lifshitz, Course of Theoretical Physics, Vol. 1, Pergamon, Oxford, 1960 262 12. Quantization of Relativistic Fields (12.2.4) The first term is the kinetic energy, and the second the negative potential energy due to the interactions between the particles and any external conservative forces. The action is defined by I t2 8 = dtL(t) . (12.2.5) tl The equations of motion of such a classical system follow from Hamilton's action principle. This states that the actual trajectory qi (t) of the system is such that the action (12.2.5) is stationary, i.e., 88=0, (12.2.6) where variations in the trajectory qi(t) + 8qi(t) between the initial and final times tl and t2 are restricted by (see Fig.12.4) (12.2.7) Fig. 12.4. Variation of the solution in the time interval between tl and t2. Here, q(t) stands for {qi(t)} The condition that the action is stationary for the actual trajectory implies (12.2.8) The second term on the last line vanishes since, according to (12.2.7), the variation 8q(t) must be zero at the endpoints. In order for 88 to vanish for all 8qi(t), we have the condition 12.2 Classical Field Theory aL d aL _ 0 aqi(t) - dt aili(t) - , i = 1, ... ,n. 263 (12.2.9) These are the Euler-Lagrange equations of motion, which are equivalent to Hamilton's equations of motion. We now proceed to extend these concepts to fields. 12.2.1.3 Hamilton's principle in field theory In field theory the index i is replaced by the continuous variable x. The equations of motion (= field equations) are obtained from the variational principle 08=0. (12.2.10) To this end, we consider variations of the fields (12.2.11) which are required to vanish on the surface r(n) of the space-time region n: 0<Pr(X) = 0 on r(n) . (12.2.12) In analogy to (12.2.9), we now calculate the change in the action (12.2.3) Here, we employ the summation convention for the repeated indices rand 11 and have also used 3 0<Pr,1-' = a~" o<pr. The last term in Eq.(12.2.13) can be re-expressed using Gauss's theorem as the surface integral J a.c dal-'~o<Pr (12.2.14) =0, o/r,p, r(n) where da I-' is the 11 component of the element of surface area. The condition that 08 in Eq.(12.1.13) vanishes for arbitrary nand o<pr yields the EulerLagrange equations of field theory a.c a a.c _ 8¢;. - axl-' a<Pr,1-' 3 OcPr(X) = cP~(X) °, r = 1,2, ... - cPr(X) and thus a~" OcPr(x) (12.2.15) = cP~,/L(x) - cPr,/L(X) = OcPr,/L(X) . 264 12. Quantization of Relativistic Fields Remark. So far we have considered the case of real fields. Complex fields can be treated as two real fields, for the real and imaginary parts. It is easy to see that this is equivalent to viewing ¢(x) and ¢*(x) as independent fields. In this sense, the variational principle and the Euler-Lagrange equations also hold for complex fields. We now introduce two further definitions in analogy to point particles in mechanics. The momentum field conjugate to ¢r(x) is defined by 7f r 8L 81:. (x) = -.- = - .. 8¢r(x) 8¢r(x) (12.2.16) The definition of the Hamiltonian reads: (12.2.17) where the ¢r have to be expressed in terms of the The Hamiltonian density is defined by 7fr . (12.2.18) The Hamiltonian can be expressed in terms of the Hamiltonian density as (12.2.19) The integral extends over all space. H is time independent since I:. does not depend explicitly on time. 12.2.1.4 Example: A real scalar field To illustrate the concepts introduced above, we consider the example of a real scalar field ¢( x). For the Lagrangian density we take the lowest powers of the field and its derivatives that are invariant under Lorentz transformations (12.2.20) where m is a constant. The derivatives of I:. with respect to ¢ and ¢,fJ- are 81:. 2 8¢ = -m ¢, from which one obtains for the Euler-Lagrange equation (12.2.15) (12.2.21) or, in the form previously employed, 12.2 Classical Field Theory 265 (12.2.21') Thus, Eq. (12.2.20) is the Lagrangian density for the Klein-Gordon equation. The conjugate momentum for this field theory is, according to (12.2.16), n(x) = ¢(x) , (12.2.22) and, from (12.2.18), the Hamiltonian density reads: H(x) 1 = "2 [n 2(x) + (V¢)2 + m2¢2(x)] . (12.2.23) If we had included higher powers of ¢2 in (12.2.20), for example ¢4, the equation of motion (12.2.21') would have contained additional nonlinear terms. Remarks on the structure of the Lagrangian density (i) The Lagrangian density may only depend on ¢r(x) and ¢r,p,(x); higher derivatives would lead to differential equations of higher than second order. The Lagrangian density can depend on x only via the fields. An additional explicit dependence on x would violate the relativistic invariance. (ii) The theory must be local, i.e., £(x) is determined by ¢r(x) and ¢r,p,(X) at the position x. Integrals over £(x) would imply nonlocal terms and could lead to acausal behavior. (iii) The Lagrangian density £ is not uniquely determined by the action, nor even by the equations of motion. Lagrangian densities that differ from one another by a four-divergence are physically equivalent £'(x) = £(x) + 8"F"(x) . (12.2.24) The additional term here leads in the action to a surface integral over the three-dimensional boundary of the four-dimensional integration region. Since the variation of the field vanishes on the surface, this can make no contribution to the equation of motion. (iv) £ should be real (in quantum mechanics, hermitian) or, in view of remark (iii), equivalent to a real £. This ensures that the equations of motion and the Hamiltonian, when expressed in terms of real fields, are themselves real. £ must be relativistically invariant, i.e., under an inhomogeneous Lorentz transformation x -+ x' Ax + a = (12.2.25) £ must behave as a scalar: £(¢~x'), ¢~,p(x') = £(¢r(x), ¢r,p,(x)) . (12.2.26) Since d4 x = dx odx 1 dx 2dx 3 is also invariant, the action is unchanged under the Lorentz transformation (12.2.25) and the equations of motion have the same form in both coordinate systems and are thus covariant. 266 12. Quantization of Relativistic Fields 12.3 Canonical Quantization Our next task is to quantize the field theory introduced in the previous section. We will allow ourselves to be guided in this by the results of the mechanical elastic continuum model (Sect. 12.1.3) and postulate the following commutation relations for the fields ¢r and the momentum fields 7rr: = ic5rs c5(x - x') , [¢r(x,t),¢s(x',t)] = [7rr (x,t),7rs (x',t)] = O. [¢r(x, t), 7rs (x', t)] (12.3.1 ) These are known as the canonical commutation relations and one speaks of canonical quantization. For the real Klein-Gordon field, where according to (12.2.22) 7r(x) = ¢(x), this also implies [¢(x, t), ¢(x', t)] = ic5(x - x') , [¢(x,t),¢(x',t)] = [¢(x,t),¢(x',t)] = o. (12.3.2) In view of the general validity of (12.1.28) and (12.1.41b), one postulates also the canonical commutation relations for interacting fields. 12.4 Symmetries and Conservation Laws, Noether's Theorem 12.4.1 The Energy-Momentum Tensor, Continuity Equations, and Conservation Laws The invariance of a system under continuous symmetry transformations leads to continuity equations and conservation laws. The derivation of these conservation laws from the invariance of the Lagrangian density is known as Noether's theorem (see below). Continuity equations can also be derived in an elementary fashion from the equations of motion. This will be illustrated for the case of the energymomentum tensor, which is defined by (12.4.1) The energy-momentum tensor obeys the continuity equation 4 (12.4.2) Proof. Differentiation of TJ.LV yields: 4 In the next section, we shall derive this continuity equation from space-time translational invariance, whence, in analogy to classical mechanics, the term energy-momentum tensor will find its justification. 12.4 Symmetries and Conservation Laws, Noether's Theorem TI-'II I-' = (aa aa.c ) ¢r,1I xl-' ¢r,1-' , = a.c II a.c aA. 'Pr' + ~'Pr A. a.c ¢r'~ a¢r,1-' + A. II I-' - o/T,J..L o/r - a".c a" r J.w = 267 (12.4.3) 0, where we have used the Euler-Lagrange equation (12.2.15) and a".c a" ¢r + a~/L a" ¢r,1-' to obtain the second identity. If a four-vector gl-' satisfies a continuity equation gi g~1-' = 0 , (12.4.4) then, assuming that the fields on which gl-' depends vanish rapidly enough at infinity, this leads to the conservation of the space integral of its zero component (12.4.5) Proof. The continuity equation, together with the generalized Gauss divergence theorem, leads to (12.4.6) This holds for every four-dimensional region fl with surface (J. One now chooses an integration region whose boundary in the spatial directions extends to infinity. In the time direction, it is bounded by two three-dimensional surfaces (Jl(XO = tl) and (J2(XO = t2) (Fig.12.5). In the spatial directions, ¢r and ¢r,1-' are zero at infinity: 0= f 0"1 d 3x gO - f d 3x gO = f d 3x gO(x, h) - f d 3x gO(x, t2) 0'"2 thus, -~r'Ul Fig. 12.5. Diagram relating to the derivation of the conservation law (see text) 268 12. Quantization of Relativistic Fields (12.4.7a) or, alternatively, (12.4.7b) Applying this result to the continuity equation for the energy-momentum tensor (12.4.1) leads to the conservation of the energy-momentum four-vector (12.4.8) The components of the energy-momentum vector are (12.4.9) and Pj -- fd 3 X 7fr ()8<Pr X £1 j = 1,2,3. (12.4.10) UXj The zero component is equal to the Hamiltonian (operator), and the spatial components represent the momentum operator of the field. 12.4.2 Derivation from Noether's Theorem of the Conservation Laws for Four-Momentum, Angular Momentum, and Charge 12.4.2.1 Noether's theorem Noether's theorem states that every continuous transformation that leaves the action unchanged leads to a conservation law. For instance, the conservation offour-momentum and of angular momentum follows from the invariance of the Lagrangian density £. under translations and rotations, respectively. Since these form continuous symmetry groups, it is sufficient to consider infinitesimal transformations. We therefore consider the infinitesimal Lorentz transformation XJ.L -+ x~ <Pr(x) -+ <p~(x') = xJ.L + bxJ.L = xJ.L + ..1wJ.Ll/xl/ + bJ.L = <Pr(x) +~ ..1wJ.L1/ S~: <Ps(x) . (12.4.11a) (12.4.11b) Here x and x' represent the same point in space time referred to the two frames of reference, and <Pr and <p~ are the field components referred to these coordinate systems. The quantities which appear in these equations should be understood as follows: The constant bJ.L causes an infinitesimal displace- 12.4 Symmetries and Conservation Laws, Noether's Theorem 269 ment. The homogeneous part of the Lorentz transformation is given by the infinitesimal antisymmetric tensor LlwJLv = -Llw vw The coefficients Sf:: in the transformation (12.4.11b) of the fields are antisymmetric in /1 and v and are determined by the transformation properties of the fields. For example, in the case of spinors (Eqs.(6.2.13) and (6.2.17)), we have ~2 A oWwJLV SJLV,J" _ rs 'i"s - _ ~ 4 oWwJLV A JLV ,J" O'rs 'i"s , (12.4.12a) i.e., (12.4.12b) where rand s( = 1, ... ,4) label the four components of the spinor field. Vector fields transform under a Lorentz transformation according to Eq. (11.1.3a) and thus we have S~: = g~ g~ - g~ g~ (12.4.12c) , where the indices r, s take the values 0, 1, 2, 3. In Eqs. (12.4.12a,b) summation over the repeated indices /1, v, and s is implied. As has already been emphasized, the invariance under the transformation (12.4.11a,b) means that the Lagrangian density has the same functional form in the new coordinates and fields as it did in the original ones: (12.4.13) From Eq. (12.4.13), the covariance of the equations of motion follows. The variation of ePr (X), for unchanged argument, is defined by (12.4.14) Furthermore, we define the total variation (12.4.15) which represents the change due to the form and the argument of the function. These two quantities are related by LlePr(x) = (eP~x') - ePr(X')) + (ePr(x') - ePr(X)) = JePr(X') + 88 ePr Jxv + 0(J 2 ) XV = JePr(X) (12.4.16) 8ePr 2 + -8 Jxv + O(J ) , Xv where 0(J 2 ) stands for terms of second order, which we neglect. In correspondence with Eq.(12.4.16), the difference between the Lagrangian densities in the coordinate systems] and I', i.e., the total variation of the Lagrangian density - which vanishes according to (12.4.13) - can be rewritten as 270 12. Quantization of Relativistic Fields o= £(<p~X'), <P~,I'(X) = £(<p~X'), o£ + ~oxl' = - £(<Pr(X), <Pr,I'(X)) ... ) - £(<Pr(X'), ... ) + (£(<Pr(X'), ... ) - £(<Pr(X), ... )) 8£ uxl' + 0(02) . (12.4.17) The first term on the right-hand side of (12.4.17) is obtained as 8£ o£ = 8A. o<pr o/r + ~O<Pr,1' 8£ o/r,p. = 8£ (8 8£) 8<pr o<pr - 8xI' 8<pr,1' o<pr = 8=1' {8~1' [L1<Pr - ~!: 8 + 8xI' (8£ ) 8<Pr,1' o<Pr OXY]} , where the Euler-Lagrange equation was used to obtain the second line and Eq.(12.4.16) to perform the last step. Together with >:>8 (£gl-'Y oX y uxl' ), ~£ uxl-' oxl-' = >:>8 (£oxl') = uxl-' Eq. (12.4.17) leads to the continuity equation gl-',I-' = 0 (12.4.18a) for the four-vector 8£ L1A. _ TI-'Y ox gl-' =- 8A. ~r Y (12.4.18b) • o/r,p. Here, gl-' depends on the variations L1<pr and oX y , and, according to the choice made, results in different conservation laws. Equations (12.4.18a) and (12.4.18b), which lead to the conserved quantities (12.4.5), amongst others, represent the general statement of Noether's theorem. 12.4.2.2 Application to Translational, Rotational, and Gauge Invariance We now analyze the result of the previous section for three important special cases. (i) Pure translations: For translations we have (12.4.19a) and, hence, (12.4.11b) gives <p~(x') L1<pr = O. <Pr(X); therefore, (12.4.19b) 12.4 Symmetries and Conservation Laws, Noether's Theorem 271 Noether's theorem then reduces to the statement g/-L = - T/-LV 8v , and since the four displacements 8v are independent of one another, one obtains the four continuity equations T/-LV ,/-L = 0 (12.2.31 ) for the energy-momentum tensor T/-LV ,v = 0,1,2,3, defined in (12.4.3). For v == 0, one obtains the continuity equation for the four-momentum-density p/-L = T0/-L, and for v = i that for the quantities Ti/-L. The conservation laws Ti/-L ,/-L = 0 contain as zero components, the spatial momentum densities pi and as current densities, the components of the so-called stress tensor Tij. (See also the discussion that follows Eq.(12.4.7b).) (ii) For rotations we have, according to (12.4.11a,b), (12.4.20a) and Ll</>r ~LlWvITS: = (12.4.20b) </>s . From (12.4.18b), it then follows that /-L - 1 a.c 9 = "2~WVIT SVIT "- It rs 'l's - T/-LV o/r,p, ~WVITX. IT It (12.4.21) Using the definition M/-LVIT = a.c SVIT "- (x) a"rs'l-'s + (xVT/-L IT _ xITT/-L V) 'Pr,J-t (12.4.22) , equation (12.4.21) can be re-expressed in the form 9 /-L _ 1 a.c SVIT "- -"2 ~ rs 'l-s~WVIT It o/r,p. - "21 T/-LV ~WVITX It IT -"21 T/-LIT ~WITVX = ~ 2 ( a"-a.c SVIT "- + xVT/-LIT _ XITT/-L V) Llw VIT rs'l-'s 'f'r,/-L -- ~M/-LVIT 2 It V (12.4.20') Itw VIT· ~ Since the six nonvanishing elements of the antisymmetric matrix LlwvIT are independent of one another, it follows that the quantities M/-LVIT satisfy the six continuity equations a/-LM/-LVIT =0. (12.4.23) This yields the six quantities M VIT = = J J d3x MO vIT d3x (7rAx)S~:</>s+vTOI-V (12.4.24) 272 12. Quantization of Relativistic Fields For the spatial components, one obtains the angular-momentum operator (12.4.25) Here, the angular-momentum vector ([1,12,13) == (M 23 , M31, M12) is conserved. The sum of the second and third terms in the integral represents the vector product of the coordinate vector with the spatial momentum density and can thus be considered as the angular momentum of the field. The first term can be interpreted as intrinsic angular momentum or spin (see below (13.3.13') and (E.31c)). The space-time components (Oi) MOi = J d 3x MOOi can be combined into the three-component boost vector (boost generator) K = (MOl, M02, M03) . (12.4.26) (iii) Gauge transformations (gauge transformation of the first kind). As a final application of Noether's theorem we consider the consequences of gauge invariance . Assuming that the Lagrangian density contains a subset of fields ¢r and ¢t only in combinations of the type #(x)¢r(x) and ¢t,{I(x)¢,,:{I(x), then it is invariant with respect to gauge transformations of the first kind. These are defined by ¢r(x) --t ¢~(x) eic¢r(x) ~ = (1 + ie)¢r(x) ¢t(x) --t ¢t' (x) = e-ic¢t(x) ~ (1- ie)¢t(x) , (12.4.27) where f is an arbitrary real number. The coordinates are not transformed and hence, according to Eq. (12.4.14), 8¢r(x) = ie ¢r(X) (12.4.28) 8¢t(x) = -ie ¢t(x) and [ef. (12.4.16)] L1¢r(x) = 8¢r(x) , L1¢t(x) = 8¢t(x) . (12.4.29) The four-current-density follows from Noether's theorem (12.4.18b) as {I g ex: ~ Be.,J.. le'f'r 'f'r,{1 Be ( . ),J..t + --t-110 'f'r' B¢r,{1 i.e., B~ gJ.L(x) = i ( Be ¢r ¢t) B¢r,J.L B¢r,J.L gO(x) = i (7f r (X)¢r(x) - 7ft(x)¢t(x)) satisfies a continuity equation. This implies that (12.4.30) 12.4 Symmetries and Conservation Laws, Noether's Theorem 273 (12.4.31) is conserved. Thus, in quantized form, dQ dt = 0, [Q, H] = (12.4.32) 0. The quantity q will turn out to be the charge. We can already see this by calculating the commutator of Q and CPr with the commutation relations (12.3.1) : [Q, CPr (X)] = -iq J d3 x' ~ CPs(X') = -qCPr(X) . (12.4.33) -ibsrb(x' - x) If IQ') is an eigenstate of Q, Q IQ') = Q'IQ') , then CPr(x) IQ') cpt(x) IQ') (12.4.34) is an eigenstate with the eigenvalue Q' - q and is an eigenstate with the eigenvalue Q' + q, as follows from (12.4.33): (QCPr(X) - CPr(x)Q) IQ') = -qCPr(X) IQ') QCPr(x) IQ') - CPr(x)Q' IQ') = -qCPr(x) IQ') QCPr(x) IQ') = (Q' - q)CPr(x) IQ') . (12.4.35) Hence, by using complex, i.e., nonhermitian, fields, one can represent charged particles. The conservation of charge is a consequence of the invariance under gauge transformations of the first kind (i.e., ones in which the phase is independent of x). In theories in which the field is coupled to a gauge field, one can also have gauge transformations of the second kind 'ljJ ---+ 'ljJ' = 'ljJeia(x) , AM ---+ Alit = AM + ~aM(x). 12.4.2.3 Generators of Symmetry Transformations in Quantum Mechanics We assume that the Hamiltonian H is time independent and consider constants of the motion that do not depend explicitly on time. The Heisenberg equations of motion dA(t) = i[H, A(t)] dt (12.4.36) imply that such constants of the motion commute with H [H,A] =0. (12.4.37) 274 12. Quantization of Relativistic Fields Symmetry transformations can in general be represented by unitary, or, in the case of time reversal, by ant iunit ary, transformations l . In the case of a continuous symmetry group, every element of which is continuously connected with the identity, e.g., rotations, the transformations are represented by unitary operators. This means that the states and operators transform as 1'Ij;) -t 1'Ij;') = U 1'Ij;) (12.4.38a) A-tA' = UAUt. (12.4.38b) and The unitarity guarantees that transition amplitudes and matrix elements of operators remain invariant, and that operator equations are covariant, i.e., the equations of motion and the commutation relations have the same form, regardless of whether they are expressed in the original or in the transformed operators. For a continuous transformation, we can represent the unitary operator in the form (12.4.39) where Tt = T and a is a real continuous parameter. The hermitian operator T is called the generator of the transformation. For a = 0, we have U(a = 0) = 1. For an infinitesimal transformation (a -t 8a), it is possible to expand U as (12.4.39') and the transformation rule for an operator A has the form A' = A + 8A = and thus 8A = (1 + i8aT)A(1- i8aT) i 8a [T, A] . + O(8a 2 ) (12.4.37b') When the physical system remains invariant under the transformation considered, then the Hamiltonian must remain invariant, 8H = 0, and from (12.4.37b') it follows that [T,H] =0. (12.4.40) Since T commutes with H, the generator of the symmetry transformation is a constant of the motion. Conversely, every conserved quantity CO generates a symmetry transformation through the unitary operator (12.4.41 ) 1 E.P. Wigner, Group Theory and its Application to the Quantum Mechanics of Atomic Spectra, Academic Press, New York, 1959, Appendix to Chap. 20, p. 233; V. Bargmann, J. Math. Phys. 5, 862 (1964) Problems 275 taO = 0, and hence since CO commutes with H on account of [H, CO] = U HUt = H, signifying that H is invariant. Not unnaturally, this is exactly the same transformation from which one derives the corresponding conserved four-current-density, which satisfies a continuity equation. This can be confirmed explicitly for PI-I, Q, and MI-IV. See Problem 13.2(b) for the KleinGordon field and 13.10 for the Dirac field. See also Problems 13.5 and 13.12 referring to the charge conjugation operator. The boost vector, Ki Ki = tpi - J == MO i , (12.4.26) d 3 x (xiTOO(x, t) - 7r(X)S~!1>sx (12.4.42) is a constant, but it depends explicitly on time. From the Heisenberg equation of motion K = 0 = i[H, K] + P, it follows that K does not commute with H [H,K] = iP. (12.4.43) For the Dirac-field one finds Ki = tpi - J d 3 x (x i1l(X) - ~7]J(x)'iP (12.4.44) Problems 12.1 Prove the completeness relation (12.1.7b) and the orthogonality relation (12.1.7b). 12.2 Demonstrate the validity of the commutation relation (12.1.10). 12.3 Show that the Hamiltonian (12.1.1) for the coupled oscillators can be transformed into (12.1.11) and gives the dispersion relation (12.1.12). 12.4 Prove the inverse transformations given in (12.1.14a,b). 12.5 Prove the commutation relations for the creation and annihilation operators (12.1.15). d'tt° 12.6 Prove the conservation law (12.4. 7b) by calculating using the threedimensional Gauss's law and by using in the definition of GO the integral over all space. 276 12. Quantization of Relativistic Fields 12.7 The coherent states for the linear chain are defined as eigenstates of the annihilation operators ak. Calculate the expectation value of the operator qn(t) = L y'2~ k [ei(kna-wkt)ak(O) + e-i(kna-wkt)al(O)] mWk for coherent states. 12.8 Show for vector fields As, s = 0,1,2,3, the validity of Eq. (12.4.12c). 13. Free Fields We shall now apply the results of the previous chapter to the free real and complex Klein-Gordon fields, as well as to the Dirac and radiation fields. We shall thereby derive the fundamental properties of these free field theories. The spin-statistics theorem will also be proved. 13.1 The Real Klein-Gordon Field Since the Klein-Gordon field was found as the continuum limit of coupled oscillators, the most important properties of this quantized field theory have already been encountered in Sects. 12.1 and 12.2.1.4. Nevertheless, here we shall once more present the essential relations in a closed, deductive manner. 13.1.1 The Lagrangian Density, Commutation Relations, and the Hamiltonian The Lagrangian density of the free real Klein-Gordon field is of the form (13.1.1) The equation of motion (12.2.21) reads: (13.1.2) The conjugate momentum field follows from (13.1.1) as 8£ . 7r(X) = - . = ¢(X) . 8¢ (13.1.3) The quantized real Klein-Gordon field is represented by the hermitian operators and The canonical quantization prescription (12.3.1) yields for the Klein-Gordon field 278 13. Free Fields [ ¢(x, t), ¢(x', t)] = i 8(x - x') [¢(X,t),¢(X',t)] = [¢(X,t),¢(X',t)] o. (13.1.4) Remarks: (i) Since ¢(x) transforms as a scalar under Lorentz transformations and possesses no intrinsic degrees offreedom, the coefficients Sf: in (12.4.11b) and (12.4.25) are zero. The spin of the Klein-Gordon field is therefore zero. (ii) Since the field operator ¢ is hermitian, the.c of Eq. (13.1.1) is not gauge invariant. Thus the particles described by ¢ carry no charge. (iii) Not all electrically neutral mesons with spin 0 are described by a real Klein-Gordon field. For example, the Ko meson has an additional property known as the hypercharge Y. At the end of the next section we will see that the Ko, together with its antiparticle Ko, can be described by a complex Klein-Gordon field. (iv) For the case of quantized fields, too, it is still common practice to speak of real and complex fields. The expansion of ¢( x) in terms of a complete set of solutions of the KleinGordon equation is of the form (13.1.5) with (13.1.6) where ¢+ and ¢- represent the contributions of positive (e- ikx ) and negative frequency (e ikx ), respectively. Inverting (13.1.5) yields: ak = at = J2~ J J2~ J + i¢(x, 0)) Wk d 3 x eikx (Wk¢(X, 0) Wk d 3 x e- ikx (Wk¢(X, 0) - i¢(x, 0)) (13.1.5') From the canonical commutation relations of the fields (13.1.4), one obtains the commutation relations for the ak and at: (13.1. 7) These are the typical commutation relations for uncoupled oscillators, i.e., for bosons. The operators (13.1.8) 13.1 The Real Klein-Gordon Field 279 have the eigenvalues nk = 0,1,2, ... and can thus be interpreted as occupation-number, or particle-number, operators. The operators ak and at annihilate and create particles with momentum k. From the energy-momentum four-vector (12.4.8), one obtains the Hamiltonian of the scalar field as f d3X~ = f d3X~ H= [J>2(X) [1T2(x) + (V¢(X))2 +m2¢2(x)] + (V¢(X))2 + m2¢2(x)] (13.1.9) , and the momentum operator of the Klein-Gordon field as (13.1.10) Remark. The quantum-mechanical field equations also follow from the Heisenberg equations, and the commutation relations (13.1.4) and (12.3.1): J>(x) = i[H, ¢(x)] = 7r(x) (13.1.11) 7i-(x) = i[H, 1T(X)] = (V2 - m2)¢(x) , (13.1.12) from which we have, in accordance with Eq. (13.1.2), ¢(x) = (V 2 - m 2)¢(x) . (13.1.13) Substitution of the expansion (13.1.5) yields Hand P as (takak +"21) _ ,...1 ~ "2Wk (t aka k + aka kt) -_ " ~wk H- k P=L~k(at+) (13.1.14) k k k (atk+~). (13.1.15) The state of lowest energy, the ground state or vacuum state 10), is characterized by the fact that it contains no particles, i.e., nk = 0, or ak 10) = 0 for all k. (13.1.16a) Thus ¢+(x) 10) = 0 for all x. (13.1.16b) The energy of the vacuum state EO="2~Wk' 1 (13.1.17) 280 13. Free Fields also known as the zero-point energy, is divergent. In itself, this is not a problem, since only energy differences are measurable and these are finite. However, it is desirable and possible to eliminate the zero-point energy from the outset by the use of normal ordering of operators. In a normal ordered product all annihilation operators are placed to the right of all creation operators. For bosons, we illustrate the definition of normal order, symbolized by two colons: ... :, by means of the following examples: (i) (ii) : a k1 a k2 at3 : atak + akat = at3 a k1 ak2 (13.1.18a) (13.1.18b) . - 2atak and (iii) : ¢(x)¢(y) : = : (¢+(x) = : + ¢-(x))(¢+(y) + ¢-(y)) : ¢+(x)¢+(y) : + : ¢+(x)¢-(y) : + : ¢-(x)¢+(y) : +: ¢-(x)¢-(y) = + ¢-(y)¢+(x) +¢-(x)¢+(y) + ¢-(x)¢-(y). ¢+(x)¢+(y) (13.1.18c) One treats the Bose operators in a normal product as if they had vanishing commutators. The order of the creation (annihilation) operators among themselves is irrelevant since their commutators are all zero. The positive frequency parts are placed to the right of the negative frequency parts. The vacuum expectation value of any normal product vanishes. We now redefine the Lagrangian density and the observables such as energy-momentum vector, angular momentum, etc. as normal products :. This means, for example, that the momentum operator (13.1.10) is replaced by p =- J d3 x : ¢(x)V¢(x) : (13.1.10') It follows from this that the energy-momentum vector, instead of being given by (13.1.14) and (13.1.15), now takes the form (13.1.19) This no longer contains any zero-point terms. We shall illustrate this for the Hamiltonian operator H. In the calculation leading to (13.1.14), the first step involved no permutation of operators. If the original H is now replaced by : H :, then, corresponding to example (ii) above, normal ordering gives H = L:k wkatak, i.e., the zero component of (13.1.19). The normalized particle states and their energy eigenvalues are: 13.1 The Real Klein-Gordon Field The vacuum 10) single-particle states two-particle states 281 Eo = 0 at 10) at at 10) for arbitrary kl Ek -=I=- k2 = Wk Ek 1 ,k 2 = Wk 1 + Wk 2 (atr 10) for arbitrary k ~ One obtains a general two-particle state by a linear superposition of these states. As a result of (13.1.7), we have atat 10) = at2at 10) . The particles described by the Klein-Gordon field are bosons: each of the occupation numbers takes the values nk = 0,1,2, .... The operator nk = atak is the particle-number operator for particles with the wave vector k whose eigenvalues are the occupation numbers nk. We now turn to the angular momentum of the scalar field. This singlecomponent field contains no intrinsic degrees of freedom and the coefficients Srs in Eq. (12.4.25) vanish, Srs = O. The angular momentum operator (12.4.23) therefore contains no spin component; it comprises only orbital angular momentum J = f f =: d3xx X (13.1.20) P(x) d3xx x ¢(x)~vcp The spin of the particles is thus zero. Since the Lagrangian density (13.1.1) and the Hamiltonian (13.1.9) are not gauge invariant, there is no charge operator. The real Klein-Gordon field can only describe uncharged particles. An example of a neutral meson with zero spin is the 1fo. 13.1.2 Propagators For perturbation theory, and also for the spin-statistic theorem to be discussed later, one requires the vacuum expectation values of bilinear combinations of the field operators. To calculate these, we first consider the commutators [cp+(x),cp+(y)] = [IP-(x),cp-(y)] = 0 A.+(x) A.-(y)] ['I' , 'I' 1 " " ( 1,)1/2 [ak' akt,] = 2V ~ ~ k' WkWk k = J ~ 2 d 3k e-ik(x-y) (21f)3 Wk e-ikx+ik'y ,kO=Wk. (13.1.21) 282 13. Free Fields Using the definitions ,1±(x) i J J A( ) _ 1 LlX 2i d3 k (27r)3 ~ = =F"2 e'fikx ko d 3k 1 (-ikx e (27r)3 Wk = Wk -e ikx) (13.1.22a) (13.1.22b) one can represent the commutators as follows: [¢+(x), (p-(y)] = i,1+(x - y) (13.1.23a) [¢-(x),¢+(y)] =i,1-(x-y) = -i,1+(y-x) (13.1.23b) [¢(x), ¢(y)] = [¢+(x), ¢-(y)] + [¢-(x), ¢+(y)] (13.1.23c) =i,1(x-y). We also have the obvious relations ,1(x - y) = ,1+ (x - y) ,1-(x) = + ,1-(x - -,1+( -x) . y)) (13.1.24a) (13.1.24b) In order to emphasize the relativistic covariance of the commutators of the field, it is convenient to introduce the following four-dimensional integral representations: ,1±(x) = - J J d4k e- ikx (27r)4 k2 _ m2 (13.1.25a) c± ,1(x) = - d4k e- ikx (27r)4 k2 _ m2 ' (13.1.25b) c for which the contours of integration in the complex ko plane are shown in Fig. 13.1. Fig. 13.1. Contours of integration C± and C in the complex ko plane for the propagators Ll±(x) and Ll(x) The expressions (13.1.25a,b) can be verified by evaluating the path integrals in the complex ko plane using the residue theorem. The integrands are proportional to [(ko - wk)(ko + Wk)]-l and have poles at the positions ±Wk. 13.1 The Real Field Klein~Gord 283 Depending on the integration path, these poles may contribute to the integrals. The right-hand sides of (13.1.25a,b) are manifestly Lorentz covariant. This was shown in (10.1.2) for the volume element, and for the integrand is self-evident. We now turn to the evaluation of the vacuum expectation values and propagators. Taking the vacuum expectation value of (13.1.23a) and using ¢ 10) = 0, one obtains iLl+(x - x') = (01 [¢+(x), ¢~(x')lIO = = (01 ¢(x )¢(x') 10) . (01 ¢+(x)~' 10) (13.1.26) In perturbation theory (Sect. 15.2) we will encounter time-ordered products of the perturbation Hamiltonian. For their evaluation we will need vacuum expectation values of time-ordered products. The time-ordered product T is defined for bosons as follows: T "'(x)"'(x') = {¢(x)¢(X') 'f' 'f' ¢(x')¢(x) t > t' t < t' = 8(t - t')¢(x)¢(x') + 8(t' - (13.1.27) t)¢(x')¢(x) . ImkO Fig. 13.2. Contour of integration in the ko plane for the Feynman propagator ,1F(X} The Feynman propagator is defined in terms of the expectation value of the time-ordered product: iLlF(x - x') == (01 T(¢(x)¢(x')) 10) = i (8(t - t')Ll+(x - x') - 8(t' - t)Ll~(x (13.1.28) - x')) . This is related to Ll±(x) through LlF(x) = ±Ll±(x) for t ~ 0 and has the integral representation d4k e~ikx LlF(x) = (271")4 k 2 _ m 2 . f (13.1.29) (13.1.30) CF The latter can be seen by adding an infinite half-circle to the integration contour in the upper or lower half-plane of ko and comparing it with 284 13. Free Fields Eq.(13.1.25a). The integration along the path CF defined in Fig. 13.2 is identical to the integration along the real ko axis, whereby the infinitesimal displacements TJ and c in the integrands serve to shift the poles ko = ±(wk-iTJ) = ±( Jk2 + m 2 - iTJ) away from the real axis: LlF x = () = lim 7)--+0+ lim g--+O+ J J e- ikx d4k -(271-)4 k5 - (Wk - iTJ)2 d4k e- ikx (271')4 .,...,,-----:0:--k 2 - m 2 ic -- + (13.1.31) . As a preparation for the perturbation-theoretical representation in terms of Feynman diagrams, it is useful to give a pictorial description of the processes represented by propagators. Plotting the time axis to the right in Fig. 13.3, Xl Xl ... "- "- time a) "- ·x ... x'- .- ~ time b) .- .- • ... Fig. 13.3. Propagation of a particle (a) from Xl to X and (b) from x to Xl diagram (a) means that a meson is created at x' and subsequently annihilated at x, i.e., it is the process described by (01 ¢(x)¢(x' ) 10) = iLl+(x - x'). Diagram (b) represents the creation of a particle at x and its annihilation at x', i.e., (01 ¢(X')¢(X) 10) = -iLl-(x - x'). Both processes together are described by the Feynman propagator for the mesons of the Klein-Gordon field, which is thus often called, for short, the meson propagator. As an example, we consider the scattering of two nucleons, which are represented in Fig. 13.4 by the full lines. The scattering arises due to the Fig. 13.4. Graphical representation of the meson propagator .1 F (x - Xl). In the first diagram, a meson is created at Xl and annihilated at x. In the second diagram, a meson is created at x and annihialted at Xl. Full lines represent nucleons, and dashed lines mesons 13.2 The Complex Klein-Gordon Field 285 exchange of meSOnS. The two processes are represented jointly, and independently of their temporal sequence, by the Feynman propagator. 13.2 The Complex Klein-Gordon Field The complex Klein-Gordon field is very similar to the real Klein-Gordon field, except that now the particles created and annihilated by the field carry a charge. Our starting point is the Lagrangian density (13.2.1) In line with the remark following Eq. (12.2.24), ¢(x) and ¢t(x) are treated as independent fields. Hence, we have, for example, aci>!~x) = ¢,/l-(x), and, from the Euler-Lagrange equations (12.2.15), the equations of motion and (13.2.2) The conjugate fields of ¢(x) and ¢t(x) are, according to (12.2.16), 71'(x)=¢t(x) and 71't(x)=¢(x). (13.2.3) Since the complex Klein-Gordon field also behaves as a scalar under Lorentz transformations, it has spin = O. Due to the gauge invariance of £, this field possesses an additional conserved quantity, namely the charge Q. The equal time commutators of the fields and their adjoints are, according to canonical quantization (12.3.1), [¢(x,t),¢t(x',t)] =i8(x-x') (13.2.4) [¢t(x,t),¢(x',t)] =i8(x-x') and [¢(x, t), ¢(x', t)] = [¢(x, t), ¢t (x', t)] = [¢(x,t),¢(x',t)] = [¢(x,t),¢t(x',t)] = O. The solutions of the field equations (13.2.2) for the complex Klein-Gordon field are also of the form e±ikx, so that the expansion of the field operator takes the form where, in contrast to the real Klein-Gordon field, the amplitudes are now independent of one another. From (13.2.5a) we have bt and ak 286 13. Free Fields (13.2.5b) In (13.2.5a,b), the operators ¢(x) and ¢t(x) are split into their positive (e- ikx ) and negative (e ikx ) frequency components. Taking the inverse of the Fourier series (13.2.5a,b) and using (13.2.4), one finds the commutation relations [ak , at,] = [b k , bt,] = 8kk, [ak , ak,] (13.2.6) = [b k , bk,] = [a k , bk,] = [a k , bt,] = 0 . One now has two occupation-number operators, for particles a and for particles b (13.2.7) and The operators at ,ak create and annihilate particles of type a, whilst bt , bk create and annihilate particles of type b, in each case the wave vector being k. The vacuum state 10) is defined by for all k , (13.2.8a) or, equivalently, for all x . (13.2.8b) One obtains for the four-momentum pI-' = Lkl-'(nak+nbk) , (13.2.9) k whose zero component, having kO = Wk, represents the Hamiltonian. On account of the invariance of the Lagrangian density under gauge transformations of the first kind, the charge (13.2.10) is conserved. The corresponding four-current-density is of the form jl-'(x) = -iq (: a¢t ¢ _ a¢ ¢t :) axl-' axl-' (13.2.11) and satisfies the continuity equation j~1-' = o. (13.2.12) 13.3 Quantization of the Dirac Field 287 Substituting the expansions (13.2.5a,b) into Q, one obtains Q= q L (nak - nbk) . (13.2.13) k The charge operator commutes with the Hamiltonian. The a particles have charge q, and the b particles charge -q. Except for the sign of their charge, these particles have identical properties. The interchange a +-+ b changes only the sign of Q. In relativistic quantum field theory, every charged particle is automatically accompanied by an antiparticle carrying opposite charge. This is a general result in field theory and also applies to particles with other spin values. It is also confirmed by experiment. An example of a particle-antiparticle pair are the charged 7r mesons 7r+ and 7r- which have charges +eo and -eo. However, the charge need not necessarily be an electrical charge: The electrically neutral KO meson has an antiparticle kO, which is also electrically neutral. These two particles carry opposite hypercharge: Y = 1 for the KO and Y = -1 for the kO, and are described by a complex Klein-Gordon field. The hypercharge 1 is a chargelike intrinsic degree of freedom, which is related to other intrinsic quantum numbers, namely the electrical charge Q, the isospin I z , the strangeness S, and the baryon number N, by and S=Y-N. The hypercharge is conserved for the strong, but not for the weak interactions. However, since the latter is weaker by a factor of about 10- 12 , the hypercharge is very nearly conserved. The electrical charge is always conserved perfectly! The physical significance of the charge of a free field will become apparent when we consider the interaction with other fields. The sign and magnitude of the charge will then play a role. 13.3 Quantization of the Dirac Field 13.3.1 Field Equations The quantized Klein-Gordon equation provides a description of mesons and, simultaneously, difficulties in its interpretation as a quantum-mechanical wave equation were overcome. Similarly, we shall consider the Dirac equation (5.3.20) as a classical field equation to be quantized: 1 See, e.g., E. Segre, Nuclei and Particles, 2nd ed., Benjamin/Cummins, London (1977), O. Nachtmann, Elementary Particle Physics, Springer, Heidelberg (1990) 288 13. Free Fields (ho - m)'l/J = 0 and i{;(i,·/a + m) = 0 . (13.3.1) The arrow above the 0 in the second equation signifies that the differentiation acts to the left on the i{;. The second equation is obtained by taking the adjoint of the first, and using the relations i{; = 'l/Jt'Y0 and 'Y°'Yt'Yo = 'Yw A possible Lagrangian density for these field equations is .c = i{;(x) (hMoM- m)'l/J(x) , (13.3.2) which may be verified from and (13.3.3) The Lagrangian density (13.3.2) is not real, but differs from a real one only by a four-divergence: .c = .c* = 4[i{;'YMOM'l/J - (oMi{;hM'l/J] - mi{;'l/J + 40M(i{;'YM'l/J) -4 [(oM'l/JthhMt'Yo'l/J - 'l/Jt'YhMtOM'Y°'l/J] -mi{;'l/J + = -4 (4 0M(i{;'Y M'l/J))t [(oMi{;hM'l/J - i{;'YMOM'l/J] - mi{;'l/J - (4 0M(i{;'YM'l/J)) . (13.3.4) The first three terms in (13.3.4), taken together, are real and could also be used as the Lagrangian density, since the last non-real term is a fourdivergence and makes no contribution to the Euler-Lagrange equations of motion. The conjugate fields following from (13.3.2) are: (13.3.5) Here, there is already an indication that the previously applied canonical quantization is not going to work for the Dirac equation because [i{;,,(x) , 7f,,(x')] = i{;" . 0 - o· i{;" = 0 # 8(x - x') . Furthermore, particles with S = ~ are fermions and not bosons and, in the nonrelativistic limit, these were quantized by means of anticommutation relations. The Hamiltonian density resulting from (13.3.2) is 13.3 Quantization of the Dirac Field . 7r a Wa _. t . - . I-' - £ - l'IjJaWa - W(l'Y 81-' - m)w = 289 (13.3.6) 8j W+ m~w -i~'Yj and the Hamiltonian reads: (13.3.7) 13.3.2 Conserved Quantities For the energy-momentum tensor (12.4.1) one obtains from (13.3.2) TI-'v = (8V~) 8£ 8 (81-' W) = 0 + ~hl-' = i~'Y1-8VW + 8£ 8v'IjJ _ I-'V £ 8(8I-'W) 8 vW- I-'V~(h8g . 9 - m)w (13.3.8) the first Since the Lagrangian density does not contain the derivative 81-'~ term in (13.3.8) vanishes. To obtain the final line, we have made use of the fact that the Lagrangian density vanishes for every solution of the Dirac equation. The order of the factors in (13.3.8) is arbitrary. As long as we are dealing only with a classical field theory, the order is irrelevant. Later, we shall introduce normal ordering. According to (12.4.8), the momentum density follows from (13.3.8) as (13.3.9) and the momentum as (13.3.10) The zero component, in particular, is given by (13.3.11) This result is identical to the Hamiltonian H of Eq. (13.3.7), as can be seen by using the Dirac equation. Finally, we consider the angular momentum determined by (12.4.24): If, in the general relation M V" = J d3x (7r r (x) S~': ¢s + xVTo" - x"TO V) , one substitutes the spinor field for ¢s, Eq. (13.3.5) for 7rr , and Eq. (12.4.12b) for S, one then obtains 290 13. Free Fields M IIU J =J d3 x = (i'l/Jl (-~) d 3 x'l/J t (J~'l/3 +Xlli'l/JtaU'l/J-XUi'l/Jtall'l/J) (iXlIa U- ixua ll + ~ (JIIU) 'l/J. (13.3.12) For the spatial components, this yields: a ·1 i a ·1 _i axi x ' - - -xJ , ax j +"21 (J'J.. ) 'l/J, ~ v orbital angular momentum (13.3.13) '-v-" spin which can be combined to form the angular momentum vector (M23, M31 , M12) M J d 3 x'l/J t (x) (xx ~V+E) (13.3.13') 'l/J(x). The first term represents the orbital angular momentum and the second the spin, with E represented by the Pauli spin matrices in (6.2.29d). 13.3.3 Quantization It will prove useful here to modify the definition of plane wave spinors. Instead of the spinors vr(k), r = 1,2, we will now adopt the notation W r (k) = { V2 (k) for r = 1 -vl(k)forr=2, (13.3.14) where the vr(k) are given in Eq. (6.3.11b), and hence ur(k) = (E+m)! 2m (~ 1 wr(k) = _(E2:m) with i(J2 == ( _ ~ ~) and Xl 2 xr) (13.3.15a) m+EXr , (:+~) = (~), X2 = (~). (13.3.15b) This definition is motivated by the ideas of Hole theory (Sect. 10.2) and implies that relations involving the spin have the same form for both electrons and positrons. An electron with spinor U 1 (m, 0) and a positron with spinor W 1 (m, 0) both have spin 2 2 i.e., for the operator ~E3 they have the eigenvalues ±1 and the effect of ~ E on electron and positron states is of the same form. Given this definition, the charge conjugation operation C transforms the spinors ur(k) into wr(k) and vice versa: ±~, 13.3 Quantization of the Dirac Field = 1,2 r = 1,2. Cur(k) = i,lur(k)* = wr(k), r Cwr(k) = i"?wr(k)* = ur(k), 291 (13.3.15c) In the new notation, the orthogonality relations (6.3.15) and (6.3.19a-c) acquire the form ur(k)us(k) = 6rs wr(k)ws(k) = -6rs ur(k)ws(k) = 0 wr(k)us(k) = 0 (13.3.16) and ur(k)'lws(k) = 0 (13.3.17) wr(k)'lus(k) = 0, k = (ko, -k) . Relations that are bilinear in vr(k), e.g., the projections (6.3.23), have the same form in wr(k). We now turn to the representation of the field as a superposition of free solutions in a finite volume V: (13.3.18a) with (13.3.19) where the last line indicates the decomposition into positive and negative frequency contributions. In classical field theory, the amplitudes brk and d rk are complex numbers, as in (10.1.9), and hermitian conjugation becomes sim= d;k' Below, we shall quantize 'ljJ(x) and ply complex conjugation, i.e., d~k i[;(x), and then the amplitudes brk and d rk will be replaced by operators. The relations (13.3.18a,b) are written in such a way that they also remain valid as an operator expansion. For the adjoint field (the adjoint field operator) i[;(x) = 'ljJt(x)')'o, one obtains from (13.3.18a) (13.3.18b) Inserting (13.3.18a,b) into (13.3.10) yields for the momentum pM = L kM (b~kr - drk~) k,r as the following algebra shows: , (13.3.20) 292 13. Free Fields (13.3.21) In the first term after the last identity we have immediately set e±i(ko-kb)xo = 1, on account of Okk', (and thus kb = Jk' + m 2 = ko). The orthogonality relations for the u and w yield the assertion (13.3.20). In quantized field theory, brk and d rk are operators. What are their algebraic properties? These we can determine using the result (13.3.20) for the Hamiltonian H = pO = L ko (b~kr - drk~) . (13.3.22) k,r If, as in the Klein-Gordon theory, commutation rules of the form [d rk , d~k'] = 8kk, , held, there would be no lower bound to the energy. (It would not help to replace dt in the expansion of the field by an annihilation operator en as H would still not be positive definite.) A system described by this Hamiltonian would would not be stable; the excitation of particles by the operator d~k reduce its energy! The way out of this dilemma is to demand anticommutation rules: {brk , b~'k } { d rk , d~'k } = 8rr ,8kk, = 8rr,8kk, (13.3.23) That anticommutation rules should apply for fermions comes as no surprise in view of the nonrelativistic many-particle theory discussed in Part 1. The second term in (13.3.22) then becomes -drk~ = d~kr-1, so that the creation 13.3 Quantization of the Dirac Field 293 of a d particle makes a positive energy contribution. The anticommutation relations (13.3.23) imply that each state can be, at most, singly occupied, i.e., the occupation-number operators n~ = b~kr and n~ = d~kr have the eigenvalues (occupation numbers) n~d) = 0,1. To avoid zero-point terms, we have also introduced normal products for the Dirac field. The definition of the normal ordering for fermions reads: All annihilation operators are written to the right of all creation operators, whereby each permutation contributes a factor (-1). Let us illustrate this definition with an example: : 1/Ja 1/J(3 : =: (1/J~ = 1/J~t + 1/J~) (1/Jt + 1/Jt3) : -1/Jt3~ + 1/J~t + 1/J~t3 (13.3.24) . All observables, e.g., (13.3.10) or (13.3.22) are defined as normal products, i.e., the final Hamiltonian is defined by H = : Horiginal : and P = : P original :, where Horiginal and P original refer to the expressions in (13.3.22) and (13.3.10). We thus have =L H Ek (b~kr + d~kr) (13.3.25) k,r =L P (b~kr k + d~kr) (13.3.26) . k,r The operators b~k (b rk ) create (annihilate) an electron with spinor ur(k)e- ikx and the operators d~k (d rk ) create (annihilate) a positron in the state wr(k)e ikx . From (13.3.25) and (13.3.26) and the corresponding representation of the angular momentum operator, it is clear that the d particles now already called positrons - have the same energy, momentum, and spin degrees of freedom as the electrons. For their complete characterization, we still have to consider their charge. 13.3.4 Charge We start from the general formula (12.4.31) which yields for the charge Q J J Q = -iq = q d3 x(7r1/J - if;if) d3 = -iq J d3 xi1/Jl1/Ja (13.3.27a) x if;l'o1/J . The associated four-current-density is of the form jI.L(x) = q : if;(x),),I.L1/J(x) : (13.3.27b) and satisfies the continuity equation j~ = O. (13.3.27c) 294 13. Free Fields Setting q = -eo for the electron, we obtain from (13.3.27a) the modified definition of Q in terms of normal products Q = -eo = -eo J d3 x : 1f;(xho1jJ(x) : == L L k (b~kr - d~kr) J d3 xjO(x) (13.3.28) . r=1,2 In the last identity the expansions (13.3.18a) and (13.3.18b) have been inserted and evaluated as in Eq.(13.3.21). The difference in sign between (13.3.28) and the Hamiltonian (13.3.25) stems from the fact that in (13.3.21) the differential operators 81-' produce different signs for the positive and negative frequency components, which, however, are compensated by the anticommutation. From (13.3.28), it is already evident that the d particles, i.e., the positrons, have opposite charge to the electrons. This is further confirmed by the following argument: [Q, b~k] = -eob~k [Q,d~k] = eod~k' (13.3.29) We consider a state IlP), which is taken to be an eigenstate of the charge operator with eigenvalue q: Q IlP) = q IlP) (13.3.30) . From (13.3.29) it then follows that Qb~k Qd~k IlP) = (q - eo)b~k IlP) (q + eo)d~k IlP) IlP) = The state b~k IlP) has the charge (q - eo) and the state d~k (q + eo). Hence, using (13.3.18b), we also conclude that Q1f;(x) IlP) = (q - eo)1f;(x) IlP) . (13.3.31 ) IlP) the charge (13.3.32) The creation of an electron or the annihilation of a positron reduces the charge by eo. The vacuum state 10) has zero charge. The charge operator, as a conserved quantity, commutes with the Hamiltonian and is time independent. As can be seen directly from (13.3.28) and (13.3.26), it also commutes with the momentum vector P, which can be written [Q,PI-'] =0. (13.3.33) Hence, there exist joint eigenfunctions of the charge and the momentum operators. In the course of our attempt, in Part II, to construct a relativistic 13.3 Quantization of the Dirac Field 295 wave equation, and to interpret 'IjJ in analogy to the Schrodinger wave function as a probability amplitude, we interpreted jO = 'IjJ t 'IjJ as a positive density; H, however, was indefinite. In the quantum field theoretic form, Q is indefinite, which is acceptable for the charge, and the Hamiltonian is positive definite. This leads to a physically meaningful picture: 'IjJ(x) is not the state, but rather a field operator that creates and annihilates particles. The states are given by the states in Fock space, i.e., 10), b~k 10), b~lk d~2k 10) , b~lk b~2kd3 10), etc. The operator b~k=O with r = l(r = 2) creates an electron at rest with spin in the z direction Sz = ~(sz = -~). Likewise, d~k=O creates a positron at rest with Sz = ~(sz = -~). Correspondingly, b~k (d~k) creates an electron (positron) with momentum k, which, in its rest frame, possesses the spin ~ for r = 1 and -~ for r = 2 (see Problem 13.11). *13.3.5 The Infinite-Volume Limit When using the Dirac field operators, we will always consider a finite volume, i.e., in their expansion in terms of creation and annihilation operators we have sums rather than integrals over the momentum. The infinite-volume limit will only be introduced in the final results; for example, in the scattering crosssection. However, there are some circumstances in which it is convenient to work with an infinite volume from the outset. Equation (13.3.18a) then becomes 2 (13.3.34) The annihilation and creation operators are related to their finite-volume counterparts by (13.3.35) These operators thus satisfy the anticommutation relations {br(k), b~,(k')} = (27r)3kobrrlbC3l(k - k') {dr(k), d~,(k')} = (27r)3kobrrlbC3l(k - k') , (13.3.36) and all other anticommutators vanish. The momentum operator has the form (13.3.37) 2 The factor Vm in (13.3.34) is chosen, as in (13.3.18a), in order to cancel the corresponding factor 1/ Vm in the spinors, so that the limit m --+ 0 exists. 296 13. Free Fields We have {pIL, bt(k)} = kILbt(k) , {PIL, br(k)} = -kILbr(k) , {pIL, dt(k)} = kILdt(k) , {PIL, dr(k)} = -kILdt(k) . (13.3.38) From (13.3.38), one sees directly that the one-electron (positron) state bt(k) 10) (dt(k) 10)) possesses momentum klL. 13.4 The Spin Statistics Theorem 13.4.1 Propagators and the Spin Statistics Theorem We are now in a position to prove the spin statistics theorem, which relates the values of the spin to the statistics (i.e., to the commutation properties and, hence, to the possible occupation numbers). By way of preparation we calculate the anticommutator of the Dirac field operators, where a and f3 stand for the spinor indices 1, ... ,4. Making use of the anticommutation relations (13.3.23), the projectors (6.3.23), and Eq. (6.3.21) we find {1jJa(X), X ~al (x')} = ~ L k L k' (Em; I) k k (ura(k)urlal(k')e-ikXeikIXI = ~ L~ 1/2 L L b"rrlb"kk l r r' + wra(k)wrlal(k')eikXe-ikIXI) (e-ik(X-X I) Lura(k)iiral(k) V k Ek r + eik(x-x' ) ~ = J d3k3 ~ (e-ik(X-X I) (~ (27r) Ek + eik(x-x' ) (~ = (ifA + m) 'I' = (if) I ~ aa 2 Wra(k)Wral(k)) m) - m) ) + 2m (13.4.1) aa' 2m ow' 3 d k ~ (e-ik(X-X I) _ eik(X-X I)) (27r)3 Ek J + m)aaliLl(x - x') , where the function (13.1.22b) Ll(x - x') =~ 21 J d 3k ~ (e-ik(X-X I) _ eik(X-X I)) (27r)3 Ek (13.4.2) has already been encountered in (13.1.23c) as the commutator of free bosons, namely 13.4 The Spin Statistics Theorem 297 [¢(x), ¢(x')] = iLl (x - x') . The anticommutator of the free field operators thus has the form (13.4.1') In order to proceed further with the analysis, we require certain properties of Ll(x), which we summarize below. Properties of Ll(x) (i) It is possible to represent Ll (x) in the form (13.4.3a) with (ii) Ll( -x) = -Ll(x) . (13.4.3b) This can be seen directly from Eq.(13.4.3a). (iii) (13.4.3c) The functions Ll(x),..:1+ (x), ,1- (x) are solutions of the free Klein-Gordon equation, since they are linear superpositions of its solutions. The propagator LlF(x) and the retarded and advanced Green's functions 3 LlR(x), LlA(x) satisfy the inhomogeneous Klein-Gordon equation with a source 8(4)(x) : (D + m 2)Ll F (x) = -8(4)(x). (iv) (13.4.3d) This follows by taking the derivative of (13.4.2). (v) Ll( x) is Lorentz invariant. This can be shown by considering a Lorentz transformation A Ll(Ax) = !! i d4k 8(k 2 _ m2)E(ko)e-ik.Ax . (2'71l Using k· Ax = A-1k· x and the substitution k' = A-1k, we have d4k = d4k' and k'2 = k 2. Furthermore, the 8 function in (13.4.3a) vanishes for space-like vectors 3 The retarded and advanced Green's functions are defined by LlR(x) and LlA(x) == -B( -xo) Ll(x). == B(xo)Ll(x) 298 13. Free Fields k, i.e., k 2 < O. Since for time-like k and orthochronous Lorentz transformations E(kO') = E(kO), it follows that .:1(Ax) = .:1(x) . (13.4.3e) For A E'c"- on the other hand, .:1(Ax) = -.:1(x). Fig. 13.5. Minkowski diagram: past and future light cone of the origin, and a space-like vector (outside the light cone), are shown (vi) For space-like vectors one has .:1 ( -x) = Ll(x) . (13.4.3f) Proof. The assertion is valid for purely space-like vectors as seen from the representation (13.4.3a) with the substitutions x -+ -x' and k -+ -k'. However, all space-like vectors can be transformed into purely space-like vectors by means of an orthochronous Lorentz transformation (Fig. 13.5). (vii) Thus, by combining (13.4.3b) and (13.4.3f), it follows for space-like vectors that Ll(x) = 0 for x2 < 0 . (13.4.3g) For purely space-like vectors this can be seen directly from the definition (13.4.2) of Ll(x). We now turn to the proof of the spin statistics theorem. First, we show that two local observables of the type if;(x)'ljJ(x), etc., commute for space-like separations, e.g.: 13.4 The Spin Statistics Theorem 299 [1/J(x)1P(x),1/J(x')1P(x')] = 1/Ja(x) [1Pa(X),1/J(x')1P(x')] + [1/Ja(X), 1/J(x')1P(x')] 1Pa(x) = 1/Ja(x) ({ 1Pa(x), 1/Ji3(x')} 1Pi3(x') -1/Ji3(x') {1Pa(X), 1Pi3(x')}) + ({ 1/Ja(x), 1/Ji3(x')} 1Pi3(x') (13.4.4) -1/Ji3(x') {1/Ja(X), 1Pi3(x')}) 1Pa(x) = 1/Ja(x) (i~ + m)ai3iLl(x - +1/Ji3(x') (( -i~ X')) 1Pi3(x') + m)i3aiLl(x - X')) 1Pa(X). Because of (13.4.3g), this commutator vanishes for space-like separations. Thus, causality is satisfied since no signal can be transmitted between x and x' when (x - X,)2 < 0. 4 What would have been the result if we had used commutators instead of anticommutators for the quantization? Apart from the absence of a lower energy bound, we would encounter a violation of causality. We would then have (13.4.5a) where Ll 1 (x - x') = ;. 21 J d3 k ~ (2'n/ ko (e-ik(X-X') + eik(X-X'») (13.4.5b) The function Lll (x) = Ll+(x) - Ll_ (x) is an even solution ofthe homogeneous Klein-Gordon equation, which does not vanish for space-like separations (x - x,)2 < O. Likewise, for space-like separations, (i~ + m)iLll(x - x') =I- O. For this kind of quantization, local operators at the same point in time but different points in space would not commute. This would amount to a violation of locality or microcausality. Based on these arguments, we can formulate the spin statistics theorem as follows: Spin statistics theorem: Particles with spin ~ and, more generally, all particles with half-integral spin are fermions, whose field operators are quantized by anticommutators. Particles with integral spin are bosons, and their field operators are quantized by commutators. Remarks: (i) Microcausality: Two physical observables at positions with a space-like separation must be simultaneously measurable; the measurements cannot 4 If it were possible to transmit signals between space-time points with space-like separations, then this could only occur with speeds greater than the speed of light. In a different coordinate system, this would correspond to a movement into the past, i.e., to acausal behavior. 300 13. Free Fields influence one another. This property is known as micro causality. Without it, space-like separations would have to be linked by a signal which, in violation of special relativity, would need to travel faster than light in order for the observables to influence one another. This situation would also apply at arbitrarily small separations, hence the expression micro causality. Instead of microcausality, the term locality is also used synonymously. We recall the general result that two observables do not interfere (are simultaneously diagonalizable) if, and only if, they commute. (ii) The prediction of the spin statistics theorem for free particles with spin S = 0 can be demonstrated in analogy to (13.4.5): Commutation rules lead to [1>(x) , 1>(x')] = iLl(x - x'). The fields thus satisfy microcausality and, by calculating the commutators of products, one can show that the observables 1>(x)2 etc. also do. If, on the other hand, one were to quantize the Klein-Gordon field with Fermi commutation relations, then, as is readily seen, neither [1>(x),1>(x')]+ nor [1>(x),1>(x')L could possess the microcausality property [1>(x),1>(x')]± = 0 for (x - X,)2 < O. Therefore, also composite operators would violate the requirement of microcausality. (iii) Perturbation theory leads one to expect that the property of microcausality can be extended from the free propagator to the interacting case5 . For the interacting Klein-Gordon field one can derive the spectral representation J 00 (01 [1>(x), 1>(x')] 10) = da 2Q(a 2)d(x - x', a) (13.4.6) o for the expectation value of the commutator6. Here d(x - x', a) is the free commutator given in Eq. (13.4.2) with explicit reference to the mass, which in Eq. (1.3.4) has been integrated over. Hence, micro causality is also fulfilled for the interacting field. If on the other hand the KleinGordon field had been quantized using Fermi anti-commutation rules one would find J 00 (01 {1>(x),1>(x')} 10) = da 2Q(a 2)dl(x-x',a) , (13.4.7) o where Lll(x - x',a) given in Eq. (13.4.5b) does not vanish for space-like separations. Microcausality would then be violated. Analogously one obtains for fermions, if they are quantized with commutators, a spectral representation containing Lll' which is again a contradiction to microcausality. 5 6 A general proof for interacting fields on the basis of axiomatic field theory can be found in R.F. Streater, A.S. Wightman, peT, Spin & Statistics and all that, W.A. Benjamin, New York, 1964, p. 146 f. J.D. Bjorken and S.D. Drell, Relativistic Quantum Fields, McGraw Hill, New York, 1965, p. 171. 13.4 The Spin Statistics Theorem 301 (iv) The reason why the observables of the Dirac field can only be bilinear quantities if;'l/J as well as powers or derivatives thereof is the following. The field 'l/J(x) itself is not measurable, since it is changed by a gauge transformation of the first kind, and observables may only be gauge-invariant quantities. Measurable quantities must, as the Lagrangian density, remain unchanged under a gauge transformation. Furthermore, there are no other fields that couple to 'l/J(x) alone. The electromagnetic vector potential AI-" for example, couples to a bilinear combination of'l/J. Another reason why 'l/J(x) is not an observable quantity follows from the transformation behavior of a spinor under a rotation through 271", Eq. (6.2.23a). Since the physically observable world is unchanged by a rotation through 271", whereas a spinor 'l/J becomes -'l/J, one must conclude that spinors themselves are not directly observable. This does not contradict the fact that, under rotation in a spatial sub domain, one can observe the phase change of a spinor with respect to a reference beam by means of an interference experiment, since the latter is determined by a bilinear quantity (see the remarks and references following Eq. (6.2.23a)). 13.4.2 Further Properties of Anticommutators and Propagators of the Dirac Field We summarize here for later use a few additional properties of anticommutators and propagators of the Dirac field. According to Eq. (13.4.1) and using the properties (13.4.3d) and (13.4.3f) of Ll(x), the equal time anticommutator of the Dirac field is given by Multiplying this by 'Y~, (3 and summing over 0/ yields: (13.4.8) Hence, i'l/J t is thus sometimes called the anticommutating conjugate operator to 'l/J(x). Fermion propagators In analogy to (13.1.23a-c), one defines for the Dirac field (13.4.9a) 302 13. Free Fields ['ljJ(x),,,j}(x')]+ =is(x-x'). (13.4.9b) The anticommutator (13.4.9b) has already been calculated in (13.4.1). From this calculation one sees that is+(x-x') (is-(x-x')) is given by the first (second) term in the penultimate line of (13.4.1). Hence, on account of (13. 1. lOac), we have (13.4.lOa) S(x) = S+(x) + S-(x) = (i¢J + m)L1(x) . (13.4.lOb) Starting from the integral representation (13.1.25a,b) of Ll± and ..1, one obtains from (13.4.9a,b) s± x - ( ) - 1 d4p . P+ m _ _ e- 1PX c± (27f)4 p2 - m 2 - - 1 c± d4p e- ipx (13.4.lla) ----- (27f)4 P- m and (13.4.llb) where we have used (p ± m)(p =F m) = p2 - m 2 . The paths C± and Care the same as defined in Fig. 13.1. For Fermi operators one also introduces a time-ordered product. The definition of the time-ordered product for Fermi fields reads: T ('ljJ(x)"j}(x')) == {'ljJ(~)"}X for t > t' -'ljJ(x')'ljJ(x) for t < t' == e(t - t')'ljJ(x)"j}(x') - e(t' - t)"j}(x')'IjJ(x) . (13.4.12) For use in the perturbation theory to be developed later, we also introduce the following definition of the Feynman fermion propagator (01 T('ljJ(x)"j}(x')) 10) == iSF(x - x') . (13.4.13) For its evaluation, we note that (01 'ljJ (x)"j) (x') 10) = (OI'ljJ+(x)"j}-(x') 10) = (01 ['IjJ+(x),,,j}-(x')]+ 10) = is+(x - x') (13.4.14a) and, similarly, (01 "j}(x')'ljJ(x) 10) = is-(x - x') , (13.4.14b) from which it follows that the Feynman fermion propagator (see Problem 13.18) is SF(X) = e(t)s+(x) - e( -t)S-(x) = (i"(1-'81-' + m)LlF(x) . (13.4.15) Problems 303 By exploiting (13.1.31), one can also write the Feynman fermion propagator in the form SF(X) J = d4 p (2.n/ e~ipx P+ m . . p2 - m 2 + IE (13.4.16) Problems 13.1 Confirm the validity of (13.1.5'). 13.2 (a) For the scalar field, show that the four-momentum operator pJ10 J d3 x{Jr¢)'J1o - =: ot.c} : can be written in the form (13.1.19) pJ10 = L kJ10atak . k (b) Show that the four-momentum operator is the generator of the translation operator: eia"p" F(¢(x»e-ia"p" = ¢(x + a) . 13.3 Confirm formula (13.1.25a) for ,1±(x). 13.4 Confirm the formula (13.1.31) for ,1F(X), taking into account Fig. 13.2. 13.5 Verify the commutation relations (13.2.6). 13.6 For the quantized, complex ation is defined by Klein~Gord field, the charge conjugation oper- ¢'(x) = e¢(x)e t = 'f/e¢t(x) , where the charge conjugation operator e is unitary and leaves the vacuum state invariant: e 10) = 10). (a) Show for the annihilation operators that eake t = 1]ebk , ebke t = 'f/~ak and derive for the single-particle states la, k) == at 10), Ib, k) == bt 10) , the transformation property e la, k) = 'f/~ Ib, k), e Ib, k) = 1]e la, k) . 304 13. Free Fields (b) Also show that the Lagrangian density (13.2.1) is invariant under charge conjugation and that the current density (13.2.11) changes sign: ej(x)e t = -j(x). Hence, particles and antiparticles are interchanged, with the four-momentum remaining unchanged. (c) Find a representation for the operator e. 13.7 Show for the Klein-Gordon field that: = -i81t¢(x) = iV¢(x) . [p1t,¢(x)] [P,¢(x)] 13.8 Derive the equations of motion for the Dirac field operator 'l/;(x), starting from the Heisenberg equations of motion with the Hamiltonian (13.3.7). 13.9 Calculate the expectation value of the quantized angular momentum operator (13.3.13) in a state with a positron at rest. 13.10 Show that the spinors ur(k) and wr(k) transform into one another under charge conjugation. 13.11 Consider an electron at rest and a stationary positron 10) Ie 'f ,k = O,s ) = {b!k=O t dsk=o 10) Show that and J31e'f , k = 0,s)- ±.!2 le'f ' k = 0 " s) where (U)rs are the matrix elements of the Pauli matrices in the Pauli spinors Xr and XS. 13.12 Prove that the momentum operator of the Dirac field (13.3.26) pit = Lk,r kit [b~kr + d~kr] is the generator of the translation operator: eia",p'" 'l/;(x)e-ial'pl' = 'l/;(x + a) . Problems 305 13.13 From the gauge invariance of the Lagrangian density, derive the expression (13.3.27b) for the current-density operator of the Dirac field. 13.14 Show that the operator of the charge conjugation transformation e = ele2 V~k A(b~rk e1 = exp [-i L k,r e2 = exp [i; Lk,r V~k - (b~r d~rk)] - dt)(bkr - dkr)] transforms the creation and annihilation operators of the Dirac field and the field operator as follows: ebkr et = 'T/edkr , ed~rt = 'T/~br , e7jJ(x)e t = 'T/eCiV(x), where the transpose only refers to the spinor indices, and C = i'l'l. The factor el yields the phase factor 'T/e = eiA . The transformation e exchanges particles and antiparticles with the same momentum, energy, and helicity. Show also that the vacuum is invariant with respect to this transformation and that the current density jP- = e : i[r·t7jJ : changes sign. 13.15 (a) Show for the spinor field that Q = -eo LL k (b~kr - d~kr ) r (Eq. (13.3.28)), by starting from Q = -eo J 3 - 0 d x : 7jJ(x)oy 7jJ(x) (b) Show, furthermore, that [Q, b~k] = -eob~k and [Q, dq = eod~k (Eq.(13.3.29)). 13.16 Show that (13.4.2) can also be written in the form (13.4.3a). 13.11 Show that LlF(X), LlR(X), and LlA(X) satisfy the inhomogeneous KleinGordon equation (EJp.f)P- + m 2 )Ll F (x) = -8(4)(x) 13.18 Prove equation (13.4.15). . 306 13. Free Fields 13.19 Show for the Dirac-field that: [PI-','¢(x)] = -j{)I-''¢(x) [P, ,¢(x)] = iV'¢(x) . 14. Quantization of the Radiation Field This chapter describes the quantization of the free radiation field. Since, for certain aspects, it is necessary to include the coupling to external current densities, a separate chapter is devoted to this subject. Starting from the classical Maxwell equations and a discussion of gauge transformations, the quantization will be carried out in the Coulomb gauge. The principal aim in this chapter is to calculate the propagator for the radiation field. In the Coulomb gauge, one initially obtains a propagator that is not Lorentz invariant. However, when one includes the effect of the instantaneous Coulomb interaction in the propagator and notes that the terms in the propagator that are proportional to the wave vector yield no contribution in perturbation theory, one then finds that the propagator is equivalent to using a covariant one. The difficulty in quantizing the radiation field arises from the massless nature of the photons and from gauge invariance. Therefore, the vector potential AI"(x) has, in effect, only two dynamical degrees of freedom and the instantaneous Coulomb interaction. 14.1 Classical Electrodynamics 14.1.1 Maxwell Equations We begin by recalling classical electrodynamics for the electric and magnetic fields E and B. The Maxwell equations in the presence of a charge density p( x, t) and a current density j (x, t) read 1 : 1 Here and in the following we shall use rationalized units, also known as HeavisideLorentz units. In these units the fine-structure constant is 2 Q = 4;~C ·2 = 1~7' whereas in Gaussian units it is given by Q = ~, i.e., eo = eov::r;;:. Correspondingly, we have E = EGaus/~ and B = BGauss/v::r;;:, and the Coulomb law 2 . V(x) = 47r1:-x'l. In the followmg, we shall also set It = c = 1. 308 14. Quantization of the Radiation Field V·E=p (14.l.1a) oB xE=-- V (14.l.1b) at V·B=O (14.l.1c) oE . =Ft+ J . V xB (14.1.1d) If we introduce the antisymmetric field tensor 0 Ex FI-'v = ( -Ex 0 Ey EZ) Bz -By , -Ey -Bz 0 Bx -Ez By -Bx 0 (14.1.2) whose components can also be written in the form Ei = F Oi Bi = ~fijk 2 (14.1.3) P. Jk , the Maxwell equations then acquire the form ovFI-'U = jl-' OA FI-'v + 01-' F VA + au FAI-' = (14.1.4a) and 0, (14.1.4b) where the four-current-density is jl-' = (p,j) , (14.1.5) satisfying the continuity equation j~1-' (14.1.6) = O. The homogeneous equations (14.1.1b,c) or (14.1.4b) can be satisfied automatically by expressing FI-'v in terms of the four-potential AI-': (14.1.7) The inhomogeneous equations (14.1.1a,d) or (14.1.4a) imply that DAI-' - ol-'ouAV = jl-' . (14.1.8) In Part II, where relativistic wave equations were discussed, jl-'(x) was the particle current density. However, in quantum field theory, and particularly in quantum electrodynamics, it is usual to use jl-'(x) to denote the electrical current density. In the following, we will have, for the Dirac field for example, jl-'(x) = ei{;(xhl-''Ij;(x) , where e is the charge of the particle, i.e., for the electron e = -eo. 14.2 The Coulomb Gauge 309 14.1.2 Gauge Transformations Equation (14.1.8) is not sufficient to determine the four-potential uniquely, since, for an arbitrary function '\(x), the transformation (14.1.9) leaves the electromagnetic field tensor FfJ- V and hence, also the fields E and B, as well as Eq. (14.1.8) invariant. One refers to (14.1.9) as a gauge transformation of the second kind. It is easy to see that not all components of AfJare independent dynamical variables and, by a suitable choice of the function '\(x), one can impose certain conditions on the components AfJ-' or, in other words, transform to a certain gauge. Two particularly important gauges are the Lorentz gauge, for which one requires AfJ- =0 ,1-' ' (14.1.10) and the Coulomb gauge, for which V·A=O (14.1.11) is specified. Other gauges include the time gauge AO = 0 and the axial gauge A3 = O. The advantage of the coulomb gauge is that it yields only two transverse photons, or, after an appropriate transformation, two photons with helicity ±1. The advantage of the Lorentz gauge consists in its obvious Lorentz invariance. In this gauge, however, there are, in addition to the two transverse photons, a longitudinal and a scalar photon. In the physical results of the theory these latter photons, apart from mediating the Coulomb interaction, will play no role. 2 14.2 The Coulomb Gauge We shall be dealing here mainly with the Coulomb gauge (also called transverse or radiation gauge). It is always possible to transform into the Coulomb gauge. If AfJ- does not satisfy the Coulomb gauge, then one takes instead the gauge-transformed field AfJ- + 81-',\, where ,\ is determined through V2,\ = -V· A. In view of the Coulomb gauge condition (14.1.11), for the zero components (p, = 0), equation (14.1.8) simplifies to (85 - V 2 )Ao - 8 0 (80 A o - V· A) = jo , and due to (14.1.11) we thus have V2AO = -jo· 2 (14.2.1) The most important aspects of the covariant quantization by means of the Gupta~Bler method are summarized in Appendix E. 310 14. Quantization of the Radiation Field This is just the Poisson equation, well known from electrostatics, which has the solution A°( t) jd3x' jO(x',t) I 'I . = x, (14.2.2) 4 7fX-X Since the charge density jO (x) depends only on the matter fields and their conjugate fields, Eq. (14.2.2) represents an explicit solution for the zero components of the vector potential. Therefore, in the Coulomb gauge, the scalar potential is determined by the Coulomb field of the charge density and is thus not an independent dynamical variable. The remaining spatial components Ai are subject to the gauge condition (14.1.11), and there are thus only two independent field components. We now turn to the spatial components of the wave equation (14.1.8), thereby taking account of (14.1.11), (14.2.3) From (14.2.2), using the continuity equation (14.1.6) and partial integration, we have (14.2.4) is a short-hand notation for the integral over the Coulomb Green's where -~ function 3 . If we insert (14.2.4) into (14.2.3), we obtain DA. J - ·trans = Jj - (!<. _8jC:A) V2 UJk . (14.2.5) Jk· The wave equation for Aj (14.2.5) contains the transverse part of the current density jyans. The significance of the transversality will become more evident later on when we work in Fourier space. 3 In this way, the solution of the Poisson equation 2 . -p IS V if> = represented as if> = 1 V --2 P _ = J d 3 xlp(x/,t) 47r IX - x, I };. For the special case p(x, t) = -8 3 (x), we have 2 3 V if> = 8 (x) 1 3 and thus if> = V28 (x) = - Jd47rlx8_(xx'I/) = - 47rlxl 1 . 3XI 3 14.3 The Lagrangian Density for the Electromagnetic Field 311 14.3 The Lagrangian Density for the Electromagnetic Field The Lagrangian density of the electromagnetic field is not unique. One can derive the Maxwell equations from .c = -~Fl'vIV - jl-'AI-' 4 (14.3.1) with FI-'v = AI-',v - Av,w This is because 8v ~8A 1-', V = 8v (-~) 4 (FI-'V - FVI-') X 2 = -8v FI-'v (14.3.2) and 8.c .1-' -=-] 8AI-' yield the Euler-Lagrange equations 8vFI-'V =jl-', (14.3.3) i.e. (14.1.4a). As was noted before Eq. (14.1.7), equation (14.1.4b) is automatically satisfied. From (14.3.1), one finds for the momentum conjugate to AI-' III-' = 8~ = _Fl-'o . (14.3.4) 8AI-' Hence, the momentum conjugate to Ao vanishes, II o = 0 and (14.3.5) The vanishing of the momentum component II o shows that it is not possible to apply the canonical quantization procedure to all four components of the radiation field without modification. Another Lagrangian density for the four-potential AI-'(x), which leads to the wave equation in the Lorentz gauge, is .cL = -~AI',v"V - jl-'AI-' . (14.3.6) Here, we have · IIr = 8A.cLI-' = -AI-',o = -AI-' , (14.3.7) and the equation of motion reads: DAI-' =jl-'. (14.3.8) 312 14. Quantization of the Radiation Field This equation of motion is only identical to (14.1.8) when the potential AJL satisfies the Lorentz condition (14.3.9) The Lagrangian density LL of equation (14.3.6) differs from the L of (14.3.1) in the occurrence of the term - ~ (0). A >.) 2, which fixes the gauge: . 2 -"41 FwP'v- I2> (0). A ) - LL = . JJLAJL . (14.3.6') This can easily be seen when one rewrites LL as follows: LL = -~(A 4 JL,V - A V,JL )(AJL,V - AV,JL) - ~o'A> 2 = -~A 2 JL,v AJL,V + ~A 2 JL,V AV,JL - o'A>~ 2 = -~AJL,v'V A 0" A 0" AO" -J' JL AJL AO" - J' JL AJL - jJLAJL . In the last line, a total derivative has been omitted since it disappears in the Lagrangian through partial integration. If one adds the term _~(0).A>2 to the Lagrangian density, one must choose the Lorentz gauge in order that the equations of motion be consistent with electrodynamics DAJL = JJL . Remarks: (i) Unlike the differential operator in (14.1.8), the d'Alembert operator appearing in (14.3.8) can be inverted. (ii) With or without the term fixing the gauge, the longitudinal part of the vector potential 8.\A.\ satisfies the d'Alembert equation o (8.\A.\) =0. This also holds in the presence of a current density j1-'(x). 14.4 The Free Electromagnatic Field and its Quantization When JJL = 0, i.e., in the absence of external sources, the solution of the Poisson equation which vanishes at infinity is AD = 0, and the electromagnetic fields read: E=-A.., B=VxA. (14.4.1) From the Lagrangian density of the free radiation field L = _~FJLV 4 FJLv = ~(E2 2 - B2) , (14.4.2) 14.4 The Free Electromagnatic Field and its Quantization 313 where (14.1.2) is used to obtain the last expression, one obtains the Hamiltonian density of the radiation field as . . 2 1 2 2 1-£"( = IIJ Aj - ,c = E - 2" (E - B ) - ~(E2 -2 (14.4.3) + B2) . Since the zero component of AIL vanishes and the spatial components satisfy the free d'Alembert equation and V·A = 0, the general free solution is given by 2 AIL(x) = "" ~ ~ k where ko = k J 21k lV (e-ikXfILk,A a kA +eikxfILk,A *atkA ) 1 A=1 , (14.4.4) Ikl, and the two polarization vectors have the properties . €k,A = 0 0 fk,A -- €k,A . €k,N = 8AN 0 (14.4.5) . In the classical theory, the amplitudes akA are complex numbers. In (14.4.4), we chose a notation such that this expansion also remains valid for the quantized theory in which the akA are replaced by the operators akA' The form of (14.4.4) guarantees that the vector potential is real. Remarks: (i) The different factor in QM I Eq. (16.49) arises due to the use there of Gaussian units, in which the energy density, for example, is given by 1-£ = 8~ (E2 + B 2). (ii) In place of the two photons polarized transversely to k, one can also use helicity eigenstates whose polarization vectors have the form IL fp _ ,±1 - ' R(p) ( 0) ±i~ 1/~ ~ , (14.4.6) where R(p) is a rotation that rotates the z axis into the direction of p. (iii) The first attempt at quantization could lead to [Ai(x,t),Aj(x',t)] =i8ij 8(x-x') i.e., (14.4.7) This relation, however, contradicts the condition for the Coulomb gauge 8i Ai = 0 and the Maxwell equation 8i Ei = O. 314 14. Quantization of the Radiation Field We will carry out the quantization of the theory in the following way: It is already clear that the quanta of the radiation field - the photons - are bosons. This is a consequence of the statistical properties (the strict validity of Planck's radiation law) and of the fact that the intrinsic angular momentum (spin) has the value S = 1. The spin statistics theorem tells us that this spin value corresponds to a Bose field. Therefore, the amplitudes of the field, the akA' are quantized by means of Bose commutation relations. We begin by expressing the Hamiltonian (function or operator) (14.4.3) in terms of the expansion (14.4.4). Using the fact that the three vectors k, 10k" and Ek2 form an orthogonal triad, we obtain (14.4.8) We postulate the Bose commutation relations [akA' at, A'] = 8AA' 8kk' [akA' ak,A'] = and (14.4.9) [aL, at, A'] = 0 . The Hamiltonian (14.4.8) then follows as (14.4.8') The divergent zero-point energy appearing here will be eliminated later by a redefinition of the Hamiltonian using normal ordering. We now calculate the commutators of the field operators. Given the definition 2 A I-tV = w """' (14.4.10) J.L v Ek,A Ek,A , A=1 it follows from (14.4.5) that (14.4.11) and 2. Aij ij = """' W Eik,A €Ik,A = 8 - kik j k2 -- A=1 (k, Ek,A' oX = 8ij ). = 1,2 form an orthogonal triad, i.e., kikj + L~=1 E~'A{ 14.4 The Free Electromagnatic Field and its Quantization 315 For the commutator, we now have ' )t ] [A i( x, t ) , A'j( x, """" ~ ~ = kA k'A' 1~ {-ikX i j * ('k')" , e e ik'x' Ek,AEk',A' 1 0 UAA'ukk' 2Vvkk' -eikx e - ik' x' i * j ( . k' ) , , Ek,A Ek , ,A' -1 0 UAA,ukk' = -2i L (i j EkAEk' ',A * eik(x-x') i· j } +EkA k , "e ',E A -ik(X-X')) kA =~ ~ (6 ij _ k~j) (eik(X-X') =i (6 ij _ 8;:) ~ = (8 ij _ 8:) 6(x-x'). i + e -ik(X-X')) eik(x-x') The commutator of the canonical variables thus reads: . .. ] [A"(x,t),A1(X',t) (.. (N)j) 8(x-x') =i 8"1- V2 (14.4.12a) or, on account of (14.4.1), . t), E1.(x', t) ] = [A"(x, -i j ( 8"1.. - aia V2 ) 8(x - x') (14.4.12b) and is consistent with the transversality condition that must be fulfilled by A and E. For the two remaining commutators we find [Ai(x,t),Aj(x',t)] = 0 [Ai(x,t),lV(X',t)] = o. (14.4.12c) (14.4.12d) These quantization properties are dependent on the gauge. However, the resulting commutators for the fields E and B are independent of the gauge chosen. Since E = -A. and B = curiA, one finds (14.4.12f) Whereas the commutator (14.4.12b) contains the nonlocal term V- 2 , the commutators (14.4.12e,f) of the fields E and B are local. 316 14. Quantization of the Radiation Field In order to eliminate the divergent zero-point energy in the Hamiltonian, we introduce the following definition H =: where ko = we have p = : ~ J d 3 x (E2 Ikl. + B2) : = L ko aLa kA (14.4.13) k,A Similarly, for the momentum operator of the radiation field, J d 3 xE xB:= LkaLa >.· k (14.4.14) k,A The normal ordered product for the components of the radiation field is defined in exactly the same way as for Klein-Gordon fields. 14.5 Calculation of the Photon Propagator The photon propagator is defined by iDj;,V(x - x') = (01 T(AJL(x)AV(X')) 10) (14.5.1) In its most general form, this second-rank tensor can be written as Dj:(x) = gJLV D(x 2 ) - f)JLf)v D(l)(x 2 ) , (14.5.2) where D (x 2 ) and D (l) (x 2 ) are functions of the Lorentz invariant x 2 . In momentum space, (14.5.2) yields: Dj;,V(k) = gJLV D(k 2 ) + kJLk VD(l)(k 2 ) • (14.5.3) In perturbation theory, the photon propagator always occurs in the combination j JLDj;,v (k)jv, where j JL and jv are electron-positron current densities. As a result of current conservation, f)JLjJL = 0, in Fourier space we have (14.5.4) and, hence, the physical results are unchanged when one replaces Dj:(k) by (14.5.5) where the XJL(k) are arbitrary functions of k. Once a particular gauge is specified, e.g., the Coulomb gauge, the resulting Dj;,V(k) is not of the Lorentz-invariant form (14.5.3); the physical results are, however, the same. The change of gauge (14.5.5) can be carried out simply with a view to convenience. We will now calculate the propagator in the Coulomb gauge and then deduce from this other equivalent representations. It is clear that D(k 2 ) in (14.5.3) is of the form 2 1 D(k ) ex k2 ' 14.5 Calculation of the Photon Propagator 317 since D';V(k) must satisfy the inhomogeneous d'Alembert equation with a four-dimensional O-source. We can adopt the same relations as for the KleinGordon propagators, but now it is also necessary to introduce the polarization vectors of the photon field. Introducing, in addition to (14.5.1), iD~V(x - x') = (01 AM(X)AV(x') 10) , (14.5.6a) we obtain from (13.1.25a) and (13.1.31) (14.5.6b) and Di(x - x') = 8(t t')D~V(x = -iJ d3 k _1_ ~ (27r)3 21kl ~ - x') - 8(t' EM EV t)D~V(x - x') (8(x O_ x,O)e-ik(x-x') k,'\ k,'\ +8(x'o _ xO)eik(X-X'») , (14.5.6c) i.e., d4 k -ik(x-x') DMV(x-x')=lim J _ _ AMV(k)e . F c--+o (27r)4 k 2 + IE (14.5.6d) Here, we have 2 AMV (k) = I: E/:,'\ Ek,'\ (14.5.7) ,\=1 with the components Making use of the tetrad = n M == (1,0,0,0) E~(k) Enk) = (0, Ek,d E~(k) , = (0, k/lkl) = E~(k) = (0, Ek,2) k M- (nk)nM 1/2 ' ((kn)2 - k 2) (14.5.8) 318 14. Quantization of the Radiation Field one can also write AIL'" in the form AIL"'(k) =_ IW _ g (klL - (kn)nIL) (k'" - (kn)n"') (kn)2 - k2 + nIL n '" kILk'" - (kn) (nILk'" + kILn"') k 2nILn'" = _gIL (kn)2 _ k2 - (kn)2 _ k2 '" (14.5.9) As was noted in connection with Eq. (14.5.5), the middle term on the second line of (14.5.9) makes no contribution in perturbation theory and can thus be omitted. The third term in (14.5.9) makes a contribution to the Feynman propagator iDF(x - x') (14.5.6d) of the form, -lim <-+0 J d4 k · , i k2 _ _ e-1k(X-X ) - - - - nIL n'" (2n)4 k 2 + if k 2 J d 3 k eik(x-x') J:( 0 -u x (2n)3 k2 ,0 nILn'" . 0 = -I8(x - x ) 4nx-x I 'I . IL '" = -In n X ,0) (14.5.10) In perturbation theory, this term is of no consequence in comparison to the Coulomb interaction, which appears explicitly when one works in the Coulomb gauge. To see this in more detail, we must also consider the Hamiltonian. The Lagrangian density .c (14.3.1) .c = -!FIL",FIL'" - jlLAIL 4 can also be written in the form .c = ~(E2 (14.3.1) - B2) - jlL AIL , (14.5.11) where E = E tr +El (14.5.12a) with the transverse and longitudinal components E tr =-A (14.5.12b) and El = -VAo. (14.5.12c) In the Lagrangian, the mixed term J d3 xEtr. El = J xA· d3 VA o , vanishes, as can be seen by partial integration and use of V . A the Lagrangian density (14.5.11) is equivalent to = O. Thus, 14.5 Calculation of the Photon Propagator 319 (14.5.13) For the momentum conjugate to the electromagnetic potential A, this yields: rrtr == 8~ 8A = -A.. (14.5.14) This, in turn, yields the Hamiltonian density H = H, = ~(rt)2 + Hint + ~(V X A)2 - ~(El)2 + j/1A/1 , (14.5.15) where the first two terms are the Hamiltonian density of the radiation field (14.4.3), and 1 1 2 Hint = -2(E) . + J/1A/1 is the interaction term. It will be helpful to separate out from the interaction term Hint the part corresponding to the Coulomb interaction of the charge density HCoul = 1 1 2 . 0 -2(E) + JoA . (14.5.16) When integrated over space, this yields: (14.5.17) which is exactly the Coulomb interaction between the charge densities jo(x, t). Thus, the total interaction now takes the form Hint = HCoul - J d 3 x j(x, t)A(x, t) . (14.5.18) The propagator of the transverse photons (14.5.6d), together with the Coulomb interaction, is thus equivalent to the following covariant propagator: D F/1 V( X ) _ - -g /1V l'1m €-tO J 4 k e- ikx -d- 4 -2-- . (271") k + if ( 14.5.19 ) As already stated at the beginning of this chapter, there are various ways to treat the quantized radiation field: In the Coulomb gauge, for every wave vector, one has as dynamical degrees of freedom the two transverse photons and, in addition, there is the instantaneous Coulomb interaction. Neither of 320 14. Quantization of the Radiation Field these two descriptions on its own is covariant, but they can be combined to yield a covariant propagator, as in Eq. (14.5.19). In the Lorentz gauge, one has four photons that automatically lead to the covariant propagator (14.5.19) or (E.1Ob). As a result of the Lorentz condition, the longitudinal and scalar photons can only be excited in such a way that Eq. (E.20a) is satisfied for every state. They therefore make no contribution to any physically observable quantities, except the Coulomb interaction, which they mediate. Problems 14.1 Derive the commutation relations (E.ll) from (E.8). 14.2 Calculate the energy-momentum tensor for the radiation field. Show that the normal ordered momentum operator has the form p =: ! d3 xE x B = LkaL,a kA · k,A 14.3 Using the results of Noether's theorem, deduce the form of the angular momentum tensor of the electromagnetic field starting from the Lagrangian density r _ .1..-- _ ~Flw 4 IW' (a) Write down the orbital angular momentum density. (b) Give the spin density. (c) Explain the fact that, although S = 1, only the values ±1 occur for the projection of the spin onto the direction k. 15. Interacting Fields, Quantum Electrodynamics 15.1 Lagrangians, Interacting Fields 15.1.1 Nonlinear Lagrangians We now turn to the treatment of interacting fields. When there are nonlinear terms in the Lagrangian density, or in the Hamiltonian, transitions and reactions between particles become possible. The simplest example of a model demonstrating this is a neutral scalar field with a self-interaction, (15.1.1) This so-called </>4 theory is a theoretical model whose special significance lies in the fact that it enables one to study the essential phenomena of a nonlinear field theory in a particularly clear form. The division of </> into creation and annihilation operators shows that the </>4 term leads to a number of transition processes. For example, two incoming particles with the momentum vectors kl and k2 can scatter from one another to yield outgoing particles with the momenta k3 and k4' the total momentum being conserved. As another example we consider the Lagrangian density for the interaction of charged fermions, described by the Dirac field 'ljJ, with the radiation field AIL (15.1.2) The interaction term is the lowest nonlinear term in AIL and 'ljJ that is bilinear in 'ljJ (see remark (iv) at the end of Sect. 13.4.1) and Lorentz invariant. A physical justification for this form will be given in Sect. 15.1.2 making use of the known interaction with the electromagnetic field (5.3.40). Quantum electrodynamics (QED), which is based on the Lagrangian density (15.1.2), is a theory describing the electromagnetic interaction between electrons, positrons, and photons. It serves as an excellent example of an interacting field theory for the following reasons: (i) It contains a small expansion parameter, the Sommerfeld fine-structure constant a ~ Ij7' so that perturbation theory can be successfully applied. 322 15. Interacting Fields, Quantum Electrodynamics (ii) Quantum electrodynamics is able to explain, among other things, the Lamb shift and the anomalous magnetic moment of the electron. (iii) The theory is renormalizable. (iv) Quantum electrodynamics is a simple (abelian) gauge theory. (v) It admits a description of all essential concepts of quantum field theory (perturbation theory, S matrix, Wick's theorem, etc.). 15.1.2 Fermions in an External Field Here, we consider the simplest case of the interaction of an electron field with a known electromagnetic field AeJ.!' which varies in space and time. The Dirac equation for this case reads: (i-yJ.!0J.! - m)7/J = eyJ.! AeJ.!7/J (15.1.3) and has the Lagrangian density £ ij;("(J.!(iOJ.! - eAeJ.!) - m)7/J = £0 +£1, (15.1.4) = where £0 is the free Lagrangian density and £1 the interaction with the field £0 = ij;(i-yJ.!0J.! - m)7/J £1 = -eij;,J.!7/JAeJ.! = -ejJ.! AeJ.! . The momentum conjugate to 7/J0. is 7f0. the Hamiltonian density is given by 1i = 1io + 1il = ij;( -i-yj OJ + m)7/J + eij;,J.!7/JAeJ.! (15.1.5) = :t = i7/Jl, as in (13.3.5), so that . (15.1.6) In the above, AeJ.! was an external field. In the next section we will consider the coupling to the radiation field, which is itself a quantized field. 15.1.3 Interaction of Electrons with the Radiation Field: Quantum Electrodynamics (QED) 15.1.3.1 The Lagrangian and the Hamiltonian Densities The Hamiltonian and the Lagrangian densities of the interacting Dirac and radiation fields are obtained by replacing AeJ.! in (15.1.5) by the quantized radiation field and adding the Lagrangian density of the free radiation field (15.1.7) 15.2 The Interaction Representation, Perturbation Theory 323 This is identical to the form postulated for formal reasons in (15.1.2). It leads to the conjugate momenta to the Dirac and radiation fields: _ 8£ _. t 7r" - -.- - 8'lj;" I'lj;" _ 8£ _ . III" - - . - - -AI" , 8AI" (15.1.8) and the Hamiltonian density operator + Hbhoton + HI H = H~irac , (15.1.9) where H~irac and Hbhoton are the Hamiltonian densities of the free Dirac and radiation fields (Eq. (13.3.7) and (E.14)). Here, HI represents the interaction between these fields (15.1.10) 15.1.3.2 Equations of Motion of Interacting Dirac and Radiation Fields For the Lagrangian density (15.1.7), the equations of motion of the field operators in the Heisenberg picture read: (i~ - m)'lj; = e/j'lj; DAI" = ei/yyl"'lj; . (15.1.11a) (15.1.11b) These are nonlinear field equations which, in general, cannot be solved exactly. An exception occurs for the simplified case of one space and one time dimension: a few such (1 + 1)-dimensional field theories can be solved exactly. An interesting example is the Thirring model (15.1.12) This can also be obtained as a limiting case of Eq. (15.1.11a) with a massive radiation field, i.e., (15.1.13) in the limit of infinite M. In general, however, one is obliged to use the methods of perturbation theory. These will be treated in the next sections. 15.2 The Interaction Representation, Perturbation Theory Experimentally, one is primarily interested in scattering processes. In this section we derive the S-matrix formalism necessary for the theoretical description of such processes. We begin by reiterating a few essential points from quantum mechanics I concerning the interaction representation. These will facilitate our perturbation treatment of scattering processes. 1 See, e.g., QM I, Sects. 8.5.3 and 16.3.1. 324 15. Interacting Fields, Quantum Electrodynamics 15.2.1 The Interaction Representation (Dirac Representation) We divide the Lagrangian density and the Hamiltonian into a free and an interaction part, where Ho is time independent: £ = £0 +£1 H=Ho +H1 (15.2.1) (15.2.2) . When the interaction £1 contains no derivatives, the density corresponding to the interaction Hamiltonian HI = f d3x1il is given by (15.2.3) We shall make use of the Schrodinger representation in which the states 1'ljI, t) are time dependent and satisfy the Schrodinger equation i :t 1'ljI, t) = H 1'ljI, t) . (15.2.4) The operators are denoted by A. The fundamental operators such as the momentum, and the field operators such as 'ljI(x) , are time independent in the Schrodinger picture. (Note that the field equations (13.1.13), (13.3.1), etc. were equations of motion in the Heisenberg picture.) When external forces are present, then one might also encounter Schrodinger operators with explicit time dependence (e.g., in Sect. 4.3 on linear response theory). The definition of the interaction representation reads: 1'ljI, t) I = eiHot 1'ljI, t) , AI(t) = eiHot Ae- iHot . (15.2.5) In the interaction representation, due to Eq. (15.2.4), the states and the operators satisfy the equations of motion i :t 1'ljI, t) I ! AI(t) = H l I (t) 1'ljI, t) I = i [Ho, AI(t)] + :t (15.2.6a) AI(t) . (15.2.6b) The final term in (15.2.6b) only occurs when the Schrodinger operator A depends explicitly on time. In the following, we will make use of the abbreviated notation 1'ljI(t)) == 1'ljI,t)I (15.2.7a) and (15.2.7b) The equation of motion for 1'ljI(t)) has the form of a Schrodinger equation with time-dependent Hamiltonian HI(t). When the interaction is switched off, i.e., when HI(t) = 0, the state vector in the interaction picture is time independent. The field operators in this representation satisfy the equations of motion dcprI(X, t) _ . [H A. ( t)] dt - 1 0, 'l'rI x, (15.2.8) 15.2 The Interaction Representation, Perturbation Theory 325 i.e., the free equations of motion. The field operators in the interaction representation are thus identical to the Heisenberg operators of free fields. Since £1 contains no derivatives, the canonical conjugate fields have the same form as the free fields, e.g., {J£ {J~a {J£o {J~a in quantum electrodynamics. Hence, the equal time commutation relations of the interacting fields are the same as those for the free fields. Since the interaction representation arises from the Schrodinger representation, and hence also from the Heisenberg representation, through a unitary transformation, the interacting fields obey the same commutation relations as the free fields. Since the equations of motion in the interaction picture are identical to the free equations of motion, the operators have the same simple form, the same time dependence, and the same representation in terms of creation and annihilation operators as the free operators. The plane waves (spinor solutions, free photons, and free mesons) are still solutions of the equations of motion and lead to the same expansion of the field operators as in the free case. The Feynman propagators are again i..:1 F (x - x') etc., where the vacuum is defined here with reference to the operators ark', brk , , d>.k. The time evolution of the states is determined by the interaction Hamiltonian. Let us once more draw attention to the differences between the various representations in quantum mechanics. In the Schrodinger representation the states are time dependent. In the Heisenberg representation the state vector is time independent, whereas the operators are time dependent and satisfy the Heisenberg equation of motion. In the interaction representation the time dependence is shared between the operators and the states. The free part of the Hamiltonian determines the time dependence of the operators. The states change in time as a result of the interaction. Thus, in the interaction representation, the field operators of an interacting nonlinear field theory satisfy the free field equations: for the real Klein-Gordon field these are given by Eq. (13.1.2), for the complex Klein-Gordon field by (13.2.2), for the Dirac field by (13.3.1), and for the radiation field by (14.1.8). For the time dependence of these fields one thus has the corresponding plane-wave expansions (13.1.5), (13.2.5), (13.3.18), and (14.4.4) or (E.5) (see also (15.3.12a-c)). We also recall the relation between Schrodinger and Heisenberg operators in the interacting field theory "pHeisenb. (x, t) = eiHt"pschrod. (x)e- iHt AHeisenb. (x, t) = (15.2.9) eiHt ASchrod. (x)e- iHt . In the interaction representation, one obtains "pI(X) == eiHot"pschrod.(x)e-iHot = "p(x) AIL(X) == eiHot AILSchrod. .. (x)e- iHot = AIL(x) I (15.2.10) , 326 15. Interacting Fields, Quantum Electrodynamics where'lj;(x) (AJl(x)) is the free Dirac field (radiation field) in the Heisenberg representation, x == (x, t). Since the interaction Hamiltonian is a polynomial in the fields, e.g., in quantum electrodynamics, in the Schrodinger picture, HI = e J d3 xi[r yJl'lj;AJl , and in the interaction representation one has HJ(t) == H lI (t) == eiHotHle-iHot =e J (15.2.11) d3 xi{J(x)')'Jl'lj;(x)AJl(x) , x == (x, t). Here, the field operators are identical to the Heisenberg operators of the free field theories, as given in (13.3.18) and (14.4.4) or (E.5). One finds the time-evolution operator in the interaction picture by starting from the formal solution of the Schrodinger equation (15.2.4), I'lj;, t) = e-iH(t-to) I'lj;, to). This leads, in the interaction representation, to I'lj;(t)) = eiHote-iH(t-to) I'lj;, to) = eiHote-iH(t-to)e-iHoto I'lj;(to)) (15.2.12) == U'(t, to) I'lj;(to)) with the time-evolution in the interaction picture U'(t, to) = eiHote-iH(t-to)e-iHoto . (15.2.13) From this relation, one immediately recognizes the group property (15.2.14a) and the unitarity (15.2.14b) of the time-evolution operator. Unitarity requires the hermiticity of Hand Ho. The equation of motion for this time-evolution operator is obtained from i :t U'(t, to) = eiHot ( -Ho = + H)e-iH(t-to)e-iHoto eiHot Hle-iHoteiHote-iH(t-to)e-iHoto (or from the equation of motion (15.2.6a) for I'lj;(t))): i:tU'(t, to) = HJ(t)U'(t,to). (15.2.15) Remark. This equation of motion also holds in the case where H, and hence HI, have an explicit time dependence: Then, in Eqs. (15.2.12)-(15.2.15), one must replace e-iH(t-to) by the general Schrodinger time-evolution operator U(t, to), which satisfies the equation of motion itt U(t, to) = HU(t, to). 15.2 The Interaction Representation, Perturbation Theory 327 15.2.2 Perturbation Theory The equation of motion for the time evolution operator (15.2.15) in the interaction picture can be solved formally using the initial condition U'(to, to) = 1 in the form U'(t,to) = (15.2.16) l-ilt dhHI(h)U'(tl,tO) ' (15.2.17) to i.e., it is now given by an integral equation. The iteration of (15.2.17) yields: i.e., U'(t, to) = I: (-it (tdh n=O X Jto t 1 dt2 ... (15.2.18) Jto lt n - 1 dt n H I (h)H I (t2) ... HI(t n ) . to By making use of the time-ordering operator T, this infinite series can be written in the form (15.2.19) or, still more compactly, as U' (t, to) = Texp ( -i 1: dt' HI (t') ) . (15.2.19') One can readily convince oneself of the equivalence of expressions (15.2.18) and (15.2.19) by considering the nth-order term: In (15.2.19) the times fulfil either the inequality sequence h ~ t2 ~ ... ~ tn, or a permutation of this inequality sequence. In the former case, the contribution to (15.2.19) is In the latter case, i.e., when a permutation of the inequality sequence applies, one can rename the integration variables and, thereby, return once more to 328 15. Interacting Fields, Quantum Electrodynamics the case tl 2: t2 2: ... tn. One thus obtains the same contribution n! times. This proves the equivalence of (15.2.18) and (15.2.19). The contribution to (15.2.19) with n factors of HI is referred to as the nth-order term. The time-ordering operator in (15.2.19) and (15.2.19'), also known as Dyson's time-ordering operator or the chronological operator, signifies at this stage the time ordering of the composite operators HI(t). If, as is the case in quantum electrodynamics, the Hamiltonian contains only even powers of Fermi operators, it can be replaced by what is known as the Wick's timeordering operator, which time-orders the field operators. It is in this sense that we shall use T in the following. The time-ordered product T( . .. ) orders the factors so that later times appear to the left of earlier times. All Bose operators are treated as if they commute, and all Fermi operators as if they anticommute. We conclude this section with a remark concerning the significance of the time-evolution operator U'(t, to), which, in the interaction picture, according to Eq. (15.2.12), gives the state 11jJ(t)) from a specified state 11jJ(to)). If at time to the system is in the state Ii), then the probability of finding the system at a later time t in the state If) is given by I (II U'(t, to) Ii) 12 . (15.2.20) From this, one obtains the transition rate, i.e., the probability per unit time of a transition from an initial state Ii), to a final state If) differing from the initial state ((ilf) = 0) as, Wi-+J = _1_1 (II U'(t, to) Ii) 12 . t - to (15.2.21) 15.3 The S Matrix 15.3.1 General Formulation We now turn our attention to the description of scattering processes. The typical situation in a scattering experiment is the following: At the initial time (idealized as t = -00), we have widely separated and thus noninteracting particles. These particles approach one another and interact for a short time corresponding to the range of the forces. The particles, and possibly newly created ones, that remain after this interaction then travel away from one another and cease to interact. At a much later time (idealized as t = 00), these are observed. The scattering process is represented schematically in Fig. 15.1. The time for which the particles interact is very much shorter than the time taken for the particles to travel from the source to the point of observation (detector); hence, it is reasonable to take the final and initial times as t = ±oo, respectively. 15.3 The S Matrix 329 Fig. 15.1. Schematic representation of a general scattering process. A number of particles are incident upon one another, interact, and scattered particles leave the interaction region. The number of scattered particles can be greater or smaller than the number of incoming particles At the initial time ti = -00 of the scattering process, we have a state Ii) corresponding to free, noninteracting particles I'lj;(-oo)) = Ii) . After the scattering, the particles that remain are again well separated from one another and are described by I'lj;(oo)) = U'(oo, -00) Ii) . (15.3.1 ) The transition amplitude into a particular final state If) is given by (I1'lj;(00)) = (II U'(oo, -00) Ii) = (II S Ii) = Sfi . (15.3.2) The states Ii) and If) are eigenstates of Ho. One imagines that the interaction is switched off at the beginning and the end. Here, we have introduced the scattering matrix, or S matrix for short, by way of S = U(oo, -00), (-it s= L-,00 n=O x £: n. 1 1 00 00 dh -00 dt2'" (15.3.3) -00 dtn T (HI(tdHI (t2) ... HI (tn)) . If the Hamiltonian is expressed in terms of the Hamiltonian density, one obtains (15.3.4) Since the interaction operator is Lorentz invariant, and the time ordering does not change under orthochronous Lorentz transformations, the scattering matrix is itself invariant with respect to Lorentz transformations, i.e., it is a relativistically invariant quantity. In quantum electrodynamics, the interaction Hamiltonian density appearing in (15.3.4) is (15.3.5) 330 15. Interacting Fields, Quantum Electrodynamics The unitarity of U(t, to) (see (15.2.14b)) implies that the S matrix is also unitary sst = sts = 1 (15.3.6a) 1 (15.3.6b) or, equivalently, L SfnS:n = 8fi L S~fni = 8fi . (15.3.7a) n (15.3.7b) n To appreciate the significance of the unitarity, we expand the asymptotic state that evolves from the initial state Ii) through 17P(oo)) = (15.3.8) S Ii) in terms of a complete set of final states {If)}: 17P(oo)) = L If) UI7P(oo)) f = L If) Sfi . (15.3.9) f We now form (7P(oo)17P(oo)) = L SjiSfi = L f ISfil 2 = 1 , (15.3.10) f where we have used (15.3.7b). The unitarity of the S matrix expresses conservation of probability. If the initial state is Ii), then the probability of finding the final state If) in an experiment is given by ISfi1 2 . The unitarity of the S matrix guarantees that the sum of these probabilites over all possible final states is equal to one. Since particles may be created or annihilated, the possible final states may contain particles different to those in the initial states. The states Ii) and If) have been assumed to be eigenstates of the unperturbed Hamiltonian H o, i.e., the interaction was assumed to be switched off. In reality, the physical states of real particles differ from these free states. The interaction turns the "bare" states into "dressed" states. An electron in such a state is surrounded by a cloud of virtual photons that are continually being emitted and reabsorbed, as illustrated in Fig. 15.2. 2 2 In order that the energy spectrum of the bare (free) particle be identical to that of the physical particle, the Lagrangian density of the Dirac field is reparameterized as with the renormalized (physical) mass mR = m + 8m. The Hamiltonian density then includes an additional perturbation term -8mi{;'Ij;. In the lowest order processes treated in the next section, this plays no role. It will be analysed further when we come to the topic of radiative corrections in Sect. 15.6.1.2. 15.3 The S Matrix + + + 331 +... Fig. 15.2. The propagation of a real (physical) electron involves the free propagation and the propagation that includes the additional emission and reabsorption of virtual photons. The significance of the different lines is explained in Fig. 15.3. The calculation of the transition elements between bare states Ii) and If) can be justified by appealing to the adiabatic hypothesis. The interaction Hamiltonian HJ(t) is replaced by HJ(t)((t), where lim ((t) t-t±oo =0 and ((t) = 1 for - T <t <T , i.e., at time t = -00 one has free particles. During the time interval -00 < t < T, the interaction turns the free particles into physical particles. Thus, in the time interval [- T, T], we have real particles that experience the total interaction HJ(t). Since the particles involved in a scattering process are initially widely separated, they only interact during the time interval [-T, TJ, which is determined by the range of the interaction and the speed of the particles. The time T must, of course, be much larger than T: T » T. The assumption of the adiabatic hypothesis is that the scattering cannot depend on the description of the states long before, or long after, the interaction. At the end of the calculation one takes the limit T --+ 00. If one is calculating a process only in the lowest order perturbation theory at which it occurs, one uses the entire interaction for the transition and not to convert the bare state into a physical state. In this case, one can take the limit T --+ 00 from the outset, and use the full interaction Hamiltonian in the whole time interval. The types of transition processes are determined by the form of the interaction Hamiltonian. If the initial state contains a certain number of particles, then the effect of the term of nth order in S (Eq.(15.3.4)) is the following: The application of HJ(xn) causes some of the original particles to be annihilated and new ones to be created. The next factor llJ(xn-d leads to further annihilation and creation processes, etc. It is necessary here to integrate over the space-time position of these processes. We will elucidate this for a few examples taken from quantum electrodynamics. Here, the interaction Hamiltonian density is HJ(x) = e: ijj(x)iJ.(x)'l/J(x) with the field operators (15.3.11) 332 15. Interacting Fields, Quantum Electrodynamics 7jJ(x) = P'~2 (V~p)1/2 L (V~ ifj(x) = p,r=1,2 A"(x) = ~ ~ 3 ( (brpur(p)e-iPX+dtpwr(p)eiPX) (15.3.12a) )1/2 (btpur(p)eipx+drpwr(p)e-ipx) (15.3.12b) p 1 2Vlkl )1/2 f~(k) (a),(k)e- ikX + al(k)e ikx ). (15.3.12c) As in previous chapters, at this stage it is useful to introduce a graphical representation, shown in Fig. 15.3. A photon is represented by a wavy line, an electron by a full line, and a positron by a full line with an arrow in the opposite time direction. If the particle interacts with an external electromagnetic field, it is similarly represented to a photon line, with a cross3 photon electron positron external field Fig. 15.3. The lines of the Feynman diagrams; the time axis points upwards 15.3.2 Simple Transitions We first discuss the basic processes that are brought about by a single factor 1h(x). The three field operators in 1lJ(x) can be split into components of positive and negative frequency, yielding a total of eight terms. For example, the term 7jJ+ annihilates an electron, while 7jJ- creates a positron. The term eifj- (x)A+ (x)7jJ+(x) annihilates a photon and an electron originally present at x, and once more creates an electron at x. This process is represented by the first diagram in Fig. 15.4. One could also describe this as the absorption of a photon by an electron. If one instead takes the summand A - from the photon field, one obtains a transition in which a photon is created at the position x, in other words, a process in which an electron at x emits a photon (the first of the lower series of diagrams in Fig. 15.4). The other six elementary processes are also shown in this figure. It is not necessary to discuss each of these in detail; we shall select just one. The third diagram of the lower series stems from eifj+ A-7jJ+ and represents the annihilation of an electron-positron pair, 3 As already mentioned elsewhere, these graphical representations are more than just simple illustrations: In the form of Feynman diagrams they prove to be unambiguous representations of the analytical expressions of perturbation theory. 15.3 The S Matrix 333 The elementary processes of the QED vertex 1lJ(x) = e: + 4-)( 'lj;+ '-v-" '-v-" '-v-" '-v-" '-v-" '-v-" ann e+ cr e- ann I cr'"'( ann e- cr e+ ( 1/;+ + 1/;- )(·r + 'lj;- ) Photon absorption e Photon emission e- scattered by a photon e+ scattered by a photon Pair annihilation Pair creation Fig. 15.4. The elementary processes of the QED vertex; the time axis points up- wards i.e., the transition of an electron and a positron into a photon. The range of possible processes is determined by the form of 1-lI(X) and its powers. The points on the diagrams at which particles are incident or are emitted (i.e., are created or annihilated) are also known as vertices. Figure 15.5 shows processes of various order, all of which are possible for an initial state consisting of an incoming electron and an incoming positron. Figure 15.5a shows the noninteracting motion described by the zeroth-order term. Figure 15.5b shows the second-order interaction in which the electron emits a photon which is absorbed by the positron. This process likewise contains the emission of a photon by the positron and its absorption by the electron. This process leads to a final state that once again contains an electron and a positron, hence it describes the scattering of an electron-positron pair. The diagram is of second order with two vertices. At higher orders of perturbation theory, the electron and positron can interact to produce the fourth-order scattering process represented in Fig. 15.5c. In the diagram shown in Fig. 15.5d, the positron propagates without interaction. The electron first emits a photon and then experiences a deflection due to an external potential. The final state consists of an electron, a positron, and a photon. 334 15. Interacting Fields, Quantum Electrodynamics Initial state e - + e+ Scattering: e e e b) a) Bremsstrahlung: c) Pair annihilation: d) e) f) Fig. 15.5. Examples of reactions having as the initial state electron plus positron: a) motion of noninteracting electron and positron; b) scattering of electron and positron; c) fourth-order scattering, exchange of two photons; d) Bremsstrahlung of the electron in the presence of an external field; the positron here propagates without interaction; e) pair annihilation in the presence of an external time-varying field whose frequency is equal to the energy of the e + e - pair; f) pair annihilation with a final state containing two photons One refers here to the bremsstrahlung of the electron in the presence of the external field. In Fig. I5.5e an external field causes the annihilation of the electron-positron pair. In order that this process may really happen, the frequency of the external field must be high enough that it at least equals the energy of the electron-positron pair. The diagram given by Fig. I5.5f represents pair annihilation, the final state consisting of two photons. We should now like to establish the transition probabilities for these processes. In order to determine the energy, momentum, and angular dependence of the individual transitions, one has to calculate the matrix elements. The procedure is similar to that for calculating the correlation functions in nonrelativistic many-particle physics. One has to re-order the creation and annihilation operators, using the commutation and anticommutation relations, in such a way that all annihilation operators are on the right, and all creation operators on the left. The effect of an annihilation operator on the vacuum state to the right is to yield zero, as is the effect of a creation operator acting * 15.4 Wick's Theorem 335 to the left. In a transition from a state Ii) to a state II), there are contributions only from those summands of products in T (1i(xd ... 1i(xn)) for which the creation and annihilation operators exactly compensate one another4 . The commutation of annihilation operators to the right then yields finite contributions from commutators or anticommutators. As will emerge, these can be expressed in terms of propagators. Thus, the result for the transition amplitude has the following structure: If the diagram consists of vertices at positions Xl, ..• ,X n and of incoming and outgoing particles, then the result is a product of propagators and this product is to be integrated over all positions of the vertices Xl, .•. ,X n . For simple processes it is easy to carry out the procedure that we have sketched step by step. In so doing, we find a set of rules which enable an analytical expression to be assigned to every diagram. These are known as the Feynman rules. To derive the Feynman rules systematically, one needs Wick's theorem, which allows an arbitrary time-ordered product to be represented by a sum of normal ordered products. The lines in the Feynman diagrams that correspond to incoming and outgoing particles are known as external lines, and the others are called internal lines. The particles represented by internal lines are termed virtual particles. In the diagram of Fig. 15.5f, the internal line can be viewed either as the motion of a virtual electron from the left- to the right-hand vertex, or as the motion of a positron from the right- to the left-hand vertex. To obtain the total transition probability, one must integrate over all space-time positions of the two vertices. Both processes are described by the Feynman propagator that analytically represents this internal line (see also the discussion at the end of Sect. 13.1). *15.4 Wick's Theorem In order to calculate the transition amplitude from the state Ii) to the state II), one needs to determine the matrix element (II S Ii). If one considers a particular order of perturbation theory, one has to evaluate the matrix element of a time-ordered product of interaction Hamiltonians. Of the many terms in the perturbation expansion, contributions come only from those 4 If, for example, the initial state Ii) contains an electron with quantum numbers (p, r) and a photon with (k, ).), then it is of the form whereas the final state as (fl = If) with particles (p', r') (01 a>.., (k')br,p' . and (k', )") appears in bra form 336 15. Interacting Fields, Quantum Electrodynamics whose application to Ii) yields the state If). Hence, (apart from the possibility of individual particles moving without interaction) the corresponding perturbation-theoretical contribution to the S matrix must contain those annihilation operators that annihilate the particles in Ii) and those creation operators that create the particles in If). In addition, a general term in S will also contain further creation and annihilation operators responsible for the creation and subsequent annihilation of virtual particles. These particles are termed virtual because they are not present in the initial or final states; they are emitted and reabsorbed in intermediate processes, e.g., the photon in Fig. 15.5b. The virtual particles do not obey the energy-momentum relation, p2 = m 2 , valid for real particles, i.e., they do not lie on the mass shell. As already mentioned in the previous section, one can calculate the value of such matrix elements of S by using the commutation relations to move the annihilation operators to the right. Instead of carrying out this calculation for every single case individually, it is helpful to rewrite the time-ordered products so that they are normal ordered from the start, i.e., so that all annihilation operators are to the left of all creation operators. Wick's theorem tells us how an arbitrary time-ordered product can be represented as a sum of normal ordered products. Wick's theorem is the basis for the systematic calculation of pertubation-theoretical contributions and their representation by means of Feynman diagrams. Since the Hamiltonian density of a normal ordered product is ll(x) = e : ij;(x)iJ.(x)'IjJ(x) : , (15.4.1) the nth-order term of S has the form s(n) = (-i~ t J d 4 xl ... d 4 xn n. x T (: ij;(xdiJ.(xd'IjJ(xd (15.4.2) this type of time-ordered product of partially normal ordered factors is known as a mixed time-ordered product. In order to facilitate the formulation of Wick's theorem, we summarize a few properties of the time-ordered and normal ordered products that were introduced in Eqs. (13.1.28) and (13.4.12). For given field operators A l , A 2 , A 3 , ... , one has (see Eq. (13.1.18c)) the distributive law AlA3A4 + A2 A3A4 : AlA3A4 : + : A2 A3A4 (15.4.3) The contraction of two field operators A and B, such as 'IjJ (xd, 'IjJ t (X2), or AJ.£(X3), ... , is defined by AB '---' == T(AB)-: AB (15.4.4) * 15.4 Wick's Theorem 337 It is easy to convince oneself that such contractions are c numbers: According to the general definition, T(AB) orders the operators A and B chronologically, and, for the case of two Fermi operators, introduces a factor (-1). Since the commutator or the anticommutator of free fields is a c number, T(AB) - AB is also a c number, and the same is true of: AB : -AB and the difference (15.4.4). Since the vacuum expectation value of a normal ordered product vanishes, it follows from (15.4.4) that AB (01 T(AB) 10) . = (15.4.5) L---' With this, the most important contractions are already known as a result of the Feynman propagators evaluated in (13.1.31), (13.4.16), and (14.5.19). For the real and the complex Klein-Gordon field, for the Dirac field, and for the radiation field, respectively, we find the following: ¢(Xl)¢(X2) = iL1 F (Xl - X2) '-------.J ¢(Xd¢t(X2) = ¢t(X2)¢(Xl) = iL1 F (Xl - X2) f",(xd~;3X2) '-------.J I (15.4.6) '--'-------'--' = -f;3(X2)~",xd AI"(xdAV(X2) = iD~V(Xl = iSFa;3(Xl - X2) - X2) . I Furthermore, we also have etc. since all these pairs of operators either commute or anticommute with one another. We recall that, in the interaction representation, the fields in 1i(x) are free Heisenberg fields. According to Eq. (15.4.4), the time-ordered product of two field operators can be represented in normal ordered form as follows: T(AB) =: AB : +AB . (15.4.8) L---' We now define what is known as the generalized normal product (normal product with contractions) of the field operators A = A(xd, B = B(X2), ... , which also contains within itself contractions of these operators: : ABCDEF ... KL ... ~ I L-..J := (-1( ACDLEF ... : BK ... :, (15.4.9) L-..J I L-...J L-.J where P is the number of the individual permutations of Fermi operators that is necessary to obtain the order ACDLEF ... BK .... For example, 1/1", (xdAI" (X2)1';3(X3)'l/Jr (x4)A V(X5)~" = (-1)f;3X~,x6: (X6) : 'l/Ja(xl)AI"(X2)'l/Jr(X4)A V(X5) (15.4.10) We are now in a position to formulate Wick's theorem both for pure time-ordered products and for mixed time-ordered products. 338 15. Interacting Fields, Quantum Electrodynamics 1st Theorem: The time-ordered product of the field operators is equal to the sum of their normal products in which the operators are linked by all different possible contractions: T(AIA2 A 3 ... An) =: A 1 A 2A 3 ··· An : + : A 1 A 2A3 ··· An : + : A 1 A 2A 3 ··· An + : AIA2A3A4··· An : + ... + : + .... '--....J ~ (15.4.11 ) In the first line there are no contractions, in the second, one, in the third, two, etc. 2nd Theorem: A mixed T product of field operators is equal to the sum of their normal products in the form (15.4.11), with the difference that the sum does not include contractions between operators that occur within one and the same normal product factor. For example, we have T('l/;l : 'l/;2'l/;3'l/;4 :) 'l/;1'l/;2'l/;3'l/;4 : = + : 'l/;1'l/;2'l/;3'l/;4 : + : 'l/;1'l/;2'l/;3'l/;4 : + : 'l/;1'l/;2'l/;3'l/;4 L-..J '-----...J I I (15.4.12) The proof of Wick's theorem is not essential for its application. Hence, the remainder of this section, which presents a simple proof, could be omitted. We first prove the 1st theorem, Eq. (15.4.11), for the case in which the operators of the product AIA2 ... An are time ordered from the outset. We will show that the general case can be reduced to this special case. We now express the field operators in terms of their positive and negative frequency components. We divide this product, which occurs in time-ordered form from the outset, into a sum of products of positive and negative frequency parts. We select one such term arbitrarily; this is time ordered but, in general, not normal ordered. We then re-order its factors in the following way: The leftmost creation operator that is not in normal order is moved step-by-step to the left by permuting it successively - through commutation or anticommutation - with each of the annihilation operators that appear to the left of it. This procedure is then repeated for the next non-normal ordered creation operator and continued until all operators have become normal ordered. For each of these permutations one obtains, from (15.4.8) and the definition of the normal ordered product, At Ak" = T(At Ak") =: At Ak" : +1t 1k" = ±Ak" At + 1t 1k" ' (15.4.13) where the lower sign applies when both operators are Fermi operators. In the final result, each of the summands is in normal ordered form with a sign that is determined by the number of pairs of Fermi operators that have been 15.5 Simple Scattering Processes, Feynman Diagrams 339 permuted. These signs can be eliminated from the expression if one writes each of the summands in the time-ordered (original) sequence and subjects it to the normal-ordering operation: ... : (see, e.g., (15.4.13), where one can write ±Ak At =: At Ak :). The result now closely resembles the expression (15.4.11) except for the fact that not all contractions appear; it includes only those between the "wrongly"positioned (non-normal ordered) operators. However, since the contraction of two operators that are both time ordered and normal ordered vanishes, we can add all such contractions to the result and thus, using the distributive law, we obtain the sum of normal ordered products with all contractions. We have thus proved (15.4.11) for the time sequence tl > ... > tn. We now consider the operators AIA2 ... An and an arbitrary permutation P(A I A 2 ... An) of these operators. On account of the definition of the time-ordering and normal-ordering operations, we have (15.4.14a) and (15.4.14b) with the same power P. Hence, we have demonstrated theorem 1, Eq. (15.4.11), for arbitrary time ordering of the operators AI, ... ,An. Theorem 2 is obtained from the proof of theorem 1 as follows: The partial factors: AB ... : within the mixed time-ordered product are already normal ordered. In the procedure described above for constructing normal ordering there is no permutation, and thus no contraction, of these simultaneous operators. The contractions of these already normal ordered, simultaneous operators - which would not vanish - do not occur. This proves theorem 2. 15.5 Simple Scattering Processes, Feynman Diagrams We shall now investigate a number of simple scattering processes, for which we will calculate the matrix elements of the S matrix. In so doing, we will encounter the most important features of the Feynman rules, which have already been mentioned on several occasions. In order of increasing complexity, we will study the first-order processes of the emission of a photon by an electron and the scattering of an electron by an external potential (Mott scattering) and, as examples of second-order processes, the scattering of two electrons (M011er scattering) and the scattering of a photon from an electron (Compton scattering). 15.5.1 The First-Order Term Taking the simplest possible example, we consider the first-order contribution to the S matrix, Eq. (15.4.14) 340 15. Interacting Fields, Quantum Electrodynamics s(1) = -ie J d 4 x : 7,6(x)4(x)'Ij;(x) : (15.5.1) with the field operators from Eq. (15.3.12). The possible elementary processes that result have already been discussed in Sect. 15.3.2 and are represented in Fig. 15.4. Of the eight possible transitions, we consider here the emission of a photon 'I by an electron (Fig. 15.6), or, in other words, the transition of an electron into an electron and a photon: Fig. 15.6. The emission of a photon I from an electron e-. This process is virtual, i.e., only possible within a diagram of higher order The initial state containing an electron with momentum p (15.5.2a) goes into the final state containing an electron with momentum p' and a photon with momentum k' (15.5.2b) The spinor index r and the polarization index A are given for the creation operators, but for the sake of brevity are not indicated for the states. The first-order contribution to the scattering amplitude is given by the matrix element of (15.5.1). Contributions to (II S(1) Ii) come from 1j;(x) only through the term with brp , from 7,6(x) only through the term b~,p" and from A(x) only through a A (k'): (II S(1) Ii) = -ie X 'IlL [ x [( V~p) J (V;p') ~ (2V~k' d 4x [ I) ~ Ur,(p')eiP'X] EAIL (k')e ik ' x ] (15.5.3) ~ ur(p)e- iPX ]. The integration over x leads to the conservation of four-momentum and thus to the matrix element 15.5 Simple Scattering Processes, Feynman Diagrams (fl S(1) Ii) = _(21f)48(4l(p' + k' _ p) (~)"2 1 YEp (~)"2 1 YEp' xieur,(p'hlLfAIL(k' = p - p')ur(p) . 341 (_1_)"2 1 2Vlk'i (15.5.4) The four-dimensional <5 function imposes conservation of momentum p' = p - k' and of energy Ep-k' + Ik'i = Ep. For electrons and photons, the latter condition leads to k' . p/lk'llpl = + m 2/p2. In general, this cannot be satisfied since the two end products would always have a lower energy than the incident electron. The condition of energy-momentum conservation cannot be satisfied for real electrons and photons. Thus, this process can only occur as a component of higher-order diagrams. A preliminary comparison of Fig. 15.6 with Eq. (15.5.3) shows that the following analytical expressions can be assigned to the elements of the Feynman diagram: To the incident electron ur(p)e- ipx , to the outgoing electron ur' (p')eip'X, to the vertex point -ie-I'll, to the outgoing photon fAIL (k')e ik ' x, and, in addition, one has to integrate over the position of the interaction point x, i.e., the vertex point is associated with the integration J d 4 x. Carrying out this integration over x, one obtains from the exponential functions the conservation of four-momentum (21f)4<5(4l(p' + k' - p), and hence one obtains finally the following rules in momentum space: Assigned to the incident electron is U r (p), to the outgoing electron ur ' (p'), to the outgoing photon f~(k'), and to the vertex point -iel''''(21f)4<5(4l (p' +k'-p). Jl 15.5.2 Mott Scattering Mott scattering is the term used to describe the scattering of an electron by an external potential. In practice, this is usually the Coulomb potential of a nucleus. The external vector potential then has the form A~ (V(x), 0, 0, 0)) = (15.5.5) The initial state Ii) = bt p (15.5.6a) 10) and the final state If) = b~,p' 10) (15.5.6b) each contains a single electron. The scattering process is represented diagrammatically in Fig. 15.7. Fig. 15.7. Mott scattering: An electron e- is scattered by an external potential 342 15. Interacting Fields, Quantum Electrodynamics The S-matrix element that follows from m x V(x) ( VEp S(1) in Eq. (15.5.1) has the form )! ur(p)e-. 1PX = -ieV(p - p') ( -mVEpl (15.5.7) )!( )! -mVEp x M21fl5(pD _ plo) . Here, (15.5.8) is the spinor matrix element. In the calculation of the transition probability I (II S(1) Ii) 12 , formally the square of a 8-function appears. In order to give meaning to this quantity, one should recall that the scattering experiment is carried out over a very long, but nonetheless finite, time interval T and that 27r8(E) should be replaced by 27r8(E) -+ jT/2 dte iEt -T/2 (15.5.9) . The square of this function is also encountered in the derivation of Fermi's golden rule (1:/:dteie.)' ~ G ~T)' ffin ~ 2~T(':n = 2~Td(E). (15.5.10) Here we have used the fact, detailed in QM I, Eqs. (16.34)-(16.35), that the final expression in brackets is a representation of the 8-function: lim 8T (E) T-too . = T-too hm (sin 2 ET/2) E2T/ 2 7r = 8(E) In briefer form, this justification may also be presented as lim j T-too T/2 dt e iEt -T /2 jT/2 dt e iEt = 27rT8(E) , (15.5.10') -T /2 where the limit of the first factor is expressed by 27r8(E) , which for the second integral then yields I~2 dteD = T. The transition probability per unit time Tij is obtained by dividing by T (Eq. (15.5.7)): I (II S(1) Ii) 12 15.5 Simple Scattering Processes, Feynman Diagrams 343 (15.5.11) The differential scattering cross-section is defined by da dJl dN(Jl) NindJl' (15.5.12) where dN(Jl) is the number of particles scattered into the element of solid angle dJl , and Nin is the number of particles incident per unit area (see Fig. 15.8). The differential scattering cross-section (15.5.12) is also equal to the number of particles scattered per unit time into dJl divided by the incident current density jin and by dJl: da dJl dN(Jl)/dt jindJl (15.5.12') In addition to the transition rate ril that has already been found, we also need to know the incident flux and the number of final states in the element of solid angle dJl. We will first show that the flux of incident electrons is given by In order to do so, we need to calculate the expectation value of the current density lfJ. (15.5.13) in the initial state Ii) = le-,p) = bt p 10) We obtain I .J.t( x ) Ie,p - ) -VE - m (e _,PJ p - ( ) J.t () _ pJ.t UrP'Y UrP -VE ' (15.5.14) p where we have used the Gordon identity (Eq. (10.1.5)) ur(p),J.tur,(q) = 1 2mUr(P) [(p+qt+iaJ.t"'(p-qt]ur,(q) . The incident current density jin is thus equal to . Jin Ipl = VEp ' p (15.5.15) Fig. 15.8. Scattering of an electron with momentum p by a potential. The momentum of the scattered electron is p', the angle of deflection iJ, and the solid angle d!2 344 15. Interacting Fields, Quantum Electrodynamics which, as might be expected, is the product of the particle number density ~ and the relative velocity ~I. p To determine dN(D) per unit time, we need the number of final states in the interval d3 p' around p'. Since the volume of momentum space per momentum value is (27f)3 IV, the number of momentum states in the interval d 3 p' is Vlp'1 2 dlp'ldD Vlp'IE'dE'dD (27f )3 (27f )3 (15.5.16a) where we have used and dE' = -7~= Ip'ldlp'l Vp,2 + m 2 Ip'ldlp'l E' (15.5.16b) Inserting Eqs. (15.5.11), (15.5.15) and (15.5.16a) into the differential scattering cross-section (15.5.12')5, one obtains the cross-section per element of solid angle dD, by keeping dD fixed and integrating over the remaining variable E' (15.5.17) where the conservation of energy, expressed by o(E - E'), yields the condition Ip'l = Ipi for the momentum of the scattered particle. The Fourier transform of the Coulomb potential of a nucleus with charge Z, in Heaviside-Lorentz units,6 Ze V(x) = 47flxl reads: Ze V(p - p') = I '12 (15.5.18) p-p For the sake of simplicity, we assume that the incident electron beam is unpolarized. This corresponds to a sum over both polarization directions with the weight ~, i.e., ~ 2::r=1,2' The polarization of the scattered particles is likewise not resolved, giving a sum 2::r' over the two polarization directions of the final state. Under this condition, inserting (15.5.8) and (15.5.18) into (15.5.17), yields for the differential scattering cross-section 5 dN(Jl) -d-t- - '" ~fEdJl r.2f -- v (27r)3 f p'EdJl 6 See footnote 1 in Chap. 14. d 3 P ' r2f· 15.5 Simple Scattering Processes, Feynman Diagrams ~~ dCJ dfl ° 2 (em)2! " 1_U,.I (') 2 2" ~ ~ P , u,. ()1 P 7f ,.,,. (Ze)21 4 I Ip ~ P 1 Ip'I=lpl 345 (15.5.19) Hence, we are led to the calculation of ,., ,. ,., ,. ° (p+m) = 'a'a ~2- 4~2 = m ° '(3(3' (15.5.20) (pl+m) -2-- a(3 m (3' a ' TqO(p + mhO (pi + m) , where we have used the representations (6.3.21) and (6.3.23) of the projection operator A+, which leads to the trace of the product of, matrices. Making use of the cyclic invariance of the trace, Tr = 0, {,I', ,V} = 2gJ.Lvll, and Tr ,O,J.L,O,V = 0 for IL i= v, one obtains ,v = Trr'/jr'/p' +4m 2 =Pl'p~Tr,o"(I./ + 4m 2 + Pk~ Tr "(0 "(k "(0 "(k + 4m 2 2 = 4(p~ + pp' + m ) = 4(E; + pp' + m 2 ) . = po~ Tr 1 (15.5.21 ) From the expression for the velocity 7, oE V= - op and = Ipi = P P =Vp2 +m 2 E Ev , pp' = Ipl2 cos 19 (for Ip'l (15.5.22a) = Ipl) we gain the following relations: Ip - p '12 = 2p 2 ( 1 - cos 19) . 2 '2 19 = 4p 2 sm (15.5.22b) and E2 + P . P ,+2 m = 19 . 2 2 E2 - P2 ( 1 - cos 19 ) = 2E 2 - 2p 2 sm 2 19 sm '2 ) . = 2E 2( 1- v 2.2 (15.5.22c) Inserting (15.5.20), (15.5.21), and (15.5.22a-c) into (15.5.19), one finally obtains the differential scattering cross-section for Mott scattering 7 E and Ep are used interchangeably. 346 15. Interacting Fields, Quantum Electrodynamics da dD (15.5.23) ·2 where a is Sommerfeld's fine-structure constant 6 a = ~. In the nonrelativistic limit, (15.5.23) yields the Rutherford scattering law, see Eq. (18.37) in QM I, da dD (15.5.24) For the scattering of Klein-Gordon particles one has, instead of Eq. (15.5.23), da dD see Problem 15.2. In addition to the elements of the Feynman diagrams encountered in the preceding section, here we also have a static external field A~(x), represented as a wavy line with a cross. According to Eq. (15.5.7), in the transition amplitude this is assigned the factor A~(x), or in momentum space (15.5.25) 15.5.3 Second-Order Processes 15.5.3.1 Electron-electron scattering Our next topic is the scattering of two electrons, also known as M(Illler scattering. The corresponding Feynman diagram is shown in Fig. 15.9. This is a second-order process following from the S-matrix term: S (2) = (_i)2 2! = (ie)2 2! X J d 4 Xl 4 d X2 T (llI(XI)1LJ(X2)) Jd dX2 4 Xl 4 T (: ¢(xJ)4(xJ)7/J(xJ) (15.5.26) : ¢(x2)4(x2)7/J(x2) :) . Application of Wick's theorem leads to one term without contraction, three terms each with one contraction, three terms each with two contractions, and, finally, to one term with three contractions. The term that contains the two external incident and outgoing fermions is 15.5 Simple Scattering Processes, Feynman Diagrams 347 J = -2" J - e; d4 xI d4 x2 : ¢(xI)1(xd1/J(XI)¢(X2)1(X2)1/J(X2) : e2 d4 Xl d 4 X2 : ¢(XI}"/'1/J(XI)¢(X2)"t1/J(X2)iDFfLV (XI - X2) . , (15.5.27) where DFfLV(XI - X2) is the photon propagator (14.5.19). Depending on the initial state, this term leads to the scattering of two electrons, of two positrons, or of one electron and one positron. We shall consider the scattering of two electrons e- + e- e~ + e- , from the initial state (15.5.28a) into the final state (15.5.28b) Here, there are clearly two contributions to the matrix element of S(2). The direct scattering contribution, in which the operator ¢(XI)"(fL1/J(XI) annihilates the particle labeled 1 with spinor Url (PI) at the position Xl and creates the particle with spinor Ur~ (p~). The operator ¢(X2)"(v1/J(X2) has the same effect on the particle labeled 2. The other contribution is the exchange scattering. This is obtained when the effect of the annihilation operators remains as just described, whilst the operator ¢(XI)"(fL1/J(XI) creates the particle in the final state ur~ (p~) and the second operator creates the particle in the state ur~ (p~). These two contributions are shown diagrammatically in Fig. 15.9. Exactly the same contributions arise when one, instead, exchanges the positions Xl and X2 of the two interaction operators. Since one has to integrate over Xl and X2, one obtains twice the contribution of the two diagrams in Fig. 15.9. The factor 2 arising from the permutation of the two vertex positions cancels with the factor in S(2). This is a general property of Feynman diagrams. The factor ~n. in sen) can be omitted when one is summing only over topologically distinct diagrams. tt HX I 1Tl Xl P1Tl I I 1Tl P2T2 X2 a) P2T2 Xl P1Tl I P2T X2 b) P2 T 2 Fig. 15.9. Electron-electron scattering: (a) direct scattering, (b) exchange scattering 348 15. Interacting Fields, Quantum Electrodynamics The 8-matrix element for the direct scattering in Fig. 15.9a is (II 8(2) li)a = _e 2 J d4X l d 4x2 (V 4E PI ;4 E , E ,) P2 P, P2 ~ X e-ipIX+~2; X (Ur~ (P~hJLUrI (pd)ur~ (P~h"Ur2 (P2))iD FJLV (Xl - X2) . (15.5.29) The exchange scattering contribution (b) is obtained by exchanging the wave functions of the final states in I8(2) Ii) a' so that the wave-function part has the form U (15.5.30) The minus sign that appears here is due to the fact that, in the exchange term, one needs an odd number of anticommutations to bring the creation and annihilation operators into the same order as in the direct term. If, e-ik(Xl -X2) in (15.5.29) one inserts iDFJLII(Xl - X2) = i J d4 k -9/Lv k 2+i€ carrying out the integrations, one obtains ' then, after (15.5.31 ) where the matrix elements are given, for the graph 15.9a, by Ma = -e2u(p~hJLdiDFI - p~)U(h"P2 (15.5.32a) and for the graph 15.9b, by (15.5.32b) The 0 function in (15.5.31) expresses the conservation of the total fourmomentum of the two particles. Since, in the matrix element Ma for the direct scattering, the photon propagator, for example, has the argument k == P2 - P; = p~ - PI, the four-momentum of the particles is conserved at every vertex. Hereby, we fix the orientation of the photon momentum of the internal line to be from right to left. In principle, the orientation of the photon momentum is arbitrary since DFJLII(k) = D FJLII ( -k); however, one must select an orientation so that one can monitor the conservation of momentum at the vertices. The Feynman diagrams in momentum space are shown in Fig. 15.10. The Feynman rules can now be extended as follows: To every internal photon line with momentum argument k, with end points at the vertices 'Y JL and 'Y/, assign the propagator iDFJLII (k) = i ;l~ . 15.5 Simple Scattering Processes, Feynman Diagrams 349 , , Fig. 15.10. Feynman diagrams in momentum space for electronelectron scattering a) direct scattering, b) exchange scattering P2 b) a) We now turn to the evaluation of the matrix element (15.5.32a), which is now lengthier than for Mott scattering. Instead of going through the details of the calculation, we refer the reader to the problems and the supplementary remarks at the end of this section and discuss the final result. The relation between the differential scattering cross-section and the matrix element M in the center-of-mass system for two fermions with mass ml and m2 is given, according to Eq. (15.5.59), by da I = _1_ mlm21MI2 dfl eM (471")2 E tot (15.5.33) ' where E tot is the total energy. Inserting the results (15.5.37)-(15.5.43) into (15.5.33), one obtains for the scattering cross-section in the center-of-mass frame (Fig. 15.11) the M011er formula (1932) Q2(2E2 - m 2)2 4E2(E2 - m 2)2 da dfl 3+ ---x (-4 sin4 f} sin 2 f} 4)) (E2 - m 2)2 ( 1+--. (2E2 - m 2)2 sin4 f) (15.5.34) In the nonrelativistic limit, E 2 ::::J m 2, v 2 = (E2 - m 2)/E2, this yields: da dfl I nr = 1(1 (Q)2 m 4v 4 sin4 :Q2 1 + cos4 :Q2 1) - sin 2 :Q cos 2 :Q 2 P~ PI = (E,p) 2 ' (15.5.35) = (E,p') P2 = (E, -p) Fig. 15.11. Kinematics of the scattering of two identical particles in the center-of-mass system 350 15. Interacting Fields, Quantum Electrodynamics a formula that was originally derived by Mott (1930). It is instructive to compare this result (15.5.35) with the classical Rutherford-scattering formula (15.5.36) This classical formula contains the familiar sin -4 ~ term, but also an additional cos- 4 ~ term, since here we are considering the scattering of two (identical) electrons. If one observes the scattering at a particular angle {), then the probability of observing the electron incident from the left is proportional to sin -4 ~. The probability that the electron incident from the right is scattered into this direction is, as can be seen from symmetry considerations, proportional to sin- 4 ( 1r - {)) -2- = cos- 4 "2{) . Classically, these probabilities simply add, which leads to (15.5.36). In the quantum-mechanical result (15.5.35), however, an additional term arises due to the interference between the two electrons. In quantum mechanics it is the two amplitudes, corresponding to the Feynman diagrams 15.9a and 15.9b, that are added. The scattering cross-section is then obtained as the absolute magnitude squared. The minus sign in the interference term results from the Fermi statistics; for bosons one obtains a plus sign. In the extreme relativistic case, !f; --+ 00, from (15.5.34) we have (15.5.37) Supplement: Calculation of the differential scattering cross-section for electron--electron scattering. For the scattering cross-section, we need (15.5.38) We assume an unpolarized electron from (15.5.32a,b) with iDFJ1-I/(k) = ;J~1. beam that scatters from likewise unpolarized electrons and, furthermore, that the polarization of the scattered particles is not registered; this implies the summation -41 Lr 1 Lr2 Lr'1 Lr'2 == -41 Lr r'· For the first term in (15.5.38) we obtain 1" t 15.5 Simple Scattering Processes, Feynman Diagrams __ e2 IMal 2 = "4 L ur~ (P~hJLUr, (pdur~ (P~hJLUr2 351 (P2) r1.,T~ X 1 _PI)2]2 url(PIh"U~)2[p e4 = "4 L Url (PIh"ur~ (P~)Ur (P~hJLUr, (pd r'l,~ (15.5.39) x Ur2 (P2hlur~ (P~)Ur _e4 Tr ( 'YII P~ m PI2m+ m) + m,,'/J2 + m) x Tr ( 'Y IP~ -----'Y _ PI)2]2 2m 'YJL r 2m p~ (P2) [(p~ + 4 - (P~hJLUr2 1 ----- 2m 1 [(p~ - PI)2]2 The second term of (15.5.38) is obtained by exchanging the momenta p~ in IMaI 2 : and (15.5.40) and the third is x Re x uri [ur~ (P~hJLUr, (PIh"ur~ Tr ( (P~hJLUr2 (P~)Ur2 e4 4 (p~ x (pdur~ (P2) (P2hlur~ (P~)] 1 p2)(~ 'YI~JL - P~ - PI)2 +m PI ~'Y +m II P~ + m JL '/J2 + m) 2m 'Y 2m (15.5.41 ) In the final expression here, it is possible to omit the Re since its argument is already real. There still remains the evaluation of the traces: For Eq. (15.5.39) one needs Tr ({IP~ + mhJL(h + m)) = 4(gJLllm2 + PIJL~ + PI~JL - gJLIP~ . PI) . (15.5.42) In Eq. (15.5.41) we encounter 'YI(P~ + mhJL(PI + mh" = -2PnJL~ + 4m(pIJL + P~JL) - 2m 2'YJL (15.5.43) 352 15. Interacting Fields, Quantum Electrodynamics and Tr bv(p~ = + mhp,(p1 + m)"t(p; + mhP,(p2 + m)) Tr (( -2pn,~ + 4m(p1 + p~), - 2m2"(p,)(p; + mhIJ(h + m)) = Tr( - 2P1(4p~ . P; - 2mpD(p2 + m) + 4m(p; + m)(h + P~)(2 + m) - 2m2(-2p; +4m)(p2 +m)) = 16( -2p1·P2 p~ .p;+m21·~ +m2(p1 +p~)·(2;m-4 . (15.5.44) The formulas (15.5.38)-(15.5.44) were used in going from (15.5.31) to (15.5.34). *15.5.3.2 Scattering Cross-Section and S-Matrix Element In many applications, it is important to have a general relation between the scattering cross-section and the relevant S-matrix element. We consider the scattering of two particles with four-momenta Pi = (Ei' Pi), i = 1,2, which react to yield a final state containing n particles with momenta PI = (E'.r, PI)' f = 1, ... , n. For brevity, we suppress the polarization states. The S-matrix element for the transition from the initial state Ii) into the final state If) has the form (II S Ii) = 8fi x + (27r)48(4) (LPI - LPi) II (2:E-) 1/2 II (2:E' ) 1/2 II i ' f f (2m)1/2 M. (15.5.45) external fermions The final product TIexternal fermions(2m)1/2 stems from the normalization factor in (15.3.12a,b) and contributes a factor (2m)1/2 for every external fermion, whereby the masses can be different. The amplitude factor M = 2:~=1 M(n) is the sum over all orders of perturbation theory, where M(n) stems from the term s(n). The four-dimensional 8 function is obtained for an infinite time interval and an infinite normalization volume. As was the case for Mott scattering, it is convenient to consider a finite time interval T as well as a finite volume. One then has (15.5.46) 15.5 Simple Scattering Processes, Feynman Diagrams 353 and (15.5.47) This leads to a transition rate or, in other words, a transition probability per unit time, of 2 ISfil ----y;- Wfi = ~ V(2~)'.s>f - 2:p,) (I.J 2:E,) II x (2m) IMI2 . (I} 2:Ei) (15.5.48) external fermions Wfi is the transition rate into a particular final state f. The transition rate into a volume element in momentum space TIL d3 is obtained by multiplying (15.5.48) by the number of states in this element p; ISfil 2 T II V d pf 3 (15.5.49) (271")3 f The scattering cross-section is the ratio of the transition rate to the incident flux. In differential form this implies (15.5.50) 1 4E E 1 II (2m) IM1 2 d4>n . 2 Vrel external fermions The normalization is chosen such that the volume V contains one particle and the incident flux equals vreJ/V, with the relative velocity Vrel. In the center-of-mass frame (P2 = -PI), the relative velocity of the two incident particles is (15.5.51 ) 354 15. Interacting Fields, Quantum Electrodynamics In the laboratory frame, where particle 2 is assumed to be at rest, we have P2 = 0, and the relative velocity is (15.5.52) In the calculation of the scattering cross-sections, one encounters phase-space factors of the outgoing particles )II (2 d3)32E' Pf I dcfJ n = (27f) 4 8 (4) (~Pf ' " I - PI - P2 7f f . (15.5.53) f If one is interested in the cross-section of the transition into a certain region of phase space, one must integrate over the remaining variables. Since the total four-momentum is conserved, the momenta pi I, ... pi n cannot all be independent variables. We consider the important special case of two outgoing particles (15.5.54) The integration over pi 2 yields 8 : (' 1 dcfJ2 = - ( )2 8 EI 27f 8(E~ + + E2 E~ I - EI - E2)pidD~ 167f2 E~ E~ == where, in this equation, E~ PI == p\ yields 9 : 1 ) d 3 pi - EI - E2 ---'-E' 4EI 2 (15.5.55) Ep;=Pl +P2~' The further integration over 1 dcfJ 2 = 16 7f dD~ Ipi 12 2EI,8(~+;) I 81p~ 2 (15.5.56) . I In the center-of-mass system we have P'2 E f'2 -_ mf12 = _p'I' From the relation + p'2f' f = 1,2 (15.5.57) it follows that 8(E~ 8 9 + E~) 81pil = 1 1(~ I PI E~ ~) + E~ = 1 'I EI + E2 PI E~ (15.5.58) The phase space changes in size when going from (15.5.54) to (15.5.56), since indefinally we are calculating the cross-section per element of solid angle dn~, pendently of Ip~1 and p;. The notation de}) is retained throughout. 6(f(X)) = Lxo 1f'(~o)6X - xo), where the sum extends over all (simple) zeros of f(x). 15.5 Simple Scattering Processes, Feynman Diagrams 355 Inserting this into (15.5.56) and (15.5.58), and inserting (15.5.52) into (15.5.50), one obtains the differential scattering cross-section in the center-of-mass frame as da I (15.5.59) dD eM This is the relationship we are seeking between the differential scattering cross-section and the amplitude M. The special case of electron-electron scattering was analyzed in Sect. 15.5.3.1. 15.5.3.3 Compton Scattering Compton scattering refers to the scattering of a photon from a free electron. In practice, the electrons are frequently bound but the high energy of the photons often means that they can still be considered as effectively free 10 . In this scattering process the initial state contains an electron and a photon, as does the final state. In second-order perturbation theory, Wick's theorem yields two contributions, each with one contraction of two Fermi operators 'ljJ and ij;. The two Feynman diagrams are shown in Fig. 15.12. From these we can directly deduce a further Feynman rule. To each internal fermion line there corresponds a propagator isF (p) = p-;'+i<' The two diagrams (b) and (c) are topologically equivalent: it is sufficient to consider just one of them. e (a) (b) (c) Fig. 15.12. Compton scattering (a) A photon is absorbed and then emitted. (b) The photon first creates an e+e- pair. This diagram is topologically equivalent to (c), where first a photon is emitted, and only afterwards is the incident photon absorbed by the electron 10 The great historical significance of the Compton effect for the evolution of quantum mechanics was described in QM I, Sect. 1.2.1.3. 356 15. Interacting Fields, Quantum Electrodynamics Remark. In connection with the calculation of the photon propagator in the Coulomb gauge (Sect. 14.5), it was asserted that the photon propagator appears only in the combination jp.D';v jv. We now explain this in second-order perturbation theory, for which the contribution to the S matrix is S(2) = (~)2 = JJ JJ (~r d4xd4x'T(jP.(x)Ap.(x)jV(x')A v (x')) d4xd4x' T (jp.(x)jV(x')) T (Ap.(x)Av(x')) , since the electron and photon operators commute with one another. The contraction of the two photon fields yields: Jd4X~' T (jp.(x)jV(x')) Dpp.v(x - x') = J d4kT(jP.(k)j"(k)) Dpp.v(k) , and, on account of the continuity equation jI"(k)kp. = 0, the part of Dpp.v(k) which we call redundant in (E.26c) makes no contribution. 15.5.4 Feynman Rules of Quantum Electrodynamics In our analysis of scattering processes in Sects. 15.5.2 and 15.5.3, we were able to apply Wick's theorem to derive the most important elements of the Feynman rules which associate analytical expressions to the Feynman diagrams. We summarize these rules in the list below and in Fig. 15.13. For given initial and final states Ii) and 11), the S-matrix element has the form (II S Ii) = 8fi + [(271l 8(4)(Pf - Pi) ( ll. ~) fermIOn ( 11 j 2:lkl)] M, photon where Pi and Pf are the total momenta of the initial and final states. In order to determine M, one draws all topologically distinct diagrams up to the desired order in the interaction and sums over the amplitudes of these diagrams. The amplitude associated with a particular Feynman diagram is itself determined as follows: 1. 2. 3. 4. One assigns a factor of -ie1'1L to every vertex point. For every internal photon line one writes a factor iDFf.Ll/(k) = ij;l';;'. For every internal fermion line one writes iSF(p) = ip_;'+ic. To the external lines one assigns the following free spinors and polarization vectors: incoming electron: U r (p) outgoing electron: ur (p) incoming positron: wr (p) outgoing positron: wr(p) 15.5 Simple Scattering Processes, Feynman Diagrams 357 Feynman rules of quantum electrodynamics in momentum space External lines: I I ! f u,-(P) incident electron outgoing electron incident positron outgoing positron incident photon outgoing photon I I external field Internal lines: k • ~ p iDcV(k)=;:l~ 1/ internal photon line internal electron line I b vertex Fig. 15.13. The Feynman rules of quantum electrodynamics. The end points where the external lines and propagators can be attached to a vertex are indicated by dots 358 5. 6. 7. 8. 15. Interacting Fields, Quantum Electrodynamics incoming photon: f,AJL(k) outgoing photon: f,AJL(k) The spinor factors (-y matrices, SF propagators, four-spinors) are ordered for each fermion line such that reading them from right to left amounts to following the arrows along the fermion lines. For each closed fermion loop, multiply by a factor (-1) and take the trace over the spinor indices. At every vertex, the four-momenta of the three lines that meet at this point satisfy energy and momentum conservation. It is necessary to integrate over all free internal momenta (Le., those not fixed by four-momentum conservation): f (27rP. 9. One multiplies by a phase factor 8p = l(or - 1), depending on whether an even or odd number of transpositions is necessary to bring the fermion operators into normal order. d4 An example of a diagram containing a closed fermion loop is the selfenergy diagram of the photon in Fig. 15.21, another being the vacuum diagram of Fig. 15.14. Vacuum diagrams are diagrams without external lines. CD Fig. 15.14. The vacuum diagram of lowest order The minus sign for a closed fermion loop has the following ongm: Starting from the part of the T-product which leads to the closed loop ... ~(x2)AX ... ~(Xf )A(xf )~(Xf ... ), one has T (... ~(xdA to permute ~(xd with an odd number of fermion fields to arrive at the arrangement ... A(Xl)~2 ... ~(xf )A(xf )~(xd ... leading to the sequence of propagators 1f'(Xl)'0(X2) ... 1(J(Xf )'0(xd with a minus sign. *15.6 Radiative Corrections We will now describe a few other typical elements of Feynman diagrams, which lead in scattering processes to higher-order corrections in the charge. These corrections go by the general name of radiative corrections. If, in electron-electron scattering, for example, one includes higher order Feynman diagrams, one obtains correction terms in powers of the fine-structure constant Q. Some of these diagrams are of a completely new form, whereas *15.6 Radiative Corrections 359 others can be shown to be modfications of the electron propagator, the photon propagator, or the electron-electron-photon vertex. In this section we will investigate the latter elements of Feynman diagrams. The intention is to give the reader a general, qualitative impression of the way in which higher corrections are calculated, and of regularization and renormalization. We will not attempt to present detailed quantitative calculations. l l 15.6.1 The Self-Energy of the Electron 15.6.1.1 Self-Energy and the Dyson Equation As observed earlier, an electron interacts with its own radiation field. It can emit and reabsorb photons. These photons, which are described by perturbation-theoretical contributions of higher order, modify the propagation properties of the electron. If, for example, a fermion line within some diagram is replaced by the diagram shown in Fig. 15.15b, this means that > (b) (a) Fig. 15.15. Replacement of a fermion propagator (a) by two propagators with inclusion of the self-energy (b) the propagator as a whole, (a)+(b), becomes SF(p) -+ S~(p) = SF(P) + SF (p) E(p) SF (p) . (15.6.1) The bubble consisting of a photon and a fermion line (Fig. 15.15b) is called the self-energy E(p). The corresponding analytical expression is given in Eq. (15.6.4). Summing over processes of this type S~(p) = SF(p) + SF(p)E(p)SF(p) +SF(p)E(p)SF(p)E(p)SF(p) + ... = SF(p) + SF (p)E(p) (SF(P) + SF(p)E(p)SF(p) + ... ), (15.6.2a) one obtains the Dyson equation (15.6.2b) 11 See, e.g., J. M. Jauch and F. Rohrlich, The Theory of Photons and Electrons or J. D. Bjorken and S. D. Drell, Relativistic Quantum Mechanics, McGraw-Hill, New York, 1964, p. 153 360 15. Interacting Fields, Quantum Electrodynamics with the solution 8' ( ) _ 1 F P - (8 F (p))-1 _ E(p) (15.6.3) Hence, the self-energy E(p) and its associated self-energy diagram give, among other things, a correction to the mass and a modification of the particle's energy, the latter being the source of the name "self-energy". The diagrammatic representation of (15.6.2a) and (15.6.2b) is given in Fig. 15.16. To distinguish it from the free (or bare) propagator 8F(p), the propagator ==-+ +00 + 0 0 0 + ... =--+ Fig. 15.16. Diagrammatic representation of the Dyson equation (15.6.2a,b). The propagator S~ (p) is represented by the double line 8~(p) is called the interacting (dressed) propagator. It is represented diagrammatically by a double line. A few self-energy diagrams of higher order have already been given in Fig. 15.2. In general, a part of a diagram is called a self-energy contribution when it is linked to the rest ofthe diagram only by two 8F(p) propagators. A proper self-energy diagram (also one-particle irreducible) is one that cannot be separated into two parts by cutting a single 8 F (p) line. Otherwise, one has an improper self-energy diagram. The self-energy diagrams in Fig. 15.2 are all proper ones. The second summand in the first line of Fig. 15.16 contains a proper self-energy part; all the others are improper. The Dyson equation (15.6.2b) can be extended to arbitrarily high orders; then, E(p) in (15.6.2b) and (15.6.3) consists of the sum of all proper self-energy diagrams. The analytical expression corresponding to the lowest order self-energy diagram of Fig. 15.17, which is also contained as a part of Fig. 15.15, reads according to the Feynman rules k p-k Fig. 15.17. Lowest (proper) self-energy diagram ofthe electron * 15.6 Radiative Corrections 361 (15.6.4) This integral is ultraviolet divergent; it diverges logarithmically at the upper limit. The bare (free) propagator tJ-':'+if has a pole at the bare mass p = m, i.e., tJ-':'+if = p~;nLf has a pole at p2 = m 2 . Correspondingly, the interacting propagator that follows from (15.6.3) is' F (p) = P_ m - i E(p) + if (15.6.5) will possess a pole at a different, physical or renormalized mass (15.6.6) mR =m+8m. The mass of the electron is modified by the emission and reabsorption of virtual photons (see, e.g., the diagrams in Fig. 15.2). We rewrite (15.6.5) with the help of (15.6.6), is' F (p) = P_ mR - i E(p) + 8m + if (15.6.7) 15.6.1.2 The Physical and the Bare Mass, Mass Renormalization It will prove convenient to redefine the fermion part of the Lagrangian density, and likewise that of the Hamiltonian, £ == ;j;(if) - m)'Ij; - eijJLN = ;j;(if) - mR)'Ij; - e;j;4'1j; + 8m;j;'Ij; , (15.6.8) so that the free Lagrangian density contains the physical mass. This then takes account of the fact that the nonlinear interaction in £1 modifies the bare mass m, and that the resulting physical mass mR, which is observed in experiment, differs from m according to (15.6.6). Individual particles that are widely separated from one another and noninteracting, as is the case for scattering processes before and after the scattering event, possess the physical mass mR. According to Eq. (15.6.8), the Hamiltonian density contains, in addition to e;j;4'1j;, the further perturbation term -8m ;j;'Ij;. The 8m has to be determined such that the combined effect of the two terms in the modified interaction part, (15.6.9) produces no change in the physical electron mass. The perturbation term -8m;j;'Ij; is represented diagrammatically in Fig. 15.18. It has the form of a 362 15. Interacting Fields, Quantum Electrodynamics -Om -~O' Fig. 15.18. Feynman diagram for the mass counter term -om i[n/; vertex with two lines. Subtraction of the term -8m 1/;'I/J, has the result that the "bare" particles of the thus redefined Lagrangian density have the same mass, and, hence, the same energy spectrum, as the physical particles, namely mR. Every self-energy term of the form shown in Fig. 15.19a is accompanied by a mass counter term (b), which cancels the k-independent contribution from (a). In higher orders of e, there are further proper self-energy diagrams to be considered, and 8m contains higher-order corrections in e. For the redefined Lagrangian density (15.6.8) and the interaction Hamiltonian density (15.6.9) the propagator also has the form (15.6.7), where the self-energy E(p) • 2 -lE(p) = -e J d4 k (27r)4 k2 -i i + if ""Iv P_ ~ _ mR + if ""I v (15.6.10) differs from (15.6.4) only in the appearance of the mass mR. The mass shift 8m is obtained from the condition that the sum of the third and fourth terms in the denominator of (15.6.7) produces no change in the (physical) mass, i.e., that iS~(p) has a pole at p = mR: (15.6.11) k > P G p-k a) > p > + 0 p > p b) Fig. 15.19. The lowest self-energy contributions according to the Lagrangian density (15.6.8) or (15.6.9). (a) Self-energy as in Eq. (15.6.4) with m -+ mR; (b) mass counter term resulting from the mass correction in (15.6.9) * 15.6 Radiative Corrections 363 15.6.1.3 Regularization and Charge Renormalization Since the integrand in (15.6.10) falls off only as k- 3 , the integral is ultraviolet divergent. Thus, in order to determine the physical effects associated with E(p), one needs to carry out a regularization which makes the integral finite. One possibility is to replace the photon propagator by 1 - - - ---+ 2 k + if k2 1 - ).2 + if - 1 k2 - A2 + if . (15.6.12) Here, A is a large cut-off wave vector: for k « A the propagator is unchanged and for k » A it falls off as k- 4 , such that E(p) becomes finite. In the limit A -+ 00 one has the original QED. In addition, ). is an artificial photon mass that is introduced so as to avoid infrared divergences, and which will eventually be set to zero. With the regularization (15.6.12), E(p) becomes finite. It will be helpful to expand E(p) in powers of (p - mR), (15.6.13) From (15.6.10) one sees that the p-independent coefficients A and B diverge logarithmically in A, whereas Ef(P) is finite and independent of A. If one multiplies E(p) from the left and right by spinors for the mass mR, only the constant A remains. If one considers aaE(p) , and again multiplies from the left PI" and right by the spinors, then only _"(i-' B remains. We will need this result later in connection with the Ward identity. The result of the explicit calculation l l is: According to Eq. (15.6.11), the mass shift 8m is obtained as 8m=A= 3mRalog~ 2n (15.6.14) mR which is logarithmically divergent. The coefficient B is a A2 B = - log 4n ~m a m2 - - log -.fl. 2n ).2 (15.6.15) The explicit form of the finite function E f (p) will not be needed here. It follows that iS~(p) = i (p - mR) [1 + B - (p - mR)Ef(p)] (p - mR) (1 + B) (1 - (p - mR)Ef(p)) + O(a 2 ) iZ2 with (15.6.16) 364 15. Interacting Fields, Quantum Electrodynamics Z-I 2 == 1 + B = 1 + ~ 471" (lOg A2 _ 2 log m'h m'h) ).2 (15.6.17) The quantity Z2 is known as the wave function renormalization constant. Now, a propagator connects two vertices, each of which contributes a factor e. Hence, the factor Z2 can be split into two factors of ..;z;. and, taking into account the two fermions entering at each vertex, one can redefine the value of the charge (15.6.18) Here, e~ is the preliminary renormalized charge. In the following, we will undertake two further renormalizations. The electron propagator that remains after the renormalization has the form (15.6.19) and is finite. 15.6.1.4 Renormalization of External Electron Lines The diagram 15.20a contains a self-energy insertion in an external electron line. This, together with the mass counter term of Fig. 15.20b, leads to the following modification of the spinor of the incident electron: ur(p) -+ ur(p) -+ (1 - +p i . (i(P -mR +If P-mR i +. (p If mR)B - i(p - mR)2 Ej(P))ur mR)B)Ur(P) , (15.6.20) since the last term in the first line vanishes on account of (p - m R) Ur (p) = o. The expression in the second line is undetermined, as can be seen, either by allowing the two operators to cancel with one another, or by applying (p - mR) to ur(p). By switching the interaction on and off adiabatically, (15.6.21 ) Fig. 15.20. (a) A diagram with inclusion of the self-energy in an external fermion line. (b) Mass counter term * 15.6 Radiative Corrections 365 where limt--+±oo «(t) = 0, and «(0) = 1, Eq. (15.6.20) is replaced by a welldefined mathematical expression, with the result (15.6.22) This means that the external lines, like the internal ones, also yield a factor V1 - B in the renormalization of the charge. Thus, Eq. (15.6.18) also holds for vertices with external lines: e ---+ e~ = (1 - B)e . Apart from the factor Z~/2, which goes into the charge renormalization, there are no radiative corrections in the external electron lines. The result (15.6.22) is to be expected intuitively for the following reasons: (i) Even an external electron must have been emitted somewhere and is thus an internal electron in some larger process. It thus yields a factor of V1 - B at every vertex. (ii) The transition from S'p to S'p in Eq. (15.6.19) can be regarded as a replacement of . -1/2 1/2 the field 'lj; by a renormahzed field 'lj; R = Z2 'lj; + ... , or Z2 'lj; R = 'lj; + ... . From this, one also sees that Z2 represents the probability of finding in a physical electron state one bare electron. 15.6.2 Self-Energy of the Photon, Vacuum Polarization The lowest contribution to the photon self-energy is represented in Fig. 15.2l. This diagram makes a contribution to the photon propagator. The photon creates a virtual electron-positron pair, which subsequently recombines to yield a photon once more. Since the virtual electron-positron pair has a fluctuating dipole moment that can be polarized by an electric field, one speaks in this context of vacuum polarization. n q k ~ q+k k Fig. 15.21. Vacuum polarization: A photon decays into an electron-positron pair which recombines to a photon According to the Feynman rules, the analytical expression equivalent to Fig. 15.21 is 366 15. Interacting Fields, Quantum Electrodynamics At first sight, this expression would appear to diverge quadratically at the upper limit. However, due to the gauge invariance, the ultraviolet contributions are in fact only logarithmically divergent. Regularizing the expression by cutting off the integral at a wave vector A would violate gauge invariance. One thus regularizes (15.6.23) using the Pauli-Villars method l l , by replacing IIp.v(k, mR) by II:;v(k, mR) =.IIp.v(k, mR)-L,i GiIIp.v(k, Mi ), where the Mi are large additional fictitious fermion masses, and the coefficients satisfy L,i Gi = 1, L,., GiMl = m7:t. The final result only involves log ~ m H M2 L,i Gi log :3-. 11tH Finally, because of the vacuum polarization self-energy contributions, the photon propagator for small k takes the form (1 1 i9J.lv ( 1 - -a - - - - (k2))) k =---Z3 J.lv( ) k 2 + if 7rm'h 15 40 m'h ' • I ID (15.6.24) where Z3 M2 a == 1 - C = 1 - 37r log m 2 (15.6.25) R is the photon field renormalization constant. This factor also leads to a renormalization of the charge e'll ~ == Z3 e/2 (1 - ~37r log mM22) e 2 . (15.6.26) R The photon propagator that remains after charge renormalization, for small k, has the form ib~J.lv(k) = Z31iD~J.lv(k) = k~if :~ (1 - (1~ - 410 (~) ) ) (15.6.27) 15.6.3 Vertex Corrections We now proceed to the discussion of vertex corrections. The divergences that occur here can again be removed by renormalization. A diagram of the type shown in Fig. 15.22a contains two fermion and one photon line; it thus has the same structure as the vertex ifrYJ.l AJ.l 'l/J in Fig. 15.22b. For diagrams of this kind, one thus speaks of vertex corrections. The diagram 15.22a represents the lowest (lowest power in e) vertex correction. This diagram also yields the leading contribution to the anomalous magnetic moment of the electron. The amplitude for the diagram, without the external lines, is given by I • AJ.l(p ,p) = (-Ie) 2 x 'Yv , /. J 1" - d4 k (27r)4 k2 i /I. 'J' - mR -i + if + If• 'YJ.l I', /. - i /I. 'J' - mR (15.6.28) v + If• 'Y * 15.6 Radiative Corrections 367 p k "~'Vv ~' p'-k P- P P - p' P p-k a) b) Fig. 15.22. (a) Vertex correction, (b) vertex AJl(p',p) is logarithmically divergent and is regularized in the following by replacing the photon propagator as specified in Eq. (15.6.12). One can split AJl(p',p) into a component that diverges in the limit A -+ 00 and a component that remains finite. We first consider AJl multiplied from the left and right by two spinors corresponding to the mass mR, to yield ur,(P)AJl(P, P)ur(P). We denote the momentum of spinors such as these, which correspond to real particles, by P. Due to Lorentz invariance, this expression can only be proportional to 'Jl and to Pw With the help of the Gordon identity (10.1.5), one can replace a pJl dependence by ,Jl, so that one has (15.6.29) with a constant L that remains to be determined. For general four-vectors p,p', we separate AJl(p',p) in the following way: (15.6.30) Whereas L diverges in the limit A -+ 00, the term At (p', p) remains finite. In order to see this, we expand the fermion part of (15.6.28) in terms of the deviation of the momentum vectors p and p' from the momentum P of free physical particles used in (15.6.29): (15.6.31 ) The divergence in (15.6.28) stems from the leading term (the product of the first terms in the brackets in (15.6.31)); this yields L,Jl, whilst the remaining terms are finite. The first term in (15.6.30), together with 'Jl' leads to the replacement 'Jl -+ 'Jl(l + L) and produces a further renormalization of the charge 368 15. Interacting Fields, Quantum Electrodynamics (15.6.32) We need not pursue the calculation of L any further since, as will be generally shown, it is related to the constant B introduced in (15.6.13) and (15.6.15), and in the charge renormalization cancels with it. 15.6.4 The Ward Identity and Charge Renormalization Taken together, the various renormalization factors for the charge yield e -+ eR = viI - C (1 - B)(1 + L)e . (15.6.33) The factor viI - C comes from the vacuum polarization (Fig. 15.21), the factor 1- B from the wave function renormalization of the electron (Fig. 15.15), and the factor 1 + L from the vertex renormalization. However, in quantum electrodynamics, it turns out that the coefficients Band L are equal. To demonstrate this, we write the self-energy of the electron (15.6.10) in the form (15.6.34) and the vertex function (15.6.28) as (15.6.35) We now make use of the relation (15.6.36) which is obtained by differentiating SF(p)Si;!(p) = 1 (15.6.37) with respect to p/.L, i.e., (15.6.38) and then multiplying by SF(p) from the right. Equation (15.6.36) states that the insertion of a vertex I/.L in an internal electron line, without energy transfer, is equivalent to the differentiation of the electron propagator with respect to p/.L (Fig. 15.23). With the help of this identity, we can write the vertex function (15.6.35) in the limit of equal momenta as * 15.6 Radiative Corrections k p 369 k p-k p p p q=O a) b) Fig. 15.23. Diagrammatic representation of the Ward identity. (a) Self-energy diagram. (b) The differentiation is equivalent to inserting into the fermion line a vertex for a photon with zero momentum VYJ.L = lim AJ.L(pl,p)1 -_. 2/ pl--+p - Ie __'2/ - Ie p'=mR , p=mR d4k D (k) 8SF(p - k) v (2'n/ F Iv 8(p - k t I d4kD (k) (27r) 4 F (15.6.39) (p-k) Iv' IV 8SFupJ.L !::\ On the other hand, from the definition of Bin (15.6.13), one obtains _ _ (-8E(P)) 8pJ.L ur(p) url(p)B,J.Lur(p) = url(p) = url(p) (e 2/ d4k4DF (k)rv 8SFJp - k) IV) ur(p) (27r) pJ.L = Uri (p)L,J.Lur(p) , (15.6.40) from which it follows that B=L. (15.6.41) This relation implies (15.6.42) so that the charge renormalization simplifies to e --+ eR ~ _ = vI - Ce = 1/2 Z3 e. (15.6.43) The renormalized charge e R is equal to the experimentally measured charge The bare charge e is, according to (15.6.26), larger than e'k. e'k == The factors arising from the renormalization of the vertex and of the wave function of the fermion cancel one another. It follows from this result that the charge renormalization is independent of the type of fermion considered. ;;7' 2 370 15. Interacting Fields, Quantum Electrodynamics In particular, it is the same for electrons and muons. Hence, for the identical bare charges, the renormalized charges of these particles are also equal, as is the case for electrons and muons. Since the renormalization factors (Z factors) depend on the mass, this last statement would not hold without such cancellation. The prediction that the charge renormalization only arises from the photon field renormalization is valid at every order of perturbation theory. The relation (15.6.36) and its generalization to higher orders, together with its implication (15.6.41), are known as the Ward identity. This identity is a general consequence of gauge invariance. Expressed in terms of the Z factors, the Ward identity (15.6.41) reads Zl = Z2 . Remarks: (i) We add a remark here concerning the form of the radiative corrections for the electron-electron scattering that was treated to leading order in Sects. 15.5.3.1 and 15.5.3.2. We will confine ourselves to the direct scattering. The leading diagram is shown in Fig. 15.24a. Taking this as the starting point, one obtains diagrams that contain self-energy insertions in internal (b) and external (d) lines, and vertex corrections (c). These are taken into account by charge renormalization and by the replacements D --t iJ' and 'YI-' --t ("(I-' +A£), as was briefly sketched above. The diagram g) Fig. 15.24. Radiative corrections to the direct electronelectron scattering up to fourth order in e. (a) Second order; (b) correction due to vacuum polarization; (c) vertex correction; (d) self-energy insertion in an external line; (f) and (g) two further diagrams * 15.6 Radiative Corrections 371 (e) stems from the mass counter term -omij)'lj;. In addition to these diagrams there are two further diagrams (f,g) that make finite contributions in second order. (ii) Quantum electrodynamics in four space-time dimensions is renormalizable since, at every order of perturbation theory, all divergences can be removed by means of a finite number of reparameterizations (renormalization constants om, Zl, Z2, and Z3). 15.6.5 Anomalous Magnetic Moment of the Electron An interesting consequence of the radiative corrections is their effect on the magnetic moment of the electron. This we elucidate by considering the scattering in an external electromagnetic potential A~. In the interaction (15.6.9) the field operator All is thus replaced by All + A~. The process to first order is shown in Fig. 15.25a. The corresponding analytical expression is p' a) p b) c) d) e) f) g) Fig. 15.25. Radiative corrrections of second order for the QED vertex with two fermions and an external potential A~ 372 15. Interacting Fields, Quantum Electrodynamics where we have used the Gordon identity (10.1.5). In anticipation of the charge renormalization (see below), we have already inserted the renormalized charge here. The second term in the square brackets is the transition amplitude for the scattering of a spin-~ particle with the magnetic moment 2'::::R = e _....3L. ,where eo is the elementary charge 12 ; i.e., the gyromagnetic ratio is 2 mR 9 = 2. The processes of higher order are shown in Fig. 15.25b-g. The selfenergy insertions (b,c) and the contribution C from vacuum polarization (d) and L coming from the vertex correction (g) lead to charge renormalization, i.e., in (15.6.44) one has, instead of e, the physical charge eR. Furthermore, the diagrams (d) and (g) yield finite corrections. For the spin-dependent scattering, only the vertex correction A£(pl,p) is important. For (15.6.30)11 the calculation yields 3) f I _ a q2 ( mR A/L(p ,p) - 'Y/L 37rm h logT - a 8 + 87rmR [~,'Y/L] (15.6.45) with q = p' - p. By adding the last term of this equation to (15.6.44), one obtains -ieRur,(p')('Y/L + 2ia a2/L vq v )ur(pA~q 7r (15.6.46) mR (p+pl) a ia qV] = -ieRur,(p') [ 2mR J.L + (1 + 27r) 2: R ur(p)A~q. has the form -a/LVov A~(x) In coordinate space, the term ia/Lvqv A~ - ~a /LvF/Lv. In order to be able to give a physical interpretation of the result (15.6.46), we consider an effective interaction Hamiltonian which, in firstorder perturbation theory, yields exactly (15.6.46): ll eff == eR = eR J J d3 x {~(xh/L'IjJX)A d 3 X{ + 2~R (~x) 2: 2~R (o/L'IjJ(x)) - + ( 1+ 2:) 4~R ~(X)a/Lv'ljJF;:V} (O/L~x) ~(x)a/Lv'IjJoV A~(x)} 'IjJ(x)) A~(x) . (15.6.47) Here, we have again used the Gordon identity. The first term after the second equals sign represents a convective current. The second term, in the case of 12 See footnote 1 in Chap. 14 Problems 373 a constant magnetic field, can be interpreted as a magnetic dipole energy. Since this can, through the substitution p12 = B3, p23 = B1, p31 = B2, 0"12 = E 3 , etc., be brought into the form -B eR (1 + -2na) 2 Jd x'l/J(x)-'l/J(x) .E ) (-2mR 2 3 - == -BJL . (15.6.48) For slow electrons, the upper components of the spinors are significantly larger than the lower ones. In this nonrelativistic limit, the magnetic moment of a single electron is, according to (15.6.48), effectively given by ~ 2mR (1+~)2 2n 2' (15.6.49) where u are the 2 x 2 Pauli matrices. The contribution to (15.6.49) proportional to the fine-structure constant is referred to as the anomalous magnetic moment of the electron. It should be stressed, however, that (15.6.47) does not represent a fundamental interaction: It merely serves to describe the second-order radiative correction within first-order perturbation theory. From (15.6.49) one obtains the modification of the g factor ~ = 0.00116 . 2 2n When one includes corrections of order a 2 and a 3 , which arise from higherorder diagrams, one finds the value g- 2 = g-2 -2- = 0.0011596524(±4), which is in impressive agreement with the experimental value of 0.00115965241(±20) . The increase in the magnetic moment of the electron can be understood qualitatively as follows. The electron is continually emitting and reabsorbing photons and is thus surrounded by a cloud of photons. Thus, a certain amount of the electron's energy, and therefore mass, resides with these photons. Hence, the charge-to-mass ratio of the electron is effectively increased and this reveals itself in a measurement of the magnetic moment. In the diagram 15.25g, the electron emits a photon before interacting with the external magnetic field. The correction is proportional to the emission probability and, thus, to the fine-structure constant a. Problems 15.1 Confirm the expression (15.2.17) for the propagator cjJ(Xt)cjJ(X2) , instead of starting from (15.2.16), by evaluating (15.2.15) directly. '------' 374 15. Interacting Fields, Quantum Electrodynamics 15.2 The interaction of the complex Klein-Gordon field with the radiation field reads, according to Eq. (F.7), to first order in AI'(x) HI(X) = jl'(x)AI' (x) , where jl' = -ie : f)f)¢t cp - f)f)¢ cpt : is the current density. Xf-L Xp, Calculate the differential scattering cross-section for the scattering from a nucleus with charge Z. Establish the result da dD (aZ)2 4E2v 4sin4% . 15.3 Show that for fermions where jl'(x) is the current-density operator, le-, p) = bb,r 10), and Ep = Vp2 + m2 . 15.4 Verify Eq. (15.5.39). 15.5 Verify Eqs. (15.5.42) and (15.5.43). 15.6 a) With the help of the Feynman rules, give the analytical expression for the transition amplitude corresponding to the Feynman diagrams of Compton scattering, Fig. 15.12a,b. b) Derive these expressions by making use of Wick's theorem. Bibliography for Part III A.I. Achieser and W.B. Berestezki, Quantum Electrodynamics, Consultants Bureau Inc., New York, 1957 I. Aitchison and A. Hey, Gauge Theories in Particle Physics, Adam Hilger, Bristol, 1982 J.D. Bjorken and S.D. Drell, Relativistic Quantum Mechanics, McGraw-Hill, New York,1964 J.D. Bjorken and S.D. Drell, Relativistic Quantum Fields, McGraw-Hill, New York, 1965 N.N. Bogoliubov and D.V. Shirkov, Quantum Fields, The Benjamin/Cummings Publishing Company, Inc., London, 1983; and Introduction to the Theory of Quantized Fields, 3rd edition, John Wiley & Sons, New York, 1980 S.J. Chang, Introduction to Quantum Field Theory, Lecture Notes in Physics Vol. 29, World Scientific, Singapore, 1990 K. Huang, Quarks, Leptons and Gauge Fields, World Scientific, Singapore, 1982 C. Itzykson and J.-B.Zuber, Quantum Field Theory, McGraw Hill, New York, 1980 J.M. Jauch and F. Rohrlich, The Theory of Photons and Electrons, 2nd ed., Springer, New York, 1976 G. Kallen, Elementary Particle Physics, Addison Wesley, Reading, 1964 G. Kane, Modem Elementary Particle Physics, Addison Wesley, Redwood City, 1987 F. Mandl and G. Shaw, Quantum Field Theory, John Wiley & Sons, Chichester, 1984 O. Nachtmann, Elementary Particle Physics, Springer, Heidelberg, 1990 376 Bibliography for Part III Yu.V. Novozhilov, Introduction to Elementary Particle Theory, Pergamon Press, Oxford, 1975 D.H. Perkins, Introduction to High Energy Physics, Addison Wesley, Reading, 1987 s.s. Schweber, An Introduction to Relativistic Quantum Field Theory, Harper & Row, New York, 1961. J.C. Taylor, Gauge Theories of Weak Interactions, Cambridge Univ. Press, Cambridge, 1976 S. Weinberg, The Quantum Theory of Fields, Cambridge University Press, Cambridge, 1995 Appendix A Alternative Derivation of the Dirac Equation Here, we shall give an alternative derivation of the Dirac equation. In so doing, we will also deduce the Pauli equation as well as a decomposition of the Dirac equation that is related to the Weyl equations for massless spin-~ particles. Our starting point is the nonrelativistic kinetic energy H=~-t_l(V)2 2m 2m . 1 (A.l) Provided there is no external magnetic field, instead of this Hamiltonian, one can use the completely equivalent form H 1 = 2m (0" p)(O" p) (A.2) as can be established from the identity (0' . a) (0' . b) = a· b + iO' . (a x b) . If one starts from (A.l) when introducing the coupling to the magnetic field, it is, in addition, necessary to add the coupling of the electron spin to the magnetic field "by hand". Alternatively, one can start with (A.2) and write the Hamiltonian with magnetic field as H (p - ~ A) 0' . (p - ~ A) = 2~ (p - ~A f + 2~ 0" [(p - ~A) = _1_ (p _ ~ A) ~O' B . 2m c 2mc = 2~ 0' . 2 _ x (p - ~A)] (A.3) . Here, we have made use of the rearrangements that lead from (5.3.29) to (5.3.29'). In this way, one obtains the Pauli equation with the correct Lande factor g = 2. We now wish to establish the relativistic generalization of this equation. To this end, we start with the relativistic energy-momentum relation 378 Appendix (A.4) which we rewrite as (~ - u . p) (~ +u . p) (A.5) = (me)2 . According to the correspondence principle ( E --+ in quantum-mechanical relation is !Jt' p --+ - in V), the (A.6) where </J is a two-component wave function (spinor). This equation was originally put forward by van der Waerden. In order to obtain a differential equation of first order in time, we introduce two two-component spinors </J(L) = -</J and </J(R) = -~ me (in~ oXo - inu· v) </J(L) . The last equation, defining </J(R) , together with the remaining differential equation from (A.6), yields: ( in~ ( in~ oXo oXo - itiu· + inu . V) v) </J(L) = -me</J(R) (A.7) </J(R) = -me</J(L) . The notation </J(L) and </J(R) refers to the fact that, in the limit m --+ 0, these functions represent left- and right-handed polarized states (i.e., spins antiparallel and parallel to the momentum). In order to make the connection to the Dirac equation, we write uV == (JiOi and form the difference and the sum of the two equations (A.7) in o~ (</J(R) - </J(L)) + itiaioi (</J(R) + </J(L)) _ me (</J(R) - </J(L)) = 0 (A.8) - in o~ (</J(R) _ me (</J(R) + </J(L)) + </J(L)) - itiaioi (</J(R) - </J(L)) = O. Combining the two-component spinors into the bispinor 'Ij; = ( </J(R) _ </J(L)) </J(R) + </J(L) (A.9a) B Dirac Matrices 379 yields (ifi"(O O~ + ifi"(ioi - me) 'ljJ = °, (A.9b) with "(0=(1° 0) (A.9c) -1 We thus obtain the standard representation of the Dirac equation. B Dirac Matrices B.1 Standard Representation "(5=(01) 10 Chirality operator : "(5 (,,(5)2 = 1 b 5 ,"(JL} = ° # = a· b - iaJLbvO'JLv O'JLV = -O'VJL = 4 , "(JL"(v"(JL = "(JL"(JL "(JL"(v"(P"(JL = ,1ft == "(JLaJL _2"(v 4gVP , "(JL,,,("(p"(u"(JL = _2"(u"(p"(v B.2 Chiral Representation o ( °-1) °' "( = f3 = -1 O'Oi i = "2 ["(0, "(i] = . -Wi a = (0"° -0"0) ' 'I = (0 -0" °0") ' 1 ° 0) =i (O'i -O'i 380 Appendix B.3 Majorana Representations or 1'0 = 0 (Y2) ( (Y2 0 ' 1'1 = 1. (0(Y1 (Y1) 0 ' 1'2 = 1. (ll0 0) _ II ' 1'3 = 1. (0(Y3 (Y3) 0 C Projection Operators for the Spin C.I Definition We define here the spin projection operator and summarize its properties. Since this projection operator contains the Dirac matrix 1'5 (Eq. 6.2.48), we give the following useful representation of 1'5 1'5 Here, . = 11'01' EJ.tvPO" 1 2 I' 1'3 =- i 41EJ.tvPO"I'J.tl'vI'PI'0" =- i 41EJ.tvpO"I'J.tl'v,pI'0" (C.1) is the totally antisymmetric tensor of fourth rank: 1 for even permutations of 0123 EJ.tvPO" = { -1 for odd permutations of 0123 (C.2) o otherwise. The spin projection operator is defined by 1 P(n) = "2(ll + 1'5rj,) (C.3) Here, rj, = l'J.tnJ.t, and nJ.t is a space-like unit vector satisfying n 2 = nJ.tnJ.t = -1 and nJ.tkJ.t = O. In the rest frame, these two vectors are denoted by ilP and kJ.t and have the form n = (0,0) and k = (m,O). C.2 Rest Frame For the special case where n z direction, one obtains == n(3) == (0,0,0,1) is a unit vector in the positive (C.4) since 1'5 ( -1'3) = - ( 1L0 1L) 0 ( 0 (Y30 ) _(y3 = ( (Y3 0 0) _(y3 . The effect of the pro- jection operator P(n(3)) on the spinors of particles at rest (Eq. (6.3.3) or (6.3.11a,b) for k = 0) is thus given by C Projection Operators for the Spin 381 (C.5) Equation (C.5) implies that, in the rest frame, P(n) projects onto eigenstates of ~.En, with the eigenvalue +~ for positive energy states and the eigenvalue - ~ for negative energy states. In Problem 6.15 the following properties of P(n) and of the projection operators A±(k) acting on spinors of positive and negative energy have already been demonstrated: [A±(k), P(n)] =0 TrA±(k)P(±n) = 1 . C.3 General Significance of the Projection Operator P{n) We will now investigate the effect of P(n) for a general space-like unit vector n, which thus obeys n 2 = -1 and n· k = O. Useful quantities for this purpose are the vector (C.7a) and the scalar product W ·n = -~f ll 4 J.LlIpa nJ.Lk (Jpa , (C.7b) which can also be written as (C.7c) The equivalence of these two expressions can be seen most easily by transforming into a frame of reference in which k is purely time-like (k = (kO, 0, 0, 0» and hence n, on account of n . k = 0, purely space-like (n = (O,nl,O,O». In this rest frame, the right-hand side of (C.7b) becomes 1 1 kO· p a 1 kO pa 1 1')' "( -4f10pan (J = -4f10pan = . 2 "(3 + f1032n 1kO.1"(3 "( 2) -41(f1023 n 1kO1"( = __i n 1kO"(2"(3 2 382 Appendix and for the right-hand side of (C.7c) we have l ' -2'"Y5# = _~'"Y0l23(-n)ko° = -~nlk°'"Y23, . thus demonstrating that they are identical. In the rest frame the vector (C.7a) has the spatial components (C.8) where we have put k O = m. Assuming that n is directed along the z axis, i.e., n = n(3) == (0,0,0,1), it follows from (C.8) that m 3 (C.9) W·n=-17. 2 The plane waves in the rest frame are eigenvectors of _ ~173ul(m,k ~ul(m,k = 0) = ~173u2(m, k = 0) = W;(3) =~ 173 = 0) -~U2(m, k = 0) (C.lO) After carrying out a Lorentz transformation from (m,k = 0) to (kO,k), we have W·n 1 - ----:;;;- = 2m '"Y5 # , where n is the transform of n(3)' The equations (C.lO) then transform into eigenvalue equations for ur(k) and vr(k) W·n - ----:;;;-ur(k) 1 = 2m '"Y5#ur (k) = 1 = ±2Ur(k) W'n - ----:;;;-vr(k) 1 for 1 2'"Y51ftUr (k) r = { 21 1 1 (C.lI) = 2m '"Y5#vr (k) = -2'"Y51ft Vr (k) = ±2Vr(k) for r = {1 2' where, after the first and the second equals signs, we have made use of (C.7c) and of ~ur(k) = mur(k) and ~vr(k) = -mvr(k), respectively. Finally, after the third equals sign we obtain the right-hand side of (C.10). The action of C Projection Operators for the Spin 383 "(51ft, on the ur(k) and vr(k) is apparent from (C.lI) and it is likewise evident that P(n) 1 = "2(1l + "(51ft,) (C.12a) is a projection operator onto ul(k) and v2(k) and that P( -n) 1 = -(1l - "(51ft,) (C.12b) 2 is a projection operator onto u2(k) and vl(k). Let n be an arbitrary space-like vector, with n·k = and let n be the corresponding vector in the rest frame. Then, P(n) projects onto spinors u(k, n) that are polarized along +n in the rest frame, and onto those v(k, n) that are polarized along -n in the rest frame. We have the eigenvalue equations ° E·nu(k,n)=u(k,n) E . n v(k, n) = -v(k, n) (C.13) . The vectors k and n are related to their counterparts k and n in the rest frame by a Lorentz transformation A: kl1- = AI1-vkv with kv = (m, 0, 0, 0) and nl1- = AI1-vnv with nV = (0, n). The inverse relation reads nV = A;nl1-. As is usual in the present context, we have used the notation u(k, n) and v(k, n) for the spinors. These are related to the ur(k) and vr(k) used previously by ul(k) vl(k) = u(k, n), u2(k) = u(k, -n) = v(k, -n), v2(k) = v(k,n) , (C.14) where n = An(3) with n(3) = (0,0,0,1). We now consider a unit vector nk, whose spatial part is parallel to k: (C.15) Trivially, this satisfies k n% = -m 2 2 ~ k2 m2 = -1 and nk . k = Iklko _ ko k 2 m mlkl = °. We now show that the combined effect of the projection operator P(nk) and the projection operators A±(k) on spinors with positive and negative energy can be represented by (C.16) To prove this relation, one starts from the definitions 384 Appendix and rearranges as follows: ,.J 'Y5'l"k ±~+m _ 2m ,.J - 'Y5'l"k ±~+m 2m 2m _ (~ - ,.J 2 'Y5'l"k ± ,.J L) ±~+m 'Y5'l"k 2m 2m . This yields: ~ ,.J 2 'Y5'l"k ±~ +m 2m _ L ,.J - ±'Y5'l"k 2m ±~ +m 2m which gives us the intermediate result (C.17) We then proceed by writing 'Y5rfok~ = 'Y5(nk' k-intO"/-,vkV) '-v-' =0 . (0 j j k =-1'Y5nkO"O·k +nO"'ok J = m . h51kT0"0jkJ J 0), (Ikl j kO k j 0) =1'Y50"0' - k - - - k J m m Ikl mo" = 'Y51kT'Y 'YJkJ . Here, we have used nk . k = 0 and also the fact that the purely spatial components make no contribution, due to the antisymmetry of O"ij. By considering, as an example, the j = 3 component of 'Y5'Y°'Y j : the assertion (C.16) is confirmed. Equation (C.16) reveals the following property of the projection operator P(nk): The operator P(nk) projects states with positive energy onto states with positive helicity, and states with negative energy onto states with negative helicity. Analogously, we have thus P( -nk) projects spinors with positive energy onto spinors with negative helicity and spinors with negative energy onto spinors with positive helicity. D The Path-Integral Representation of Quantum Mechanics 385 D The Path-Integral Representation of Quantum Mechanics We start from the Schrodinger equation in! I~, t) = H I~, (D.l) t) with the Hamiltonian H = 1 2mp2 + V, (D.2) and let the eigenstates of H be In). Assuming that limx--+±oo V(x) = 00, we know that the eigenvalues of H are discrete. In the coordinate representation, the eigenstates of H are the wave functions ~n(x) = (xln), where Ix) is the position eigenstate with position x. The discussion that follows will be based on the Schrodinger representation. If, at time a the particle is in the position state Iy), then at time t its state is e- iHt / n Iy). The probability amplitude that at time t the particle is located at x is given by G(y, alx, t) = (xl e- itH/ n Iy) . (D.3) We call G(y, alx, t) the Green's function. It satisfies the initial condition G(y, alx, a) = 8(y - x). Inserting the closure relation :n. = Ln In) (nl in (D.3), G(y, alx, t) = L (xln) (nl e- itH/ n 1m) (mly) , n,m we obtain the coordinate representation of the Green's function G(y, alx, t) = L e-itEn/~(x)y. (D.4) n By dividing the time interval [a, tj into N parts (Fig. D.l), whereby increasing N yields ever smaller time differences ..1t = we may express the Green's function as follows: 11, G(y, alx, t) = = (xl e-iH.tlt/n ... e-iH.tlt/n Iy) J J dZ1 ... dZN_l(zNle-iH.tlt/nlzN_l) ... (zlle-iH.tlt/nlzo) , (D.5) 1-------,----.-----" - - - - - ,.-------,----.----1 y = Zo Zl ZN-1ZN=X Fig. D.l. Discretization of the time interval [0, t], with the ators introduced in (D.5) (zo = y, ZN = x) Zi of the identity oper- 386 Appendix where we have introduced the identity operators 11. have = JdZi IZi) (zil. We then (D.6) We now determine the necessary matrix elements (~I e-i~ l' Ie) (~Ik) Jdk = 211" = JJ dk 211" e (kle) ik(t;_",) -i (k,,)2 ~ dk e _;Llt"(k~)2+i' = (~21t e- i _ e Llt 2= " (D.7) = 2Llt"/2= 2= 2""3TIi: 211" = ( --im - -) 1/2 211"/t6.t ;= e 2iiFt ("_"')2 ~ ~ . In the first step the completeness relation for the momentum eigenfunctions was inserted twice. From (D.6) and (D.7) it follows that (~I e-iHLlt/1i = exp W) [_. 6.t (V~) 1 + V(e) Ii _ 2 m(~ e)2)] (211"fi6.t 2(6.t)2 -im ) ~ (D.8) and finally, for the Green's function, G(y, olx, t) = J J dz 1 ••• x exp [ + In the limit N -+ G(y, olx, t) = 00 ~ dZ N - i6.t n log ~ 1 ~ 2~t {m(zn - zn_d 2 V(zn) + V(zn-d } 2(6.t)2 2 l· this yields the Feynman path-integral representation 1 J i t V[z] exp h, Jo dt' {mz(t/)2 2 - V(z(t')) } , (D.9) where V[z] 1 = J~o . )lY.. N - 1 -1m ( 211"liLlt dZ n 2 g (D.1O) R. P. Feynman and A. R. Hibbs, Quantum Mechanics and Path Integrals, McGra.w-Hill, New York, 1965; G. Parisi, Statistical Field Theory, AddisonWesley, 1988, p.234 E Covariant Quantization of the Electromagnetic Field 387 and The probability amplitude for the transition from y to x after a time t is given as the sum of the amplitudes of all possible trajectories from y to x, each being given the weight exp J~dt' L(z, z) and where * L(z, z) = mz~t)2 - V(z(t)) is the classical Lagrangian. The phase of the probability amplitude is just the classical action. In the limit Ii --+ 0, the main contribution to the functional integral comes from the neighborhood of the trajectory whose phase is stationary. This is just the classical trajectory. E Covariant Quantization of the Electromagnetic Field, the Gupta-Bleuler Method E.1 Quantization and the Feynman Propagator In the main text, we treated the radiation field in the Coulomb gauge. This has the advantage that only the two transverse photons occur. To determine the propagator, however, one has to combine the photon contributions with the Coulomb interaction in order to obtain the final covariant expression. In this appendix, we describe an alternative and explicitly covariant quantization of the radiation field by means of the Gupta-Bleuler method 2 . In the covariant theory, one begins with (E.l) The components of the momentum conjugate to AIL are (E.2) From the Lagrangian density (E.l), we obtain the field equations (E.3) 2 Detailed presentations of the Gupta-Bleuler method can be found in S.N. Gupta, Quantum Electrodynamics Gordon and Breach, New York, 1977; C. Itzykson and J.-B. Zuber, Quantum Field Theory McGraw Hill, New York, 1980; F. Mandl and G. Shaw, Quantum Field Theory, J. Wiley, Chichester 1984; J.M. Jauch and F. Rohrlich, The Theory of Photons and Electrons, 2nd ed., Springer, New York, 1976, Sect.6.3. 388 Appendix These are only equivalent to the Maxwell equations when the four-potential AI' (x) satisfies the gauge condition (E.4) The most general solution of the free field equations (jl' linear superposition3 = 0) is given by the (E.5) The four polarization vectors obey the orthogonality and completeness relations Er(k)Es(k) L (rE~k) == = Erl'(k)~ -(reSrs, r, s = 0,1,2,3 (E.6a) (E.6b) = _gl'V , r where (E.6c) (0 =-1 On occasion it is useful to employ the special polarization vectors E~(k) = E~(k) = (O,€k,r) where €k,l and and to k, and €k,3 == (1,0,0,0) nl' €k,2 = kjlkl (E.7a) (E.7b) r = 1,2,3, are unit vectors that are orthogonal, both to one another (E.7c) . This implies n· €k,r = €k,r€k,s r = 1,2 0, = eSrs , r,s = 1,2,3. (E.7d) (E.7e) The longitudinal vector can also be expressed in the form (E.7f) 3 To distinguish them from the polarization vectors €k,A and creation and annihilation operators atA and akA (,\ = 1,2) of the main text, where the Coulomb gauge was employed, we denote the polarization vectors in the covariant representation by €~(k), and the corresponding creation and annihilation operators by at(k) and ar(k). E Covariant Quantization of the Electromagnetic Field 389 The four vectors describe f~ transverse polarization f~ longitudinal polarization f~ scalar or time-like polarization. fi, The covariant simultaneous canonical commutation relations for the radiation field read: [AJl(x,t), AV(x', t)] =0, [AJl(x,t),AV(x',t)] [AJl(x,t),AV(x',t)] =0 (E.8) = -ig JlV 8(x-x'). The commutation relations are the same as those for the massless KleinGordon field, but with the additional factor _gJlv. The zero component has the opposite sign to the spatial components. One can thus obtain the propagators directly from those for the KleinGordon equation: [AJl(x), AV(x')] DJlV(x - x') = iDJlV(x - x') = igJlV (E.9a) J (OIT(AJl(x)AV(x')) 10) = 4 d k 8(k2)f(ko)e- ikx (27r )3 iD~V(x (E.9b) - x') DJlV( _ ') __ JlV F x x - 9 J d4k k2 (E.IOa) e -ikx (E. lOb) + if By inverting (E.5) and using (E.8), one obtains the commutation relations for the creation and annihilation operators [ar(k), a!(k')] = (r8rs 8kk , = -1 [ar(k),as(k')] = [at(k),a!(k')] = O. , (0 , (1 = (2 = (3 = 1, (E.11) E.2 The Physical Significance of Longitudinal and Scalar Photons For the components 1,2,3 (i.e., the two transverse, and the longitudinal, photons) one has, according to Eq. (E.ll), the usual commutation relations, whereas for the scalar photon (r = 0) the roles of the creation and annihilation operators seem to be reversed. The vacuum state 10) is defined by for all k and r = 0, 1,2,3 , i.e., (E.I2) 390 Appendix for all x. One-photon states have the form Iqs) = a!(q) 10) . (E.13) The Hamiltonian is obtained from (E.1) as (E.14) Inserting (E.2) and the expansion (E.5) into this Hamiltonian yields H = L (E.15) Ikl (r at(k)ar(k) . r,k One may be concerned that the energy might not be positive definite, because of (0 = -1. However, because of the commutation relation (E.1l) the energy is indeed positive definite (E.16) r,k = Iql a!(q) 10) s = 0,1,2,3 Correspondingly, one defines the occupation-number operator (E.17) For the norm of the states, one finds (qslqs) = (01 a (q)a!(q) 10) = (8 (010) = (8 8 . (E.18) In the Gupta-Bleuler theory, the norm of a state with a scalar photon is negative. More generally, every state with an odd number of scalar photons has a negative norm. However, the Lorentz condition ensures that, essentially, the scalar photons are eliminated from all physical effects. In combination with the longitudinal photons, they merely lead to the Coulomb interaction between charged particles. For the theory to be really equivalent to the Maxwell equations, we still need to satisfy the Lorentz condition (E.4). In the quantized theory, however, it is not possible to impose the Lorentz condition as an operator identity. If one were to attempt this, Eq. (E.9a) would imply that (E.19) E Covariant Quantization of the Electromagnetic Field 391 must vanish. However, from (E.1Ob) we know that this is not the case. Gupta and Bleuler replaced the Lorentz condition by a condition 4 on the states (E.20a) This also gives (E.20b) and thus (E.21) It is thereby guaranteed that the Maxwell equations are always satisfied in the classical limit. The subsidiary condition (E.20a) affects only the longitudinal- and scalarphoton states since the polarization vectors of the transverse photons are orthogonal to k. From (E.20a), (E.5), and (E.6), it follows for all k that (E.22) Equation (E.22) amounts to a restriction on the allowed combinations of excitations of scalar and longitudinal photons. If lIP") satisfies the condition (E.22), the expectation value of the term with the corresponding wave vector in the Hamiltonian is (wi a~(k)3 - ab(k)ao(k) = (wi a~(k)3 = (wi a~(k) Iw) - ab(k)ao(k) - ab(k)(a3(k) - ao(k)) - ab(k))a3(k) Iw) (E.23) Iw) = 0 . Thus, with (E.15), we have (wi H Iw) = (wi L k 4 L Ikl a~(k)r Iw) , (E.24) r=1,2 As already stated prior to Eq.(E.19), the Lorentz condition cannot be imposed as an operator condition, and cannot even be imposed as a condition on the states in the form (E.20c) For the vacuum state, Eq. (E.20c) would yield 8It AIt(x) Iwo) = 8It A It -(x) Iwo) = O. Multiplication of the middle expression by A+(y) yields At(y)8V A;:;-(x) 1%) = a~v (At (y)A;:;-(x)) Iwo) a~v ([At(y),A;:;-(x)] + A;:;-(x)At(y)) 1%) --!--i9ItvD+(y - x) Iwo) =I- 0 , which constitutes a contradiction. Thus the L~;entz condition can only be imposed in the weaker form (E.20a). 392 Appendix so that only the two transverse photons contribute to the expectation value of the Hamiltonian. From the structure of the remaining observables P, J, etc., one sees that this is also the case for the expectation values of these observabies. Thus for free fields, in observable quantities only transverse photons occur, as is the case for the Coulomb gauge. The excitation of scalar and longitudinal photons obeying the subsidiary condition (E.20a) leads, in the absence of charges, to no observable consequences. One can show that the excitation of such photons leads merely to a transformation to another gauge that also satisfies the Lorentz condition. It is thus simplest to take as the vacuum state the state containing no photons. When charges are present, the longitudinal and scalar photons provide the Coulomb interaction between the charges and thus appear as virtual particles in intermediate states. However, the initial and final states still contain only transverse photons. E.3 The Feynman Photon Propagator We now turn to a more detailed analysis of the photon propagator. For this we utilize the equation (E.6b) r and insert the specific choice (E. 7a-c) for the polarization vector tetrad into the Fourier transform of (E.wb): D~V(k) = k2 ~ iE L~2 E~(k) (E.25) + (kf.L - (k· n)nf.L) (kV - (k· n)nV) _ nf.Lnv} . (kn)2 - k 2 The first term on the right-hand side represents the exchange of transverse photons D~rans(k) = k 2 ~ iE L r=1,2 E~(k) . (E.26a) We divide the remainder of the expression, i.e., the second and third terms, into two parts: Df.LV F,Coul (k) = _1_ k 2 + iE {(kn) 2n f.L n V _ nf.Lnv} (kn)2 _ k2 k2 nf.Ln v - k 2 + iE (kn)2 - k 2 nf.LnV (kn)2 - k 2 (E.26b) E Covariant Quantization of the Electromagnetic Field 393 and (E.26c) (E.26b') This part of the propagator represents the instantaneous Coulomb interaction. The longitudinal and scalar photons thus yield the instantaneous Coulomb interaction between charged particles. In the Coulomb gauge only transverse photons occurred. The scalar potential was not a dynamical degree of freedom and was determined through Eq. (14.2.2) by the charge density of the particles (the charge density of the Dirac field). In the covariant quantization, the longitudinal and scalar (time-like) components were also quantized. The Coulomb interaction now no longer occurs explicitly in the theory, but is contained as the exchange of scalar and longitudinal photons in the propagator of the theory (in going from (E.25) to (E.26b) it is not only the third term of (E.25) that contributes, but also a part of the second term). The remaining term D~"red makes no physical contribution and is thus redundant, as can be seen fro'm the structure of perturbation theory (see the Remark in Sect. 15.5.3.3), J J =J d4 x' jl-'(x)D~" d4 x - X')j2,,(X') (E.27) d4 kjll-'(k)D';(k)j2,,(k) . Since the current density is conserved, (E.28) the term D~"red' tribution. comprising terms proportional to kl-' or k", makes no con' E.4 Conserved Quantities From the free Lagrangian density corrresponding to (E.1), (E.29) 394 Appendix according to (12.4.1), one obtains for the energy-momentum tensor (E.30a) and hence the energy and momentum densities TOO = T Ok = _~(AV Av + okAVOkAV) 2 -Avok A V . (E.30b) (E.30c) Furthermore, from (12.4.21), one obtains the angular-momentum tensor (E.31a) having the spin contribution (E.31b) from which one finally establishes the spin three-vector S = A(x) x A(x) . (E.31c) The vector product of the polarization vectors of the transverse photons El(k) x E2(k) equals k/lkl, and hence the value of the spin, is 1 with only two possible orientations, parallel or antiparallel to the wave vector. In this context it is instructive to make the transition from the two creation and annihilation operators (k) and a~(k) (or adk) and a2(k)) to the creation and annihilation operators for helicity eigenstates. at F Coupling of Charged Scalar Mesons to the Electromagnetic Field The Lagrangian density for the complex Klein-Gordon field is, according to (13.2.1), (F.1a) In order to obtain the coupling to the radiation field, one has to make the replacement 01-' -+ 01-' + ieAI-'. The resulting covariant Lagrangian density, including the Lagrangian density of the electromagnetic field (F.1b) reads: £=-~(OVAI')v (~: -ieAI-'c/)) (:~ +ieAI-'¢) _m 2 ¢t¢. (F.2) F Coupling of Charged Scalar Mesons 395 The equations of motion for the vector potential are obtained from o o£ o£ ----=oA =-ox v oA~v I-' oAI-' . (F.3) By differentiating with respect to q}, one obtains the Klein-Gordon equation in the presence of an electromagnetic field. Defining the electromagnetic current density (F.4) one obtains j I-' = -ie ((o¢t _ieA ¢t) ¢_¢t (~+ oxl-' I-' oxl-' ieA I-' ¢)) , (F.5) which, by virtue of the equations of motion, is conserved. The Lagrangian density (F.6) can be separated into the Lagrangian density of the free Klein-Gordon field £KG, that of the free radiation field £rad, and an interaction Lagrangian density £1, (F.6) where (F.7) The occurrence of the term e2 AI-'AI-'¢t ¢ is characteristic for the Klein-Gordon field and corresponds, in the nonrelativistic limit, to the A 2 term in the Schrodinger equation. From (F.7) one obtains for the interaction Hamiltonian density which enters the S matrix (15.3.4)5 for charged particles 5 P.T. Matthews, Phys. Rev. 76, 684L (1949); 76, 1489 (1949); S.S. Schweber, An Introduction to Relativistic Quantum Field Theory, Harper & Row, New York, 1961, p.482; C. Itzykson and J.-B. Zuber, Quantum Field Theory, McGraw Hill, New York, 1980, p.285 Index acausal behavior 265 action 261,262 active transformation 150, 155-156, 209-211 adiabatic hypothesis 331,364 adjoint field operator 291 analyticity of XAB(Z) 89 angular momentum 156-159, 272, 280 - of a field 272 - of the Dirac field 289 - of the radiation field 394 - of the scalar field 281 angular momentum operator 272,304 angular momentum tensor - of the electromagnetic field 320,394 annihilation operator 11, 14, 16,26, 252,303,336,338 anomalous magnetic moment of the electron 371-373 anti-Stokes lines 85, 97 anticommutation relations 288, 295 anticommutation rules for fermions 19,292 anticommutator 17, 301-303 antineutrino 243 antiparticle 205,216,287 autocorrelation 80 axial vector 145 axioms of quantum mechanics 115 Baker-Hausdorff identity 29, 184 bare states 331 baryon number 287 bilinear form 142 binding energy 165 bispinor 123 Bogoliubov approximation 63 Bogoliubov theory 62 Bogoliubov transformation 63,72,74 Bohr magneton 128 Bohr radius 43, 192 boost 150, 275 boost vector 272, 275 Bose commutation relations 12,314 Bose field 55-72,314 Bose fluid 60 Bose gas 60 Bose operators 14, 280, 328 Bose-Einstein condensation 60,62, 68 bosons 5,10,21,280,299,314 boundary conditions - periodic 25, 250 Bravais lattice 92 bremsstrahlung 334 Brillouin zone 251 broken symmetry 68 bulk modulus 44 canonical commutation relations 266 canonical ensemble 83 canonical quantization 266,277,311 Cauchy principal value 89 causality 88 charge 285-287,293-296 - bare 369 - renormalized 363-368 charge conjugation 214-217,232,303, 304 charge conjugation operation 290, 303-305 charge density 310,319 charge operator 281,294 charge renormalization 363, 368-372 chiral representation of the Dirac matrices 125,379 chirality operator 240, 379 classical electrodynamics 307-312 classical limit 91, 104 Clebsch-Gordan coefficients 169 coherent (incoherent) dynamical structure function 81 coherent scattering cross-section 80 398 Index coherent states 276 commutation relations 21,277-281, 303 canonical 266 of the Dirac field operators 296 of the field operators 28 commutator 18, see commutation relation - of free bosons 296 completely antisymmetric states 8 completely symmetric states 8, 11 completeness relation 11,17,251 compressibility 66 compressibility sum rule 107,108 Compton scattering 339,355-356 Compton wavelength 130, 166, 189, 192 condensate 62 conjugate fields 285, 288 conservation laws 212-213,266 conserved quantities 274,289-290, 393 constant of the motion 274 contact potential 31,65 continuity equation 24,199,266,286, 293,308 - of the Dirac equation 122 - of the Klein-Gordon equation 119 continuous symmetry group 274 continuum limit 255-258 contraction 336 contravariant indices 117, 131 coordinate representation 21 correction, relativistic 165 correlation function 46,83,91,334 - classical limit 104 - symmetry properties 98-106 correspondence principle 116, 378 Coulomb interaction 33,46,319 Coulomb potential 46-49,161-179 - scattering in see Mott scattering Coulomb repulsion 41,50 covariance 265 - relativistic 282 covariant indices 117, 131 covariant quantization 387 creation operator 11, 14, 16,26,252, 336,338 cross-section, differential 78 current density 23, 304, 305, 310, 343 - electrical 308 - under time reversal 231 current-density operator 286, 305 d'Alembert equation 231,313 - inhomogeneous 317 d'Alembert operator 312 - definition 132 damped harmonic oscillator 98 Darwin term 188 Debye-Waller factor 109 degeneracy 164,178,229 degenerate electron gas 34 o-function potential 68 density matrix 80 density of solutions of the free Dirac equation 150 density operator 27 density response function 100,107 density wave 66 density-density correlation 100 density-density correlation function 39,85,96 density-density susceptibility 106 diffusion 96 diffusion equation 96 diffusive dynamics 97 Dirac equation 120-130,287,322, 377-379 continuity equation 122 - for a Coulomb potential 168-179 in chiral representation 241 in covariant form 123-125 Lorentz covariance 135 - massless 239 nonrelativistic limit 126-128 - quadratic form 152-153 - solutions of the free equation 125, 146 - time-dependent 195 - with electromagnetic field 168-179 Dirac field 305,321, 322, 337 - quantized 287-296 Dirac field operators 295,304 Dirac Hamiltonian 120 Dirac hole theory see Hole theory Dirac matrices - chiral representation 240,379 - form 145-146 - fundamental theorem 146, 153 - Majorana representation 216, 244, 380 - properties 123,145-146 - standard representation 379 Dirac representation 323-328 dispersion relations 90,101,252 dissipation 91 Index dissipative response 90 divergent zero-point energy 316 dynamical susceptibility 85,91,93,98, 106 Dyson equation 359 effective mass 69 effective target area 78 eigenstate 115 Einstein approximation 54 elastic scattering 81, 107 electrodynamics, classical 307-312 electromagnetic field see radiation field Dirac equation 125-130, 168-180, 183-187 Klein-Gordon equation 162-168 electromagnetic vector potential 301 electron anomalous magnetic moment 371-373 - bare mass 361 - charge 126,369 - magnetic moment 128,371-373 - renormalized mass 362 electron gas 41-49 - ground state energy 44 electron-electron interaction 42 electron-electron scattering see M¢ller scattering electron-hole pair 34 electron-positron current density 316 electron-positron pair 332 - virtual 365 emission, of a photon 195,340 energy absorption 92 energy levels 164 - relativistic, of the hydrogen atom 177 - of the Dirac equation for a Coulomb potential 175 - of the Klein-Gordon equation in Coulomb potential 164 energy uncertainty 207 energy, negative 214 energy-momentum conservation 341 energy-momentum four-vector 268 energy-momentum tensor 266, 289 - for the Dirac-field 289 - for the radiation field 320 energy-momentum vector 280 equation of motion - for field operators 23 - for the density operator 24 399 Euler-Lagrange equations 263, 288, 311 - of field theory 263 exchange hole 38 exchange of mesons 285 exchange term 44 excitation energy 76 expansion of the field operator 285 expectation value of an observable 115 external lines 356 f -sum rule 106-108 factorization approximation 47 Fermi energy 34 Fermi operators 19,328 Fermi sphere 33 Fermi wave number 33 Fermi's golden rule 77 fermion line 356 fermion propagators 301 fermions 5,16,21,288,299 Feynman diagrams 284,336,347 - external lines 356 Feynman path-integral representation 386 Feynman propagator 283,285,318, 325 - for fermions 302-303 - for mesons 284 - for photons 320, 392 Feynman rules 339,348,355-358 Feynman slash 124 field equations 23,285,287-289 - classical 287 - free 325 - nonlinear 323 - quantum-mechanical 279 field operator 20,258, 259, 295, 324, 325,331,332 - adjoint 291 field tensor, electromagnetic 309 field theories, free 277 field theory - classical 261-265 - nonlinear 321,325 fields - free 277 - conjugate 288 - electric 307 - free 303 - free, electromagnetic 312-320 - interacting 321-373 400 Index - relativistic 249-275 fine structure 178 fine-structure constant, Sommerfeld's 162,307,321 fluctuation-dissipation theorem 91, 95 fluctuations 57 Fock space 11,17,259,295 Foldy-Wouthuysen transformation 181-187 four-current-density 161,286,293,308 four-dimensional space-time continuum 261 four-momentum 117 four-momentum operator 303 four-spinor 123 four-velocity 117 free bosons 55 functional derivative 51 g factor 127, 128, 187,373,377 Galilei transformation 70 ,matrices see Dirac matrices gauge 316 - axial 309 - Coulomb 309-310 - Lorentz 309 - time 309 - transverse 190 gauge invariance 68, 272, 285 - of the Lagrangian density 305 gauge theory, abelian 322 gauge transformations - of the first kind 272,286,301 - of the second kind 273,309 generator 273-275 - of rotations 157,213 - of symmetry transformations 274 - of translations 303, 304 golden rule 92 Gordon identity 197,207,367,372 grand canonical ensemble 83 grand canonical partition function 83 graphs 284 Green's function 385 - advanced 297 - Coulomb 310 - retarded 297 ground state 12, 279 - of the Bose gas 62 - of a Dirac particle in a Coulomb potential 178 - of superfluid helium 71 - of the Fermi gas 33 - of the field 259 - of the linear chain 254 ground state energy 41 group velocity 197,201 Gupta-Bleuler method 387-394 gyromagnetic ratio see g factor Hamilton's principle 262 Hamiltonian 22,115,277-281,289, 321 - nonlocal 183 - of a many-particle system 20 - of the Dirac equation 120 - of the scalar field 279 - rotationally invariant 158 - with central potential 159 Hamiltonian density 264, 319, 322-323 - of the free Dirac field 288 - of the free radiation field 313 hard-core potential 61 harmonic approximation 72 harmonic crystal 92-96 harmonic oscillator 29,30 Hartree-Fock approximation 42 Hartree-Fock energy levels 48 Hartree-Fock equations 49-52 He-II phase 69 Heaviside-Lorentz units see LorentzHeaviside units Heisenberg equation - nonrelatvistic 190 Heisenberg equation of motion 23 Heisenberg ferromagnet 68 Heisenberg model 72 Heisenberg operators 83, 325 Heisenberg representation 23, 250, 325,326 Heisenberg state 83 helicity 236-238, 305, 384 helicity eigenstates 238,313,394 helium - excitations 69 - phase diagram 60 - superfluidity 69 hole 34 Hole theory 204-207,214,216,290 Holstein-Primakoff transformation 72 Hubbard model 31 hypercharge 278, 287 hyperfine interaction 189 hyperfine structure 178 Index identical particles 3 incoherent scattering cross-section 80 inelastic scattering 76, 77 inelastic scattering cross-section 76 inertial frames 132 infrared divergence 192 interacting fields 321-373 interaction Hamiltonian 325, 331 interaction Hamiltonian density 329 interaction representation 86,323-328 interaction term 319,321 interaction, electromagnetic 321 interference 57,200 interference terms 80 intrinsic angular momentum 272 invariance 212-213 - relativistic 265 invariant subspaces 7 inversion symmetry 100 irreducible representation 241 isospin 287 isothermal compressibility 41 isothermal sound velocity 108 KO meson 278, 287 kinetic energy 21 Klein paradox 202-204 Klein-Gordon equation 116-120 - continuity equation 119 - free solutions 120 in Coulomb potential 162-168 - one-dimensional 256 with electromagnetic field 161-168 Klein-Gordon field 258-260, 337 - complex 260,278,285-287,303,394 - real 277-285 Klein-Gordon propagator 317 Kramers degeneracy 229 Kramers theorem 228 Kramers-Kronig relations 90 Kubo relaxation function 105 Lagrangian 261,318,321-323 - nonlinear 321-322 Lagrangian density 261,277-281,285, 318,321-323,394 - of quantum electrodynamics 321, 322 - of radiation field and charged scalar mesons 394 - of the ¢4 theory 321 - of the Dirac field 288 - of the free real Klein-Gordon field 277 401 - of the radiation field 311-312, 387 Lamb shift 178, 189-193,322 Lande factor see 9 factor lattice dynamics 81,92 lattice vibrations 249-260 left-handed states 237 Lennard-Jones potential 61 light cone 298 Lindemann criterion 45 linear chain 249-255 linear response 87 linear response function 88 linear susceptibility 75 locality 265,299,300 Lorentz condition 221 Lorentz covariance 283 - of the Dirac equation 135 Lorentz group 133 Lorentz spinor, four-component 136 Lorentz transformation 131-134, 297 along the Xl direction 139 - infinitesimal 136-141, 156 - inhomogeneous 265 - linearity 132 - orthochronous 133,144,298 - proper 133 - rotation 138 - spatial reflection 141 - time reflection type 133 Lorentz-Heaviside units 307,344 magnetic moment - of the electron 373 magnetization 68 magnons 72 Majorana representation of the Dirac matrices 125,216,244, 380 many-body see many-particle many-particle operator 14,19 many-particle state 3, 13 many-particle theory, nonrelativistic 3-72,292 mass - bare 193,361-362 - physical 193,361-362 - renormalized see mass, physical mass density 66 mass increase, relativistic 166 mass shift 362 massless fermions see neutrino matrix elements 334 Maxwell equations 307-308 measurement 115 402 Index meson propagator 284 mesons 162,167,284 - electrically neutral 259,278 - free 325 - scalar 394 metric tensor 131 microcausality 299 minimal coupling 129 Minkowski diagram 298 Moller formula 349 Moller scattering 339, 346-352 momentum 289,291,322 momentum conjugate 311 momentum density 289 momentum eigenfunctions 25 momentum field 264 momentum operator 129,280,295 - normal ordered 320 - of the Dirac field 304 - of the Klein-Gordon field 279 - of the radiation field 316,387 momentum representation 25 momentum transfer 78 motion reversal 217-236, see time reversal motion-reversed state 218 Mott scattering 339,341-346 multiphonon state 254 neutrino 239-243 neutrons - scattering 76, 79 - scattering cross-section 80 - wavelength 76 Noether's theorem 268-270,320 noninteracting electron gas 38 nonlocality 265 nonrelativistic limit 176, 181 nonrelativistic many-particle theory 3-72 normal coordinates 93,250 normal momenta 250 normal ordered products 280 normal ordering 280,314,316 - for fermions 293 normal product, generalized 337 normal-ordering operator 339 nuclear radius, finite 166, 167, 179 nucleon 284 observables 115 occupation numbers 10, 17 occupation-number operator 254,279 13,19, one-phonon scattering 109 operator - antilinear 223 - antiunitary 217,223 - chronological 328 - d'Alembert 312 - even 181 - odd 181 orbital angular momentum density 320 orbital angular momentum of the field 281,290 orbital angular momentum quantum number 164 orthogonality relation 11,251,291 - for solutions of the free Dirac equation 198 - of the solutions of the free Dirac equation 149 orthonormality of momentum eigenfunctions 25 oscillators, coupled 249-260 pair annihilation 334 pair correlations 59 pair creation 206 pair distribution function 36,39,55 paraparticles 6 parastatistics 6 parasymmetric states 6 parity 141,213 - intrinsic 141 parity transformation 213, 232 particle density 22 particle interpretation 253 particle-number density 27 particle-number operator 13,279,281 passive transformation 137, 155-156, 209-211 path-integral representation 385 - Feynman 386 Pauli equation 127, 187,237 Pauli matrices 123, 290, 304 Pauli principle 36, 205 Pauli spinor 127, 169,304 Pauli's fundamental theorem see Dirac matrices, fundamental theorem Pauli-Villars method 366 peT theorem 235 periodic boundary conditions 25 permutation group 4 permutation operator 3 permutations 4,127,347 Index perturbation expansion 335 perturbation Hamiltonian 283 perturbation theory 188,281,283, 322,327-328 rj>4 theory 321 phonon annihilation operator 253 phonon correlation function 93 phonon creation operator 253 phonon damping 96 phonon dispersion relation 108 phonon frequencies 93 phonon resonances 96 phonon scattering 81 phonons 66,69,93-96,108,253 - acoustic 252 - optical 252 photon correlations 59 photon field see radiation field photon line 356 photon propagator 316-320,392 photon self-energy 365 photons 314 free 325 - longitudinal 389 - transverse 389 7r mesons 287 7r 0 meson 281 7r meson 162, 167 Planck's radiation law 314 Poincare group 133, 152 Poincare transformation 132 point mechanics 261 Poisson equation 310,312 polarization vector 191,313 - of the photon field 317 position eigenstates 21 positron 205, 290, 304 potential electromagnetic 319 - rotationally invariant 213 - spherically symmetric 162 potential step 202,204 pressure 44, 66 principal quantum number 164, 175 principle of least action 262 principle of relativity 132, 135 probability amplitude 387 probability distribution 207 projection operators 151,216 - for the spin 380 propagator 281-287,301-303 - and spin statistics theorem 296-301 - covariant 319 - free 360 - interacting 360 pseudopotential 79 pseudoscalar 144,153 pseudovector 144,153 purely space-like vectors 403 298 QED see quantum electrodynamics quanta, of the radiation field 314 quantization 290-293 canonical 266,285 - of the Dirac field 206,287-296 - of the radiation field 307-320 quantization rule, canonical 277 quantum crystals 61 quantum electrodynamics 193, 321-373 quantum field theory 193,207,216 - relativistic 287 quantum fields, relativistic 249-275 quantum fluctuations 45 quantum fluid 60 quantum number, radial 164,174 quasi particles 65, 253 Racah time reflection 235-236 radiation field 307-322,332,387 - quantized 193,307-320 radiative corrections 193,358-373 radiative transitions 195 Raman scattering 98 random phase approximation 44 reflection, spatial 213 regularization 363,366 relativistic corrections 176, 181, 187-189 relativistic mass correction 188 renormalizable theory 322 renormalization see charge renormalization renormalization constant see wave function renormalization constant renormalization factors 368 representations of the permutation group 5 resolvent 189 response 75 rest mass 118 rest-state solutions of the Dirac equation 147 retardation 297 right-handed states 237 Ritz variational principle 51 404 Index rotation 138, 152, 156-159, 212, 271 - infinitesimal 157 rotation matrix 139 rotational invariance 102,104 roton minimum 66, 108 rotons 69 RPA 44 Rutherford scattering law 346 Rydberg energy 128 Rydberg formula, nonrelativistic 165 S matrix 322, 328-332, 336, 339 S-matrix element 335,352,356 for 'Y emission 340 - for Mott scattering 342 - for M011er scattering 348 scalar 144 scattering 75, 328, 333, 339 - of two nucleons 284 scattering amplitude 340 scattering cross-section 354 - differential 343, 344 M011er scattering 349 - - for Mott scattering 345 -- in the center-of-mass frame 355 - relation to the S-matrix element 352 scattering experiments 76 scattering length 68,74 scattering matrix see S matrix Schr6dinger equation 82,115,324 Schr6dinger operator 83, 324, 325 Schr6dinger representation 324-326 second quantization 22, 23 second-order phase transitions 68 self-energy 359 - of the electron 359 - of the photon 365 self-energy diagram 360 self-energy insertions 372 signature of an operator 103, 226 single-particle correlation function 37 single-particle operator 14, 20 single-particle potential 22 single-particle state 281,303 Sommerfeld's fine-structure constant 162,307,321 sound velocity 66 space-like vectors 297,298 spatial reflection 141,213 special theory of relativity 117 spectral representation 84, 90 spherical well potential 68, 73 spin 27,272 - of the Klein-Gordon field 278 spin density 28,320 spin density operator 28 spin projection operator 380 spin statistics theorem 296-303 spin-~ fermions 28,33,299 spin--Drbit coupling 178, 188 spin-dependent pair distribution function 38 spin-statistic theorem 281 spinor 123,290,378,383 - adjoint 144 - free 196 - hermitian adjoint 123 spinor field 289,305 spinor index 340 spinor solutions 325 standard representation of the Dirac matrices 379 state vector 115 states - bare 330 - coherent 276 - dressed 330 static form factor 107,108 static structure factor 40 static susceptibility 91,98 stationary solutions 162 stiffness constant 252, 255 Stokes lines 85,97 strangeness 287 stress tensor Tij 271 string, vibrating 255-258 sum rules 106-108 summation convention 11 7 superfluid state 60 superfluidity 69 susceptibility 100 - dynamical 85-88 symmetric operator 4 symmetry 209-243 - discrete 213 symmetry breaking 68 symmetry properties 98 symmetry relations 98 symmetry transformation 214,274 temporal translational invariance tensor 153 - antisymmetric 144 thermal average 81 Thirring model 323 84 Index time reversal 217-236 time-evolution operator 86 - in the interaction picture 326 - Schrodinger 326 time-like vectors 298 time-ordered product 283, 336, 337 - for Fermi operators 302 - for the Klein-Gordon field 283 time-ordering operator 5 - Dyson's 327 - Wick's 328 time-reversal invariance 102 - in classical mechanics 218 - in nonrelativistic quantum mechanics 221 - of the Dirac equation 229-235 time-reversal operation 102 time-reversal operator, in linear state space 225-229 time-reversal transformation 102, 233 total angular momentum 157 total current 196 total number of particles 13 total-particle-number operator 22 totally antisymmetric states 5 totally symmetric states 5 transformation - active 150, 155-156, 209-211 - antiunitary 274 - infinitesimal 152, 268 - of the Dirac equation 210 - of vector fields 209 - passive 137, 155-156,209-211 - unitary 274,325 transition amplitude 329 transition probability 78, 334 transition rate 328 transitions, simple 332-339 translation 213, 270 translation operator 303 translational invariance 94,102,190, 213 - of the correlation function 104 405 transpositions 4 two-fluid model 69 two-particle interaction 22 two-particle operator 15,20 two-particle state 281 ultraviolet divergence 74,192,361 uncertainty relation 207 vacuum 281 vacuum expectation value 283,337 vacuum polarization 365,372 vacuum state 12, 17,279,303,389 variation 263, 269 vector potential, electromagnetic 301 vertex 333, 335 vertex corrections 366,372 vertex point 356 violation of parity conservation 234 virtual particles 336 von Neumann equation 86 Ward identity 363,368-371 wave function renormalization constant 364 wave packets 195,198 wave, plane 202 weak interaction 234 weakly interacting Bose gas 62-68 - condensate 62 - excitations 66 - ground-state energy 68,74 Weyl equations 242,377 Wick's theorem 322, 335-339, 356 Wigner crystal 45, 54 world line 11 7 Wu experiment 234 Z factors 370 zero-point energy 254,280 - divergent 314 zero-point terms 293 Zitterbewegung 199-201 - amplitude of 199 Location: https://www.springer .de/phy s I You are one click away from a world of physics information! Come and visit Springer's Physics On ine ibrary Books • Search the Springer website catalogue • Subscribe to our free alerting service for new books • Look through the book series profiles You want to order? Email to:[email protected] Journals • Get abstracts. ToC's free of charge to everyone • Use our powerful search engine SpringerLink Search • Subscribe to our free alerting service Springerlink Alert • Read full-text articles (available only to subscribers of the paper version of a journal) You want to subscribe? Email to:[email protected] Electronic Media You have a question on an electronic product? • Get more information on our software and CD-ROMs Email to:[email protected] : ............... Bookmark now: nger.de/phys/ Springer. (ustomff Service H.....ntr. 7 • 0-69126 H.id.I .... rg. G.rmany Tel: +4910) 6221 )45·0· fax: +49(0) 6221345-4229 dlkp 006437 sflc tc Printing: Saladruck Berlin Binding: Sturtz AG, Wurzburg Springer