Maxwell's Equations and Gauge Theory: Electromagnetism as a Principal Bundle
A compass needle aligns with a magnetic field threading through empty air. A lightning bolt drives a current through the atmosphere via the same electric force that pulls charged particles in a wire. Radio waves cross a continent at the speed of light, which turns out to be the same constant that emerges from the equations governing both electricity and magnetism. All of this follows from four equations that James Clerk Maxwell assembled in 1865.
The classical formulation in vector calculus is powerful and concrete, but it hides something. The equations have a symmetry—gauge invariance—that suggests the fields E and B are not the most fundamental objects. Behind them sits the vector potential A, and behind A sits the geometry of a principal fibre bundle over spacetime. The same geometric structure that appears in electromagnetism reappears, with a non-abelian gauge group, in the quantum chromodynamics of quarks and the electroweak theory of the Standard Model. In 2000, the Clay Mathematics Institute designated the question of whether the non-abelian (Yang–Mills) version of this structure always has a positive mass gap as one of the seven Millennium Prize Problems, with a million-dollar prize still unclaimed.
This post develops four parallel languages for Maxwell's equations: the classical vector calculus form, the coordinate-free language of differential forms on Minkowski spacetime, the spacetime algebra (geometric algebra) encoding all four equations in a single line, and the principal fibre bundle construction that reveals gauge theory as differential geometry. At each level the physical content is identical; what changes is how much of the underlying structure becomes visible.
1. The Classical Laws
The four Maxwell equations rest on four independent experimental observations, each discovered in a 46-year window between 1785 and 1831. Coulomb measured the force between static charges. Biot and Savart quantified the force between current-carrying wires. Faraday discovered that a moving magnet induces a voltage in a nearby coil. Ampere related closed line integrals of to enclosed currents. In 1865, Maxwell added one term to Ampere's law to repair an inconsistency with charge conservation. As a consequence, he found that light is an electromagnetic wave.
Before stating the laws, two pieces of notation need grounding.
Charge is an intrinsic scalar property of matter, analogous to mass but with one fundamental difference: it can be positive, negative, or zero. Two particles with the same sign of charge repel each other; two with opposite signs attract. Coulomb quantified this force in 1785, and that measurement is the foundational observation that justifies introducing charge as a concept at all. Classical theory offers no deeper explanation for why matter carries charge; it is a label that organizes the forces we observe. Field theory adds structural content to that label: charge is the source term that couples matter to the electromagnetic field. Without charge, Gauss's law reduces to the homogeneous equation , and the field has no reason to be nonzero anywhere. Charge is what forces the field to respond to the presence of matter.
Charge density extends this to continuous distributions. When charge is spread through a volume (as in a plasma or the conduction band of a metal), we cannot assign a charge to each geometric point; instead we ask how much charge occupies an infinitesimally small region near . Let be a ball of radius centered at . Then
where is the total charge inside that ball at time and . This is the same construction as mass density in continuum mechanics: charge per unit volume, measured in coulombs per cubic meter. The total charge in any region is recovered by integration:
Point charges and the Dirac measure. An electron carries all of its charge at a single spatial point, to every precision ever measured. A density function for this situation must be zero everywhere except at the electron's location, yet integrate over all of space to the total charge . No function in the ordinary (Lebesgue) sense can achieve this: a function that is zero almost everywhere has integral zero, regardless of the value at any isolated point.
The -dimensional Dirac delta resolves this by working in the broader class of distributions: continuous linear functionals on smooth test functions, rather than pointwise-valued maps. The three-dimensional Dirac delta centered at the origin is defined by its action on any smooth compactly supported test function :
It is the evaluation functional at the origin. A concrete way to approach it: is the distributional limit of the family of normalized Gaussians
Drag the slider right to push . The Gaussian concentrates to an infinitely tall spike while its area stays fixed at 1. The Dirac delta is what the family converges to in the distributional sense.
as . For any fixed smooth , one has as . The Dirac delta is what that Gaussian family converges to in the distributional sense: a spike of unit mass that has collapsed to a single point. The -dimensional version is the same construction in , satisfying .
The charge density of a point charge at the origin is then , a distributional statement with full physical weight: substituting into Gauss's law and integrating over any sphere enclosing the origin recovers Coulomb's inverse-square field exactly. A point particle moving along a worldline generalizes this to , with current density . In field theory, the interaction between a charged particle and the electromagnetic potential is the spacetime integral . For a point particle, the delta function collapses this volume integral to a line integral along the worldline. That line integral is precisely the geometric object whose ambiguity encodes gauge freedom and whose measurability produces the Aharonov–Bohm phase in Section 8.
In a conductor or plasma, is instead a smooth function spread continuously through space. Both cases, the point-supported delta and the smooth continuous distribution, are valid instances of the same distributional framework; only the degree of spatial concentration differs.
On quantum mechanics. The treatment above assigns charge density via Dirac deltas for point particles or smooth functions for continuous distributions, and in both cases the electron is treated as if it occupies a definite location or a definite extended region. Quantum mechanics alters this. An electron is not at a definite point; it occupies a quantum state described by a wavefunction , and is the probability density for finding the electron near . The charge density it contributes is accordingly (negative because the electron carries charge ). This is not charge spread spatially in the classical sense; it is the expectation value of the charge density operator in the state . In an atom, is an orbital, and is a smooth function spread over the orbital volume that sources the electric field through Gauss's law exactly as a classical continuous distribution does. Section 10 develops this connection: how quantum charge densities enter Maxwell's equations, how the resulting electrostatic potential feeds back into the Schrödinger equation, and how the coupled system is solved computationally.
The closed surface integral adds up the outward flux of a vector field through every infinitesimal patch of a closed surface (one with no boundary, like a sphere or a box). It measures how much of is leaving versus entering the enclosed region. The divergence theorem relates this to a volume integral: , which is what converts the integral form of each law into the differential (pointwise) form below.
Each law is stated as a proposition and derived from the corresponding experiment below.
Let be the charge density. The electric field satisfies
Derivation. Coulomb's law gives the field of a point charge at the origin as . The flux through a sphere of radius is
By the divergence theorem, . Replacing by a continuous distribution and equating the integrands gives pointwise.
For any magnetic field produced by currents,
Derivation. The Biot–Savart law gives , which can be written as [eq:b-potential] with . Then by the identity , which holds identically for any smooth vector field.
For any stationary closed curve bounding surface ,
Derivation. Faraday observed in 1831 that a changing magnetic flux through a loop drives an EMF:
For a stationary loop, . By Stokes' theorem, . Since this holds for every , the integrands must match: .
For current density and electric field ,
Derivation. Ampere's part. The Biot–Savart law for a long straight wire gives at distance . Integrating around a circular Amperian loop: . By Stokes, this becomes in the static case.
Maxwell's correction. Taking the divergence: , but charge conservation requires , which is nonzero during transients. By Gauss's law, . Adding the displacement current to the right side restores consistency: by continuity.
In vacuum (, ), the fields and satisfy the wave equation with propagation speed
equal to the measured speed of light.
Proof. Apply to [eq:faraday-eq] (Faraday's law): using [eq:ampere-eq] (vacuum Ampere–Maxwell law). The identity (with in vacuum) gives . Maxwell identified m/s with the known speed of light and concluded that light is an electromagnetic wave. The same derivation for gives an identical equation.
For the rest of this post, when it reduces clutter without ambiguity, we set and (natural units). The speed-of-light factors can be reinstated by dimensional analysis: every time appears where a spatial derivative would sit, supply a factor of .
2. The Vector Potential and Gauge Freedom
The equation says that the magnetic field has no divergence. A standard result from vector calculus (the Poincaré lemma in disguise) says that any divergence-free vector field on a simply connected domain can be written as the curl of another vector field. This defines the magnetic vector potential.
Because , we write
for a vector field called the magnetic vector potential. Substituting into Faraday's law gives , so the combination is curl-free and can be written as the gradient of a scalar:
The scalar is the electric scalar potential.
The fields E and B are determined by the pair , but not uniquely: different potentials can produce the same fields.
For any smooth scalar function , the transformation
leaves the physical fields E and B unchanged: and .
Proof. Direct substitution: since any gradient is curl-free. For E: since partial derivatives commute.
The freedom in choosing is called gauge freedom. Two gauge choices that often appear in practice:
The Lorenz gauge imposes the covariant condition , which reduces the potential equations to decoupled wave equations: and , where is the d'Alembertian.
The Coulomb gauge imposes , which makes the scalar potential satisfy Poisson's equation instantaneously, without retardation. This gauge is often used in quantum mechanics and in the canonical quantization of the electromagnetic field.
The potentials were originally treated as mathematical auxiliaries with no independent physical meaning. Section 8 shows this view is wrong: the Aharonov–Bohm effect demonstrates that the vector potential has measurable physical consequences in regions where .
3. Differential Forms and the Faraday Tensor
Vector calculus works beautifully in flat three-dimensional space, but it cannot be easily extended to curved spacetime or to general dimensions. Differential forms provide the coordinate-free language that makes this extension natural. The key payoff: all four Maxwell equations reduce to two compact statements.
A -form on a smooth manifold is a field of antisymmetric multilinear maps on the tangent space, eating tangent vectors and returning a number. These spaces are organized into the de Rham complex: a sequence of vector spaces connected by the exterior derivative , which maps -forms to -forms, subject to the central identity . This single algebraic fact encodes conservation laws, Bianchi identities, and the existence of gauge potentials all at once.
On a smooth -manifold , the de Rham complex is the sequence
where is the space of smooth -forms on and each arrow is the exterior derivative. The defining property says: applying the exterior derivative twice gives zero, so every exact form (one of the form ) is automatically closed (satisfies ).
In three-dimensional Euclidean space , the sequence specializes to
and the identity recovers two classical identities simultaneously: (the curl of any gradient is zero) and (the divergence of any curl is zero). These are not independent facts; they are both instances of read at different grades of the complex.
A -form is closed if , and exact if for some -form . Every exact form is closed, but whether every closed form is exact depends on the global topology of . The -th de Rham cohomology group measures the gap:
A nonzero class in corresponds to a closed form that cannot globally be written as an exact form, signaling a -dimensional topological hole in .
The Poincaré lemma provides the converse on contractible domains: on any contractible open set , every closed form is exact. In three dimensions, this says that any curl-free vector field on a simply connected region is a gradient, and any divergence-free field is a curl. On non-simply-connected domains (such as ), closed forms need not be exact, and this failure is precisely what allows the Aharonov–Bohm effect (Section 8).
The de Rham complex is the continuum structure that discrete exterior calculus (DEC) faithfully discretizes: smooth manifolds become simplicial complexes, smooth -forms become -cochains, and the exterior derivative becomes a combinatorial coboundary operator that inherits exactly from the boundary identity . Section 10 develops this construction and uses it to formulate Maxwell's equations on a discrete mesh, where and gauge invariance hold exactly at the discrete level rather than approximately. For Maxwell's equations, the relevant complex runs over four-dimensional Minkowski spacetime up to .
Minkowski spacetime carries coordinates with metric signature . The metric tensor is . Lowering and raising indices uses and its inverse .
The Levi-Civita symbol with defines the Hodge star: for a -form with components , the Hodge dual is the -form with components proportional to .
The electromagnetic field is encoded in the Faraday 2-form , with components
In terms of the electric and magnetic field components:
Explicitly in coordinates:
The current density defines a 1-form (or its Hodge dual 3-form, depending on convention).
The four Maxwell equations are equivalent to
The first equation, , encodes Faraday's law and the no-monopole condition . The second, , encodes Gauss's law and Ampere's law.
Proof. We work in coordinates with . We use the convention in which the component form of is , (consistent with the STA bivector in Definition 4.2). In this convention the field 2-form reads
Part 1: . Because is a 3-form in four dimensions it has exactly four independent components. The general formula for the antisymmetrised derivative of a 2-form is
The component — the only purely spatial triple. Only the magnetic 2-forms contribute (the electric terms involve and vanish on a purely spatial triple):
Setting this to zero gives , the no-monopole condition.
The component. Only , , and contribute, since these are the only field components whose basis 2-forms share two of the three indices :
Setting this to zero gives , the -component of Faraday's law . The remaining two components follow by cycling :
which are the - and -components of .
Part 2: . The Hodge star in acts on basis 2-forms by with . Applying this to , using to verify signs:
The Hodge star has swapped the electric and magnetic sectors. Now applying to :
The component. Only the spatial 2-forms in contribute a purely spatial 3-form:
The source 3-form (the Hodge dual of the current 1-form ) has in this convention. Setting gives , Gauss's law.
The component. The magnetic 2-forms , and the spatial form contribute:
Setting equal to gives , the -component of Ampere's law . The and components give the remaining components by the same calculation.
All eight equations — four from [eq:maxwell-forms-eq] — are precisely Maxwell's four equations in classical form.
Since , the first equation in [eq:maxwell-forms-eq] says F is a closed 2-form. On a contractible domain, the Poincaré lemma guarantees a 1-form A such that . This is the gauge potential in differential form language. The components of A are exactly from Section 2. The gauge transformation leaves unchanged since .
The forms language also makes the charge conservation law automatic: applying to and using gives , which in coordinates is , the continuity equation.
4. Spacetime Algebra:
The differential forms approach breaks Maxwell's equations into two statements. Geometric algebra (Clifford algebra applied to spacetime) collapses them into one. The gain is more than notational: the algebraic structure makes the spinor formulation of relativistic quantum mechanics and the Yang–Mills generalization (Section 7) more transparent.
The spacetime algebra (STA) is the real Clifford algebra generated by four basis vectors satisfying
so that and for . The 16-dimensional algebra has a basis organized by grade: grade 0 (scalars, 1 element), grade 1 (vectors, 4 elements), grade 2 (bivectors, 6 elements), grade 3 (trivectors, 4 elements), grade 4 (pseudoscalar, 1 element). The pseudoscalar is with .
The spacetime gradient is , where .
In the STA, the electromagnetic field is a grade-2 multivector (bivector):
where and are relative bivectors (bivectors with one timelike leg), and is the pseudoscalar. In natural units :
All four Maxwell equations are equivalent to the single multivector equation
where is the 4-current. The vector part encodes Gauss and Ampere. The trivector part (when is equated to the grade-1 object J) encodes Faraday and the no-monopole condition.
Proof. Write with , so and . The Clifford product of a grade-1 vector with a grade-2 bivector decomposes by grade:
where (the dot part) decreases grade by 1 and (the wedge part) increases grade by 1. Since is a bivector, is the sum of a grade-1 vector and a grade-3 trivector.
Grade-1 part: inhomogeneous equations. Expanding in the canonical blade basis (computed explicitly via the Clifford multiplication rules for and ):
The coefficient is ; equating to gives Gauss's law. The coefficient is ; equating to gives Ampere's law .
Grade-3 part: homogeneous equations. Expanding in the trivector basis:
The coefficients are the components of ; each set to zero gives a component of Faraday's law. The purely spatial trivector coefficient is
which set to zero gives . Since is a grade-1 object, equating forces the grade-3 part to vanish identically, recovering all four homogeneous equations simultaneously. The grade-1 equation then yields the four inhomogeneous equations of [eq:maxwell-forms-eq].
For a non-abelian gauge field (Section 7), the STA equation becomes
where the bracket is the commutator in the Lie algebra, and is the non-abelian curvature. The additional commutator term reflects the self-interaction of non-abelian gauge bosons: gluons in QCD, for instance, carry color charge and interact with each other, unlike photons. The STA formulation makes visible that this self-interaction is a purely algebraic consequence of non-commutativity.
5. Principal Fibre Bundles
The gauge potential A of electromagnetism is a 1-form on spacetime with values in a Lie algebra. This structure has a precise geometric home: it is a connection on a principal fibre bundle. Understanding that home requires building up the definition carefully, but the physical picture is always a family of gauge-equivalent copies of the structure group, one sitting over each point of spacetime.
Think of a helix wrapping around a cylinder. The cylinder is the base space (the spacetime you navigate), the circle at each height is the fibre (the gauge group), and the helix tells you how to move consistently from one fibre to the next. That consistent prescription is a connection.
A principal -bundle over a smooth manifold is a smooth manifold together with:
- a smooth surjection (the bundle projection),
- a smooth free right action , written , that preserves fibres: for all , ,
- local triviality: each has an open neighborhood and a diffeomorphism intertwining the right action with right multiplication in G.
The fibre over is . A local section is a smooth map with .
The 3-sphere is the set of pairs with . The unit circle acts on by . The quotient is the 2-sphere , and the map given by the Hopf map
is a principal -bundle. This bundle is nontrivial: there is no global section, meaning no smooth way to pick one representative from each fibre over all of . The obstruction is topological and is measured by the first Chern class .
The gauge group of electromagnetism is . Over Minkowski spacetime , the electromagnetic field lives on a principal -bundle . Since is contractible, this bundle is trivial and a global section exists: this is why we can define A as a globally defined 1-form. Over topologically nontrivial spacetimes (for instance, outside an infinitely long solenoid, where the effective base space is with fundamental group ), the bundle can be genuinely nontrivial. That nontriviality is the Aharonov–Bohm effect.
6. Connections and Curvature
A connection on a principal bundle answers the question: given a path in the base manifold, how do we lift it to a path in the total space in a consistent, -equivariant way? The answer defines parallel transport, and the failure of parallel transport to close around loops is curvature.
Concretely, a connection splits the tangent space of at each point into a vertical part (tangent to the fibre, "staying at the same base point") and a horizontal part (transverse to the fibre, "moving in the base direction"). The choice of horizontal subspace, required to be G-equivariant and smooth, is the connection.
A connection on a principal -bundle is a -valued 1-form satisfying:
- for all , where is the fundamental vector field generated by X,
- (G-equivariance under right multiplication).
Given a local section , the local gauge potential is the pullback
This is exactly the gauge field A of physics.
If is a different local section (a gauge transformation by a smooth map ), then the local potentials are related by
For the abelian case with , this reduces to
which, after identifying , recovers [eq:gauge-transform-eq] from Section 2.
The curvature of a connection is the -valued 2-form
where the bracket is the Lie bracket in . Locally, via a section , the curvature pulls back to
For the abelian case , the Lie algebra is commutative, so and
This is [eq:maxwell-forms-eq], the Faraday 2-form of Section 3. The four Maxwell equations follow by imposing the Yang–Mills equations (Section 7) on this abelian bundle.
The Bianchi identity (for the abelian case) is not a physical law but a geometric identity. It says , which follows from . In the non-abelian case the identity becomes the covariant Bianchi identity , reflecting the structure equations of the Lie algebra.
7. Yang–Mills Theory
In 1954, Chen-Ning Yang and Robert Mills asked: what happens if the gauge group is non-abelian? The same geometric structure (principal bundle, connection, curvature) applies, but the non-commutativity of the group means the gauge bosons themselves carry charge and interact. In electromagnetism, photons are electrically neutral and do not interact with each other. In a non-abelian gauge theory, the gauge bosons have charges under their own gauge group and self-interact.
For a principal -bundle with connection A and curvature , the Yang–Mills action is
where is the Killing form on and is the Hodge star of the spacetime metric. For with , this is proportional to , the electromagnetic field energy.
The Euler-Lagrange equations of under variation of A are
where is the covariant exterior derivative. Together with the Bianchi identity , these are the Yang–Mills equations. For they reduce to [eq:maxwell-forms-eq] (Maxwell's equations).
The Standard Model of particle physics uses the gauge group :
- The factor gives the hypercharge gauge field, which mixes with the gauge boson to produce the photon (electromagnetism) and the Z boson after spontaneous symmetry breaking.
- The factor governs the weak nuclear force, with gauge bosons and .
- The factor governs quantum chromodynamics (QCD), with 8 gluon gauge bosons carrying color charge. Because is non-abelian, gluons interact with each other, producing confinement: quarks cannot be extracted individually from hadrons.
Each of these three factors is a Yang–Mills theory on the same principal bundle framework.
Pure Yang–Mills theory on is expected on physical and numerical grounds to have a mass gap: the lowest energy excitation above the vacuum has energy at least . This would explain why the weak and strong forces are short-range (massive mediators) despite the gauge bosons being classically massless. A rigorous mathematical proof (constructing a quantum Yang–Mills theory on and proving a positive mass gap) has not been found. The Clay Mathematics Institute lists this as one of the seven Millennium Prize Problems, each carrying a $1,000,000 prize.[3]
8. The Aharonov–Bohm Effect
Consider a solenoid: a tightly wound coil of wire carrying current. Inside the solenoid there is a strong magnetic field. Outside, by symmetry and Ampere's law, the field is nearly zero. Classical electromagnetism predicts that electrons traveling only in the field-free region outside the solenoid should be unaffected by whatever happens inside.
In 1959, Yakir Aharonov and David Bohm predicted that this classical intuition is wrong. Even though outside, the vector potential is nonzero (it must be, since is nonzero inside). And the vector potential has a directly measurable effect on quantum mechanics: it shifts the phase of the electron wavefunction along any path. If two electron paths encircle the solenoid from opposite sides and recombine, the phase difference between the paths depends on the total enclosed magnetic flux.
An electron wavefunction acquires a phase
along a path in spacetime, where is the electron charge and A is the electromagnetic gauge potential. For two paths (upper) and (lower) that together encircle the solenoid, the relative phase difference is
where is the total magnetic flux through the solenoid, is the magnetic flux quantum, and is any surface bounded by . The second equality uses Stokes' theorem. The effect is nonzero whenever , regardless of whether B is zero on both paths.
The prediction was confirmed experimentally by Chambers (1960) and definitively by Tonomura and collaborators (1986) using electron holography with a toroidal magnet fully enclosed in a superconducting shield, ensuring everywhere on the electron paths to very high precision.
Drag the flux slider to shift the interference fringe pattern. Even though outside the solenoid, the phase difference [eq:ab-phase-eq] grows linearly with flux.
In the language of fibre bundles, the AB phase is the holonomy of the connection A around the closed loop formed by the two paths. Holonomy measures how much parallel transport around a loop fails to return to the starting point in the fibre. For an abelian bundle, holonomy is a phase in .
The AB phase around a loop is
This can be nonzero even when the curvature along the entire path (since outside the solenoid). The discrepancy is possible because the loop is not contractible in the base space: has fundamental group . The winding number of the loop around the excluded origin counts how many times the path encircles the solenoid. A flat bundle (zero curvature) can still have nontrivial holonomy on a topologically nontrivial base.
This shows that A is not just a gauge artifact. The physical content of electromagnetism is not fully captured by the local fields E and B; the full content is the connection on the principal bundle, including its holonomy properties over topologically nontrivial paths.
9. Dirac Monopoles and Chern Classes
No magnetic monopole has ever been observed. But Dirac showed in 1931 that if a single monopole exists anywhere in the universe, electric charge must be quantized in discrete units: every particle's charge is an integer multiple of the electron charge. This is a striking example of topology constraining physics.
The argument uses the same two-patch construction that classifies line bundles over spheres, now called the Wu–Yang construction.
A magnetic monopole of strength at the origin creates a radial magnetic field
satisfying . On any sphere of radius surrounding the origin,
The problem: we need to write for a potential A defined on the sphere, but Stokes' theorem would then give (a sphere has no boundary), contradicting the nonzero flux. A globally defined A on cannot exist when the flux is nonzero.
Split into two overlapping patches: a northern hemisphere (including the north pole) and a southern hemisphere (including the south pole), overlapping on an equatorial strip. Define gauge potentials on and on that correctly reproduce the monopole field on each patch. On the equatorial overlap, they must differ by a gauge transformation:
For the wavefunction of an electron (charge ) to be single-valued as it traverses the equator once, the gauge transformation parameter must satisfy
Combined with , this gives the Dirac quantization condition:
If any magnetic charge exists, electric charge must be a rational multiple of , explaining charge quantization universally.
The two-patch construction defines a principal -bundle over , classified by the transition function on the equatorial overlap. Since the equator is a circle , the transition function is a map classified by its winding number .
This integer is the first Chern class of the bundle. In terms of the curvature:
For the trivial bundle (no monopole), . For the Hopf bundle of Section 5, . The bundle of a Dirac monopole with Dirac charge has .
The integrality of is a theorem, not an assumption. For any principal -bundle over any compact 2-manifold, the integral of the curvature form over the manifold is an integer multiple of . This is a special case of the Chern-Weil theorem, which relates the topology of a bundle (characteristic classes, computed from curvature forms) to the topology of the base manifold.
The physical content: charges are quantized because the topology of gauge bundles is quantized. The same integer that counts the winding of the transition function also counts the units of magnetic charge. Topology constrains physics.
10. Orbitals, Maxwell's Equations, and Computational Electrostatics
The four languages developed in the preceding sections describe the electromagnetic field given its sources. In classical physics, those sources are point charges and continuous charge distributions, handled via Dirac deltas and smooth densities as in Section 1. In quantum mechanics, even a single particle has an extended charge distribution: the source term in Maxwell's equations is determined by the particle's quantum state. This section develops the semiclassical coupling, in which the charge distribution is quantum-mechanical but the electromagnetic field is treated classically.
For an electron in a normalized quantum state , the semiclassical charge density and current density sourcing Maxwell's equations are
where is the electron charge, is the electron mass, and is the magnetic vector potential. The two terms in are the paramagnetic current (driven by the phase gradient of ) and the diamagnetic current (driven by the vector potential itself). The continuity equation is satisfied identically as a consequence of the time-dependent Schrödinger equation, confirming that quantum mechanics is internally consistent with charge conservation.
For hydrogen, the Schrödinger equation with Coulomb potential has normalized eigenstates , where is the Bohr radius. The charge densities of the lowest orbitals are:
Both are smooth, extended functions. The 1s density is spherically symmetric with characteristic scale ; the density has the dumbbell profile of the orbital's angular part. From far away (), both densities integrate to over all space, so the electron looks like a point charge at the nucleus, consistent with Coulomb's law.
The electrostatic potential sourced by an electron in state satisfies Gauss's law in the form of Poisson's equation:
The solution is the Hartree potential,
This is Coulomb's law with the point charge replaced by the quantum charge density. For the 1s orbital, the integral evaluates in closed form via the expansion of in Legendre polynomials, giving a potential that behaves as for and smoothly interpolates to a finite value at the nucleus.
Derivation. The charge density is the source in Gauss's law [eq:gauss-e]. In the electrostatic case, , so , giving . The integral formula is the Green's function solution of Poisson's equation on with source .
Self-consistent fields. In a many-electron atom or molecule with electrons occupying orbitals , the total charge density is and the Hartree potential sourced by all electrons is
Each orbital satisfies a Schrödinger equation with this potential:
The orbitals determine , which sources through Poisson's equation, which in turn enters the Schrödinger equation for the orbitals. This circularity is the self-consistent field (SCF) problem: the solution must be found iteratively. Adding exchange corrections (the antisymmetry of the many-electron wavefunction) gives the Hartree–Fock equations; replacing the many-body wavefunction with the electron density as the primary variable gives density functional theory (DFT), which is exact in principle via the Hohenberg–Kohn theorem and tractable in practice via approximate exchange-correlation functionals.
Minimal coupling and the vector potential. So far only the scalar potential has appeared. The full coupling between an electron and the electromagnetic field is captured by minimal coupling: replace the canonical momentum with the kinetic momentum everywhere in the Hamiltonian:
Expanding and using the Coulomb gauge (so that and commute):
The term is the paramagnetic coupling, responsible for orbital magnetic moments and NMR chemical shifts; the term is the diamagnetic coupling. The Aharonov–Bohm phase of Section 8 is the unitary transformation implementing minimal coupling for a wavepacket traveling along : substituting into the free Schrödinger equation produces exactly the minimally coupled Hamiltonian above. Gauge invariance of the Schrödinger equation under together with matches the gauge transformation of Section 2 exactly.
Computational pipeline. Solving the coupled Schrödinger–Poisson system for a molecule or solid proceeds iteratively:
- Initialize trial orbitals (typically superposed atomic orbitals).
- Build the electron density .
- Solve Poisson's equation for the Hartree potential. In plane-wave codes, this is a pointwise division in Fourier space: , so . In real-space codes it is solved by multigrid or finite element methods on a numerical grid.
- Build the Kohn–Sham or Hartree–Fock Hamiltonian with updated .
- Diagonalize: solve .
- Check convergence: if , accept; otherwise return to step 2.
Step 3 is the point where Maxwell's equations enter the quantum calculation: the charge density generated by quantum mechanics is fed into the classical electrostatic equation, and the resulting potential modifies the quantum dynamics. The loop between step 2 and step 5 is a discrete version of the self-consistent field condition, and its convergence is guaranteed (under mild conditions) by the Banach fixed-point theorem applied to the density mixing map.
Discrete exterior calculus and cochain complexes. The computational methods above (finite element grids, plane-wave Fourier solves) are instances of a common algebraic framework: discrete exterior calculus (DEC). The central idea is to replace the smooth de Rham complex of Section 3 with an exact combinatorial analogue defined on a mesh.
Let be a simplicial complex (a mesh of nodes, edges, triangles, and tetrahedra) triangulating a domain in . A -cochain is a linear map from the -cells of to : it assigns a real number to each edge (), each face (), each volume (), or each node (). The space of -cochains is .
The coboundary operator is defined by duality with the boundary operator :
for any -cell , where is its oriented boundary (a signed sum of -cells). Since (the boundary of a boundary is empty), we get automatically. This gives the discrete cochain complex:
In coordinates, is the node-to-edge incidence matrix, is the edge-to-face incidence matrix, and is the face-to-volume incidence matrix. These are sparse integer matrices encoding the combinatorial topology of the mesh.
The de Rham complex of Section 3 is the continuum limit of this construction: as the mesh is refined, smooth -forms integrate over -cells to give cochains, and the coboundary operator converges to the exterior derivative. Whitney forms (piecewise linear -forms associated to each -simplex) are the interpolating basis functions that carry this correspondence to the finite element setting; their span over a simplicial complex forms a subcomplex of the de Rham complex, which is the foundation of finite element exterior calculus (FEEC).
The Hodge star of Section 3 maps -forms to -forms using the metric. Discretely, it maps primal -cochains to dual -cochains, where the dual mesh is constructed by placing a dual node at the circumcenter of each primal simplex and connecting dual nodes across shared faces.
The discrete Hodge star is a diagonal matrix whose entries are ratios of dual and primal volumes:
where is the volume of a primal -cell and is the volume of its dual -cell. The discrete Laplacian on -cochains is then
a sparse symmetric positive semi-definite matrix. On 0-cochains, is the discrete scalar Laplacian: the equation is the cochain form of Poisson's equation from step 3 of the SCF pipeline.
Place the electromagnetic degrees of freedom on the primal and dual meshes as follows. The scalar potential lives on primal 0-cochains; the vector potential on primal 1-cochains; the magnetic field on primal 2-cochains; and the electric field on dual 1-cochains. With this placement, [eq:maxwell-forms-eq] from Section 3 become exact algebraic identities:
Both hold at the level of integer arithmetic on the mesh because . The source equations (Gauss and Ampere–Maxwell) become
These are the DEC discretization of from Section 3.
The Yee finite-difference time-domain (FDTD) grid, the standard computational tool for solving Maxwell's equations in 3D, is a special case of this structure: it is the DEC complex on a Cartesian hexahedral mesh, where E-field components sit on primal edges, B-field components sit on primal faces, and the staggering in space and time ensures that is satisfied exactly at every grid point and every time step.
Three structural properties follow from the cochain complex that are difficult to achieve by ad hoc discretizations. First, is preserved exactly by the time-stepping scheme, not just approximately, because it is an algebraic consequence of . Second, the discrete gauge freedom of Section 2 is preserved: replacing leaves unchanged because . Third, the discrete Hodge decomposition gives an exact splitting of any 1-cochain into a gradient part, a curl part, and a harmonic part, enabling gauge-fixed solvers (discrete Lorenz gauge or Coulomb gauge) that are guaranteed to be consistent.
For the SCF pipeline, DEC replaces the ad hoc Poisson solve in step 3 with a principled algebraic system: the charge density is a 3-cochain (a number per mesh volume), the potential is a 0-cochain (a number per node), and the discrete Poisson operator is the sparse matrix assembled once from the mesh geometry. Solving by conjugate gradient (exploiting the symmetry and sparsity of ) is the core linear algebra step shared by real-space DFT codes such as GPAW and Octopus.
11. Synthesis
We have presented the same physical theory in four languages. Here is what each one shows.
| Language | Equation | Reveals |
|---|---|---|
| Vector calculus | etc. | Forces on charges, wave propagation, practical computation |
| Differential forms | Coordinate freedom, relativistic covariance, Stokes' theorem, charge conservation | |
| Spacetime algebra | Single compact equation, spinor structure, algebraic unification with gravity | |
| Principal bundles | Gauge invariance as geometry, topological invariants, non-abelian extension |
The fibre bundle picture is the most fundamental. The electric and magnetic fields E and B are not the primary objects; they are components of the curvature 2-form of a connection on a principal bundle. The connection is itself not quite primary: it depends on a choice of local section (gauge choice), and only gauge-invariant quantities (such as the curvature and holonomies along loops) are physical. The primary object is the bundle itself together with the gauge-equivalence class of connections.
Three consequences follow immediately from this geometric viewpoint:
Gauge invariance is geometry. The freedom to choose a gauge is the freedom to choose a local section of the bundle. Changing the section transforms A by and leaves the curvature invariant. This is the geometric origin of gauge invariance, which from the vector calculus viewpoint appears as an arbitrary-seeming symmetry of the potentials.
The Aharonov–Bohm effect is holonomy. The phase acquired by a charged particle traveling around a loop is the holonomy of the connection around that loop. This is nonzero even when the curvature is zero ( on the path), because the bundle can be topologically nontrivial over the base space with the solenoid excluded. The vector potential A is not redundant: it carries the holonomy information that F does not.
Charge quantization is topology. The first Chern class of a bundle is an integer. If magnetic monopoles exist, the Dirac quantization condition forces electric charge to be an integer multiple of a fundamental unit. The integrality is not a dynamical result but a consequence of the fact that bundles are classified by integers.
The same framework, extended to non-abelian gauge groups and , gives the full Standard Model of particle physics. The weak force is a Yang–Mills theory for ; the strong force is Yang–Mills for ; both share the principal bundle geometry developed here. The unsolved mass gap problem for Yang–Mills is a question about the quantum theory of connections: when the classical theory is quantized, do its energy excitations have a positive lower bound? The classical geometry is understood. What remains is the quantum.
References
- Y. Aharonov and D. Bohm, "Significance of electromagnetic potentials in the quantum theory," Phys. Rev. 115 (1959), 485–491. DOI
- A. Tonomura et al., "Evidence for Aharonov–Bohm effect with magnetic field completely shielded from electron wave," Phys. Rev. Lett. 56 (1986), 792–795. DOI
- A. Jaffe and E. Witten, "Quantum Yang–Mills Theory," in The Millennium Prize Problems, Clay Mathematics Institute, 2006. PDF
- P. A. M. Dirac, "Quantised singularities in the electromagnetic field," Proc. R. Soc. London A 133 (1931), 60–72. DOI
- T. T. Wu and C. N. Yang, "Concept of nonintegrable phase factors and global formulation of gauge fields," Phys. Rev. D 12 (1975), 3845–3857. DOI
- J. C. Maxwell, "A dynamical theory of the electromagnetic field," Phil. Trans. R. Soc. London 155 (1865), 459–512. DOI