Why Was Michelson Surprised?

Why Was Michelson Surprised?

Maxwell's theory of electromagnetism, first published in 1864, suggests that light propagates as a wave with a fixed characteristic speed. It was commonly assumed that this speed must be evaluated with respect to the rest frame of a substantial medium, sometimes called the luminiferous aether. In the same year Maxwell considered various ways in which it might be possible to determine the earth’s state of motion relative to the aether. Some of his proposals involved terrestrial measurements of the permittivity of the vacuum, and so on, but none of these could be expected to show any appreciable variation. He conceived and actually performed experiments involving optical refraction, but the results seemed to be null, i.e., to indicate no appreciable “aether drift”. He wrote a paper on this, but when informed by George Stokes that the same experiments had been performed by Arago many years earlier, and that the results had been explained by Fresnel (by invoking partial dragging), Maxwell decided not to publish the paper. Subsequently, in an article on “Aether” for the 9th edition of the Encyclopedia Britannica, Maxwell again discussed the possibility of detecting the Earth’s motion through the ether.

The only practicable method of determining directly the relative velocity of the aether with respect to the solar system is to compare the values of the velocity of light deduced from the observation of the eclipses of Jupiter’s satellites when Jupiter is seen from the earth at nearly opposite points of the ecliptic.

He went on to describe Arago’s refraction experiment, which involved viewing a ray of starlight through an achromatic prism at various angles relative to the earth’s motion. Maxwell then described his own version of this experiment, which made use of a spectroscope, and which he said was communicated “to Dr Huggins and included by him in his [1868] paper on some of the stars and nebulae”.

The experiment was tried at different times of the year, but only negative results were obtained. We cannot however conclude absolutely from this experiment that the aether near the surface of the earth is carried along with the earth in its orbit, for it has been shown by Professor Stokes that according to Fresnel’s hypothesis the relative velocity of the Aether within the prism would be to that of the aether outside inversely as the square of the index of refraction, and that in this case the deviation would not be sensibly altered on account of the motion of the prism through the aether.

He concluded by saying “the whole question of the luminiferous medium near the earth, and its connexion with gross matter, is very far as yet from being settled by experiment”. Later Maxwell wrote to the American astronomer David Peck Todd to ask if current observational accuracy of the moons of Jupiter was sufficient for the test he had proposed, but Todd informed him that it was not. Maxwell also included in his letter to Todd a detailed description of a terrestrial method for measuring the local speed of the aether by reflecting two light beams at right angles to each other, and he noted that this would require precision on the order of (v/c)², which he (Maxwell) thought must surely be unachievable. After Maxwell’s death a short time later, Todd published the letter in the journal Nature in 1879.

The 28-year-old Albert Michelson (1852-1931) read Maxwell's letter in Nature, and was intrigued by the idea of detecting the Earth's motion through the (supposed) ether by measuring variations in the speed of light directly, in the manner that Maxwell had described in the letter, but which he (Maxwell) had assumed was completely impractical. Michelson was an expert in interferometry techniques, and took up the challenge to achieve the level of precision necessary to carry out the test and measure the speed of the aether.

The basic idea was simple. The time required for light to make a round trip back and forth along a measuring bar of length L, assuming the bar is moving axially with speed v relative to the ether, is

so the first-order effect cancels out, leaving only a second order effect in v/c. Michelson realized that interferometry techniques were precise enough to detect variations of this magnitude, and he fully expected to observe such variations when he pointed the device in various directions. Hence he was surprised to find that the experiment, performed first in 1881 in Potsdam, Germany, and then again in 1887 at Case Reserve University with his co-worker Edward Morley, revealed no such variations, i.e., the speed of light was apparently the same in all direction with respect to the inertial coordinates of his laboratory, regardless of the direction of light travel.

Michelson's null result also came as a surprise to Hendrik Lorentz (1853-1928), one of the leading theorists in the field of electrodynamics at the time. Lorentz was initially unable to account for the apparent absence of second-order variations in the speed of light, but he eventually developed a theory incorporating both time dilation and length contraction (first suggested by George Fitzgerald in 1892) to match the experimental results. (The unsuccessful efforts to detect the Earth's motion through the purported ether also provided some of the impetus for Einstein's special theory relativity in 1905, which entails the same time dilation and length contraction effects contained in Lorentz's theory, albeit with a different interpretation.)

Lorentz had been laboring to develop a constructive theory, by determining the detailed physical processes that give material objects their shapes and sizes, all assuming a Galilean background of space and time, i.e., a spacetime consisting of the Cartesian product of a three-dimensional Euclidean space manifold with a one-dimensional time manifold. At that time (around 1900), the only known force with any appreciable strength on a small scale was the electromagnetic force, which Lorentz believed was propagated at the invariant speed c with respect to an absolutely motionless background medium (the luminiferous ether). From this standpoint it was not unreasonable to hypothesize that the structure of all (non-ether) material entities was established and enforced by signals that propagate at the absolute speed c with respect to the ether. For example, an elementary particle at rest in the ether would be expected to have a spherical shape, based on the idea that a wave emanating from the geometric center of the particle would expand spherically until reaching the radius of the particle, where we can imagine that it is reflected, and the reflected wave then contracts spherically back to a point (like a spatial filter) and re-expands on the next cycle. This is illustrated by the left-hand cycle below. (Only two spatial dimensions are shown; in full 4-dimensional spacetime each shell is a sphere).

If the particle is moving relative to the putative ether, then obviously the absolute shape must change from a sphere to an ellipsoid, as illustrated by the right-hand figure above. Of course, the spatial size of the particle with respect to the ether rest frame coordinates is just the intersection of a horizontal time slice with the shaft swept out by these shells. It's also clear that, for any given characteristic particle, since there is no motion relative to the ether in the transverse direction, the size in the transverse direction is unaffected by the motion. Thus the widths of the shells in the "y" direction in the above figure are equal.

On this basis, what can we infer about the dimensions of moving objects? The figure below shows side and top views of one cycle of a stationary and a moving particle (with motions referenced to the rest frame of the putative ether).

It's understood that these represent the same characteristic particle, so the transverse size is the same. The right-hand particle is moving with a speed v in the positive x direction. In each case the geometric center of the particle is moving from point A to point B. The characteristic radius of the particle shell has been taken as unity.

In order to make the transverse sizes of the shells equal, the enclosed areas of the cross-sectional side views must be equal. Thus, light emanating from point A of the moving particle extends a distance 1/λ to the left and a distance λ to the right, where λ is a constant function of v. Specifically, we must have

The leading edge of the shaft swept out by the moving shell crosses the x axis at a distance λ(1−v) from the center point A, which implies that the object's instantaneous spatial extent from the center to the leading edge is only

Likewise it's easy to see that the elapsed time (according to the putative ether rest frame coordinates) for one cycle of the moving particle, i.e., from point A to point B, is simply

compared with an elapsed time of 2 for the same particle at rest.

Hence we unavoidably arrive at Fitzgerald's length contraction and Lorentz's time dilation for objects in motion with respect to the x,y,t coordinates, provided only that all characteristic spatial and temporal intervals associated with physical entities are enforced by signals that propagate at the fixed speed c = 1 with respect to these coordinates.

Lorentz's writings on this subject include references to what he called the "theorem of corresponding states". He noticed that there is a system of space and time coordinates, x′,y′,t′, in terms of which the right-hand particle in the above figure is at rest, and the physical description of the moving particle corresponds precisely to the description of the left-hand particle with respect to its rest frame coordinates (i.e., the ether frame). The transformation between these two coordinate systems is easily shown to be

Noting that the speed c = 1 is preserved under this transformation, we see that the x′,y′,t′ coordinate system could just as well be regarded as the rest frame of the ether. Indeed, if we just algebraically invert this transformation we find

In other words, the relationship between these two systems is reciprocal and symmetrical, and they are moving with speed −v and +v relative to each other. Thus if all physically meaningful substantial intervals of time and space are ultimately enforced by signals propagating at a fixed speed c with respect to one particular "absolute" system of space and time coordinates, then it automatically follows that there exist infinitely many systems of space and time coordinates, related according to the above transformation, all of which have the same fixed speed c, and each of which is equally suitable to be considered the "absolute" system of reference. Henri Poincare (1854-1912) noted that the so-called Lorentz transformations constitute a group that maintains the invariance of all quantities of the form (dt)² − (dx)² − (dy)² − (dz)², just as the group of translations and rotations in Euclidean space maintains the invariance of all quantities of the form (dx)² + (dy)² + (dz)². Just as there is no intrinsically distinguished position or orientation in Euclidean space, there is no intrinsically distinguished rest frame in Minkowskian spacetime, so this naturally leads to Einstein's relativistic view of spacetime.

It's interesting that, despite the unavoidability of these conclusions from the basic premises that were held by many of the physicists from Maxwell to Lorentz (i.e., the electromagnetic world view), few if any researchers anticipated Michelson's null result. There are, I think, two reasons for this. First, Maxwell's equations are linear, so it was always clear that they could never yield stable particles. Hence there must be other forces at work, fundamentally different from electromagnetism, in order to achieve stable particles. This realization obviously undermines the cogency of Lorentz's molecular force hypothesis. (Indeed, it was this very realization that prompted Einstein to seek a more general phenomenological principle, not directly tied to Maxwell's equations, but compatible with them.) Second, and perhaps more importantly, Lorentz's theorem of corresponding states rested on another hypothesis, namely, the hypothesis that the inertial masses of material objects transform (longitudinally and transversely) in accord with the same expressions as do electromagnetic forces. There is no a priori reason - aside from the principle of relativity - to suppose that this hypothesis is true. In fact, the combination of this inertial hypothesis with the molecular force hypothesis is tantamount to the assumption of relativity, i.e., the assumption that electromagnetism and mechanics are covariant with respect to the same family of velocity transformations. The reason people were surprised by this is that it entailed a modification of either Maxwell's equations or Newton's laws, both of which seemed satisfactory within their respective areas of applicability. (See Section 3.6 of Reflections on Relativity for a more detailed discussion.)

Incidentally, the projections of the points A and B in the above drawing are the foci of the projected ellipse. Also, the semi-latus rectum of this ellipse is the factor (1−v²)^1/2, and the eccentricity of the ellipse is v. To see this, we need only take the locus of points comprising the shell of the particle with respect to the co-moving coordinates

and apply the (inverted) Lorentz transformation, which gives

Since sin(θ)² + cos(θ)² = 1, the second two equations give the projected locus

which is the equation of an ellipse with the stated semi-latus rectum and eccentricity, with one focus at (0, 0) and the other at (2v/(1−v²)^1/2, 0). The polar equation of this ellipse is

The angle ϕ in this equation is different from the angle θ in the equations for x and y, because θ is evaluated with respect to the co-moving x′,y′,t′ coordinates, whereas ϕ is evaluated with respect to the x,y,t coordinates. Since (x)² + (y)² = (r)², these angles are related by the equation

Expanding the squares and simplifying, this gives

This is equivalent to the well-known aberration formula, which can be seen by expanding the left hand side and solving for cos(θ), which gives

Return to MathPages Main Menu