1. Introduction
In 1907, Einstein introduced the equivalence principle [
1]. He used it to ‘extrapolate’ the effects of special relativity to systems at rest in a gravitational field via their alleged equivalence to uniformly accelerated systems. In that paper, Einstein first derived the gravitational redshift, the gravitational time dilation, and other effects of gravity on electromagnetic processes, like the variable velocity of light and the gravitational light deflection.
His first attempt to extend special relativity to gravitation was, according to Einstein himself, not particularly satisfying, and he returned to the topic in 1911, providing a much simpler derivation of the gravitational time dilation, redshift, and light deflection. Let us briefly review this second derivation of the gravitational redshift [
2].
Consider two material systems,
and
, at rest in a local, uniform gravitational field
(
Figure 1).
and
are separated by a distance
d. Consider further a reference frame
. System
is a free-falling (gravitation-free) system located near
with an initial instantaneous velocity relative to
equal to zero.
Suppose further that a ray of light of frequency
is emitted by
towards
when the relative velocity of the free-falling frame
with respect to
and
is still equal to zero. The ray of light reaches
after a time nearly equal to
. According to the principle of equivalence, this situation is physically equivalent to one in which
is at rest, and
and
accelerate with acceleration
and initial velocity equal to zero. When the ray of light arrives at
, the velocity of
relative to the stationary frame
is equal to
. Therefore,
in the view of any observer in frame , the ray of light received at
has a frequency
as follows
where the second term is the Doppler formula for
.
For
, Einstein substituted the gravitational potential
of
, that of
is taken as zero, and assumed that the relation (
1), deduced for a homogeneous gravitational field, would also hold for other forms of field. Then, Einstein arrived at the well-known (approximated) formula for the gravitational redshift (in this example, it is actually a blueshift)
From this formula, Einstein also derived the gravitational time dilation formula. Suppose that, during the time interval
(as measured by a clock at rest at
),
emits
n waves. Then, from the definition of frequency, we have
. Let
receive these same
n waves during the time interval
(as measured by a clock at rest at
). Then, again, according to the definition of frequency, we have
. Hence, equation (
2) leads to the gravitational time dilation formula
In section 2 of the 1911 paper, Einstein showed that, in general, energy is affected by a gravitational field and that, like the inertial mass, the gravitational mass of a body increases by
when the body absorbs an amount of energy equal to
E. In that derivation, the setup is the same as in
Figure 1. Einstein used the approximated relativistic energy transformation law
[
3] and, again, the equivalence principle. Moreover, by devising a clever thought experiment, he proved the following:
[...] hence, energy must possess a gravitational mass which is equal to its inertial mass. If a mass is suspended from a spring balance in the system [the system moving with acceleration ], the balance will indicate the apparent weight because of the inertia of . If the quantity of energy E is transferred to , the spring balance will indicate , in accordance with the principle of the inertia of energy. According to our basic assumption [the principle of equivalence], exactly the same thing must happen if the experiment is repeated in the system K, i.e., in the gravitational field. [emphasis in the original]
It cannot escape us that the discovery that energy is affected by a gravitational field, together with the quantum formula for the energy of a photon of frequency
,
, and some algebra, again gives the relation (
2) for the gravitational frequency shift. However, although Einstein’s discovery of mass and energy dependence on gravitation is a crucial assumption, and his idealized experiment inspired the subsequent thought experiments analyzed in this paper, this last derivation does not strictly count as a derivation of the gravitational frequency shift from energy conservation. In fact, it is still a derivation from special relativity and the principle of equivalence. Instead, the typical (archetypal) derivations from energy conservation can be found, for instance, in the books by Born [
4], Feynman, Leighton, and Sands [
5], Weinberg [
6], Misner, Thorne, and Wheeler [
7], Rindler [
8], and Schutz [
9] (and many minor physics textbooks).
In the following section, we first explicitly list all the assumptions necessary for the derivation from energy conservation and outline the typical thought experiment widely used in literature, which extends those assumptions to the photon. We briefly discuss the criticism of these kinds of derivation advanced, among others, by Weinberg [
6] and Okun, Selivanov, and Telegdi [
10]. We agree with their reservations in applying the mentioned assumptions to the photon. However, we recall that a derivation from energy conservation that does not require these assumptions for the photon is possible. As representatives of such derivations, we analyze in more detail the thought experiments presented in [
5,
6,
7]. We show that, when revised and corrected, they, too, appear unfortunately troublesome for the very existence of the gravitational frequency shift.
That early attempts for a clear derivation of the gravitational redshift were fraught with errors and ambiguities is not new, see Scott [
12], but here we can say more. In
Section 3, we prove that, in fact, energy conservation does not imply gravitational frequency shift. We do that with a simple thought experiment that does not need to assume any gravitational mass or gravitational potential energy of the photon. In that proof, we do not even need to assume that the gravitational potential energy of a body contributes to the total mass of the body. In the Appendix, we also present a more direct thought experiment that uses the conservation of linear momentum and arrives at the same conclusion.
In the last section, we briefly recap the significance of our results.
2. Gravitational Frequency Shift and the Conservation of Energy
The derivation of the gravitational frequency shift from energy conservation is generally seen as a confirmation of that phenomenon alternative to (and independent of) the classical derivation from special relativity and the principle of equivalence, and it can be found in many textbooks on general relativity.
Let us first list all the premises and commonly held beliefs explicitly or tacitly assumed in the derivation from energy conservation. They are crucial for the acceptance of its physical validity:
- 1)
Not only can mass be converted into energy, but
every kind of energy has mass as well (or can always be converted into mass) via the mass-energy equivalence formula
, where
is the rest mass [
1,
2];
- 2)
Inertial mass is equivalent to gravitational mass;
- 3)
The energy of a light photon with frequency is , where h is the Planck’s constant;
- 4)
The principle of conservation of energy.
An example of derivation is the following. An ‘infinitesimal’ version of it can be found, for instance, in the book by Rindler [
8]. A receiver
R is placed straight above an emitter of photons
E at a distance
d. Both are stationary in a uniform gravitational field
g. The emitter
E emits a photon of frequency
, and energy
, towards
R. Photons do not have rest mass, but for the sake of derivation, it is assumed that the emitted photon has an ‘effective’ gravitational mass
m equal to its inertial mass obtained from the mass-energy equivalence,
(assumptions 1, 2, and 3). Since the emitted photon needs to climb a height
d in the uniform gravitational field, its energy
at the receiver
R is lower than
E. Due to the conservation of energy (assumption 4), we necessarily have that
where the potential energy
is the energy ‘spent’ by the photon climbing the distance
d.
Equation (
4) can be rewritten as follows,
which is the sought-out gravitational frequency shift formula (
1) (if the positions of
E and
R are reversed, the minus sign becomes a plus sign in the equation).
As far as this author knows, that type of derivation received few criticisms, with some notable exceptions like Weinberg, who affirms that the concept of gravitational potential energy for a photon is without foundation [
6]. Like Weinberg, Okun, Selivanov, and Telegdi argue that any explanation of the gravitational frequency shift in terms of gravitational mass and gravitational potential energy of the photon is incorrect and misleading [
10].
We fully agree with these authors since the photon has no rest mass, and the appeal to its gravitational mass and potential energy is a loose and not legitimate argument. However, it is possible to come up with a gravitational frequency shift derivation from energy conservation that does not appeal to those concepts like, for instance, the derivations in Feynman, Leighton, and Sands [
5], Weinberg [
6], Misner, Thorne, and Wheeler [
7], Schutz [
9], Koks [
11], and Earman and Glymour [
13].
As a representative of such derivations, consider first the thought experiment by Misner, Thorne, and Wheeler. They recount Einstein’s 1911 realization of the interaction between light and gravity as follows (the speed of light is set as ):
That a photon must be affected by a gravitational field Einstein (1911) showed from the law of conservation of energy, applied in the context of Newtonian gravitation theory. Let a particle of rest mass m start from rest in a gravitational field g at point and fall freely for a distance h to point . It gains kinetic energy . Its total energy, including rest mass, become
Now, let the particle undergo an annihilation at , converting its total rest mass plus kinetic energy into a photon of the same energy. Let this photon travel upward in the gravitational field to . If it does not interact with gravity, it will have its original energy on arrival at . At this point it could be converted by a suitable apparatus into another particle of rest mass m (which could then repeat the whole process) plus an excess energy that costs nothing to produce. To avoid this contradiction of the principal [sic] of conservation of energy, which can also be stated in purely classical terms, Einstein saw that the photon must suffer a red shift.
In this derivation, nowhere reference is made to the gravitational mass or the gravitational potential energy of the photon. Energy has a mass only after absorption by a non-relativistic and macroscopic material body (the apparatus that converts it into a particle in the last part of the process). That is allowed by the widely-held interpretation of the mass-energy equivalence.
Unfortunately, even Misner, Thorne, and Wheeler’s argument is problematic [
16]. If a particle of rest mass
m starts from rest in a gravitational field
g at point
and falls freely for a distance
h to point
, that particle possesses also an energy equal to
already at point
. It is called gravitational potential energy. Therefore, owing to mass-energy equivalence (assumption 1; see also Appendix A), at point
, that particle already has a total mass/energy equal
1 to
. Now, if the energy of the photon produced in the particle annihilation at point
and traveling upward does not have its original value on arrival at
(i.e.,
), the mass of the particle created by the suitable apparatus at the end of the process would not have the same mass as the original particle (again,
), and the total energy/mass would not be conserved. When Misner, Thorne, and Wheeler say that the particle “gains kinetic energy
” on arrival at point
, and “its total energy, including rest mass, becomes
”, they seem to forget that the particle already has gravitational potential energy
, and total energy
, just before starting to fall. That is demanded by the principle of conservation of energy. The same analysis with a few adjustments also applies to the derivations in Schutz [
9] and Koks [
11] with the same conclusion.
Even if Misner, Thorne, and Wheeler do not explicitly mention the Planck-Einstein formula , the fact that the energy can be converted into a single photon or a finite (and definite) number of photons is a tacit but important further assumption. For if it were possible to convert energy into light in a ‘continuous’ way, the conservation of energy could still be re-established: in principle, if the emitter at point continuously emitted a higher frequency radiation (higher intrinsic energy) for an interval and the receiver at point continuously received a lower frequency radiation (lower intrinsic energy) for a suitably longer interval , the total amount of energy could still be conserved (and the gravitational time dilation would necessarily get back into the game). However, the quantization of energy in light transmission has solid theoretical and experimental corroboration.
Weinberg presented a derivation from energy conservation slightly different from that given by Misner, Thorne, and Wheeler but equally problematic. It could be of some interest to go into detail. Weinberg writes (again, the speed of light is set as
) [
6]:
Incidentally, the gravitational red shift of light rising from a lower to a higher gravitational potential can to some extent be understood as a consequence of quantum theory, energy conservation, and the “weak” Principle of Equivalence. When a photon is produced at point 1 by some heavy nonrelativistic apparatus, an observer in a locally inertial coordinate system moving with the apparatus will see its internal energy and hence its inertial mass change by an amount related to the photon frequency
he observes, that is, by
where
erg sec is Planck’s constant. Suppose that the photon is then absorbed at point 2 by a second heavy apparatus; an observer in a freely falling system will see the apparatus change in inertial mass by an amount related to the photon frequency
he observes, that is, by
However, the total internal plus gravitational potential energy of the two pieces of apparatus must be the same before and after these events, so
and therefore
in agreement with our previous result. (Also, it makes no difference whether the photon frequencies are measured in locally inertial systems, because the gravitational field in any other frame will affect the rate of the observer’s standard clock in the same way as it affects the
’s.)
Leaving aside the reference to the free-falling observer who will necessarily see a Doppler shift due to the motion relative to the stationary emitting apparatus, a thing that, in the humble opinion of this author, unnecessarily complicates the picture, Weinberg’s derivation seems to violate the conservation of energy just from the beginning. First, he states that, upon the photon emission, the apparatus will change its internal energy by an amount . But, then, he says that the variation of the total energy of the apparatus to consider upon emission is . Namely, the apparatus emits energy equal to , but its total energy variation is . That already represents a violation of energy conservation. If we reestablish the conservation of energy (), no gravitational frequency shift is implied.
A similar issue in the application of energy conservation also affects the derivation in Feynman, Leighton, and Sands [
5]. They write:
We know that the gravitational force on an object is proportional to its mass M, which is related to its total internal energy E by . For instance, the masses of nuclei determined from the energies of nuclear reactions which transmute one nucleus into another agree with the masses obtained from atomic weights.
Now think of an atom which has a lowest energy state of total energy
and a higher energy state
, and which can go from the state
to the state
by emitting light. The frequency
of the light will be given by
Now suppose we have such an atom in the state
sitting on the floor, and we carry it from the floor to the height
H. To do that we must do some work in carrying the mass
up against the gravitational force. The amount of work done is
Then we let the atom emit a photon and go into the lower energy state
. Afterward we carry the atom back to the floor. On the return trip the mass is
; we get back the energy
so we have done a net amount of work equal to
When the atom emitted the photon it gave up the energy
. Now suppose that the photon happened to go down to the floor and be absorbed. How much energy would it deliver there? You might at first think that it would deliver just the energy
. But that can’t be right if energy is conserved, as you can see from the following argument. We started with the energy
at the floor. When we finish, the energy at the floor level is the energy
of the atom in its lower state plus the energy
received from the photon. In the meantime we have had to supply the additional energy
of Eq. (42.10). If energy is conserved, the energy we end up with at the floor must be greater than we started with by just the work we have done. Namely, we must have that
or
It must be that the photon does not arrive at the floor with just the energy
it started with, but with a
little more energy. Otherwise some energy would have been lost. If we substitute in Eq. (42.11) the
we got in Eq. (42.10) we get that the photon arrives at the floor with the energy
But a photon of energy
has the frequency
. Calling the frequency of the emitted photon
—which is by Eq. (42.7) equal to
—our result in Eq. (42.12) gives again the relation of (42.5) between the frequency of the photon when it is absorbed on the floor and the frequency with which it was emitted.
The weak link of the above inference chain appears to be the assumption (42.9). The total energy of the atom sitting on the floor is . After being carried to the height H, its total energy becomes (its rest energy plus the work done on the atom). With the emission of a photon of energy , the total energy becomes . According to the conservation of energy, that total energy must be conserved after the atom is carried back to the floor. Now, if we subtract the new rest energy of the atom from this total energy, we get back the correct energy, , and the net amount of work we have done is . Therefore, according to equation (42.11), the photon must arrive at the floor with just the energy it started with at the height H.
3. Proof That Energy Conservation Does Not Imply Gravitational Redshift
Here, we prove that the conservation of energy does not imply gravitational frequency shift. We do that with a simple thought experiment that does not need to assume any gravitational mass or gravitational potential energy of the photon. In this proof, we do not even need to assume that the gravitational potential energy of a body contributes to the total mass of the body (assumption 1), as we have done in the revision of some previous demonstrations. Incidentally, it can be shown that that must always be the case (see Appendix A).
Consider a body of mass
m stationary at point
and a macroscopic apparatus stationary at point
at a height
h above point
in a gravitational field
g (
Figure 2). Let the apparatus perform mechanical work on body
m, raising it to point
. The work done by the apparatus is equal to
, which is also equal to the gravitational potential energy of the body
m relative to point
. Now, if the mass is lowered back to point
and its potential energy conventionally (and entirely) converted into electrical energy and then into a single photon of energy
(ultimately emitted by a beacon), the energy of the photon must always be the same while climbing up the gravitational field back to point
. The photon energy at point
must still be equal to
. That is demanded by the conservation of energy. Through photon absorption, the apparatus must regain the same energy expended at the beginning of the cycle on
m. Therefore, owing to the Planck-Einstein formula, the photon frequency must be the same at points
and
.
To emphasize the above conclusion, consider the cycle in reverse. The first step now consists of the crane emitting a photon of energy (frequency ) suitably lower than . The original energy is such that when the photon arrives at the beacon, it becomes equal to () owing to the standard gravitational redshift (blueshift in this case). In this way, is what is exactly needed to raise the mass m to the crane at the height h. Then, the mass is released back to the initial position, and the energy coming from that release () goes into the crane reservoir. At the end of the cycle, the crane will gain positive energy () out of nowhere.