Bell’s theorem [
1] is an impossibility argument (or “proof”) that claims that no locally causal and realistic hidden variable theory envisaged by Einstein that could “complete” quantum theory can reproduce all of the predictions of quantum theory. But some such claims of impossibility in physics are known to harbor unjustified assumptions. In this paper, I show that Bell’s theorem against locally causal hidden variable theories is no exception. It is no different, in this respect, from von Neumann’s theorem against all hidden variable theories [
2], or the Coleman-Mandula theorem overlooking the possibilities of supersymmetry [
3]. The implicit and unjustified assumptions underlying the latter two theorems seemed so innocuous to many that they escaped notice for decades. By contrast, Bell’s theorem has faced skepticism and challenges by many from its very inception (cf. footnote 1 in [
4]), including by me [
4,
5,
6,
7,
8,
9,
10,
11,
12,
13,
14,
15], because it depends on a number of questionable implicit and explicit physical assumptions that are not difficult to recognize [
9,
15]. In what follows, I bring out one such assumption and demonstrate that Bell’s theorem is based on a circular argument [
8]. It unjustifiably assumes the additivity of expectation values for dispersion-free states of hidden variable theories for non-commuting observables involved in the Bell-test experiments [
16], which is tautologous to assuming the bounds of
on the Bell-CHSH sum of expectation values. It thus assumes in a different guise what it sets out to prove. As a result, what is ruled out by Bell-test experiments is not local realism but the additivity of expectation values, which does not hold for non-commuting observables in dispersion-free states of hidden variable theories to begin with.
1.1. Heuristics for completing quantum mechanics
The goal of any hidden variable theory [
2,
17,
18] is to reproduce the statistical predictions encoded in the quantum states
of physical systems using hypothetical dispersion-free states
that have no inherent statistical character, where the Hilbert space
is extended by the space
of hidden variables
, which are hypothesized to “complete” the states of the physical systems as envisaged by Einstein [
19]. If the values of
can be specified in advance, then the results of any measurements on a given physical system are uniquely determined.
To appreciate this, recall that expectation value of the square of any self-adjoint operator
in a normalized quantum mechanical state
and the square of the expectation value of
will not be equal to each other in general:
This gives rise to inherent statistical uncertainty in the value of
, indicating that the state
is not dispersion-free:
By contrast, in a normalized dispersion-free state
of hidden variable theories formalized by von Neumann [
2], the expectation value of
,
by hypothesis, is equal to one of its eigenvalues
, determined by the hidden variables
,
so that a measurement of
in the state
would yield the result
with certainty. How this can be accomplished in a dynamical theory of measurement process remains an open question [
17]. But accepting the hypothesis (
3) implies
Consequently, unlike in a quantum sate
, in a dispersion-free state
observables
have no inherent uncertainty:
The expectation value of
in the quantum state
can then be recovered by integrating over the hidden variables
:
where
denotes the normalized probability distribution over the space
of thus hypothesized hidden variables.
As it stands, this prescription amounts to assignment of unique eigenvalues
to all observables
simultaneously, regardless of whether they are actually measured. In other words, according to (
6) every physical quantity of a given system represented by
would possess a unique preexisting value, irrespective of any measurements being performed. In Section 2 of [
17], Bell works out an instructive example to illustrate how this works for a system of two-dimensional Hilbert space. The prescription (
6) fails, however, for Hilbert spaces of dimensions greater than two, because in higher dimensions degeneracies prevent simultaneous assignments of unique eigenvalues to all observables in dispersion-free states
dictated by the ansatz (
3), giving contradictory values for the same physical quantities. This was proved independently by Bell [
17], Kochen and Specker [
20], and Belinfante [
21], as a corollary to Gleason’s theorem [
22,
23].
These proofs – known as the Kochen-Specker theorem – do not exclude contextual hidden variable theories in which the complete state
of a system assigns unique values to physical quantities only
relative to experimental contexts [
18,
23]. If we denote the observables as
with
c being the environmental contexts of their measurements, then thenon-contextual prescription (
6) can be easily modified to accommodate contextual hidden variable theories as follows:
Each observable
is still assigned a unique eigenvalue
, but now determined cooperatively by the complete state
of the system and the state
c of its environmental contexts. Consequently, even though some of its features are no longer intrinsic to the system, contextual hidden variable theories do not have the inherent statistical character of quantum mechanics, because outcome of an experiment is a cooperative effect just as it is in classical physics [
23]. Therefore, such theories interpret quantum entanglement at the level of the complete state
only epistemically.
For our purposes here, it is also important to recall that in the Hilbert space formulation of quantum mechanics [
2] the correspondence between observables and Hermitian operators is one-to-one. Moreover, a sum
of several observables such as
is also an observable representing a physical quantity, and consequently the sum of the expectation values of
is the expectation value of the summed operator
,
regardless of whether the observables are simultaneously measurable or mutually commutative [
17]. The question then is, since within any contextual hidden variable theory characterized by (
7) all of the observables
and their sum
are assigned unique eigenvalues
and
, respectively, would these eigenvalues satisfy the equality
in dispersion-free states
of physical systems in analogy with the linear quantum mechanical relation (
8) above? The answer is: Not in general, because the eigenvalue
of the summed operator
is not equal to the sum
of eigenvalues
for given
, unless the constituent observables
are mutually commutative. As Bell points out in Section 3 of [
17], the linear relation (
8) is an unusual property of quantum mechanical states
. There is no reason to demand it
individually of the dispersion-free states
, whose function is to reproduce themeasurable features of quantum systems only when averaged over, as in (
7). I will come back to this point in
Section 1.5.
1.2. Special case of the singlet state and EPR-Bohm observables
Now, the proof of Bell’s famous theorem [
1] is based on Bohm’s spin version of the EPR’s thought experiment [
24], which involves an entangled pair of spin-
particles emerging from a source and moving freely in opposite directions, with particles 1 and 2 subject, respectively, to spin measurements along independently chosen unit directions
and
by Alice and Bob, who are stationed at a spacelike separated distance from each other (see
Figure 1). If initially the pair has vanishing total spin, then the quantum mechanical state of the system is described by the entangled singlet state
where
is an arbitrary unit vector in
and
defines quantum mechanical eigenstates in which the two fermions have spins “up” or “down” in the units of
, with
being the Pauli spin “vector”
. Once the state (
10) is prepared, the observable
of interest is
whose possible eigenvalues are
where
and
are the results of spin measurements made jointly by Alice and Bob along their randomly chosen detector directions
and
. In the singlet state (
10) the joint observable (
12) predicts sinusoidal correlations
between the values of the spins observed about the freely chosen contexts
and
[
5].
For
locally contextual hidden variable theories there is a further requirement that the results of local measurements must be describable by functions that respect local causality, as first envisaged by Einstein [
19] and later formulated mathematically by Bell [
1]. It can be satisfied by requiring that the eigenvalue
of the observable
in (
12) representing the joint result
is factorizable as
, or in Bell’s notation as
with the factorized functions
and
satisfying the following condition of local causality:
Apart from the hidden variables
, the result
of Alice depends
only on the measurement context
, chosen freely by Alice, regardless of Bob’s actions. And, likewise, apart from the hidden variables
, the result
of Bob depends
only on the measurement context
, chosen freely by Bob, regardless of Alice’s actions. In particular, the function
does not depend on
or
and the function
does not depend on
or
. Moreover, the hidden variables
do not depend on either
,
,
, or
[
10].
The expectation value
of the joint results in the dispersion-free state
should then satisfy the condition
where the hidden variables
originate from a source located in the overlap of the backward light-cones of Alice and Bob, and the normalized probability distribution
is assumed to remain statistically independent of the contexts
and
so that
, which is a reasonable assumption. In fact, relaxing this assumption to allow
to depend on
and
introduces a form of non-locality, as explained by Clauser and Horne in footnote 13 of [
25]. Then, since
and
, their product
, setting the following bounds on
:
These bounds are respected not only by local hidden variable theories but also by quantum mechanics and experiments.
1.3. Mathematical core of Bell’s theorem
By contrast, at the heart of Bell’s theorem is a derivation of the bounds of
on a combination of the expectation values
of local results
and
, recorded at remote observation stations by Alice and Bob, from four different sub-experiments involving measurements of non-commuting observables such as
and
[
1,
16]:
Alice can freely choose a detector direction
or
, and likewise Bob can freely choose a detector direction
or
, to detect, at a space-like distance from each other, the spins of fermions they receive from the common source. Then, from (
16), we can immediately read off the upper and lower bounds on the combination (
17) of expectation values:
The next step in Bell’s derivation of the bounds
instead of
is the assumption of additivity of expectation values:
We will have much to discuss about this step, but if we accept the last equality, then the bounds of
on Bell-CHSH combination (
17) of expectation values is not difficult to work out by rewriting the integrand on its right-hand side as
Since
, if
, then
, and vice versa. Consequently, since
, the integrand (
20) is bounded by
and the absolute value of the last integral in (
19) does not exceed2:
Therefore, the equality (
19) implies that the absolute value of the combination of expectation values is bounded by 2:
But since the bounds on (
17) predicted by quantum mechanics and observed in experiments are
, Bell concludes that no local and realistic theory envisaged by Einstein can reproduce the statistical predictions of quantum mechanics. In particular, contextual hidden variable theories specified by (
7) that respect the factorizability (
14) are not viable.
Now, it is not difficult to demonstrate the
converse of the above derivation in which the additivity of expectation values (
19) is derived by assuming the stringent bounds of
on the sum (
17). Employing (
15), (
17) can be writtenas
Since each product
in the above integrals is equal to
, each of the four integrals is bounded by
:
Thus the sum of four integrals in (
23) is bounded by
, not
. However, we started with (
22), which contends that the sum of integrals in (
23) is bounded by
. But the only way to reduce the bounds on (
23) from
to
without violating the rules of anti-derivatives is by equating the sum of integrals in (
23) to the following integral of the sum,
which, as we saw above in (
21), is bounded by
. We have thus derived the additivity of expectation values (
19) by imposing (
22) as our starting assumption. Thus, given the previous derivation that led us to (
22) by assuming (
19) and the current derivation that led us to (
19) by assuming (
22), we have proved that the assumption (
19) of the additivity ofexpectation values is tautologous to assuming the bounds of
on Bell-CHSH combination (
17) of expectation values.
In many derivations of (
22) in the literature, factorized probabilities of observing binary measurement results are employed rather than measurement results themselves I have used in (
14) in my derivation following Bell [
1,
16]. But employing probabilities would only manage to obfuscate the logical flaw in Bell’s argument I intend to bring out here.
1.4. Additivity of expectation values is respected by quantum states
The key step that led us to the bounds of
on (
17) that are more restrictive than
is the assumption (
19) of the additivity of expectation values, which (as noted after (
9) and will be further explained in
Section 1.5) is
valid only for commuting observables [
15]. This assumption, however, is usually not viewed as an assumption at all. It is usually viewed as a benign mathematical step, necessitated by Einstein’s requirement of realism [
19]. But as I will demonstratein
Section 1.5, far from being required by realism, the right-hand side of (
19), in fact,
contradicts realism, which requires that
every observable of a physical system is assigned a unique eigenvalue, quantifying one of its preexisting properties.
Moreover, realism has already been adequately accommodated by the very definition of the local functions
and
and their counterfactual juxtaposition on the left-hand side of (
19), as contextually existing properties of the system. Evidently, while a result in only one of the four expectation values corresponding to a sub-experiment that appear on the left-hand side of (
19) can be realized in a given run of a Bell-test experiment, the remaining three results appearing on that side are realizable at least counterfactually, thus fulfilling the requirement of realism [
8]. Therefore, the requirement of realism does not necessitate the left-hand side of (
19) to be equated with its right-hand side in the derivation of (
22). Realism requires definite results
to exist as eigenvalues only counterfactually,
not allfour at once, as they are written on the right-hand side of (
19). What is more, as we will soon see, realism implicit in the prescription (
7) requires the quantity (
20) to be a
correct eigenvalue of the summed operator (
33), but it is not.
On the other hand, given the assumption
of statistical independence and the addition property of anti-derivatives, mathematically the equality (
19) follows at once. The binary properties of the functions
and
then immediately lead to the bounds of
on the Bell-CHSH sum (
17). But, as we saw above, assuming the bounds of
on (
17) leads, conversely, to the assumption (
19) of the additivity of expectation values. Thus, assuming the additivity of expectation values (
19) is mathematically equivalent to assuming the bounds of
on the sum (
17). In other words, Bell’s argument presented in
Section 1.3 assumes its conclusion (
22) in the guise of assumption (
19).
Sometimes assumption (
19) is justified on statistical grounds. It is argued that the four sub-experiments appearing on the left-hand side of (
19) with different experimental settings
,
,
etc. can be performed independently of each other, on possibly different occasions, and then the resulting averages are added together at a later time for statistical analysis. If the number of experimental runs for each pair of settings is sufficiently large, then, theoretically, the sum of the four averages appearing on the left-hand side of (
19) are found not to exceed the bounds of
, thus justifying the equality (
19). This can be easily verified in numerical simulations (see Ref. [27] cited in [
12]). However, this heuristic argument is not an analytical proof of the bounds. What it implicitly neglects to take into account by explicitly assuming that the four sub-experiments can be performed independently, is that the sub-experiments involve mutually exclusive pairs of settings such as
and
in physical space, and thus involve non-commuting observables that cannot be measured simultaneously [
8]. Unless the statistical analysis takes this physical fact into account, it cannot be claimed to have any relevance for the Bell-test experiments. For ignoring this physical fact amounts to incorrectly assuming that the spin observables
,
etc. are mutually commuting, and thus simultaneously measurable, for which assumption (
19) is indeed valid, as demonstrated below in
Section 1.5 (see the discussion around (
39)). On the other hand, when the non-commutativity of the observables involved in the sub-experiments is taken into account in numerical simulations, the bounds on (
17) turn out to be
, as shown in [
9,
10] and Ref. [27] cited in [
12]. In other words, such a statistical argument is simply assumption (
19) in disguise.
Another important point to recognize here is that the above derivation of the stringent bounds of
on (
17) for a locally causal dispersion-free counterpart
of the quantum mechanical singlet state (
10) must comply with the heuristics of the contextual hidden variable theories we discussed in
Section 1.1. If it does not, then the bounds of
cannot be claimed to have any relevance for the viability of local hidden variable theories [
23]. Therefore, as discussed in
Section 1.1, in a contextual hidden variable theory all of the observables
of any physical system,
including their sum
(which also represents a physical quantity in the Hilbert space formulation of quantum mechanics [
2] whether or not it is observed), must be assigned unique eigenvalues
and
, respectively, in the dispersion-free states
of the system, regardless of whether these observables are simultaneously measurable.
Now, within quantum mechanics, expectation values do add in analogy with the equality (
19) assumed by Bell for local hidden variable theories [
2,
17]. In quantum mechanics, the statistical predictions of which any hidden variable theory is obliged to reproduce, the joint results
observed by Alice and Bob would be eigenvalues of the operators
, and the linearity in the rules of Hilbert space quantum mechanics ensures that these operators satisfy the additivity of expectation values. Thus, for any quantum state
, the following equality holds:
Comparing (
19) and (
26), the equality between the two sides of (
19) seems reasonable, even physically. Furthermore, since the condition (
15) for any hidden variable theory obliges us to set the four terms on the left-hand side of (
26) as
it may seem reasonable that, given the quantum mechanical equality (
26), any hidden variable theory should satisfy
adhering to the prescription (
7), which would then justify equality (
19). Since hidden variable theories are required to satisfy the prescription (
7), should not they also reproduce Equation (
31)? The answer to this is not straightforward.
1.5. Additivity of expectation values does not hold for dispersion-free states
The problem with Equation (
31) is that, while the joint results
,
etc. appearing on the left-hand side of Equation (
19) are possible eigenvalues of the products of spin operators
,
etc., their summation
appearing as the integrand on the right-hand side of Equations (
31) or (
19) is
not an eigenvalue of the summed operator
because the spin operators
and
,
etc., and therefore
,
etc., do not commute with each other:
Consequently, Equation (
31) would hold within any hidden variable theory
only if the operators
,
etc. were commuting operators. This is well known from the famous criticisms of von Neumann’s theorem against hidden variable theories (see,
e.g., [
8] and references therein). While the equality (
19) of the sum of expectation values with the expectation value of the sum is respected in quantum mechanics, it does not hold for hidden variable theories [
17].
In [
17], Bell illustrates this problem using spin components of a spin-
particle. Suppose we make a measurement of the component
of the spin with a Stern-Gerlach magnet suitably oriented in
. That would yield an eigenvalue
of
as a result. However, if we wish to measure the component
of the spin, then that would require a different orientation of the magnet in
, and would give a different eigenvalue,
of
, as a result. Moreover, a measurement of the sum of the
x- and
y-components of the spin,
, would again require a very different orientation of the magnet in
. Therefore, the result obtained as an eigenvalue of the summed operators
will not be the sum
of an eigenvalue of the operator
added linearly to an eigenvalue of the operator
. As Bell points out in [
17], the additivity of expectation values
is a rather unusual property of the quantum states
. It does not hold for the dispersion-free states
of hidden variable theories because the eigenvalues of non-commuting observables such as
and
do not add linearly, as we noted at the end of
Section 1.1. Consequently, the additivity relation (
19) that holds for quantum states would not hold for the dispersion-free states.
This problem, however, suggests its own resolution. We can work out the correct eigenvalue
of the summed operator (
33), at least formally, as I have worked out in
Appendix A below. The correct version of Equation (
31) is then
where
is the correct eigenvalue of the summed operator (
33), with its non-commuting part separated out as the operator
where the vector
The details of how this separation is accomplished using (
34) can be found in
Appendix A below. From (
36), it is now easy to appreciate that the additivity of expectation values (
19) assumed by Bell can hold only if the expectation value
of the non-commuting part within the eigenvalue
of the summed operator (
33) is zero. But that is possible only if the operators
,
etc. constituting the sum (
33) commute with eachother. In general, if the operators
,
etc. in (
33) do not commute with each other, then we would have
But the operators
,
etc. indeed do not commute with each other, because the pairs of directions
,
etc. in (
33) are mutually exclusive directions in
. Therefore, the additivity of expectation values assumed at step (
19) in the derivation of (
22) is unjustifiable. Far from being necessitated by realism, it actually contradicts realism.
Since three of the four results appearing in the expression (
32) can be realized only counterfactually, their summation in (
32) cannot be realized
even counterfactually [
8]. Thus, in addition to not being a correct eigenvalue of the summed operator (
33) as required by the prescription (
7) for hidden variable theories, the quantity appearing in (
32) is, in fact, an entirely fictitious quantity, with no counterpart in any possible world, apart from in the trivial case when all observables are commutative. By contrast, the correct eigenvalue (
36) of the summed operator (
33) can be realized at least counterfactually because it is a genuine eigenvalue of that operator, thereby satisfying the requirement of realism correctly, in accordance with the prescription (
7) for hidden variable theories. Using (
36), all five of the observables appearing on both sides of the quantum mechanical Equation (
26) can be assigned unique and correct eigenvalues [
8].
Once this oversight is ameliorated, it is not difficult to show that the conclusion of Bell’s theorem no longer follows. For then, using the correct eigenvalue (
36) of (
33) instead of (
32) on the right-hand side of (
19), we have the equation
instead of (
19), which implements local realism correctly on both of its sides, as required by the prescription (
7) we discussed in
Section 1.1. This Equation (
40) is thus the correct dispersion-free counterpart of the equivalence (
26) for the quantum mechanical expectation values [
8]. It can reduce to Bell’s assumption (
19) only when the expectation value
of the non-commuting part within the eigenvalue
of the summed operator (
33) happens to be vanishing, and thus expresses the correct relationship among the expectation values for the singlet state (
10) in the local hidden variable framework considered by Bell [
1]. Recall again from the end of
Section 1.1 that the quantum mechanical relation (
26) is an unusual property of the quantum states
. As Bell stressed in [
17], “[t]here is no reason to demand it individually of the hypothetical dispersion free states, whose function it is to reproduce the
measurable peculiarities of quantum mechanics
when averaged over.” Moreover, in Section V of [
8] I have demonstrated that the bounds on the right-hand side of (
40) are
instead of
. An alternative derivation of these bounds follows from the magnitude
of the vector defined in (
38), which, as proved in
Appendix B below, is bounded by 2, and thereforethe eigenvalue
of the operator (
37) obtained as its expectation value
is bounded by
, giving
Substituting these into (
36), together with the bounds of
we worked out before on the commuting part (
32), gives
which is constrained to be real despite the square root in the expression (
36) because the operator (
33) is Hermitian. Consequently, we obtain the following Tsirel’son’s bounds in the dispersion-free state, on the right-hand side of (
40):
Given the correct relation (
40) between expectation values instead of the flawed assumption (
19), we thus arrive at
Since the bounds of
we have derived on the Bell-CHSH sum of expectation values are the same as those predicted by quantum mechanics and observed in the Bell-test experiments, the conclusion of Bell’s theorem is mitigated. What is ruled out by these experiments is not local realism but the assumption of the additivity of expectation values, which does not hold for non-commuting observables in dispersion-free states of any hidden variable theories to begin with.
1.6. Conclusion: Bell’s theorem assumes its conclusion (petitio principii)
Let me reiterate the main points discussed above. Together, they demonstrate that Bell’s theorem begs the question.
(1) The first point is that the derivation in
Section 1.3 of the bounds of
on (
17) for the dispersion-free counterpart
of the singlet state (
10) must comply with the heuristics of the contextual hidden variable theories discussed in
Section 1.1. Otherwise, the stringent bounds of
cannot be claimed to have any relevance for hidden variable theories. This requires compliance with the prescription (
7) that equates the quantum mechanical expectation values with theirhidden variable counterparts for
all observables, including any sums of observables, pertaining to the singlet system.
(2) The most charitable view of the equality (
19) is that it is an
assumption, over and above those of locality, realism, and all other auxiliary assumptions required for deriving the inequalities (
22), because it is valid only for commuting observables. Far from being required by realism, it contradicts realism, because it fails to assign the correct eigenvalue (
36) to the summed observable (
33) as its realistic counterpart, as required by the prescription (
7). Realism requiresthat all observables, including their sums, must be assigned unique eigenvalues, regardless of whether they are observed.
(3) Expectation values in dispersion-free states of hidden variable theories do not add linearly for observables that are not simultaneously measurable. And yet, Bell assumed linear additivity (
19) within a local hidden variable model. Conversely, in the light of the heuristics of contextual hidden variable theories we discussed in
Section 1.1, assuming (
19) is equivalent to assuming that the spin observables
,
etc. commute with each other, but they do not.
(4) When the correct eigenvalue (
36) is assigned to the summed operator (
33) replacing the incorrect step (
19), the bounds on Bell-CHSH sum (
17) work out to be
instead of
, thus mitigating the conclusion of Bell’s theorem.
(5) As we proved in
Section 1.3, the assumption (
19) of the additivity of expectation values is equivalent to assuming the strong bounds of
on Bell-CHSH sum (
17) of expectation values. In other words, (
19) and (
22) are tautologous.
The first four points above invalidate assumption (
19), and thus inequalities (
22) on physical grounds, and the lastone demonstrates that Bell’s theorem assumes its conclusion in a different guise, and is thus invalid on logical grounds.
In this paper I have focused on a formal and logical critique of Bell’s theorem. Elsewhere [
9,
13,
15], I have developeda comprehensive local-realistic framework for understanding quantum correlations in terms of the geometry of the spatial part of one of the well-known solutions of Einstein’s field equations of general relativity — namely, that of a quaternionic 3-sphere — taken as a physical space within which we are confined to perform Bell-test experiments. Thisframework is based on Clifford algebra and thus explicitly takes the non-commutativity of observables into account. It thus shows, constructively, that contextually local hidden variable theories are not ruled out by Bell-test experiments. Since, as we discussed in
Section 1.2, the formal proof of Bell’s theorem is based on the entangled singlet state (
10), in [
4,
5,
7,
10,
11,
12,
14] I have reproduced the correlations predicted by (
10) as a special case within the local-realistic framework proposed in [
9,
13,
15]. I especially recommend the calculations presented in [
7] and [
14], which also discuss a macroscopic experiment that would be able to falsify the 3-sphere hypothesis I have proposed in these publications.