Preprint
Article

Physics as an Optimization Problem over All Measurements

Altmetrics

Downloads

381

Views

145

Comments

0

This version is not peer-reviewed

Submitted:

21 June 2024

Posted:

25 June 2024

Read the latest preprint version here

Alerts
Abstract
We propose a novel approach to quantum theory construction that involves solving a maximization problem on the Shannon entropy of all possible measurements of a system, relative to its initial preparation. This maximization problem is additionally constrained by a phase condition that vanishes under measurements. Specifically, enforcing a vanishing U(1)-valued phase constraint leads to standard quantum mechanics, while a vanishing Spin^c(3,1)-valued phase constraint extends the theory to relativistic quantum mechanics and to quantum gravity. The latter scenario is found to incorporate the SU(3)xSU(2)xU(1) symmetries of the Standard Model as invariants of the probability measure itself, and to construct the metric tensor as an operator via a double-copy mechanism applied to Dirac currents. Significantly, this solution is consistent exclusively with a 3+1-dimensional spacetime configuration---all other dimensional settings are shown to lead to fundamental obstructions. This framework seamlessly integrates fundamental concepts from quantum mechanics, relativistic quantum mechanics, quantum gravity, the dimensional specificity of spacetime, and particle physics symmetries as the solution to a simple entropy maximization problem.
Keywords: 
Subject: Physical Sciences  -   Quantum Science and Technology

1. Introduction

The canonical formalism of quantum mechanics (QM) is based on five principal axioms [1,2]:
QM Axiom 1 of 5 
State Space: Each physical system corresponds to a complex Hilbert space, with the system’s state represented by a ray in this space.
QM Axiom 2 of 5 
Observables: Physical observables correspond to Hermitian operators within the Hilbert space.
QM Axiom 3 of 5 
Dynamics: The time evolution of a quantum system is dictated by the Schrödinger equation, where the Hamiltonian operator signifies the system’s total energy.
QM Axiom 4 of 5 
Measurement: The act of measuring an observable results in the system’s transition to an eigenstate of the associated operator, with the measurement value being one of the eigenvalues.
QM Axiom 5 of 5 
Probability Interpretation: The likelihood of a specific measurement outcome is determined by the squared magnitude of the state vector’s projection onto the relevant eigenstate.
Contrastingly, statistical mechanics (SM), the other statistical pillar of physics, derives its probability measure through entropy maximization, constrained by the following expression:
SM Constraint 1 of 1 
Average Energy Constraint: The average of energy measurements of a system at thermodynamic equilibrium converge to a specific value ( E ¯ ):
E ¯ = i ρ i E i
To maximize entropy while satisfying this constraint, the theory uses a Lagrange multiplier approach.
Definition 1
(Fundamental Lagrange Multiplier Equation of SM).
L = k B i ρ i ln ρ i Boltzmann entropy + λ 1 i ρ i N o r m a l i z a t i o n   C o n s t r a i n t + β E ¯ i ρ i E i A v e r a g e   E n e r g y   C o n s t r a i n t
where λ and β are the Lagrange multipliers.
Theorem 1
(Gibbs Measure). The solution to the Lagrange multiplier equation of SM is the Gibbs measure.
ρ i = 1 i exp ( β E i ) Microcanonical Ensemble exp ( β E i )
Proof. 
This is an well-known result by E. T. Jaynes [3,4]. As a convenience, we replicate the proof in Annex Appendix A. □
As evident from E. T. Jaynes’ methodological innovation, SM relies on a single constraint related to the nature of the measurements under consideration, which allows the formulation of an optimization problem sufficient to derive the relevant probability measure. This is an exceptionally parsimonious formulation of a physical theory.
We propose a generalization of E. T. Jaynes’ approach to the realms of Quantum Mechanics (QM), relativistic Quantum Mechanics (RQM) and Quantum Gravity (QG). For each domain, we will introduce a single constraint related to measurements, formulate a corresponding entropy maximization problem, and present a main theorem that encapsulates the theory. This formulation reduces fundamental physics to its most parsimonious expression, deriving the core theories as optimal solutions to a well-defined entropy maximization problem.

1.1. Quantum Mechanics

To reformulate QM as the solution to an entropy maximization problem, we propose the following constraint:
QM Constraint 1 of 1 
Vanishing Complex-Phase: Quantum measurements admit a vanishing complex phase. The constraint is:
0 = tr i ρ i 0 E i E i 0
Here, the matrix representation engenders the complex phase, and the trace will cause it to vanish under measurement.
It associates to the follow equation:
Definition 2
(Fundamental Lagrange Multiplier Equation of QM).
L = i ρ i ln ρ i p i R e l a t i v e   S h a n n o n   E n t r o p y + λ 1 i ρ i N o r m a l i z a t i o n   C o n s t r a i n t + τ tr i ρ i 0 E i E i 0 V a n i s h i n g   C o m p l e x P h a s e
where λ and τ are the Lagrange multipliers.
The relative Shannon entropy [5,6] is utilized because we are solving for the least biased theory that connects an initial preparation p to its final measurement ρ .
Theorem 2.
The least biased probability measure that connects an initial preparation p to its final measurement ρ, under the constraint of the vanishing complex-phase, is:
ρ i = 1 i p i exp ( i t E i / ) Unitarily Invariant Ensemble exp ( i t E i / ) Born Rule p i Initial Preparation
where we have defined τ = t / (analogous to β = 1 / ( k B T ) in SM).
The proof of this theorem will be presented in the results section. We will show that this solution entails the five axioms of QM, which are now promoted to theorems, yielding a parsimonious formulation of QM.

1.2. Relativistic Quantum Mechanics

Before we can discuss RQM, we first need to introduce a notation. Let u = a + x + f + v + b , where a is a scalar, x is a vector, f is a bivector, v is a pseudo-vector and b is a pseudo-scalar, be a multivector of the geometric algebra GA ( 3 , 1 ) , and let M be its matrix representation. Then, the fundamental constraint is:
QG Constraint 1 of 1 
Vanishing Relativistic Phase: Our formulation of RQM is based around a vanishing phase spanning the Spin c ( 3 , 1 ) group. The constraint is:
0 = tr 1 2 i ρ i M i
where M i is the matrix representation of the multivector u i = f i + b i of GA ( 3 , 1 ) . Using the real Majorana representation of the gamma matrices, the representation is as follows:
u M = f 02 b f 13 f 01 + f 12 f 03 + f 23 b + f 13 f 02 f 03 + f 23 f 01 f 12 f 01 f 12 f 03 f 23 f 02 b f 13 f 03 f 23 f 01 + f 12 b + f 13 f 02
Similarly to the QM case, here the matrix representation engenders a Spin c ( 3 , 1 ) -phase and the trace will cause it to vanish under measurement.
The Lagrange multiplier equation is as follows:
Definition 3
(Fundamental Lagrange Multiplier Equation of RQM).
L = i ρ i ln ρ i p i R e l a t i v e   S h a n n o n   E n t r o p y + λ 1 i ρ i N o r m a l i z a t i o n   C o n s t r a i n t + ζ tr 1 2 i ρ i M i V a n i s h i n g   S p i n c ( 3 , 1 )   P h a s e
where λ and ζ are the Lagrange multipliers.
Theorem 3.
The least biased probability measure that connects an initial preparation p i to its final measurement ρ i , under the constraint of the vanishing relativistic phase, is:
ρ ( q ) = 1 i p i det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Invariant Ensemble det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Born Rule p i Initial Preparation
The proof of this theorem is presented in the results section.
In the results section, we aim to demonstrate that this solution establishes the foundation for a quantum mechanical theory of observables transforming in 3+1-dimensional spacetime. In this framework, ζ acts as a one-parameter generator for boosts, rotations, and phase transformations.
Incorporating additional structures, such as the Dirac equation, serves as a constraint that narrows the space of possible wavefunctions to those that satisfy it. This constitutes a specific realization of a RQM theory within this general framework.

1.3. Quantum Gravity

The maximization problem employed for RQM, as described in Section 1.2, will also be found to encompass QG. While RQM was situated within a subset of the solution space, fully leveraging the generality of the solution is necessary for QG. Specifically, the solution will be found to incorporate a double-copy mechanism applied to the Dirac current, such that two Dirac currents are multiplied together to yield a metric tensor expectation value. In this context, a single copy of the Dirac current pertains to RQM, whereas the double-copy leads to QG. The double-copy of the Dirac current facilitates the construction of the metric tensor as an observable from basis vectors, thereby establishing a connection between the probability measure and the geometrical structure of spacetime.

1.4. Dimensional Obstructions

We conclude the results section with a series of theorems demonstrating that, except for SM (no vanishing phase) and QM (vanishing U ( 1 ) phase), the entropy maximization technique yields a solution only in 3+1-dimensional spacetime (vanishing Spin c ( 3 , 1 ) phase). In all other dimensional configurations, various obstructions are encountered. These findings suggest an intriguing connection between the entropy maximization approach and the specific dimensionality of our universe, the implications of which are discussed.

2. Results

2.1. Quantum Mechanics

In statistical mechanics, the founding observation is that energy measurements of a thermally equilibrated system tend towards an average value. Comparatively, in QM, the founding observation involves the interplay between the systematic elimination of complex phases in measurement outcomes and the presence of interference effects in repeated measurement outcomes. To represent this observation, we introduce the Vanishing U ( 1 ) -Phase Anti-Constraint:
0 = tr i ρ i 0 E i E i 0
where E i are scalar-valued. The usage of the matrix generates a U ( 1 ) phase, and the trace causes it to vanish under measurements.
At first glance, this expression may seem to reduce to a tautology equating zero with zero, suggesting it imposes no restriction on energy measurements. However, this appearance is deceptive. Unlike a conventional constraint that limits the solution space, this expression serves as a formal device to expand it, allowing for the incorporation of complex phases into the probability measure. The expression’s role in broadening, rather than restricting, the solution space leads to its designation as an ”anti-constraint.”
In general, usage of anti-constraints expand classical probability measures into larger domains, such as quantum probabilities.
Its significance will become evident upon the completion of the optimization problem. For the moment, this expression can be conceptualized as an ansatz that, when incorporated as an anti-constraint within an entropy-maximization problem, resolves into the axioms of quantum mechanics.
Our next procedural step involves solving the corresponding Lagrange multiplier equation, mirroring the methodology employed in statistical mechanics by E. T. Jaynes. We utilize the relative Shannon entropy because we wish to solve for the least biased probability measure that connects an initial preparation p i to its final measurement ρ i . For that, we deploy the following Lagrange multiplier equation:
L = i ρ i ln ρ i p i Relative   Shannon   Entropy + λ 1 i ρ i Normalization   Constraint + τ tr i ρ i 0 E i E i 0 Vanishing   U ( 1 ) - Phase
where λ and τ are the Lagrange multipliers.
We solve the maximization problem as follows:
0 = L [ ρ 1 , , ρ n ] ρ i
= ln ρ i p i p i λ τ tr 0 E i E i 0
= ln ρ i p i + p i + λ τ tr 0 E i E i 0
ln ρ i p i = p i λ τ tr 0 E i E i 0
ρ i = p i exp ( p i λ ) exp τ tr 0 E i E i 0
= 1 Z ( τ ) p i exp τ tr 0 E i E i 0
The partition function is obtained as follows:
1 = i p i exp ( p i λ ) exp τ tr 0 E i E i 0
exp ( p i λ ) 1 = i p i exp τ tr 0 E i E i 0
Z ( τ ) : = i p i exp τ tr 0 E i E i 0
Finally, the least biased probability measure that connects an initial preparation p i to its final measurement ρ i , under the constraint of the vanishing U(1) phase, is:
ρ i = 1 i p i exp τ tr 0 E i E i 0 exp τ tr 0 E i E i 0 p i
Though initially unfamiliar, this form effectively establishes a comprehensive formulation of quantum mechanics, as we will demonstrate.
Upon examination, we find that phase elimination is manifestly evident in the probability measure: since the trace evaluates to zero, the probability measure simplifies to classical probabilities, aligning precisely with the Born rule’s exclusion of complex phases:
ρ i = p i i p i
However, the significance of this phase elimination extends beyond this mere simplicity. As we will soon see, the partition function Z gains unitary invariance, allowing for the emergence of interference patterns and other quantum characteristics under appropriate basis changes.
We will begin by aligning our results with the conventional quantum mechanical notation. As such, we transform the representation of complex numbers from a b b a to a + i b . For instance, the exponential of a complex matrix is:
exp a b b a = r cos ( b ) sin ( b ) sin ( b ) cos ( b ) , where r = exp a
Then, we associate the exponential trace to the complex norm using exp tr M det exp M :
exp tr a b b a = det exp a b b a = r 2 det cos ( b ) sin ( b ) sin ( b ) cos ( b ) , where r = exp a
= r 2 ( cos 2 ( b ) + sin 2 ( b ) )
= r ( cos ( b ) + i sin ( b ) )
= r exp ( i b )
Finally, substituting τ = t / analogously to β = 1 / ( k B T ) , and applying the complex-norm representation to both the numerator and to the denominator, consolidates the Born rule, normalization, and initial prepration into :
ρ i = 1 i p i exp ( i t E i / ) Unitarily Invariant Partition Function exp ( i t E i / ) Born Rule p i Initial Preparation
We are now in a position to explore the solution space.
The wavefunction is delineated by decomposing the complex norm into a complex number and its conjugate. It is then visualized as a vector within a complex n-dimensional Hilbert space. The partition function acts as the inner product. This relationship is articulated as follows:
i p i exp ( i t E i / ) = Z = ψ | ψ
where
ψ 1 ( t ) ψ n ( t ) = exp ( i t E 1 / ) exp ( i t E n / ) ψ 1 ( 0 ) ψ n ( 0 )
We clarify that p i represents the probability associated with the initial preparation of the wavefunction, where p i = ψ i ( 0 ) | ψ i ( 0 ) .
We also note that Z is invariant under unitary transformations.
Let us now investigate how the axioms of quantum mechanics are recovered from this result:
  • The entropy maximization procedure inherently normalizes the vectors | ψ with 1 / Z = 1 / ψ | ψ . This normalization links | ψ to a unit vector in Hilbert space. Furthermore, as physical states associate to the probability measure, and the probability is defined up to a phase, we conclude that physical states map to Rays within Hilbert space. This demonstrates QM Axiom 1 of 5.
  • In Z, an observable must satisfy:
    O ¯ = i p i O i exp ( i t E i / )
    Since Z = ψ | ψ , then any self-adjoint operator satisfying the condition O ψ | ϕ = ψ | O ϕ will equate the above equation, simply because O = ψ | O | ψ . This demonstrates QM Axiom 2 of 5.
  • Upon transforming Equation 31 out of its eigenbasis through unitary operations, we find that the energy, E i , typically transforms in the manner of a Hamiltonian operator:
    | ψ ( t ) = exp ( i t H / ) | ψ ( 0 )
    The system’s dynamics emerge from differentiating the solution with respect to the Lagrange multiplier. This is manifested as:
    t | ψ ( t ) = t ( exp ( i t H / ) | ψ ( 0 ) )
    = i H / exp ( i t H / ) | ψ ( 0 )
    = i H / | ψ ( t )
    H | ψ ( t ) = i t | ψ ( t )
    which is the Schrödinger equation. This demonstrates QM Axiom 3 of 5.
  • From Equation 31 it follows that the possible microstates E i of the system correspond to specific eigenvalues of H . An observation can thus be conceptualized as sampling from ρ , with the measured state being the occupied microstate i. Consequently, when a measurement occurs, the system invariably emerges in one of these microstates, which directly corresponds to an eigenstate of H . Measured in the eigenbasis, the probability measure is:
    ρ i ( t ) = 1 ψ | ψ ( ψ i ( t ) ) ψ i ( t ) .
    In scenarios where the probability measure ρ i ( τ ) is expressed in a basis other than its eigenbasis, the probability P ( λ i ) of obtaining the eigenvalue λ i is given as a projection on a eigenstate:
    P ( λ i ) = | λ i | ψ | 2
    Here, | λ i | ψ | 2 signifies the squared magnitude of the amplitude of the state | ψ when projected onto the eigenstate | λ i . As this argument hold for any observables, this demonstrates QM Axiom 4 of 5.
  • Finally, since the probability measure (Equation 29) replicates the Born rule, QM Axiom 5 of 5. is also demonstrated.
Revisiting quantum mechanics with this perspective offers a coherent and unified narrative. Specifically, the vanishing U(1) phase constraint (Equation 11) is sufficient to entail the foundations of quantum mechanics (Axiom 1, 2, 3, 4 and 5) through the principle of entropy maximization. Equation 11 becomes the formulation’s new singular foundation, and Axioms 1, 2, 3, 4, and 5 are now promoted to theorems.

2.2. RQM in 2D

In this section, we investigate RQM in 2D. Although all dimensional configurations except 3+1D contain obstructions, which will be discussed later in this section, the 2D case provides a valuable starting point before addressing the more complex 3+1D case. In RQM 2D, the fundamental Lagrange Multiplier Equation is:
L = i ρ i ln ρ i p i Relative   Shannon   Entropy + λ 1 i ρ i Normalization   Constraint + θ tr 1 2 i ρ i M i Vanishing   Relativistic   Phase
where λ and θ are the Lagrange multipliers, and where M i is the matrix representation of a multivector b i of GA ( 2 ) , where b i is a pseudo-scalar. In general a multivector u = a + x + b of GA ( 2 ) , where a is a scalar, x is a vector and b a pseudo-scalar, is represented as follows:
a + x y b y + b a x a + x σ x + y σ y + b σ x σ y
The basis elements are defined as:
σ x = 1 0 0 1 , σ y = 0 1 1 0 , σ x σ y = 0 1 1 0
If we take a 0 , x 0 then M reduces as follows:
u = a + x + b | a 0 , x 0 = b M = 0 b b 0
The Lagrange multiplier equation can be solved as follows:
0 = L [ ρ 1 , , ρ n ] ρ i
= ln ρ i p i p i λ θ tr 1 2 0 b i b i 0
= ln ρ i p i + p i + λ + θ tr 1 2 0 b i b i 0
ln ρ i p i = p i λ θ tr 1 2 0 b i b i 0
ρ i = p i exp ( p i λ ) exp θ tr 1 2 0 b i b i 0
= 1 Z ( θ ) p i exp θ tr 1 2 0 b i b i 0
The partition function Z ( θ ) , serving as a normalization constant, is determined as follows:
1 = i p i exp ( p i λ ) exp θ tr 1 2 0 b i b i 0
exp ( p i λ ) 1 = i p i exp θ tr 1 2 0 b i b i 0
Z ( θ ) : = i p i exp θ tr 1 2 0 b i b i 0
Consequently, the least biased probability measure that connects an initial preparation p i to a final measurement ρ i , under the constraint of the vanishing relativistic phase in 2D is:
ρ i = 1 i p i det exp 1 2 θ 0 b i b i 0 Spin ( 2 ) Invariant Ensemble det exp 1 2 θ 0 b i b i 0 Spin ( 2 ) Born Rule p i Initial Preparation
where det exp M = exp tr M .
In 2D, the Lagrange multiplier θ correspond to an angle of rotation, and in 1+1D it would correspond to the rapidity ζ :
2 D : exp θ 0 1 1 0 = cos θ sin θ sin θ cos θ θ is the angle of rotation
1 + 1 D : exp ζ 0 1 1 0 = cosh ζ sinh ζ sinh ζ cosh ζ ζ is the rapidity
The 2D solution may appear equivalent to the QM case because they are related by an isomorphism Spin ( 2 ) SO ( 2 ) U ( 1 ) and under the replacement θ τ . However, an isomorphism does not mean identical, and in Spin(2) we gain extra structures related to a relativistic description, which are not available in the QM case.
To investigate the solution in more detail, we introduce the multivector conjugate, also known as the Clifford conjugate, which generalizes the concept of complex conjugation to multivectors.
Definition 4.
(Multivector conjugate (a.k.a Clifford conjugate)). Let u = a + x + b be a multi-vector of the geometric algebra over the reals in two dimensions GA ( 2 ) . The multivector conjugate is defined as:
u = a x b
The determinant of the matrix representation of a multivector can be expressed as a self-product:
Theorem 4
(Determinant as a Multivector Self-Product).
u u = det M
Proof. 
Let u = a + x σ x + y σ y + b σ x σ y , and let M be its matrix representation a + x y b y + b a x . Then:
1 : u u
= ( a + x σ x + y σ y + b σ x σ y ) ( a + x σ x + y σ y + b σ x σ y )
= ( a x σ x y σ y b σ x σ y ) ( a + x σ x + y σ y + b σ x σ y )
= a 2 x 2 y 2 + b 2
2 : det M
= det a + x y b y + b a x
= ( a + x ) ( a x ) ( y b ) ( y + b )
= a 2 x 2 y 2 + b 2
 □
Building upon the concept of the multivector conjugate, we introduce the multivector conjugate transpose, which serves as an extension of the Hermitian conjugate to the domain of multivectors.
Definition 5
(Multivector Conjugate Transpose). Let V ( GA ( 2 ) ) n :
V = a 1 + x 1 + b 1 a n + x n + b n
The multivector conjugate transpose of V is defined as first taking the transpose and then the element-wise multivector conjugate:
V = a 1 x 1 b 1 a n x n b n
Definition 6
(Bilinear Form). Let V and W be two vectors valued in GA ( 2 ) . We introduce the following bilinear form:
V | W = ( a 1 x 1 b 1 ) ( a 1 + x 1 + b 1 ) + ( a n x n b n ) ( a n + x n + b n )
Theorem 5
(Inner Product). Restricted to the even sub-algebra of GA ( 2 ) , the bilinear form is an inner product.
Proof. 
V | W x 0 = ( a 1 b 1 ) ( a 1 + b 1 ) + ( a n b n ) ( a n + b n )
This is isomorphic to the inner product of a complex Hilbert space, with the identification i σ x σ y . □
Definition 7.
(Spin(2)-valued Wavefunction).
ψ = e 1 2 ( a 1 + b 1 ) e 1 2 ( a n + b n ) = ρ 1 R 1 ρ 2 R 2
where ρ i = e 1 2 a i representing the square root of the probability and R i = e 1 2 b i representing a rotor in 2D (or boost in 1+1D).
The partition function of the probability measure can be expressed using the bilinear form applied to the Spin(2)-valued Wavefunction:
Theorem 6
(Partition Function). Z = ψ | ψ
Proof. 
ψ | ψ = i ψ i ψ i = i ρ i R i R i = i ρ i = Z
 □
Definition 8.
(Spin(2)-valued Evolution Operator).
T = e 1 2 θ b 1 e 1 2 θ b n
Theorem 7.
The partition function is invariant with respect to the Spin(2)-valued evolution operator.
Proof. 
We note that:
T v | T v = v | v = v T T v T T = I
then, since e 1 2 θ b 1 e 1 2 θ b n e 1 2 θ b 1 e 1 2 θ b n = I , the relation T T = I is satisfied. □
We note that the even sub-algebra of GA ( 2 ) , being closed under addition and multiplication and constituting an inner product through its bilinear form, allows for the construction of a Hilbert space. In this context, the Hilbert space is Spin(2)-valued. The primary distinction between a wavefunction in a complex Hilbert space and one in a Spin(2)-valued Hilbert space lies in the subject matter of the theory. Specifically, in the latter, the construction governs the change in orientation experienced by an observer, which in turn dictates the measurement basis used in the experiment, consistently with the rotational symmetry of the system.
The dynamics of observer orientation transformations are described by the Schrödinger equation, which is derived by taking the derivative of the wavefunction with respect to the Lagrange multiplier, θ :
Definition 9.
(Spin(2)-valued Schrödinger Equation).
d d θ ψ 1 ( θ ) ψ n ( θ ) = 1 2 b 1 1 2 b n ψ 1 ( θ ) ψ n ( θ )
Here, θ represents a global one-parameter evolution parameter akin to time, which is able to transform the wavefunction under the Spin(2), locally across the states of the Hilbert space. This is an extremely general equation that captures all transformations that can be done consistently with the symmetries of the wavefunction.
Definition 10
(David Hestenes’ Formulation). In 3+1D, the David Hestenes’ formulation [7] of the wavefunction is ψ = ρ R e i b / 2 , where R = e f / 2 is a Lorentz boost or rotation and where e i b / 2 is a phase. In 2D, as the algebra only admits a bivector, his formulation would reduce to ψ = ρ R , which is the form we have recovered.
The definition of the Dirac current applicable to our wavefunction follows the formulation of David Hestenes:
Definition 11
(Dirac Current). Given the basis σ x and σ y , the Dirac current for the 2D theory is defined as:
J 1 ψ σ x ψ = ρ R σ x R SO ( 2 ) = ρ σ ˜ x
J 2 ψ σ y ψ = ρ R σ y R SO ( 2 ) = ρ σ ˜ y
where σ ˜ x and σ ˜ y are a SO(2) rotated basis vectors.

Obstructions

As stated, all dimensional configurations except 3+1D contain obstructions. Specifically, in 1+1D and 2D, we identify two obstructions:
  • In 1+1D: The 1+1D theory results in a split-complex quantum theory due to the bilinear form ( a b e 0 e 1 ) ( a + b e 0 e 1 ) , which yields negative probabilities: a 2 b 2 R for certain wavefunction states, in contrast to the non-negative probabilities a 2 + b 2 R 0 obtained in the Euclidean 2D case. (This is why we had to use 2D instead of 1+1D in this two-dimensional introduction…)
  • In 1+1D and in 2D: The basis vectors ( σ x and σ y in 2D, and e 0 and e 1 in 1+1D) are not self-adjoint. Although useable in the context defining the Dirac current, their non-self-adjointness prevents the construction of the metric tensor as an observable. The benefits of having the basis vectors self-adjoint will become obvious in the 3+1D case, where we will be able to construct the metric tensor from basis measurements. Specifically, in 2D:
    ( e μ u ) u u e μ u
    because ( e μ u ) u = u e μ u = u ( e μ ) u .
In the following section, we will explore the obstruction-free 3+1D case.

2.3. RQM in 3+1D

In this section, we extend the concepts and techniques developed for multivector amplitudes in 2D to the more physically relevant case of 3+1D dimensions. The Lagrange multiplier equation is as follows:
L = i ρ i ln ρ i p i Relative   Shannon   Entropy + λ 1 i ρ i Normalization   Constraint + ζ tr 1 2 i ρ i M i Vanishing   Spin c ( 3 , 1 ) Phase
The solution (proof in Annex Appendix B) is obtained using the same step-by-step process as the 2D case, and yields:
ρ i = 1 i p i det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Invariant Ensemble det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Born Rule p i Initial Preparation
where ζ is a "twisted-phase" rapidity. (If the invariance group was Spin(3,1) instead of Spin c (3,1), obtainable by posing b 0 , then it would simply be the rapidity).

2.3.1. Preliminaries

Our initial goal will be to express the partition function as a self-product of elements of the vector space. As such, we begin by defining a general multivector in the geometric algebra GA ( 3 , 1 ) .
Definition 12
(Multivector). Let u be a multivector of GA ( 3 , 1 ) . Its general form is:
u = a
+ t γ 0 + x γ 1 + y γ 2 + z γ 3
+ f 01 γ 0 γ 1 + f 02 γ 0 γ 2 + f 03 γ 0 γ 3 + f 12 γ 1 γ 2 + f 13 γ 1 γ 3 + f 23 γ 2 γ 3
+ p γ 1 γ 2 γ 3 + q γ 0 γ 2 γ 3 + v γ 0 γ 1 γ 3 + w γ 0 γ 1 γ 2
+ b γ 0 γ 1 γ 2 γ 3
where γ 0 , γ 1 , γ 2 , γ 3 are the basis vectors in the real Majorana representation.
A more compact notation for u is
u = a + x + f + v + b
where a is a scalar, x a vector, f a bivector, v is pseudo-vector and b a pseudo-scalar.
This general multivector can be represented by a 4 × 4 real matrix using the real Majorana representation:
Definition 13
(Matrix Representation of u ).
M = a + f 02 q z b f 13 + w x f 01 + f 12 p + v f 03 + f 23 + t + y b + f 13 + w x a + f 02 + q + z f 03 + f 23 t y f 01 f 12 p + v f 01 f 12 + p + v f 03 f 23 + t y a f 02 + q z b f 13 w x f 03 f 23 t + y f 01 + f 12 + p + v b + f 13 w x a f 02 q + z
To manipulate and analyze multivectors in GA ( 3 , 1 ) , we introduce several important operations, such as the multivector conjugate, the 3,4 blade conjugate, and the multivector self-product.
Definition 14.
(Multivector Conjugate (in 4D)).
u = a x f + v + b
Definition 15
(3,4 Blade Conjugate). The 3,4 blade conjugate of u is
u 3 , 4 = a + x + f v b
The results of Lundholm [8], demonstrates that the multivector norms in the following definition, are the unique forms which carries the properties of the determinants such as N ( u v ) = N ( u ) N ( v ) to the domain of multivectors:
Definition 16.
The self-products associated with low-dimensional geometric algebras are:
GA ( 0 , 1 ) : φ φ
GA ( 2 , 0 ) : φ φ
GA ( 3 , 0 ) : φ φ 3 φ φ
GA ( 3 , 1 ) : φ φ 3 , 4 φ φ
GA ( 4 , 1 ) : ( φ φ 3 , 4 φ φ ) ( φ φ 3 , 4 φ φ )
We can now express the determinant of the matrix representation of a multivector via the self-product φ φ 3 , 4 φ φ . Again, this choice is not arbitrary, but the unique choice with allows us to represent the determinant of the matrix representation of a multivector within GA ( 3 , 1 ) :
Theorem 8
(Determinant as a Multivector Self-Product).
u u 3 , 4 u u = det M
Proof. 
Please find a computer assisted proof of this equality in Annex Appendix C. □
Definition 17
( GA ( 3 , 1 ) -valued Vector).
V = u 1 u n = a 1 + x 1 + f 1 + v 1 + b 1 a n + x n + f n + v n + b n
These constructions allow us to express the partition function in terms of the multivector self-product:
Definition 18
(Double-Copy Product). Instead of an inner product, we obtain what we call a double-copy product:
V | V | V | V = i ψ i ψ i copy 1 3 , 4 ψ i ψ i copy 2
= u 1 u n u 1 0 0 u n copy 1 3 , 4 u 1 0 0 u n u 1 u n copy 2
Theorem 9
(Partition Function). Z = V | V | V | V
Proof. 
V | V | V | V
= u 1 u n u 1 0 0 u n 3 , 4 u 1 0 0 u n u 1 u n
= u 1 u 1 u n u n 3 , 4 u 1 u 1 u n u n
= u 1 u 1 3 , 4 u 1 u 1 + + u n u n 3 , 4 u n u n
= i = 1 n det M u i
= Z
 □
Desirable properties for the double-copy product are introduced by reducing multivectors to its subgroups. First, non-negativity:
Theorem 10
(Non-negativity). The double-copy product, applied to the even sub-algebra of GA ( 3 , 1 ) is always non-negative.
Proof. 
Let V = a 1 + f 1 + b 1 a n + f n + b n . Then,
V | V | V | V
= ( a 1 + f 1 + b 1 ) ( a 1 + f 1 + b 1 ) 3 , 4 ( a 1 + f 1 + b 1 ) ( a 1 + f 1 + b 1 )
= ( a 1 f 1 + b 1 ) ( a 1 + f 1 + b 1 ) 3 , 4 ( a 1 f 1 + b 1 ) ( a 1 + f 1 + b 1 )
= a 1 2 + a 1 f 1 + a 1 b 1 f 1 a 1 f 1 2 f 1 b 1 + b 1 a 1 + b 1 f 1 + b 1 2 3 , 4
= a 1 2 f 1 2 + b 1 2 3 , 4
We note 1) b 2 = ( b I ) 2 = b 2 and 2) f 2 = E 1 2 E 2 2 E 3 2 + B 1 2 + B 2 2 + B 3 2 + 4 e 0 e 1 e 2 e 3 ( E 1 B 1 + E 2 B 2 + E 3 B 3 )
= a 1 2 b 1 2 + E 1 2 + E 2 2 + E 3 2 B 1 2 B 2 2 B 3 2 4 e 0 e 1 e 2 e 3 ( E 1 B 1 + E 2 B 2 + E 3 B 3 ) 3 , 4
We note that the terms are now complex numbers, which we rewrite as ( z ) = a 1 2 b 1 2 + E 1 2 + E 2 2 + E 3 2 B 1 2 B 2 2 B 3 2 and ( z ) = 4 ( E 1 B 1 + E 2 B 2 + E 3 B 3 )
= z 1 z 2 3 , 4 z n z n
= z 1 z 2 z n z n
= z 1 z 1 + + z n z n
which is always non-negative. □
Then, positive-definiteness of the double-copy product is obtained by creating an equivalence class between the zero vector and any non-zero vector of length zero, and taking the zero vector as the representative of the class. To realize the equivalence class, we define the Spin c ( 3 , 1 ) -valued wavefunction, which is valued in the even sub-algebra of GA ( 3 , 1 ) , as follows:
Definition 19
( Spin c ( 3 , 1 ) -valued Wavefunction).
ψ = e 1 2 ( a 1 + f 1 + b 1 ) e 1 2 ( a n + f n + b n ) = ρ 1 R 1 B 1 ρ n R n B n
where R i = e 1 2 f i is a rotor, B i = e 1 2 b i is a phase, and where ρ i = e 1 2 a i 0 .
Any even multivectors of GA(3,1) admits a unique exponential representation, except when ρ i = 0 in which it is surjective. Consequently, in this representation the double-copy product yields 0 only for the zero vector, rendering the double-copy product positive-definite.
Now, let us turn our attention to the evolution operator, which leaves the partition function invariant:
Definition 20
( Spin c ( 3 , 1 ) Evolution Operator).
T = e 1 2 ζ ( f 1 + b 1 ) e 1 2 ζ ( f n + b n )
In turn, this leads to a Schrödinger equation obtained by taking the derivative of the wavefunction with respect to the Lagrange multiplier ζ :
Definition 21
( Spin c ( 3 , 1 ) -valued Schrödinger equation).
d d ζ ψ 1 ( ζ ) ψ n ( ζ ) = 1 2 ( f 1 + b 1 ) 1 2 ( f n + b n ) ψ 1 ( ζ ) ψ n ( ζ )
In this case ζ represents a one-parameter evolution parameter akin to time, which is able to transform the measurement basis under action of the Spin c ( 3 , 1 ) group. This is an extremely general equation that captures all transformations that can be done consistently with the symmetries of the wavefunction.

2.3.2. RQM

Definition 22
(David Hestenes’ Formulation). The Spin c ( 3 , 1 ) -valued wavefunction we have recovered is formulated identically to David Hestenes’ [7] formulation of the wavefunction within GA(3,1).
ψ = e 1 2 ( a + f + b ) = ρ R e i b / 2
where e 1 2 a = ρ , e 1 2 f = R and e 1 2 b = e i b / 2 .
Before we continue the RQM investigation, let us note that the double-copy product contains two copies of a bilinear form ψ ψ :
ψ ψ copy 1 3 , 4 ψ ψ copy 2
In the present section, we will investigate the properties of each copy individually, leaving the properties specific to the double-copy for the section on quantum gravity.
Taking a single copy, the Dirac current is obtained directly from the gamma matrices, as follows:
Definition 23
(Dirac Current). The definition of the Dirac current is the same as Hestenes’:
J ψ γ μ ψ = ρ R B γ μ B R = ρ R γ μ B 1 B R = ρ R γ μ R SO ( 3 , 1 ) = ρ γ ˜ μ
where γ ˜ μ is a SO(3,1) rotated basis vector.

2.3.3. Standard Model Gauge Symmetries

We will now demonstrate that the copied bilinear form is automatically invariant with respect to the U ( 1 ) , SU ( 2 ) , and SU ( 3 ) symmetries and the Spin ( 3 , 1 ) and unitary U U = I symmetries which play a fundamental role in the standard model of particle physics.
Theorem 11.
(U(1) Invariance). Let e 1 2 b be a general element of U(1). Then, the equality
ψ γ 0 ψ 3 , 4 ψ γ 0 ψ = ( e 1 2 b ψ ) γ 0 e 1 2 b ψ copy 1 3 , 4 ( e 1 2 b ψ ) γ 0 e 1 2 b ψ copy 2
is satisfied, yielding a U(1) symmetry for each copied bilinear form.
Proof. 
Equation 119 is invariant if this expression is satisfied:
e 1 2 b γ 0 e 1 2 b = γ 0
This is always satisfied simply because e 1 2 b γ 0 e 1 2 b = γ 0 e 1 2 b e 1 2 b = γ 0
Theorem 12.
(SU(2) Invariance). Let e 1 2 f be a general element of Spin(3,1). Then, the equality:
ψ γ 0 ψ 3 , 4 ψ γ 0 ψ = ( e 1 2 f ψ ) γ 0 e 1 2 f ψ copy 1 3 , 4 ( e 1 2 f ψ ) γ 0 e 1 2 f ψ copy 2
is satisfied for if f = θ 1 γ 0 γ 1 + θ 2 γ 0 γ 2 + θ 3 γ 0 γ 3 (which generates SU(2)), yielding a SU(2) symmetry for each copied bilinear form.
Proof. 
Equation 121 is invariant if this expression is satisfied [9]:
e 1 2 f γ 0 e 1 2 f = γ 0
We now note that moving the left-most term to the right of the gamma matrix yields:
e θ 1 γ 0 γ 1 θ 2 γ 0 γ 2 θ 3 γ 0 γ 3 B 1 γ 2 γ 3 B 2 γ 1 γ 3 B 3 γ 1 γ 2 γ 0 e 1 2 f
= γ 0 e θ 1 γ 0 γ 1 θ 2 γ 0 γ 2 θ 3 γ 0 γ 3 + B 1 γ 2 γ 3 + B 2 γ 1 γ 3 + B 3 γ 1 γ 2 e 1 2 f
Therefore, the product e 1 2 f γ 0 e 1 2 f reduces to γ 0 if and only if B 1 = B 2 = B 3 = 0 , leaving f = θ 1 γ 0 γ 1 + θ 2 γ 0 γ 2 + θ 3 γ 0 γ 3 :
Finally, we note that e θ 1 γ 0 γ 1 + θ 2 γ 0 γ 2 + θ 3 γ 0 γ 3 generates SU ( 2 ) . □
Theorem 13.
(SU(3) invariance). Let f = E 1 γ 0 γ 1 + E 2 γ 0 γ 2 + E 3 γ 0 γ 3 + B 1 γ 2 γ 3 + B 2 γ 1 γ 3 + B 3 γ 1 γ 2 be a general bivector. Then, the equality:
ψ γ 0 ψ 3 , 4 ψ γ 0 ψ = ( f ψ ) γ 0 f ψ copy 1 3 , 4 ( f ψ ) γ 0 f ψ copy 2
is satisfied if ( E 1 2 + B 1 2 ) + ( E 2 2 + B 2 2 ) + ( E 3 2 + B 3 2 ) = 1 , where E 1 γ 0 γ 1 + B 1 γ 0 γ 1 I E 1 + i B 1 , E 2 γ 0 γ 2 + B 2 γ 0 γ 2 I E 2 + i B 2 and E 3 γ 0 γ 3 + B 3 γ 0 γ 3 I E 3 + i B 3 which are the defining conditions for the SU(3) symmetry group. This yields a SU(3) symmetry for each copy of the bilinear form.
Proof. 
Equation 125 is invariant if this expression is satisfied [9,10]:
f γ 0 f = γ 0
which we can rewrite as follows:
( E 1 γ 0 γ 1 + E 2 γ 0 γ 2 + E 3 γ 0 γ 3 + B 1 γ 2 γ 3 + B 2 γ 1 γ 3 + B 3 γ 1 γ 2 ) γ 0 f
The first three terms anticommute with γ 0 , while the last three commute with γ 0 :
= γ 0 ( E 1 γ 0 γ 1 + E 2 γ 0 γ 2 + E 3 γ 0 γ 3 B 1 γ 2 γ 3 B 2 ( q ) γ 1 γ 3 B 3 ( q ) γ 1 γ 2 ) f ( q )
This can be written as:
γ 0 ( E B ) ( E + B )
= γ 0 ( E 2 + E B B E B 2 )
where E = E 1 γ 0 γ 1 + E 2 γ 0 γ 2 + E 3 γ 0 γ 3 and B = B 1 γ 2 γ 3 + B 2 γ 1 γ 3 + B 3 γ 1 γ 2 .
Thus, for f γ 0 f = γ 0 , we require: 1) E 2 B 2 = 1 and 2) E B = B E . The second requirement means that E and B must commute (and thus be isomorphic to three complex numbers), and the first implies:
E 2 B 2 = ( E 1 2 + B 1 2 ) + ( E 2 2 + B 2 2 ) + ( E 3 2 + B 3 2 ) = 1
which are the defining conditions for the SU ( 3 ) symmetry group. □
Theorem 14.
(Spin(3,1) invariance). Let e 1 2 f be a general element of Spin(3,1). Then, the equality:
ψ ψ 3 , 4 ψ ψ = ( e 1 2 f ψ ) e 1 2 f ψ copy 1 3 , 4 ( e 1 2 f ψ ) e 1 2 f ψ copy 2
is always satisfied, yielding a Spin(3,1) symmetry for each copied bilinear form.
Proof. 
Equation 132 is invariant if this expression is satisfied
e 1 2 f e 1 2 f = 1
which is always the case. □
Theorem 15
(Unitary invariance). Let U be n × n unitary matrices. Then unitary invariance:
ψ | γ μ ψ | ψ | γ ν ψ = U ψ | γ μ U ψ | U ψ | γ ν U ψ U U = I
is individually satisfied for each copied bilinear form.
Proof. 
Equation 134 is satisfied if U γ μ U = γ μ . Since U is valued in complex numbers, then U = U T , and since γ μ γ 0 γ 1 γ 2 γ 3 = γ 0 γ 1 γ 2 γ 3 γ μ , it follows that:
γ μ U U = γ μ
which is satisfied when U U = I . □
The invariances SU(3), SU(2) and U(1) discussed above can be promoted to local symmetries using the usual gauge symmetry construction techniques, along with the Dirac equation or field Lagrangian.
In conventional QM, the Born rule naturally leads to a U(1)-valued gauge theory due to the following symmetry:
( e i θ ( x ) ψ ( x ) ) e i θ ( x ) ψ ( x ) = ψ ( x ) ψ ( x )
However, the SU(3) and SU(2) symmetries do not emerge from the probability measure in the same way and must instead be introduced manually, justified by experimental considerations. This raises the question: why these specific symmetries and not others? In contrast, within the double-copy product framework, all three symmetry groups–U(1), SU(2), and SU(3)–as well as the Spin(3,1) and unitary symmetries, follow naturally from the invariance of the probability measure, in the same way that U(1) symmetry follows from the Born rule. This suggests that the double-copy product framework may provide a more unified and consistent approach to deriving the fundamental symmetries of nature.

2.3.4. Quantum Gravity

In the previous section, we developed a quantum theory valued in Spin c (3,1), which served as the arena for RQM and the standard model gauge symmetries. The goal of this section is to extend the methodology to arbitrary basis vectors, in which the metric tensor exists as an observable. To formulate the theory, we will exploit the features of the double-copy product, which will enable us to express the metric tensor as an observable. To a degree, this formulation is reminiscent of the Bern-Carrasco-Johansson (BCJ) [11] double-copy gauge theory of perturbatively expanded quantum gravity, but our double-copy is formulated at the level of the Dirac current directly. By applying the double-copy product to the Dirac current, we aim to establish a connection between the wavefunction and the geometrical structure of spacetime, providing a novel approach to quantize gravity.
We recall the definition of the metric tensor in terms of basis vectors of geometric algebra, as follows:
g μ ν = 1 2 ( e μ e ν + e ν e μ )
Then, we note that the double-copy product acts on a pair of basis element e μ and e ν , as follows:
1 2 ψ e μ ψ copy 1 3 , 4 ψ e ν ψ copy 1 + ψ e ν φ copy 1 3 , 4 ψ e μ ψ copy 1
= 1 2 R ˜ ρ e i b / 2 e i b / 2 Born rule copy 1 e μ R R ˜ ρ e i b / 2 e i b / 2 Born rule copy 2 e ν R + R ˜ ρ e i b / 2 e i b / 2 Born rule copy 2 e ν R R ˜ ρ e i b / 2 e i b / 2 Born rule copy 1 e μ R
= 1 2 ρ 2 R ˜ e μ R R ˜ e ν R + R ˜ e ν R R ˜ e ν R
= ρ 2 1 2 ( e ˜ μ e ˜ ν + e ˜ ν e ˜ μ ) metric tensor
where e ˜ μ and e ˜ ν are SO(3,1) rotated basis vectors.
As one can swap e μ and e ν and obtain the same metric tensor, the double-copy product guarantees that g μ ν is symmetric.
Furthermore, since e μ = e μ , we get:
( e μ ψ ) ψ 3 , 4 ( e ν ψ ) ψ
= ψ ( 1 ) e μ ψ 3 , 4 ψ ( 1 ) e ν ψ
= ψ e μ ψ 3 , 4 ψ e ν ψ
which allows us to conclude that e μ and e ν are self-adjoint within the double-copy product, entailing the interpretation of g μ ν as an observable.
In the double-copy product, the metric tensor is a double copy of Dirac currents (we recall that the Dirac current is defined as ρ γ ˜ μ = ψ γ μ ψ ). As such, we intuit that it encodes the probabilistic structure of a general relativistic quantum theory in the form of a metric tensor analogously to how the Dirac current encodes the probabilistic structure of a special relativistic quantum theory in the form of a vector.
Let us now investigate the dynamics. We recall that the evolution operator (Definition 20) is:
T = e 1 2 ζ ( f 1 + b 1 ) e 1 2 ζ ( f n + b n )
Acting on the wavefunction, the effect of this operator cascades down to the basis vectors via the double-copy product:
ψ T e μ T ψ copy 1 3 , 4 ψ T e ν T ψ copy 2
which realizes an SO ( 3 , 1 ) transformation of the metric tensor via action of the exponential of a bivector, and a double-copy unitary invariant transformation via action of the exponential of a pseudo-scalar:
ψ e 1 2 ζ f e μ e 1 2 ζ f SO ( 3 , 1 ) evolution e 1 2 ζ b e 1 2 ζ b unitary evolution ψ copy 1 3 , 4 ψ e 1 2 ζ f e μ e 1 2 ζ f SO ( 3 , 1 ) evolution e 1 2 ζ b e 1 2 ζ b unitary evolution ψ copy 2
In summary, this initial investigation has identified a scenario in which the wavefunction measures the metric tensor by acting on the basis vectors. In this context, the evolution operator, governed by the Schrödinger equation, dynamically realizes SO(3,1) transformations on the metric tensor. Furthermore, the amplitudes associated with possible metric tensors are given by a double-copy of unitary quantum theories operating on the basis vectors. This formulation respects both the SO(3,1) symmetry, which is essential for describing the structure of spacetime, and the unitary symmetry, which is a fundamental aspect of quantum mechanics. We will now investigate this formulation in the context of gravity.

2.3.5. Linearized Gravity (Sketch)

It is well known that the Einstein-Hilbert action:
S [ g μ ν ] = c 4 16 π G R | g | d 4 x
under the assumption of a small perturbation g μ ν = η μ ν + h μ ν , and working in de Donder gauge α h α μ 1 2 μ h = 0 , where h = η μ ν h μ ν , can be reduced to its linearized form which is:
S E H ( 1 ) [ h μ ν ] = d 4 x 1 2 μ h ρ σ μ h ρ σ 1 4 μ h μ h
Furthermore, varying this action with respect to h μ ν and applying the transverse-traceless gauge, yields the wave equation as the equation of motion:
δ S δ h μ ν = 0 h μ ν = 0
We now wish to express the wave equation for h μ ν in terms of basis vectors h μ and h ν , such that h μ ν = 1 2 ( h μ h ν + h ν h μ ) . The expression of the wave equation becomes:
1 2 ( h μ h ν + h ν h μ ) = 0
We identify the solution for h μ and h ν by an ansatz:
h μ ( x , t ) = Re d 3 k A + ( k ) e μ + ( k ) + A × ( k ) e μ × ( k ) e i ( k · x ω k t )
h ν ( x , t ) = Re d 3 k A + ( k ) e ν + ( k ) + A × ( k ) e ν × ( k ) e i ( k · x ω k t )
The × and + symbol designed two polarizations.
Finally, we promote h μ ( x , t ) and h ν ( x , t ) to operators:
h ^ μ ( x , t ) = Re d 3 k A ^ + ( k ) e μ + ( k ) + A ^ × ( k ) e μ × ( k ) e i ( k · x ω k t )
h ^ ν ( x , t ) = Re d 3 k A ^ + ( k ) e ν + ( k ) + A ^ × ( k ) e ν × ( k ) e i ( k · x ω k t )
In the conventional perturbative approach to quantum gravity, the metric tensor h ^ μ ν is quantized, and its expectation value is calculated using the Born rule as follows: h ^ μ ν = ψ | h ^ μ ν | ψ . In contrast, our approach quantizes the two basis vectors h ^ μ and h ^ ν separately and utilizes a double-copy product to obtain the expectation value of the metric tensor as h ^ μ ν = 1 2 ( ψ | h ^ μ ψ | ψ | h ^ ν ψ + ψ | h ^ ν ψ | ψ | h ^ μ ψ ) . In the latter case, the operators realize a double copy of Dirac currents, each copy requiring an individual application of the Born rule. The resulting probability calculation differs from the conventional approach, as it involves the product of two separate Born rule applications. Remarkably, our probability measure appears unable to assign an expectation value to a quantized metric tensor unless it is done via a double-copy mechanism, making the double-copy mechanism fundamental to our construction.
The double-copy mechanism, identified by Bern, Carrasco, and Johansson (BCJ) [11], has been shown to simplify the calculations of scattering amplitudes for gravitons in many cases. However, it remains an open question whether our double-copy mechanism, formulated at the level of the Dirac current, carries similar advantages.

2.4. Dimensional Obstructions

In this section, we explore the dimensional obstructions that arise when attempting to resolve the entropy maximization problem for other dimensional configurations. We found that all dimensional configurations except those we have explored here (e.g. GA ( 0 ) R , GA ( 0 , 1 ) C and GA ( 3 , 1 ) ) are obstructed:
Dimensions Obstruction
GA ( 0 ) R Unobstructed statistical mechanics
GA ( 0 , 1 ) C Unobstructed quantum mechanics
GA ( 1 , 0 ) Negative probabilities in the RQM
GA ( 2 , 0 ) No metric measurement
GA ( 1 , 1 ) Negative probabilities in the RQM
GA ( 0 , 2 ) Not isomorphic to a real matrix algebra
GA ( 3 , 0 ) Not isomorphic to a real matrix algebra
GA ( 2 , 1 ) Not isomorphic to a real matrix algebra
GA ( 1 , 2 ) Not isomorphic to a real matrix algebra
GA ( 0 , 3 ) Not isomorphic to a real matrix algebra
GA ( 4 , 0 ) Not isomorphic to a real matrix algebra
GA ( 3 , 1 ) Unobstructed quantum gravity SU ( 3 ) × SU ( 2 ) × U ( 1 )
GA ( 2 , 2 ) Negative probabilities in the RQM
GA ( 1 , 3 ) Not isomorphic to a real matrix algebra
GA ( 0 , 4 ) Not isomorphic to a real matrix algebra
GA ( 5 , 0 ) Not isomorphic to a real matrix algebra
GA ( 6 , 0 ) No probability measure as a self - product
Let us now demonstrate the obstructions mentioned above.
Theorem 16
(Not isomorphic to a real matrix algebra). The determinant of the matrix representation of the geometric algebras in this category is either complex-valued or quaternion-valued, making them unsuitable as a probability.
Proof. 
These geometric algebras are classified as follows:
GA ( 0 , 2 ) H
GA ( 3 , 0 ) M 2 ( C )
GA ( 2 , 1 ) M 2 2 ( R )
GA ( 1 , 2 ) M 2 ( C )
GA ( 0 , 3 ) H 2
GA ( 4 , 0 ) M 2 ( H )
GA ( 1 , 3 ) M 2 ( H )
GA ( 0 , 4 ) M 2 ( H )
GA ( 5 , 0 ) M 2 2 ( H )
The determinant of these objects is valued in C or in H , where C are the complex numbers, and where H are the quaternions. □
Theorem 17
(Negative Probabilities in the RQM). The even sub-algebra, which associates to the RQM part of the theory, of these dimensional configurations allows for negative probabilities, making them unsuitable as a RQM.
Proof. 
This category contains three dimensional configurations:
GA ( 1 , 0 ) :
Let ψ = a + b e 1 , then:
( a + b e 1 ) ( a + b e 1 ) = ( a b e 1 ) ( a + b e 1 ) = a 2 b 2 e 1 e 1 = a 2 b 2
which is valued in R .
GA ( 1 , 1 ) :
Let ψ = a + b e 0 e 1 , then:
( a + b e 0 e 1 ) ( a + b e 0 e 1 ) = ( a b e 0 e 1 ) ( a + b e 0 e 1 ) = a 2 b 2 e 0 e 1 e 0 e 1 = a 2 b 2
which is valued in R .
GA ( 2 , 2 ) :
Let ψ = a + b e 0 e e 1 e 2 , where e 0 2 = 1 , e 2 = 1 , e 1 2 = 1 , e 2 2 = 1 , then:
( a + b ) ( a + b ) 3 , 4 ( a + b ) ( a + b )
= a 2 + 2 a b + b 2 3 , 4 ( a 2 + 2 a b + b 2 )
We note that b 2 = b 2 e 0 e e 1 e 2 e 0 e e 1 e 2 = b 2 , therefore:
= ( a 2 + b 2 2 a b ) ( a 2 + b 2 + 2 a b )
= ( a 2 + b 2 ) 2 4 a 2 b 2
= ( a 2 + b 2 ) 2 4 a 2 b 2
which is valued in R .
In all of these cases the RQM probability can be negative. □
We repeat the following self-products [8] (Definition 16), which will help us demonstrate the next theorem:
GA ( 0 , 1 ) : φ φ
GA ( 2 , 0 ) : φ φ
GA ( 3 , 0 ) : φ φ 3 φ φ
GA ( 3 , 1 ) : φ φ 3 , 4 φ φ
GA ( 4 , 1 ) : ( φ φ 3 , 4 φ φ ) ( φ φ 3 , 4 φ φ )
Theorem 18
(No Metric Measurements). This obstruction applies to GA ( 2 , 0 ) . A probability measure of at least four self-products are required for the theory to be observationally complete with respect to its geometry.
Proof. 
A metric measurement requires a probability measure of 4 self products because the metric tensor is defined using 2 self-products of the gamma matrices:
g μ ν = 1 2 ( e μ e ν + e ν e μ )
Each pair of wavefunction products fixes one basis elements. Thus, two pairs of wavefunction products are required to fix the geometry from the wavefunction. As probability measures of four self-products begin to appear in 3D, then the GA ( 2 , 0 ) cannot produce a metric measurement as a quantum observable, thus its geometry is not observationally complete with respect to its geometry. □
Conjecture 1.
(No probability measures as a self-product (in 6D)). The multivector representation of the norm in 6D cannot satisfy any observables.
Argument 
In six dimensions and above, the self-product patterns found in Definition 16 collapse. The research by Acus et al. [12] in 6D geometric algebra demonstrates that the determinant, so far defined through a self-products of the multivector, fails to extend into 6D. The crux of the difficulty is evident in the reduced case of a 6D multivector containing only scalar and grade-4 elements:
s ( B ) = b 1 B f 5 ( f 4 ( B ) f 3 ( f 2 ( B ) f 1 ( B ) ) ) + b 2 B g 5 ( g 4 ( B ) g 3 ( g 2 ( B ) g 1 ( B ) ) )
This equation is not a multivector self-product but a linear sum of two multivector self-products [12].
The full expression is given in the form of a system of 4 equations, which is too long to list in its entirety. A small characteristic part is shown:
a 0 4 2 a 0 2 a 47 2 + b 2 a 0 2 a 47 2 p 412 p 422 + 72 monomials = 0
b 1 a 0 3 a 52 + 2 b 2 a 0 a 47 2 a 52 p 412 p 422 p 432 p 442 p 452 + 72 monomials = 0
74 monomials = 0
74 monomials = 0
From Equation 196, it is possible to see that no observable O can satisfy this equation because the linear combination does not allow one to factor it out of the equation.
b 1 O B f 5 ( f 4 ( B ) f 3 ( f 2 ( B ) f 1 ( B ) ) ) + b 2 B g 5 ( g 4 ( B ) g 3 ( g 2 ( B ) g 1 ( B ) ) ) = b 1 B f 5 ( f 4 ( B ) f 3 ( f 2 ( B ) f 1 ( B ) ) ) + b 2 O B g 5 ( g 4 ( B ) g 3 ( g 2 ( B ) g 1 ( B ) ) )
Any equality of the above type between b 1 O and b 2 O is frustrated by the factors b 1 and b 2 , forcing O = 1 as the only satisfying observable. Since the obstruction occurs within grade-4, which is part of the even sub-algebra it is questionable that a satisfactory theory (with non-trivial observables) be constructible in 6D, suing this method. □
This conjecture proposes that the multivector representation of the determinant in 6D does not allow for the construction of non-trivial observables, which is a crucial requirement for a relevant quantum formalism. The linear combination of multivector self-products in the 6D expression prevents the factorization of observables, limiting their role to the identity operator.
Conjecture 2.
(No probability measures as a self-product (above 6D)). The norms beyond 6D are progressively more complex than the 6D case, which is already obstructed.
These theorems and conjectures provide additional insights into the unique role of the unobstructed 3+1D signature in our proposal.
It is also interesting that our proposal is able to rule out GA ( 1 , 3 ) even if in relativity, the signature of the metric ( + , , , ) versus ( , , , + ) does not influence the physics. However, in geometric algebra, GA ( 1 , 3 ) represents 1 space dimension and 3 time dimensions. Therefore, it is not the signature itself that is ruled out but rather the specific arrangement of 3 time and 1 space dimensions, as this configuration yields quaternion-valued "probabilities" (i.e. GA ( 1 , 3 ) M 2 ( H ) and det M 2 ( H ) H ).
Consequently, 3+1D is the only dimensional configuration (other than the "non-geometric" configurations of GA ( 0 ) R and GA ( 0 , 1 ) C ) in which a ’least biased’ solution to the problem of maximizing the Shannon entropy of quantum measurements relative to an initial preparation, exists. This is an extremely constraining result regarding the possible spacetime configurations of the universe, and our ability (or inability) to construct a least biased theory to investigate it.

3. Discussion

The principle of maximum entropy [3] states that the probability measure that best represents the current state of knowledge about a system is the one with the largest entropy, constrained by prior data. In QM, an experiment typically consists of three stages: an initial preparation, followed by some transformations, and concluding with a final measurement of the system, which yields the result of the experiment. Consistent with the maximum entropy principle, our aim is to derive the ’least biased’ theory that connects the initial preparation p to its final measurement ρ . By formulating the theory as a solution to a maximization problem, rather than merely by axiomatic stipulation, we ensure (by mathematical proof) that the resulting framework is as unbiased as possible given the available information.
Using this methodology, fundamental physics can be formulated as the general solution to a maximization problem involving the Shannon entropy of all possible measurements of an arbitrary system relative to its initial preparation, subject to a vanishing phase constraint. The structure of the inferred theory is thus determined by the nature and generality of the employed constraint. In this paper, we have investigated three specific entropy maximization problems, each characterized by a different constraint and corresponding to a distinct level of description in physics:
Constraint Vanishing Phase Inferred Theory Wavefunction E ¯ = i ρ i E i none SM N . A . 0 = tr i ρ i 0 E i E i 0 U ( 1 ) QM C n 0 = 1 2 tr i ρ i M i Spin c ( 3 , 1 ) RQM / QG ( R × Spin c ( 3 , 1 ) ) n
where n represents the size of the ensemble.
Despite the differences in constraints, the three theories here-so formulated share a common logical genesis, adhere to the same principle of maximum entropy, and qualify as the least biased theory for their given constraint.

Interpretation

The Born rule is the least biased probability measure operating on a complex Hilbert space, as established in Theorem 2. However, when the framework is extended to include geometric considerations, the Born rule is no longer the least biased measure. Instead, as demonstrated in Theorem 3, the double-copy product emerges as the least biased probability measure in the 3+1D setting. Interestingly, no solutions exist for other dimensional configurations, as demonstrated in Section 2.4.

4. Conclusion

This paper presents a novel approach to quantum theory construction by solving a maximization problem on the Shannon entropy of all possible measurements of a system relative to its initial preparation, under the constraint of a vanishing phase. By selecting the appropriate group of the vanishing phase, the solution resolves to quantum mechanics, relativistic quantum mechanics, or a candidate for a theory of quantum gravity. The resulting measure is invariant under a wide range of geometric transformations, including those generated by the gauge groups of the Standard Model, and leads to the metric tensor as an operator involving a double copy of Dirac currents, without additional assumptions. Again without additional assumptions, the theory automatically fails in dimensional configurations other than 3+1D. This result aligns with the observed dimensionality and gauge symmetries of the universe, suggesting a possible explanation for its specific structure. This approach may offer a promising avenue for the unification of fundamental physical theories and may provide new insights into the underlying principles governing the structure of our universe.

Data Availability Statement

No datasets were generated or analyzed during the current study. During the preparation of this manuscript, we utilized a Large Language Model (LLM), for assistance with spelling and grammar corrections, as well as for minor improvements to the text to enhance clarity and readability. This AI tool did not contribute to the conceptual development of the work, data analysis, interpretation of results, or the decision-making process in the research. Its use was limited to language editing and minor textual enhancements to ensure the manuscript met the required linguistic standards.

Conflicts of Interest

The author declares that he has no competing financial or non-financial interests that are directly or indirectly related to the work submitted for publication.

Appendix A. SM

Here, we solve the Lagrange multiplier equation of SM.
L = k B i ρ i ln ρ i Boltzmann   Entropy + λ 1 i ρ i Normalization   Constraint + β E ¯ i ρ i E i Average Energy Constraint
We solve the maximization problem as follows:
0 = L ( ρ i , , ρ n ) ρ i
= ln ρ i 1 λ β E i
= ln ρ i + 1 + λ + β E i
ln ρ i = 1 λ β E i
ρ i = exp ( 1 λ ) exp β E i
= 1 Z ( τ ) exp β E i
The partition function, is obtained as follows:
1 = i exp ( 1 λ ) exp β E i
exp ( 1 λ ) 1 = i exp β E i
Z ( τ ) : = i exp β E i
Finally, the probability measure is:
ρ i = 1 i exp β E i exp β E i

Appendix B. RQM in 3+1D

L = i ρ i ln ρ i p i Relative   Shannon   Entropy + λ 1 i ρ i Normalization   Constraint + ζ tr 1 2 i ρ i M i Vanishing   Relativistic - Phase   Anti - Constraint
The solution is obtained using the same step-by-step process as the 2D case, and yields:
ρ i = 1 i p i det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Invariant Ensemble det exp ( ζ 1 2 M i ) Spin c ( 3 , 1 ) Born Rule p i Initial Preparation
Proof. 
The Lagrange multiplier equation can be solved as follows:
0 = L ( ρ 1 , , ρ n ) ρ i
= ln ρ i p i p i λ ζ tr 1 2 M i
= ln ρ i p i + p i + λ + ζ tr 1 2 M i
ln ρ i p i = p i λ ζ tr 1 2 M i
ρ i = p i exp ( p i λ ) exp ζ tr 1 2 M i
= 1 Z ( ζ ) p i exp ζ tr 1 2 M i
The partition function Z ( ζ ) , serving as a normalization constant, is determined as follows:
1 = i p i exp ( p i λ ) exp ζ tr 1 2 M i
exp ( p i λ ) 1 = i p i exp ζ tr 1 2 M i
Z ( ζ ) : = i p i exp ζ tr 1 2 M i
 □

Appendix C. SageMath program showing ⌊u ‡ u⌋ 3,4 u ‡ u=detM u

[H] from sage.algebras.clifford_algebra import CliffordAlgebra
from sage.quadratic_forms.quadratic_form import QuadraticForm
from sage.symbolic.ring import SR
from sage.matrix.constructor import Matrix
 
# Define the quadratic form for GA(3,1) over the Symbolic Ring
Q = QuadraticForm(SR, 4, [-1, 0, 0, 0, 1, 0, 0, 1, 0, 1])
 
# Initialize the GA(3,1) algebra over the Symbolic Ring
algebra = CliffordAlgebra(Q)
 
# Define the basis vectors
e0, e1, e2, e3 = algebra.gens()
 
# Define the scalar variables for each basis element
a = var(’a’)
t, x, y, z = var(’t x y z’)
f01, f02, f03, f12, f23, f13 = var(’f01 f02 f03 f12 f23 f13’)
v, w, q, p = var(’v w q p’)
b = var(’b’)
 
# Create a general multivector
udegree0=a
udegree1=t*e0+x*e1+y*e2+z*e3
udegree2=f01*e0*e1+f02*e0*e2+f03*e0*e3+f12*e1*e2+f13*e1*e3+f23*e2*e3
udegree3=v*e0*e1*e2+w*e0*e1*e3+q*e0*e2*e3+p*e1*e2*e3
udegree4=b*e0*e1*e2*e3
u=udegree0+udegree1+udegree2+udegree3+udegree4
 
u2 = u.clifford_conjugate()*u
 
u2degree0 = sum(x for x in u2.terms() if x.degree() == 0)
u2degree1 = sum(x for x in u2.terms() if x.degree() == 1)
u2degree2 = sum(x for x in u2.terms() if x.degree() == 2)
u2degree3 = sum(x for x in u2.terms() if x.degree() == 3)
u2degree4 = sum(x for x in u2.terms() if x.degree() == 4)
u2conj34 = u2degree0+u2degree1+u2degree2-u2degree3-u2degree4
 
Preprints 110056 i001
mdegree0 = a
mdegree1 = t*y0+x*y1+y*y2+z*y3
mdegree2 = f01*y0*y1+f02*y0*y2+f03*y0*y3+f12*y1*y2+f13*y1*y3+f23*y2*y3
mdegree3 = v*y0*y1*y2+w*y0*y1*y3+q*y0*y2*y3+p*y1*y2*y3
mdegree4 = b*y0*y1*y2*y3
m=mdegree0+mdegree1+mdegree2+mdegree3+mdegree4
 
print(u2conj34*u2 == m.det())
 
The program outputs
 
True
showing, by computer assisted symbolic manipulations, that the determinant of the real Majorana representation of a multivector u is equal to the double-copy form: det M u = u u 3 , 4 u u .

References

  1. Dirac, P.A.M. The principles of quantum mechanics; Number 27, Oxford university press, 1981.
  2. Von Neumann, J. Mathematical foundations of quantum mechanics: New edition; Vol. 53, Princeton university press, 2018.
  3. Jaynes, E.T. Information theory and statistical mechanics. Physical review 1957, 106, 620. [Google Scholar] [CrossRef]
  4. Jaynes, E.T. Information theory and statistical mechanics. II. Physical review 1957, 108, 171. [Google Scholar] [CrossRef]
  5. Kullback, S.; Leibler, R.A. On information and sufficiency. The annals of mathematical statistics 1951, 22, 79–86. [Google Scholar] [CrossRef]
  6. Shannon, C.E. A mathematical theory of communication. Bell system technical journal 1948, 27, 379–423. [Google Scholar] [CrossRef]
  7. Hestenes, D. Spacetime physics with geometric algebra. American Journal of Physics 2003, 71, 691–714. [Google Scholar] [CrossRef]
  8. Lundholm, D. Geometric (Clifford) algebra and its applications. arXiv preprint math/0605280 2006.
  9. Hestenes, D. Space-time structure of weak and electromagnetic interactions. Foundations of Physics 1982, 12, 153–168. [Google Scholar] [CrossRef]
  10. Lasenby, A. Some recent results for SU(3) and Octonions within the Geometric Algebra approach to the fundamental forces of nature. 2022. arXiv:2202.06733 2022.
  11. Bern, Z.; Carrasco, J.J.M.; Johansson, H. Perturbative quantum gravity as a double copy of gauge theory. Physical Review Letters 2010, 105, 061602. [Google Scholar] [CrossRef] [PubMed]
  12. Acus, A.; Dargys, A. Inverse of multivector: Beyond p+ q= 5 threshold. 2017. arXiv:1712.05204 2017.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated