Deriving Fundamental Physics via Entropy Maximization under a Universal Measurement Constraint

Alexandre Harvey-Tremblay

doi:10.20944/preprints202404.1009.v7

Submitted:

29 October 2024

Posted:

30 October 2024

Read the latest preprint version here

Abstract

We present a novel derivation of fundamental physical laws by solving a maximization problem on the Shannon entropy of all possible measurements relative to a system's initial state, subject to a universal measurement constraint. Its most sophisticated solution involves 3+1-dimensional measurements that include both bivectors and complex numbers. In this case, the solution describes a relativistic quantum theory that naturally yields the metric tensor of general relativity through a double-copy mechanism applied to the Dirac current and that inherently incorporates the SU(3)xSU(2)xU(1) gauge symmetries of the Standard Model. These findings reveal deep connections between probability theory and the mathematical structures underlying fundamental physics, offering new insights into the emergence of 3+1 spacetime dimensions, gravity, and symmetry structures in our universe.

Keywords:

foundations of quantum physics

Subject:

Physical Sciences - Quantum Science and Technology

1. Introduction

Statistical mechanics (SM), in the formulation developed by E.T. Jaynes [1,2], is founded on an entropy optimization principle. Specifically, the Boltzmann entropy is maximized under the constraint of a fixed average energy

\bar{E}

:

\begin{matrix} \bar{E} = \sum_{i} ρ_{i} E_{i} \end{matrix}

(1)

The Lagrange multiplier equation defining the optimization problem is:

\begin{matrix} L = - k_{B} \sum_{i} ρ_{i} ln ρ_{i} + λ (1 - \sum_{i} ρ_{i}) + β (\bar{E} - \sum_{i} ρ_{i} E_{i}), \end{matrix}

(2)

where

λ

and

β

are Lagrange multipliers enforcing the normalization and average energy constraints. Solving this optimization problem yields the Gibbs measure:

\begin{matrix} ρ_{i} = \frac{1}{Z} exp (- β E_{i}), \end{matrix}

(3)

where

Z = \sum_{i} exp (- β E_{i})

is the partition function.

Formulating statistical mechanics as the solution to an empirically constrained optimization problem provides a parsimonious and unified framework in which fundamental principles are derived from observable quantities. In contrast, other physical theories often begin with fundamental principles or axioms, from which observable quantities are introduced to describe different phenomena. By basing its foundation on empirical measurement constraints, statistical mechanics achieves a coherent and comprehensive understanding through a single optimization principle, highlighting a reversal of the typical approach found in other theories.

For comparison, quantum mechanics (QM) is founded on five principles [3,4]:

State Space: Every physical system is associated with a complex Hilbert space, and its state is represented by a ray (an equivalence class of vectors differing by a non-zero scalar multiple) in this space.
Observables: Physical observables correspond to Hermitian (self-adjoint) operators acting on the Hilbert space.
Dynamics: The time evolution of a quantum system is governed by the Schrödinger equation, where the Hamiltonian operator represents the system’s total energy.
Measurement: Measuring an observable projects the system into an eigenstate of the corresponding operator, yielding one of its eigenvalues as the measurement result.
Probability Interpretation: The probability of obtaining a specific measurement outcome is given by the squared magnitude of the projection of the state vector onto the relevant eigenstate (Born rule).

Special Relativity (SR) is based on two axioms:

The principle of relativity: The laws of physics are the same in all inertial reference frames.
The constancy of the speed of light: The speed of light in a vacuum is constant and is independent of the motion of the light source or observer.

General Relativity (GR) and the Standard Model are based on the following key principles, respectively:

Dimensionality of Spacetime: The universe exhibits three spatial dimensions and one temporal dimension, forming a 3+1-dimensional spacetime continuum. This spacetime exhibits the symmetries of the Poincaré group, which includes translations, rotations, and Lorentz boosts[5].
Einstein Field Equations: The relationship between the geometry of spacetime and the energy-momentum content within it is given by the Einstein field equations[6,7].:

$\begin{matrix} G_{μ ν} + Λ g_{μ ν} = κ T_{μ ν} \end{matrix}$

(4)
Gauge Symmetry of the Standard Model: The fundamental interactions (excluding gravity) are described by the gauge symmetry group SU(3)×SU(2)×U(1). [8,9].

Finally, we consider the following hypothesis:

Quantum Gravity: One approach to quantum gravity (amongst many) involves linearizing the Einstein Field Equations around flat Minkowski spacetime, expressing the metric as $g_{μ ν} = η_{μ ν} + h_{μ ν}$ . The perturbation $h_{μ ν}$ is promoted to a quantum operator ${\hat{h}}_{μ ν}$ , allowing quantum field theory techniques to be applied to gravity [10]. A recent development, the double copy mechanism, reveals a deep connection between gauge theories and gravity by suggesting that gravitational amplitudes can be obtained from gauge theory amplitudes, effectively relating gravity to the "square" of gauge theories [11].

Although there are of course other principles, these principles provide essential context for our discussion and are integral to the development of our theoretical framework.

Given this contrast between statistical mechanics and other theories, it becomes desirable to seek a similar unifying framework across physics. Despite the effectiveness of the numerous axioms underlying quantum mechanics, general relativity, the Standard Model, and other foundational principles, the diversity of these axioms points to the potential for a more unified approach. Inspired by the methodology of statistical mechanics—where fundamental laws emerge from observable constraints—we aim to identify a universal constraint that, when utilized in an optimization problem, leads to a theory encompassing these fundamental axioms. Such an approach could provide a more coherent and parsimonious foundation, connecting diverse phenomena through a common principle and potentially offering deeper insights into the underlying structure of the universe.

Specifically, we report the discovery of the following constraint, which we call the Universal Measurement Constraint, that makes this unification possible:

Axiom 1 (Universal Measurement Constraint).

\begin{matrix} \bar{M} = \sum_{i} ρ_{i} M_{i}, \end{matrix}

(5)

where

M_{i}

are

n \times n

matrices, and

\bar{M}

is their average.

We then construct the following optimization problem, aiming to maximize the relative Shannon entropy [12,13] between the initial preparation probabilities

p_{i}

and the final measurement probabilities

ρ_{i}

:

Definition 1

(Universal Measurement Optimization Problem).

\begin{matrix} L = - \sum_{i} ρ_{i} ln \frac{ρ_{i}}{p_{i}} + λ (1 - \sum_{i} ρ_{i}) + τ tr (\bar{M} - \sum_{i} ρ_{i} M_{i}), \end{matrix}

(6)

where

λ

and

τ

are Lagrange multipliers enforcing the normalization and universal measurement constraints, respectively. The inclusion of the trace operator (tr) is made by ansatz to ensure the correct recovery of quantum mechanical probabilities, as detailed in Section 2.1.

Axiom 1 and Definition 1 constitute the sum total of our proposal. In the rest of this manuscript, we intend to demonstrate that Axiom 1 and Definition 1 are sufficient to uniquely recover the Statistical Mechanics (SM), the five axioms of Quantum Mechanics (QM), the two axioms of Special Relativity (SR), the Einstein Field Equations (EFE), the gauge symmetry of the Standard Model, the dimensionality of spacetime, and quantum gravity.

Let us begin by solving the optimization problem:

Theorem 1.

The solution to the entropy maximization problem is:

\begin{matrix} ρ_{i} = \frac{p_{i} det exp (- τ M_{i})}{\sum_{j} p_{j} det exp (- τ M_{j})} . \end{matrix}

(7)

Proof.

We solve the maximization problem by setting the derivative of the Lagrangian with respect to

ρ_{i}

to zero:

\begin{matrix} \frac{\partial L}{\partial ρ_{i}} & = - ln \frac{ρ_{i}}{p_{i}} - 1 - λ - τ tr M_{i} = 0 . \end{matrix}

(8)

\begin{matrix} \Rightarrow ln \frac{ρ_{i}}{p_{i}} & = - 1 - λ - τ tr M_{i} . \end{matrix}

(9)

\begin{matrix} \Rightarrow ρ_{i} & = p_{i} exp (- 1 - λ) exp (- τ tr M_{i}) . \end{matrix}

(10)

Normalizing the probabilities using

\sum_{i} ρ_{i} = 1

, we find:

\begin{matrix} 1 & = \sum_{i} ρ_{i} = exp (- 1 - λ) \sum_{i} p_{i} exp (- τ tr M_{i}), \end{matrix}

(11)

\begin{matrix} \Rightarrow exp (1 + λ) & = \sum_{j} p_{j} exp (- τ tr M_{j}) . \end{matrix}

(12)

Substituting back, we obtain:

\begin{matrix} 1 - 1 ρ_{i} & = \frac{p_{i} exp (- τ tr M_{i})}{\sum_{j} p_{j} exp (- τ tr M_{j})} . \end{matrix}

(13)

Finally, using the identity

det exp (M) = exp tr M

for square matrices

M

, we get:

\begin{matrix} ρ_{i} = \frac{p_{i} det exp (- τ M_{i})}{\sum_{j} p_{j} det exp (- τ M_{j})} . \end{matrix}

(14)

□

This solution encapsulates fundamental physics as follows:

Statistical Mechanics:

To recover statistical mechanics from Equation 14, we consider the case where the matrices

M_{i}

are

1 \times 1

, i.e., scalars. Specifically, we set:

\begin{matrix} \bar{M} = \sum_{i} ρ_{i} M_{i}, with M_{i} = [\begin{matrix} E_{i} \end{matrix}], \end{matrix}

(15)

and take

p_{i} \to 1

. Multiplying by the Boltzmann constant

k_{B}

, Equation 14 reduces to the Gibbs distribution:

\begin{matrix} ρ_{i} = \frac{1}{Z} exp (- τ E_{i}), \end{matrix}

(16)

where

τ

corresponds to

β

in traditional statistical mechanics. This demonstrates that our solution generalizes SM–its scalar case.

Quantum Mechanics:

By choosing

M_{i}

to generate the U(1) group, we derive the fundamental equations of quantum mechanics from entropy maximization. Specifically, we set:

\begin{matrix} \bar{M} = \sum_{i} ρ_{i} M_{i}, with M_{i} = [\begin{matrix} 0 & - E_{i} \\ E_{i} & 0 \end{matrix}], \end{matrix}

(17)

where

E_{i}

are energy levels. In the results section, we will detail how this choice leads to a probability measure that includes a unitarily invariant ensemble and the Born rule, satisfying all five axioms of QM.

Fundamental Physics:

Extending our approach, we choose

M_{i}

to be

4 \times 4

matrices representing the generators of the Spin^c(3,1) group. Specifically, we consider multivectors of the form

u = f + b

, where

f

is a bivector and

b

is a pseudoscalar of the 3+1D geometric algebra

GA (3, 1)

. The matrix representation of

M_{i}

is:

\begin{matrix} M_{i} = [\begin{matrix} f_{02} & b - f_{13} & - f_{01} + f_{12} & f_{03} + f_{23} \\ - b + f_{13} & f_{02} & f_{03} + f_{23} & f_{01} - f_{12} \\ - f_{01} - f_{12} & f_{03} - f_{23} & - f_{02} & - b - f_{13} \\ f_{03} - f_{23} & f_{01} + f_{12} & b + f_{13} & - f_{02} \end{matrix}], \end{matrix}

(18)

where

f_{01}, f_{02}, f_{03}, f_{12}, f_{13}, f_{23}

, and b correspond to the generators of the Spin^c(3,1) group, which includes both Lorentz transformations and U(1) phase rotations. This choice leads to a relativistic quantum probability measure:

\begin{matrix} ρ_{i} = \frac{p_{i} det exp (- \frac{1}{2} ζ M_{i})}{\sum_{j} p_{j} det exp (- \frac{1}{2} ζ M_{j})}, \end{matrix}

(19)

where

ζ

emerges as a parameter generating boosts, rotations, and phase transformations.

In the results section, we show that the resulting Dirac current is automatically invariant under the gauge symmetries of the Standard Model, specifically SU(3) × SU(2) × U(1). Furthermore, we show that the metric tensor of general relativity emerges via a double-copy mechanism applied to the Dirac current.

Dimensional Obstructions:

Our general solution yields valid probability measures only in specific cases. Beyond the instances of statistical mechanics (the scalar case) and quantum mechanics (the U(1) generating case), the entropy maximization technique yields a consistent solution only in 2 dimensions (the Spin(2) generating case) and

3 + 1

dimensions (the Spin^c(3,1) generating case).

In other configurations, various obstructions arise—such as the absence of a real matrix algebra isomorphism or the emergence of negative probabilities—thereby violating the axioms of probability theory. The following table summarizes the cases and their obstructions:

\begin{matrix} Dimensions & Obstruction \end{matrix}

\begin{matrix} GA (0) & Unobstructed \Rightarrow statistical mechanics \end{matrix}

(20)

\begin{matrix} GA (0, 1) & Unobstructed \Rightarrow quantum mechanics \end{matrix}

(21)

\begin{matrix} GA (1, 0) & Negative probabilities \end{matrix}

(22)

\begin{matrix} GA (2, 0) & Unobstructed \Rightarrow Spin (2) \end{matrix}

(23)

\begin{matrix} GA (1, 1) & Negative probabilities \end{matrix}

(24)

\begin{matrix} GA (0, 2) & Not isomorphic to a real matrix algebra \end{matrix}

(25)

\begin{matrix} GA (3, 0) & Not isomorphic to a real matrix algebra \end{matrix}

(26)

\begin{matrix} GA (2, 1) & Not isomorphic to a real matrix algebra \end{matrix}

(27)

\begin{matrix} GA (1, 2) & Not isomorphic to a real matrix algebra \end{matrix}

(28)

\begin{matrix} GA (0, 3) & Not isomorphic to a real matrix algebra \end{matrix}

(29)

\begin{matrix} GA (4, 0) & Not isomorphic to a real matrix algebra \end{matrix}

(30)

\begin{matrix} GA (3, 1) & Unobstructed \Rightarrow quantum gravity \land SU (3) \times SU (2) \times U (1) \end{matrix}

(31)

\begin{matrix} GA (2, 2) & Negative probabilities \end{matrix}

(32)

\begin{matrix} GA (1, 3) & Not isomorphic to a real matrix algebra \end{matrix}

(33)

\begin{matrix} GA (0, 4) & Not isomorphic to a real matrix algebra \end{matrix}

(34)

\begin{matrix} GA (5, 0) & Not isomorphic to a real matrix algebra \\ ⋮ & ⋮ \end{matrix}

(35)

\begin{matrix} GA (6, 0) & No probability measure as a self - product \\ ⋮ & ⋮ \end{matrix}

(36)

\begin{matrix} \infty \end{matrix}

(37)

We will first investigate the unobstructed cases and then demonstrate these obstructions in Section 2.4. These obstructions are desirable because they allow the theory to automatically limit itself to 3+1D, thus providing a built-in mechanism for the observed dimensionality of our universe.

2. Results

2.1. Quantum Mechanics

In statistical mechanics (SM), the central observation is that energy measurements of a thermally equilibrated system tend to cluster around a fixed average value. In contrast, quantum mechanics (QM) is characterized by the presence of interference effects in measurement outcomes. To capture these features within an entropy maximization framework, we introduce the following special case of the linear constraint:

Definition 2 (U(1) Generating Constraint).

We reduce the universal measurement constraint to the generator of the U(1) group. Specifically, we replace

\begin{matrix} \bar{M} = \sum_{i} ρ_{i} M_{i} with M_{i} = [\begin{matrix} 0 & - E_{i} \\ E_{i} & 0 \end{matrix}] \end{matrix}

(38)

Here,

E_{i}

are scalar values (e.g., energy levels),

ρ_{i}

are the probabilities of outcomes, and the matrix generates the U(1) group.

The general solution of the maximization problem

\begin{matrix} ρ_{i} = \frac{1}{\sum_{i} p_{i} det exp (- τ M_{i})} det exp (- τ M_{i}) p_{i} \end{matrix}

(39)

likewise reduces as follows

\begin{matrix} ρ_{i} = \frac{1}{\sum_{i} p_{i} det exp (- τ [\begin{matrix} 0 & - E_{i} \\ E_{i} & 0 \end{matrix}])} det exp (- τ [\begin{matrix} 0 & - E_{i} \\ E_{i} & 0 \end{matrix}]) p_{i} \end{matrix}

(40)

Though initially unfamiliar, this form effectively establishes a comprehensive formulation of quantum mechanics, as we will demonstrate.

To align our results with conventional quantum mechanical notation, we translate the matrices to complex numbers. Specifically, we consider that:

\begin{matrix} [\begin{matrix} a & - b \\ b & a \end{matrix}] \leftrightarrow a + i b . \end{matrix}

(41)

Then, we note the following equivalence with the complex norm:

\begin{matrix} det exp [\begin{matrix} a & - b \\ b & a \end{matrix}] & = r^{2} det [\begin{matrix} cos (b) & - sin (b) \\ sin (b) & cos (b) \end{matrix}], where r = exp a \end{matrix}

(42)

\begin{matrix} = r^{2} ({cos}^{2} (b) + {sin}^{2} (b)) \end{matrix}

(43)

\begin{matrix} = ∥ r (cos (b) + i sin (b)) ∥ \end{matrix}

(44)

\begin{matrix} = ∥ r exp (i b) ∥ \end{matrix}

(45)

Finally, substituting

τ = t / ℏ

analogously to

β = 1 / (k_{B} T)

, and applying the complex-norm representation to both the numerator and to the denominator, consolidates the Born rule, normalization, and initial prepration into :

\begin{matrix} ρ_{i} & = \underset{Unitarily Invariant Partition Function}{\underset{︸}{\frac{1}{\sum_{i} p_{i} ∥ exp (- i t E_{i} / ℏ) ∥}}} \underset{Born Rule}{\underset{︸}{∥ exp (- i t E_{i} / ℏ) ∥}} \underset{Initial Preparation}{\underset{︸}{p_{i}}} \end{matrix}

(46)

The wavefunction emerges by decomposing the complex norm into a complex number and its conjugate. It is then visualized as a vector within a complex n-dimensional Hilbert space. The partition function acts as the inner product. This relationship is articulated as follows:

\begin{matrix} \sum_{i} p_{i} ∥ exp (- i t E_{i} / ℏ) ∥ = Z = 〈 ψ | ψ 〉 \end{matrix}

(47)

where

We clarify that

p_{i}

represents the probability associated with the initial preparation of the wavefunction, where

p_{i} = 〈 ψ_{i} (0) | ψ_{i} (0) 〉

.

We also note that Z is invariant under unitary transformations.

Let us now investigate how the axioms of quantum mechanics are recovered from this result:

The entropy maximization procedure inherently normalizes the vectors $| ψ 〉$ with $1 / Z = 1 / \sqrt{〈 ψ | ψ 〉}$ . This normalization links $| ψ 〉$ to a unit vector in Hilbert space. Furthermore, as physical states associate to the probability measure, and the probability is defined up to a phase, we conclude that physical states map to Rays within Hilbert space. This demonstrates a.
In Z, an observable must satisfy:

$\begin{matrix} \bar{O} = \sum_{i} p_{i} O_{i} ∥ exp (- i t E_{i} / ℏ) ∥ \end{matrix}$

(49)

Since $Z = 〈 ψ | ψ 〉$ , then any self-adjoint operator satisfying the condition $〈 O ψ | ϕ 〉 = 〈 ψ | O ϕ 〉$ will equate the above equation, simply because $〈 O 〉 = 〈 ψ | O | ψ 〉$ . This demonstrates b.
Upon transforming Equation 48 out of its eigenbasis through unitary operations, we find that the energy, $E_{i}$ , typically transforms in the manner of a Hamiltonian operator:

$\begin{matrix} | ψ (t) 〉 = exp (- i t H / ℏ) | ψ (0) 〉 \end{matrix}$

(50)

The system’s dynamics emerge from differentiating the solution with respect to the Lagrange multiplier. This is manifested as:

$\begin{matrix} \frac{\partial}{\partial t} | ψ (t) 〉 & = \frac{\partial}{\partial t} (exp (- i t H / ℏ) | ψ (0) 〉) \end{matrix}$

(51)

$\begin{matrix} = - i H / ℏ exp (- i t H / ℏ) | ψ (0) 〉 \end{matrix}$

(52)

$\begin{matrix} = - i H / ℏ | ψ (t) 〉 \end{matrix}$

(53)

$\begin{matrix} \Rightarrow H | ψ (t) 〉 & = i ℏ \frac{\partial}{\partial t} | ψ (t) 〉 \end{matrix}$

(54)

which is the Schrödinger equation. This demonstrates c.
From Equation 48 it follows that the possible microstates $E_{i}$ of the system correspond to specific eigenvalues of $H$ . An observation can thus be conceptualized as sampling from $ρ$ , with the measured state being the occupied microstate i. Consequently, when a measurement occurs, the system invariably emerges in one of these microstates, which directly corresponds to an eigenstate of $H$ . Measured in the eigenbasis, the probability measure is:

$\begin{matrix} ρ_{i} (t) = \frac{1}{〈 ψ | ψ 〉} {(ψ_{i} (t))}^{†} ψ_{i} (t) . \end{matrix}$

(55)

In scenarios where the probability measure $ρ_{i} (τ)$ is expressed in a basis other than its eigenbasis, the probability $P (λ_{i})$ of obtaining the eigenvalue $λ_{i}$ is given as a projection on a eigenstate:

$\begin{matrix} P (λ_{i}) = {| 〈 λ_{i} | ψ 〉 |}^{2} \end{matrix}$

(56)

Here, $| 〈 λ_{i} | ψ 〉 |^{2}$ signifies the squared magnitude of the amplitude of the state $| ψ 〉$ when projected onto the eigenstate $| λ_{i} 〉$ . As this argument hold for any observables, this demonstrates d.
Finally, since the probability measure (Equation 46) replicates the Born rule, e is also demonstrated.

Revisiting quantum mechanics with this perspective offers a coherent and unified narrative. Specifically, the U(1) generating constraint is sufficient to entail the foundations of quantum mechanics (Axiom 1, 2, 3, 4 and 5) through the principle of entropy maximization. The following Lagrange multiplier equation

\begin{matrix} L = - \sum_{i} ρ_{i} ln \frac{ρ_{i}}{p_{i}} + λ (1 - \sum_{i} ρ_{i}) + τ tr ([\begin{matrix} 0 & \bar{E} \\ \bar{E} & 0 \end{matrix}] - \frac{1}{2} \sum_{i} ρ_{i} [\begin{matrix} 0 & - E_{i} \\ E_{i} & 0 \end{matrix}]) \end{matrix}

(57)

becomes the formulation’s new singular foundation, and QM Axioms 1, 2, 3, 4, and 5 are now promoted to theorems.

2.2. RQM in 2D

In this section, we investigate a toy model that lives in 2D which provides a valuable starting point before addressing the more complex 3+1D case. In RQM 2D, the fundamental Lagrange Multiplier Equation is:

\begin{matrix} L = - \sum_{i} ρ_{i} ln \frac{ρ_{i}}{p_{i}} + λ (1 - \sum_{i} ρ_{i}) + θ tr (\bar{M} - \frac{1}{2} \sum_{i} ρ_{i} M_{i}) \end{matrix}

(58)

where

λ

and

θ

are the Lagrange multipliers, and where

M_{i}

is the

2 \times 2

matrix representation of the multivectors of

GA (2)

.

In general a multivector

u = a + x + b

of

GA (2)

, where a is a scalar,

x

is a vector and

b

a pseudo-scalar, is represented as follows:

\begin{matrix} [\begin{matrix} a + x & y - b \\ y + b & a - x \end{matrix}] ≅ a + x σ_{x} + y σ_{y} + b σ_{x} \land σ_{y} \end{matrix}

(59)

This holds for any

2 \times 2

matrix and any multivectors of

GA (2)

.

The basis elements are defined as:

\begin{matrix} σ_{x} = [\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}], σ_{y} = [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}], σ_{x} \land σ_{y} = [\begin{matrix} 0 & - 1 \\ 1 & 0 \end{matrix}] \end{matrix}

(60)

To investigate this case in more detail, we introduce the multivector conjugate, also known as the Clifford conjugate, which generalizes the concept of complex conjugation to multivectors.

Definition 3

(Multivector conjugate). Let

u = a + x + b

be a multi-vector of the geometric algebra over the reals in two dimensions

GA (2)

. The multivector conjugate is defined as:

\begin{matrix} u^{‡} = a - x - b \end{matrix}

(61)

The determinant of the matrix representation of a multivector can be expressed as a self-product:

Theorem 2

(Determinant as a Multivector Self-Product).

\begin{matrix} u^{‡} u = det M \end{matrix}

(62)

Proof.

Let

u = a + x σ_{x} + y σ_{y} + b σ_{x} \land σ_{y}

, and let

M

be its matrix representation

[\begin{matrix} a + x & y - b \\ y + b & a - x \end{matrix}]

. Then:

\begin{matrix} 1 : & u^{‡} u \end{matrix}

(63)

\begin{matrix} = {(a + x σ_{x} + y σ_{y} + b σ_{x} \land σ_{y})}^{‡} (a + x σ_{x} + y σ_{y} + b σ_{x} \land σ_{y}) \end{matrix}

(64)

\begin{matrix} = (a - x σ_{x} - y σ_{y} - b σ_{x} \land σ_{y}) (a + x σ_{x} + y σ_{y} + b σ_{x} \land σ_{y}) \end{matrix}

(65)

\begin{matrix} = a^{2} - x^{2} - y^{2} + b^{2} \end{matrix}

(66)

\begin{matrix} 2 : & det M \end{matrix}

(67)

\begin{matrix} = det [\begin{matrix} a + x & y - b \\ y + b & a - x \end{matrix}] \end{matrix}

(68)

\begin{matrix} = (a + x) (a - x) - (y - b) (y + b) \end{matrix}

(69)

\begin{matrix} = a^{2} - x^{2} - y^{2} + b^{2} \end{matrix}

(70)

□

Building upon the concept of the multivector conjugate, we introduce the multivector conjugate transpose, which serves as an extension of the Hermitian conjugate to the domain of multivectors.

Definition 4

(Multivector Conjugate Transpose). Let

|V〉 {〉 \in (GA (2))}^{n}

:

\begin{matrix} |V〉 〉 = [\begin{matrix} a_{1} + x_{1} + b_{1} \\ ⋮ \\ a_{n} + x_{n} + b_{n} \end{matrix}] \end{matrix}

(71)

The multivector conjugate transpose of

|V〉 〉

is defined as first taking the transpose and then the element-wise multivector conjugate:

\begin{matrix} 〈 〈V| = [\begin{matrix} a_{1} - x_{1} - b_{1} & \dots & a_{n} - x_{n} - b_{n} \end{matrix}] \end{matrix}

(72)

Definition 5

(Bilinear Form). Let

|V〉 〉

and

|W〉 〉

be two vectors valued in

GA (2)

. We introduce the following bilinear form:

\begin{matrix} 〈 〈V | W〉 〉 = (a_{1} - x_{1} - b_{1}) (a_{1} + x_{1} + b_{1}) + \dots (a_{n} - x_{n} - b_{n}) (a_{n} + x_{n} + b_{n}) \end{matrix}

(73)

Theorem 3

(Inner Product). Restricted to the even sub-algebra of

GA (2)

, the bilinear form is an inner product.

Proof.

\begin{matrix} {〈 〈V | W〉 〉}_{x \to 0} & = (a_{1} - b_{1}) (a_{1} + b_{1}) + \dots (a_{n} - b_{n}) (a_{n} + b_{n}) \end{matrix}

(74)

This is isomorphic to the inner product of a complex Hilbert space, with the identification

i ≅ σ_{x} \land σ_{y}

. □

Let us now solve the optimization problem for the even multivectors of

GA (2, 0)

, whose inner product is positive-definite.

We take

a \to 0, x \to 0

then

M

reduces as follows:

\begin{matrix} {u = a + x + b |}_{a \to 0, x \to 0} = b \Rightarrow M = [\begin{matrix} 0 & - b \\ b & 0 \end{matrix}] \end{matrix}

(75)

The Lagrange multiplier equation can be solved as follows:

\begin{matrix} 0 & = \frac{\partial L [ρ_{1}, \dots, ρ_{n}]}{\partial ρ_{i}} \end{matrix}

(76)

\begin{matrix} = - ln \frac{ρ_{i}}{p_{i}} - p_{i} - λ - θ tr \frac{1}{2} [\begin{matrix} 0 & - b_{i} \\ b_{i} & 0 \end{matrix}] \end{matrix}

(77)

\begin{matrix} = ln \frac{ρ_{i}}{p_{i}} + p_{i} + λ + θ tr \frac{1}{2} [\begin{matrix} 0 & - b_{i} \\ b_{i} & 0 \end{matrix}] \end{matrix}

(78)

\begin{matrix} \Rightarrow ln \frac{ρ_{i}}{p_{i}} & = - p_{i} - λ - θ tr \frac{1}{2} [\begin{matrix} 0 & - b_{i} \\ b_{i} & 0 \end{matrix}] \end{matrix}

(79)

\begin{matrix} \Rightarrow ρ_{i} & = p_{i} exp (- p_{i} - λ) exp (- θ tr \frac{1}{2} [\begin{matrix} 0 & - b_{i} \\ b_{i} & 0 \end{matrix}]) \end{matrix}

(80)

\begin{matrix} = \frac{1}{Z (θ)} p_{i} exp (- θ tr \frac{1}{2} [\begin{matrix} 0 & - b_{i} \\ b_{i} & 0 \end{matrix}]) \end{matrix}

(81)

The partition function

Z (θ)

, serving as a normalization constant, is determined as follows:

\begin{matrix} 1 & = \sum_{i} p_{i} exp (- p_{i} - λ) exp (- θ tr \frac{1}{2} [\begin{matrix} 0 & - b_{i} \\ b_{i} & 0 \end{matrix}]) \end{matrix}

(82)

\begin{matrix} \Rightarrow {(exp (- p_{i} - λ))}^{- 1} & = \sum_{i} p_{i} exp (- θ tr \frac{1}{2} [\begin{matrix} 0 & - b_{i} \\ b_{i} & 0 \end{matrix}]) \end{matrix}

(83)

\begin{matrix} Z (θ) & : = \sum_{i} p_{i} exp (- θ tr \frac{1}{2} [\begin{matrix} 0 & - b_{i} \\ b_{i} & 0 \end{matrix}]) \end{matrix}

(84)

Consequently, the least biased probability measure that connects an initial preparation

p_{i}

to a final measurement

ρ_{i}

, under the 2D linear constraint, is:

\begin{matrix} ρ_{i} = \underset{Spin (2) Invariant Ensemble}{\underset{︸}{\frac{1}{\sum_{i} p_{i} det exp (- \frac{1}{2} θ [\begin{matrix} 0 & - b_{i} \\ b_{i} & 0 \end{matrix}])}}} \underset{Spin (2)}{\underset{︸}{det exp (- \frac{1}{2} θ [\begin{matrix} 0 & - b_{i} \\ b_{i} & 0 \end{matrix}])}} Born Rule \underset{Initial}{\underset{︸}{p_{i}}} Preparation \end{matrix}

(85)

In 2D, the Lagrange multiplier

θ

correspond to an angle of rotation. For comparison, in 1+1D it would correspond to the rapidity

ζ

:

\begin{matrix} 2 D : & exp θ [\begin{matrix} 0 & - 1 \\ 1 & 0 \end{matrix}] = [\begin{matrix} cos θ & - sin θ \\ sin θ & cos θ \end{matrix}] & θ is the angle of rotation \end{matrix}

(86)

\begin{matrix} 1 + 1 D : & exp ζ [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}] = [\begin{matrix} cosh ζ & sinh ζ \\ sinh ζ & cosh ζ \end{matrix}] & ζ is the rapidity \end{matrix}

(87)

Definition 6 (Spin(2)-valued Wavefunction).

\begin{matrix} |ψ〉 〉 = [\begin{matrix} e^{\frac{1}{2} (a_{1} + b_{1})} \\ ⋮ \\ e^{\frac{1}{2} (a_{n} + b_{n})} \end{matrix}] = [\begin{matrix} \sqrt{ρ_{1}} R_{1} \\ ⋮ \\ \sqrt{ρ_{2}} R_{2} \end{matrix}] \end{matrix}

(88)

where

\sqrt{ρ_{i}} = e^{\frac{1}{2} a_{i}}

representing the square root of the probability and

R_{i} = e^{\frac{1}{2} b_{i}}

representing a rotor in 2D (or boost in 1+1D).

The partition function of the probability measure can be expressed using the bilinear form applied to the Spin(2)-valued Wavefunction:

Theorem 4

(Partition Function).

Z = 〈 〈ψ | ψ〉 〉

Proof.

\begin{matrix} 〈 〈ψ | ψ〉 〉 = \sum_{i} ψ_{i}^{‡} ψ_{i} = \sum_{i} ρ_{i} R_{i}^{‡} R_{i} = \sum_{i} ρ_{i} = Z \end{matrix}

(89)

□

Definition 7 (Spin(2)-valued Evolution Operator).

\begin{matrix} T = [\begin{matrix} e^{- \frac{1}{2} θ b_{1}} \\ ⋱ \\ e^{- \frac{1}{2} θ b_{n}} \end{matrix}] \end{matrix}

(90)

Theorem 5.

The partition function is invariant with respect to the Spin(2)-valued evolution operator.

Proof.

We note that:

\begin{matrix} 〈 〈T v | T v〉 〉 = 〈 〈v | v〉 〉 = v^{‡} T^{‡} T v \Rightarrow T^{‡} T = I \end{matrix}

(91)

then, since

[\begin{matrix} e^{\frac{1}{2} θ b_{1}} \\ ⋱ \\ e^{\frac{1}{2} θ b_{n}} \end{matrix}] [\begin{matrix} e^{- \frac{1}{2} θ b_{1}} \\ ⋱ \\ e^{- \frac{1}{2} θ b_{n}} \end{matrix}] = I

, the relation

T^{‡} T = I

is satisfied. □

We note that the even sub-algebra of

GA (2)

, being closed under addition and multiplication and constituting an inner product through its bilinear form, allows for the construction of a Hilbert space. In this context, the Hilbert space is Spin(2)-valued. The primary distinction between a wavefunction in a complex Hilbert space and one in a Spin(2)-valued Hilbert space lies in the subject matter of the theory. Specifically, in the latter, the construction governs the change in orientation experienced by an observer, which in turn dictates the measurement basis used in the experiment, consistently with the rotational symmetry and freedom of the system.

The dynamics of observer orientation transformations are described by a variant of the Schrödinger equation, which is derived by taking the derivative of the wavefunction with respect to the Lagrange multiplier,

θ

:

Definition 8 (Spin(2)-valued Schrödinger Equation).

\begin{matrix} \frac{d}{d θ} [\begin{matrix} ψ_{1} (θ) \\ ⋮ \\ ψ_{n} (θ) \end{matrix}] = [\begin{matrix} - \frac{1}{2} b_{1} \\ ⋱ \\ - \frac{1}{2} b_{n} \end{matrix}] [\begin{matrix} ψ_{1} (θ) \\ ⋮ \\ ψ_{n} (θ) \end{matrix}] \end{matrix}

(92)

Here,

θ

represents a global one-parameter evolution parameter akin to time, which is able to transform the wavefunction under the Spin(2), locally across the states of the Hilbert space. This is an extremely general equation that captures all transformations that can be done consistently with the symmetries of the wavefunction for the Spin(2) group.

Definition 9

(David Hestenes’ Formulation). In 3+1D, the David Hestenes’ formulation [14] of the wavefunction is

ψ = \sqrt{ρ} R e^{i b / 2}

, where

R = e^{f / 2}

is a Lorentz boost or rotation and where

e^{i b / 2}

is a phase. In 2D, as the algebra only admits a bivector, his formulation would reduce to

ψ = \sqrt{ρ} R

, which is the form we have recovered.

The definition of the Dirac current applicable to our wavefunction follows the formulation of David Hestenes:

Definition 10

(Dirac Current). Given the basis

σ_{x}

and

σ_{y}

, the Dirac current for the 2D theory is defined as:

\begin{matrix} J_{x} \equiv ψ^{‡} σ_{x} ψ = ρ \underset{SO (2)}{\underset{︸}{R^{‡} σ_{x} R}} = ρ {\tilde{σ}}_{x} \end{matrix}

(93)

\begin{matrix} J_{y} \equiv ψ^{‡} σ_{y} ψ = ρ \underset{SO (2)}{\underset{︸}{R^{‡} σ_{y} R}} = ρ {\tilde{σ}}_{y} \end{matrix}

(94)

where

{\tilde{σ}}_{x}

and

{\tilde{σ}}_{y}

are a SO(2) rotated basis vectors.

2.2.1. 1+1D Obstruction

As stated in the introduction, of the dimensional cases, only 2D and 3+1D are free of obstructions. For instance, the 1+1D theory results in a split-complex quantum theory due to the bilinear form

(a - b e_{0} \land e_{1}) (a + b e_{0} \land e_{1})

, which yields negative probabilities:

a^{2} - b^{2} \in R

for certain wavefunction states, in contrast to the non-negative probabilities

a^{2} + b^{2} \in R^{\geq 0}

obtained in the Euclidean 2D case. This is why we had to use 2D instead of 1+1D in this two-dimensional introduction. In the following section, we will investigate the 3+1D case, then we will show why all other dimensional cases are obstructed.

2.3. RQM in 3+1D

In this section, we extend the concepts and techniques developed for multivector amplitudes in 2D to the more physically relevant case of 3+1D dimensions. The Lagrange multiplier equation is as follows:

\begin{matrix} L = - \sum_{i} ρ_{i} ln \frac{ρ_{i}}{p_{i}} + λ (1 - \sum_{i} ρ_{i}) + ζ tr (\bar{M} - \frac{1}{2} \sum_{i} ρ_{i} M_{i}) \end{matrix}

(95)

The solution (proof in Annex Appendix B) is obtained using the same step-by-step process as the 2D case, and yields:

\begin{matrix} ρ_{i} = \underset{{Spin}^{c} (3, 1) Invariant Ensemble}{\underset{︸}{\frac{1}{\sum_{i} p_{i} det exp (- ζ \frac{1}{2} M_{i})}}} {\underset{Spin}{\underset{︸}{det exp (- ζ \frac{1}{2} M_{i})}}}^{c} (3, 1) Born Rule \underset{Initial Preparation}{\underset{︸}{p_{i}}} \end{matrix}

(96)

where

ζ

is a "twisted-phase" rapidity. (If the invariance group was Spin(3,1) instead of Spin^c(3,1), obtainable by posing

b \to 0

, then it would simply be the rapidity).

2.3.1. Preliminaries

Our initial goal will be to express the partition function as a self-product of elements of the vector space. As such, we begin by defining a general multivector in the geometric algebra

GA (3, 1)

.

Definition 11

(Multivector). Let

u

be a multivector of

GA (3, 1)

. Its general form is:

\begin{matrix} u & = a \end{matrix}

(97)

\begin{matrix} + t γ_{0} + x γ_{1} + y γ_{2} + z γ_{3} \end{matrix}

(98)

\begin{matrix} + f_{01} γ_{0} \land γ_{1} + f_{02} γ_{0} \land γ_{2} + f_{03} γ_{0} \land γ_{3} + f_{12} γ_{1} \land γ_{2} + f_{13} γ_{1} \land γ_{3} + f_{23} γ_{2} \land γ_{3} \end{matrix}

(99)

\begin{matrix} + p γ_{1} \land γ_{2} \land γ_{3} + q γ_{0} \land γ_{2} \land γ_{3} + v γ_{0} \land γ_{1} \land γ_{3} + w γ_{0} \land γ_{1} \land γ_{2} \end{matrix}

(100)

\begin{matrix} + b γ_{0} \land γ_{1} \land γ_{2} \land γ_{3} \end{matrix}

(101)

where

γ_{0}, γ_{1}, γ_{2}, γ_{3}

are the basis vectors in the real Majorana representation.

A more compact notation for

u

is

\begin{matrix} u = a + x + f + v + b \end{matrix}

(102)

where a is a scalar,

x

a vector,

f

a bivector,

v

is pseudo-vector and

b

a pseudo-scalar.

This general multivector can be represented by a

4 \times 4

real matrix using the real Majorana representation:

Definition 12

(Matrix Representation of

u

).

\begin{matrix} M = [\begin{matrix} a + f_{02} - q - z & b - f_{13} + w - x & - f_{01} + f_{12} - p + v & f_{03} + f_{23} + t + y \\ - b + f_{13} + w - x & a + f_{02} + q + z & f_{03} + f_{23} - t - y & f_{01} - f_{12} - p + v \\ - f_{01} - f_{12} + p + v & f_{03} - f_{23} + t - y & a - f_{02} + q - z & - b - f_{13} - w - x \\ f_{03} - f_{23} - t + y & f_{01} + f_{12} + p + v & b + f_{13} - w - x & a - f_{02} - q + z \end{matrix}] \end{matrix}

(103)

To manipulate and analyze multivectors in

GA (3, 1)

, we introduce several important operations, such as the multivector conjugate, the 3,4 blade conjugate, and the multivector self-product.

Definition 13 (Multivector Conjugate(in 4D)).

\begin{matrix} u^{‡} = a - x - f + v + b \end{matrix}

(104)

Definition 14

(3,4 Blade Conjugate). The 3,4 blade conjugate of

u

is

\begin{matrix} {⌊ u ⌋}_{3, 4} = a + x + f - v - b \end{matrix}

(105)

The results of Lundholm[15], demonstrates that the multivector norms in the following definition, are the unique forms which carries the properties of the determinants such as

N (u v) = N (u) N (v)

to the domain of multivectors:

Definition 15.

The self-products associated with low-dimensional geometric algebras are:

\begin{matrix} GA (0, 1) : & φ^{†} φ \end{matrix}

(106)

\begin{matrix} GA (2, 0) : & φ^{‡} φ \end{matrix}

(107)

\begin{matrix} GA (3, 0) : & {⌊ φ^{‡} φ ⌋}_{3} φ^{‡} φ \end{matrix}

(108)

\begin{matrix} GA (3, 1) : & {⌊ φ^{‡} φ ⌋}_{3, 4} φ^{‡} φ \end{matrix}

(109)

\begin{matrix} GA (4, 1) : & {({⌊ φ^{‡} φ ⌋}_{3, 4} φ^{‡} φ)}^{†} ({⌊ φ^{‡} φ ⌋}_{3, 4} φ^{‡} φ) \end{matrix}

(110)

We can now express the determinant of the matrix representation of a multivector via the self-product

{⌊ φ^{‡} φ ⌋}_{3, 4} φ^{‡} φ

. Again, this choice is not arbitrary, but the unique choice with allows us to represent the determinant of the matrix representation of a multivector within

GA (3, 1)

:

Theorem 6

(Determinant as a Multivector Self-Product).

\begin{matrix} {⌊ u^{‡} u ⌋}_{3, 4} u^{‡} u = det M \end{matrix}

(111)

Proof.

Please find a computer assisted proof of this equality in Annex Appendix C. □

Definition 16

(

GA (3, 1)

-valued Vector).

\begin{matrix} |V〉 〉 = [\begin{matrix} u_{1} \\ ⋮ \\ u_{n} \end{matrix}] = [\begin{matrix} a_{1} + x_{1} + f_{1} + v_{1} + b_{1} \\ ⋮ \\ a_{n} + x_{n} + f_{n} + v_{n} + b_{n} \end{matrix}] \end{matrix}

(112)

These constructions allow us to express the partition function in terms of the multivector self-product:

Definition 17

(Double-Copy Product). Instead of an inner product, we obtain what we call a double-copy product:

\begin{matrix} 〈 〈V | V | V | V〉 〉 & = \sum_{i} {⌊ \underset{copy}{\underset{︸}{ψ_{i}^{‡} ψ_{i}}} 1 ⌋}_{3, 4} \underset{copy}{\underset{︸}{ψ_{i}^{‡} ψ_{i}}} 2 \end{matrix}

(113)

\begin{matrix} = {⌊ \underset{copy}{\underset{︸}{[\begin{matrix} u_{1}^{‡} & \dots & u_{n} \end{matrix}] [\begin{matrix} u_{1} & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & u_{n} \end{matrix}]}} 1 ⌋}_{3, 4} \underset{copy}{\underset{︸}{[\begin{matrix} u_{1}^{‡} & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & u_{n}^{‡} \end{matrix}] [\begin{matrix} u_{1} \\ ⋮ \\ u_{n} \end{matrix}]}} 2 \end{matrix}

(114)

Theorem 7

(Partition Function).

Z = 〈 〈V | V | V | V〉 〉

Proof.

\begin{matrix} 〈 〈V | V | V | V〉 〉 \end{matrix}

(115)

\begin{matrix} = {⌊ [\begin{matrix} u_{1}^{‡} & \dots & u_{n} \end{matrix}] [\begin{matrix} u_{1} & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & u_{n} \end{matrix}] ⌋}_{3, 4} [\begin{matrix} u_{1}^{‡} & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & u_{n}^{‡} \end{matrix}] [\begin{matrix} u_{1} \\ ⋮ \\ u_{n} \end{matrix}] \end{matrix}

(116)

\begin{matrix} = {⌊ [\begin{matrix} u_{1}^{‡} u_{1} & \dots & u_{n} u_{n} \end{matrix}] ⌋}_{3, 4} [\begin{matrix} u_{1}^{‡} u_{1} \\ ⋮ \\ u_{n}^{‡} u_{n} \end{matrix}] \end{matrix}

(117)

\begin{matrix} = {⌊ u_{1}^{‡} u_{1} ⌋}_{3, 4} u_{1}^{‡} u_{1} + \dots + {⌊ u_{n}^{‡} u_{n} ⌋}_{3, 4} u_{n}^{‡} u_{n} \end{matrix}

(118)

\begin{matrix} = \sum_{i = 1}^{n} det M_{u_{i}} \end{matrix}

(119)

\begin{matrix} = Z \end{matrix}

(120)

□

Desirable properties for the double-copy product are introduced by addressing the issue of non-positivity. First, we establish non-negativity:

Theorem 8

(Non-negativity). The double-copy product, applied to the even subalgebra of

GA (3, 1)

, is always non-negative.

Proof.

Let

|V〉 〉 = [\begin{matrix} a_{1} + f_{1} + b_{1} \\ ⋮ \\ a_{n} + f_{n} + b_{n} \end{matrix}]

. Then,

\begin{matrix} 〈 〈V | V | V | V〉 〉 \end{matrix}

(121)

\begin{matrix} = {⌊ [\begin{matrix} {(a_{1} + f_{1} + b_{1})}^{‡} (a_{1} + f_{1} + b_{1}) & \dots \end{matrix}] ⌋}_{3, 4} [\begin{matrix} {(a_{1} + f_{1} + b_{1})}^{‡} (a_{1} + f_{1} + b_{1}) \\ ⋮ \end{matrix}] \end{matrix}

(122)

\begin{matrix} = {⌊ [\begin{matrix} (a_{1} - f_{1} + b_{1}) (a_{1} + f_{1} + b_{1}) & \dots \end{matrix}] ⌋}_{3, 4} [\begin{matrix} (a_{1} - f_{1} + b_{1}) (a_{1} + f_{1} + b_{1}) \\ ⋮ \end{matrix}] \end{matrix}

(123)

\begin{matrix} = {⌊ [\begin{matrix} a_{1}^{2} + a_{1} f_{1} + a_{1} b_{1} - f_{1} a_{1} - f_{1}^{2} - f_{1} b_{1} + b_{1} a_{1} + b_{1} f_{1} + b_{1}^{2} & \dots \end{matrix}] ⌋}_{3, 4} \dots \end{matrix}

(124)

\begin{matrix} = {⌊ [\begin{matrix} a_{1}^{2} - f_{1}^{2} + b_{1}^{2} & \dots \end{matrix}] ⌋}_{3, 4} \dots \end{matrix}

(125)

We note 1)

b^{2} = {(b I)}^{2} = - b^{2}

and 2)

f^{2} = - E_{1}^{2} - E_{2}^{2} - E_{3}^{2} + B_{1}^{2} + B_{2}^{2} + B_{3}^{2} + 4 e_{0} e_{1} e_{2} e_{3} (E_{1} B_{1} + E_{2} B_{2} + E_{3} B_{3})

\begin{matrix} 1 - 1 & = {⌊ [\begin{matrix} a_{1}^{2} - b_{1}^{2} + E_{1}^{2} + E_{2}^{2} + E_{3}^{2} - B_{1}^{2} - B_{2}^{2} - B_{3}^{2} - 4 e_{0} e_{1} e_{2} e_{3} (E_{1} B_{1} + E_{2} B_{2} + E_{3} B_{3}) & \dots \end{matrix}] ⌋}_{3, 4} \dots \end{matrix}

(126)

We note that the terms are now complex numbers, which we rewrite as

ℜ (z) = a_{1}^{2} - b_{1}^{2} + E_{1}^{2} + E_{2}^{2} + E_{3}^{2} - B_{1}^{2} - B_{2}^{2} - B_{3}^{2}

and

ℑ (z) = - 4 (E_{1} B_{1} + E_{2} B_{2} + E_{3} B_{3})

\begin{matrix} = {⌊ [\begin{matrix} z_{1} & \dots & z_{2} \end{matrix}] ⌋}_{3, 4} [\begin{matrix} z_{n} \\ ⋮ \\ z_{n} \end{matrix}] \end{matrix}

(127)

\begin{matrix} = [\begin{matrix} z_{1}^{†} & \dots & z_{2}^{†} \end{matrix}] [\begin{matrix} z_{n} \\ ⋮ \\ z_{n} \end{matrix}] \end{matrix}

(128)

\begin{matrix} = z_{1}^{‡} z_{1} + \dots + z_{n}^{‡} z_{n} \end{matrix}

(129)

which is always non-negative. □

To achieve positive-definiteness of the double-copy product, we must address the issue of non-zero vectors that have zero norm due to the indefinite metric signature of spacetime in

GA (3, 1)

. In such algebras, null vectors (vectors with zero norm) can be non-zero, which prevents the inner product from being strictly positive-definite.

To resolve this, we introduce an equivalence relation that identifies all non-zero vectors of zero norm with the zero vector. Formally, we define the equivalence relation:

Definition 18

(Equivalence Relation for Null Vectors). For vectors

v, w

in the even subalgebra of

GA (3, 1)

, we say:

v \sim w if and only if v - w = 0 or ∥ v - w ∥ = 0 .

This means that in our quotient space

V / \sim

, the only vector of zero norm is the zero vector itself. Consequently, vectors that were previously non-zero but had zero norm are now identified with the zero vector, ensuring that the inner product is positive-definite on the quotient space.

To implement this equivalence relation and to represent physical states appropriately, we define the

{Spin}^{c} (3, 1)

-valued wavefunction, which takes values in the even subalgebra of

GA (3, 1)

:

Definition 19

(

{Spin}^{c} (3, 1)

-Valued Wavefunction).

|ψ〉 〉 = [\begin{matrix} e^{\frac{1}{2} (a_{1} + f_{1} + b_{1})} \\ ⋮ \\ e^{\frac{1}{2} (a_{n} + f_{n} + b_{n})} \end{matrix}] = [\begin{matrix} \sqrt{ρ_{1}} R_{1} B_{1} \\ ⋮ \\ \sqrt{ρ_{n}} R_{n} B_{n} \end{matrix}],

where:

$\sqrt{ρ_{i}} = e^{\frac{1}{2} a_{i}} \geq 0$ is a positive scalar factor ensuring non-negativity.
$R_{i} = e^{\frac{1}{2} f_{i}}$ is a rotor representing Lorentz transformations (rotations and boosts in spacetime).
$B_{i} = e^{\frac{1}{2} b_{i}}$ is a complex phase factor, as $b_{i} = b_{i} I$ and $e^{\frac{1}{2} b_{i}} = e^{\frac{1}{2} b_{i} I} = cosh (\frac{b_{i}}{2}) + I sinh (\frac{b_{i}}{2})$ .

In this representation:

The exponential map $e^{\frac{1}{2} (a_{i} + f_{i} + b_{i})}$ maps elements of the algebra to the connected component of the identity in the spin group ${Spin}^{c} (3, 1)$ , except at the zero vector, where the map is not injective.
The wavefunction $|ψ〉 〉$ captures both the amplitude (through $\sqrt{ρ_{i}}$ ) and the phase (through $R_{i}$ and $B_{i}$ ) of the quantum state.

By considering the quotient space under the equivalence relation ∼, the double-copy product

〈 〈ψ | ψ | ψ | ψ〉 〉

becomes positive-definite. This is because:

The double-copy product yields zero if and only if $|ψ〉 〉$ is the zero vector in the quotient space.
All non-zero vectors $|ψ〉 〉$ in the quotient space have a strictly positive norm.
The identification of null vectors with the zero vector removes the degeneracy caused by the indefinite metric signature.

Now, let us turn our attention to the evolution operator, which leaves the partition function invariant:

Definition 20

(

{Spin}^{c} (3, 1)

Evolution Operator).

\begin{matrix} T = [\begin{matrix} e^{- \frac{1}{2} ζ (f_{1} + b_{1})} \\ ⋱ \\ e^{- \frac{1}{2} ζ (f_{n} + b_{n})} \end{matrix}] \end{matrix}

(130)

In turn, this leads to a variant of the Schrödinger equation obtained by taking the derivative of the wavefunction with respect to the Lagrange multiplier

ζ

:

Definition 21

(

{Spin}^{c} (3, 1)

-valued Schrödinger equation).

\begin{matrix} \frac{d}{d ζ} [\begin{matrix} ψ_{1} (ζ) \\ ⋮ \\ ψ_{n} (ζ) \end{matrix}] = [\begin{matrix} - \frac{1}{2} (f_{1} + b_{1}) \\ ⋱ \\ - \frac{1}{2} (f_{n} + b_{n}) \end{matrix}] [\begin{matrix} ψ_{1} (ζ) \\ ⋮ \\ ψ_{n} (ζ) \end{matrix}] \end{matrix}

(131)

In this case

ζ

represents a one-parameter evolution parameter akin to time, which is able to transform the measurement basis under action of the

{Spin}^{c} (3, 1)

group. This is an extremely general equation that captures all transformations that can be done consistently with the symmetries of the wavefunction.

Theorem 9

(Spin^c(3,1) invariance).Let

e^{\frac{1}{2} f} e^{\frac{1}{2} b}

be a general element of Spin^c(3,1). Then, the equality:

\begin{matrix} {⌊ ψ^{‡} ψ ⌋}_{3, 4} ψ^{‡} ψ = {⌊ {(e^{\frac{1}{2} f} e^{\frac{1}{2} b} ψ)}^{‡} e^{\frac{1}{2} f} e^{\frac{1}{2} b} ψ ⌋}_{3, 4} {(e^{\frac{1}{2} f} e^{\frac{1}{2} b} ψ)}^{‡} e^{\frac{1}{2} f} e^{\frac{1}{2} b} ψ \end{matrix}

(132)

is always satisfied.

Proof.

\begin{matrix} {⌊ {(e^{\frac{1}{2} f} e^{\frac{1}{2} b} ψ)}^{‡} e^{\frac{1}{2} f} e^{\frac{1}{2} b} ψ ⌋}_{3, 4} {(e^{\frac{1}{2} f} e^{\frac{1}{2} b} ψ)}^{‡} e^{\frac{1}{2} f} e^{\frac{1}{2} b} ψ \end{matrix}

(133)

\begin{matrix} = {⌊ ψ^{‡} e^{- \frac{1}{2} f} e^{\frac{1}{2} b} e^{\frac{1}{2} f} e^{\frac{1}{2} b} ψ ⌋}_{3, 4} ψ^{‡} e^{- \frac{1}{2} f} e^{\frac{1}{2} b} e^{\frac{1}{2} f} e^{\frac{1}{2} b} ψ \end{matrix}

(134)

\begin{matrix} = {⌊ ψ^{‡} e^{b} ψ ⌋}_{3, 4} ψ^{‡} e^{b} ψ \end{matrix}

(135)

\begin{matrix} = {⌊ ψ^{‡} ψ ⌋}_{3, 4} e^{- b} e^{b} ψ^{‡} ψ \end{matrix}

(136)

\begin{matrix} = {⌊ ψ^{‡} ψ ⌋}_{3, 4} ψ^{‡} ψ \end{matrix}

(137)

□

2.3.2. RQM

Definition 22

(David Hestenes’ Wavefunction). The

{Spin}^{c} (3, 1)

-valued wavefunction we have recovered is formulated identically to David Hestenes’[14] formulation of the wavefunction within GA(3,1).

\begin{matrix} ψ = e^{\frac{1}{2} (a + f + b)} = \sqrt{ρ} R e^{- i b / 2} \end{matrix}

(138)

where

e^{\frac{1}{2} a} = \sqrt{ρ}

,

e^{\frac{1}{2} f} = R

and

e^{\frac{1}{2} b} = e^{- i b / 2}

.

Before we continue the RQM investigation, let us note that the double-copy product contains two copies of a bilinear form

ψ^{‡} ψ

:

\begin{matrix} {⌊ \underset{copy}{\underset{︸}{ψ^{‡} ψ}} 1 ⌋}_{3, 4} \underset{copy}{\underset{︸}{ψ^{‡} ψ}} 2 \end{matrix}

(139)

In the present section, we will investigate the properties of each copy individually, leaving the properties specific to the double-copy for the section on quantum gravity.

Taking a single copy, the Dirac current is obtained directly from the gamma matrices, as follows:

Definition 23

(Dirac Current). The definition of the Dirac current is the same as Hestenes’:

\begin{matrix} J & \equiv ψ^{‡} γ_{μ} ψ = ρ R^{‡} B^{‡} γ_{μ} B R = ρ R^{‡} γ_{μ} B^{- 1} B R = ρ \underset{SO (3, 1)}{\underset{︸}{R^{‡} γ_{μ} R}} = ρ {\tilde{γ}}_{μ} \end{matrix}

(140)

where

{\tilde{γ}}_{μ}

is a SO(3,1) rotated basis vector.

2.3.3. Standard Model Gauge Symmetries

We will now demonstrate that the double-copy product is automatically invariant under transformations corresponding to the

U (1)

,

SU (2)

, and

SU (3)

symmetries, as well as under unitary transformations satisfying

U^{†} U = I

, all of which play fundamental roles in the Standard Model of particle physics. These symmetries constitute the set of transformations that leave the Dirac current invariant, i.e.,

{(T ψ)}^{‡} γ_{0} T ψ = ψ^{‡} γ_{0} ψ

with T valued in

GA (3, 1)

.

Theorem 10 (U(1) Invariance).

Let

e^{\frac{1}{2} b}

be a general element of U(1). Then, the equality

\begin{matrix} {⌊ ψ^{‡} γ_{0} ψ ⌋}_{3, 4} ψ^{‡} γ_{0} ψ = {⌊ \underset{copy}{\underset{︸}{{(e^{\frac{1}{2} b} ψ)}^{‡} γ_{0} e^{\frac{1}{2} b} ψ}} 1 ⌋}_{3, 4} \underset{copy}{\underset{︸}{{(e^{\frac{1}{2} b} ψ)}^{‡} γ_{0} e^{\frac{1}{2} b} ψ}} 2 \end{matrix}

(141)

is satisfied, yielding a U(1) symmetry for each copied bilinear form.

Proof.

Equation 141 is invariant if this expression is satisfied:

\begin{matrix} e^{\frac{1}{2} b} γ_{0} e^{\frac{1}{2} b} = γ_{0} \end{matrix}

(142)

This is always satisfied simply because

e^{\frac{1}{2} b} γ_{0} e^{\frac{1}{2} b} = γ_{0} e^{- \frac{1}{2} b} e^{\frac{1}{2} b} = γ_{0}

□

Theorem 11 (SU(2) Invariance).

Let

e^{\frac{1}{2} f}

be a general element of Spin(3,1). Then, the equality:

\begin{matrix} {⌊ ψ^{‡} γ_{0} ψ ⌋}_{3, 4} ψ^{‡} γ_{0} ψ = {⌊ \underset{copy}{\underset{︸}{{(e^{\frac{1}{2} f} ψ)}^{‡} γ_{0} e^{\frac{1}{2} f} ψ}} 1 ⌋}_{3, 4} \underset{copy}{\underset{︸}{{(e^{\frac{1}{2} f} ψ)}^{‡} γ_{0} e^{\frac{1}{2} f} ψ}} 2 \end{matrix}

(143)

is satisfied for if

f = θ_{1} γ_{2} γ_{3} + θ_{2} γ_{1} γ_{3} + θ_{3} γ_{1} γ_{2}

(which generates SU(2)), yielding a SU(2) symmetry for each copied bilinear form.

Proof.

Equation 143 is invariant if this expression is satisfied[16]:

\begin{matrix} e^{- \frac{1}{2} f} γ_{0} e^{\frac{1}{2} f} = γ_{0} \end{matrix}

(144)

We now note that moving the left-most term to the right of the gamma matrix yields:

\begin{matrix} e^{- E_{1} γ_{0} γ_{1} - E_{2} γ_{0} γ_{2} - E_{3} γ_{0} γ_{3} - θ_{1} γ_{2} γ_{3} - θ_{2} γ_{1} γ_{3} - θ_{3} γ_{1} γ_{2}} γ_{0} e^{\frac{1}{2} f} \end{matrix}

(145)

\begin{matrix} = γ_{0} e^{E_{1} γ_{0} γ_{1} + E_{2} γ_{0} γ_{2} + E_{3} γ_{0} γ_{3} - θ_{1} γ_{2} γ_{3} - θ_{2} γ_{1} γ_{3} - θ_{3} γ_{1} γ_{2}} e^{\frac{1}{2} f} \end{matrix}

(146)

Therefore, the product

e^{- \frac{1}{2} f} γ_{0} e^{\frac{1}{2} f}

reduces to

γ_{0}

if and only if

E_{1} = E_{2} = E_{3} = 0

, leaving

f = θ_{1} γ_{2} γ_{3} + θ_{2} γ_{1} γ_{3} + θ_{3} γ_{1} γ_{2}

:

Finally, we note that

e^{θ_{1} γ_{2} γ_{3} + θ_{2} γ_{1} γ_{3} + θ_{3} γ_{1} γ_{2}}

generates

SU (2)

. □

Theorem 12 (SU(3)).

The generators of SU(3) in GA(3,1) are given by Anthony Lesenby in [17] and are as follows:

\begin{matrix} {\hat{E}}_{i j} & = {\hat{e}}_{i} {\hat{e}}_{j} - {\hat{f}}_{i} {\hat{f}}_{j} & where i < j \end{matrix}

(147)

\begin{matrix} {\hat{F}}_{i j} & = {\hat{e}}_{i} {\hat{f}}_{j} + {\hat{e}}_{j} {\hat{f}}_{i} & where i < j \end{matrix}

(148)

\begin{matrix} \hat{J} & = {\hat{e}}_{i} {\hat{f}}_{i} & where i = 1, 2, 3 \end{matrix}

(149)

where

\begin{matrix} {\hat{e}}_{i} & = multiplication on the left by σ_{i}, so that {\hat{e}}_{i} (F) = σ_{i} F \end{matrix}

(150)

\begin{matrix} {\hat{f}}_{i} & = multiplication on the right by I σ_{i}, so that {\hat{f}}_{i} (F) = I σ_{i} F \end{matrix}

(151)

This defines the 9 generators of U(3).

With the additional restriction on

\hat{J}

\begin{matrix} α_{1} {\hat{J}}_{1} + α_{2} {\hat{J}}_{2} + α_{3} {\hat{J}}_{3}, with α_{1} + α_{2} + α_{3} = 0 \end{matrix}

(152)

the number generators is reduced to 8, consistently with SU(3).

We now must show that the following equation is satisfied for all 8 generators:

\begin{matrix} {⌊ ψ^{‡} γ_{0} ψ ⌋}_{3, 4} ψ^{‡} γ_{0} ψ = {⌊ \underset{copy}{\underset{︸}{{(e^{θ^{i} λ_{i}} ψ)}^{‡} γ_{0} e^{θ^{i} λ_{i}} ψ}} 1 ⌋}_{3, 4} \underset{copy}{\underset{︸}{{(e^{θ^{i} λ_{i}} ψ)}^{‡} γ_{0} e^{θ^{i} λ_{i}} ψ}} 2 \end{matrix}

(153)

Proof.

First, we note the following action:

\begin{matrix} - f γ_{0} f = γ_{0} \end{matrix}

(154)

which we can rewrite as follows:

\begin{matrix} - (E_{1} γ_{0} γ_{1} + E_{2} γ_{0} γ_{2} + E_{3} γ_{0} γ_{3} + B_{1} γ_{2} γ_{3} + B_{2} γ_{1} γ_{3} + B_{3} γ_{1} γ_{2}) γ_{0} f \end{matrix}

(155)

The first three terms anticommute with

γ_{0}

, while the last three commute with

γ_{0}

:

\begin{matrix} = γ_{0} (E_{1} γ_{0} γ_{1} + E_{2} γ_{0} γ_{2} + E_{3} γ_{0} γ_{3} - B_{1} γ_{2} γ_{3} - B_{2} (q) γ_{1} γ_{3} - B_{3} (q) γ_{1} γ_{2}) f (q) \end{matrix}

(156)

This can be written as:

\begin{matrix} γ_{0} (E - B) (E + B) \end{matrix}

(157)

\begin{matrix} = γ_{0} (E^{2} + E B - B E - B^{2}) \end{matrix}

(158)

where

E = E_{1} γ_{0} γ_{1} + E_{2} γ_{0} γ_{2} + E_{3} γ_{0} γ_{3}

and

B = B_{1} γ_{2} γ_{3} + B_{2} γ_{1} γ_{3} + B_{3} γ_{1} γ_{2}

.

Thus, for

- f γ_{0} f = γ_{0}

, we require: 1)

E^{2} - B^{2} = 1

and 2)

E B = B E

. The first requirement expands as follows:

\begin{matrix} E^{2} - B^{2} = (E_{1}^{2} + B_{1}^{2}) + (E_{2}^{2} + B_{2}^{2}) + (E_{3}^{2} + B_{3}^{2}) = 1 \end{matrix}

(159)

which is the defining conditions for the

SU (3)

symmetry group.

Finally, as the SU(3) norm is a consequence of preserving the Dirac current, it follows that the SU(3) generators provided by Lasenby, acting on

f

, cannot change the SU(3) norm, hence must also preserve the Dirac current. □

Theorem 13

(Unitary invariance). Let U be

n \times n

unitary matrices. Then unitary invariance:

\begin{matrix} 〈 〈ψ | γ_{μ} ψ | ψ | γ_{ν} ψ〉 〉 = 〈 〈U ψ | γ_{μ} U ψ | U ψ | γ_{ν} U ψ〉 〉 \Rightarrow U^{†} U = I \end{matrix}

(160)

is individually satisfied for each copied bilinear form.

Proof.

Equation 160 is satisfied if

U^{‡} γ_{μ} U = γ_{μ}

. Since U is valued in complex numbers, then

U^{‡} = U^{T}

, and since

γ_{μ} γ_{0} γ_{1} γ_{2} γ_{3} = - γ_{0} γ_{1} γ_{2} γ_{3} γ_{μ}

, it follows that:

\begin{matrix} γ_{μ} U^{†} U = γ_{μ} \end{matrix}

(161)

which is satisfied when

U^{†} U = I

. □

The invariances SU(3), SU(2) and U(1) discussed above can be promoted to local symmetries using the usual gauge symmetry construction techniques, along with the Dirac equation or field Lagrangian.

In conventional QM, the Born rule naturally leads to a U(1)-valued gauge theory due to the following symmetry:

\begin{matrix} {(e^{- i θ (x)} ψ (x))}^{†} e^{- i θ (x)} ψ (x) = ψ {(x)}^{†} ψ (x) \end{matrix}

(162)

However, the

SU (3)

and

SU (2)

symmetries do not emerge from the probability measure in the same straightforward manner and are typically introduced manually, justified by experimental observations. This raises the question: why these specific symmetries and not others? In contrast, within the double-copy product framework, all three symmetry groups—

U (1)

,

SU (2)

, and

SU (3)

—as well as the

Spin (3, 1)

and unitary symmetries, follow naturally from the invariance of the probability measure, in the same way that the

U (1)

symmetry follows from the Born rule. This suggests a deeper underlying principle governing the symmetries in fundamental physics.

2.3.4. A Starting Point for a Theory of Quantum Gravity

In the previous section, we developed a quantum theory valued in Spin^c(3,1), which served as the arena for RQM. We then demonstrated how a single copy of this theory leads to the gauge symmetries of the standard model, the Dirac current and other features of RQM. The goal of this section is to extend this methodology to basis vectors, in which the metric tensor emerges as an observable. To achieve this, we will utilize both copies.

We recall the definition of the metric tensor in terms of basis vectors of geometric algebra, as follows:

\begin{matrix} g_{μ ν} = \frac{1}{2} (e_{μ} e_{ν} + e_{ν} e_{μ}) \end{matrix}

(163)

Then, we note that the double-copy product acts on a pair of basis element

e_{μ}

and

e_{ν}

, as follows: Preprints 137761 i002

\begin{matrix} = \frac{1}{2} ρ^{2} (\tilde{R} e_{μ} R \tilde{R} e_{ν} R + \tilde{R} e_{ν} R \tilde{R} e_{ν} R) \end{matrix}

(166)

\begin{matrix} = \underset{probability}{\underset{︸}{ρ^{2}}} \underset{metric}{\underset{︸}{\frac{1}{2} ({\tilde{e}}_{μ} {\tilde{e}}_{ν} + {\tilde{e}}_{ν} {\tilde{e}}_{μ})}} tensor \end{matrix}

(167)

where

{\tilde{e}}_{μ}

and

{\tilde{e}}_{ν}

are SO(3,1) rotated basis vectors, and where

ρ^{2}

is a probability measure.

As one can swap

e_{μ}

and

e_{ν}

and obtain the same metric tensor, the double-copy product guarantees that

g_{μ ν}

is symmetric.

Furthermore, since

e_{μ}^{‡} = - e_{μ}

, we get:

\begin{matrix} {⌊ {(e_{μ} ψ)}^{‡} ψ ⌋}_{3, 4} {(e_{ν} ψ)}^{‡} ψ \end{matrix}

(168)

\begin{matrix} = {⌊ ψ^{‡} (- 1) e_{μ}^{‡} ψ ⌋}_{3, 4} ψ^{‡} (- 1) e_{ν}^{‡} ψ \end{matrix}

(169)

\begin{matrix} = {⌊ ψ^{‡} e_{μ} ψ ⌋}_{3, 4} ψ^{‡} e_{ν} ψ \end{matrix}

(170)

which allows us to conclude that

e_{μ}

and

e_{ν}

are self-adjoint within the double-copy product, entailing the interpretation of

g_{μ ν}

as an observable.

In the double-copy product, the metric tensor emerges as a double copy of Dirac currents. This formulation suggests that the metric tensor encodes the probabilistic structure of a quantum theory of gravity in the form of a rank-2 tensor, analogous to how the Dirac current encodes the probabilistic structure of a special relativistic quantum theory in the form of a 4-vector.

Let us now investigate the dynamics. We recall that the evolution operator (Definition 20) is:

\begin{matrix} T = [\begin{matrix} e^{- \frac{1}{2} ζ (f_{1} + b_{1})} \\ ⋱ \\ e^{- \frac{1}{2} ζ (f_{n} + b_{n})} \end{matrix}] \end{matrix}

(171)

Acting on the wavefunction, the effect of this operator cascades down to the basis vectors via the double-copy product: Preprints 137761 i003

which realizes an

SO (3, 1)

transformation of the metric tensor via action of the exponential of a bivector, and a double-copy unitary invariant transformation via action of the exponential of a pseudo-scalar: Preprints 137761 i004

In summary, this initial investigation has identified a scenario in which the metric tensor is measured using basis vectors. The evolution operator, governed by the Schrödinger equation, dynamically realizes SO(3,1) transformations on the metric tensor. Furthermore, the amplitudes associated with possible metric tensors are derived from a double-copy of unitary quantum theories acting on the basis vectors. This formulation simultaneously preserves the SO(3,1) symmetry, essential for describing spacetime structure, and the unitary symmetry, fundamental to quantum mechanics. It describes all changes of basis transformations that an observer in 3+1D spacetime can perform prior to measuring (in the quantum sense) a basis system in spacetime, and attributes a probability to the outcome (the outcome being the metric tensor).

2.3.5. The Einstein Field Equation

In the previous section, we established that the metric tensor

g_{μ ν}

emerges as an observable through the double-copy mechanism acting on basis vectors. This probability measure inherently possesses an

SO (3, 1)

invariance due to the Lorentz transformations applied via the evolution operator. While Lorentz invariance is fundamental to the structure of spacetime, in our context, this symmetry introduces redundancies that can change the physics by affecting the physical degrees of freedom of the gravitational field.

To eliminate this unwanted symmetry, we seek to construct a quantity that remains unaffected by

SO (3, 1)

transformations acting on

g_{μ ν}

. Specifically, we consider forming function of the probability measure and its derivatives that are invariant under such transformations.

The Einstein Field Equations (EFE) emerge naturally as the equations of motion resulting from the simplest possible Lagrangian—the Einstein-Hilbert action—where this invariance property is manifest. The Einstein-Hilbert action is given by

\begin{matrix} S_{EH} = \frac{1}{2 κ} \int \sqrt{- g} R d^{4} x, \end{matrix}

(174)

where

κ = 8 π G

with G being Newton’s gravitational constant, and g is the determinant of the metric tensor. Varying this action with respect to

g_{μ ν}

yields the EFE. This demonstrates that by constructing an appropriate equation of motion involving the Einstein tensor—a function of

g_{μ ν}

invariant under

SO (3, 1)

transformations—we effectively remove the unwanted symmetry and recover the fundamental equations governing gravitational interactions.

This result aligns with our goal of unifying fundamental physical laws through an information-theoretic approach. It shows that gravity—as described by general relativity—can be integrated seamlessly into our unified framework by addressing and eliminating extraneous symmetries, using the simplest possible Lagrangian that permits this elimination.

2.4. Dimensional Obstructions

In this section, we explore the dimensional obstructions that arise when attempting to resolve the entropy maximization problem for other dimensional configurations. We found that all geometric configurations except those we have explored here (e.g.

GA (0) ≅ R

,

GA (0, 1) ≅ C

,

GA (2, 0)

and

GA (3, 1)

) are obstructed. By obstructed, we mean that the solution to the entropy maximization problem,

ρ

, does not satisfy all axioms of probability theory.

\begin{matrix} Dimensions & Obstruction \end{matrix}

\begin{matrix} GA (0) & Unobstructed \Rightarrow statistical mechanics \end{matrix}

(175)

\begin{matrix} GA (0, 1) & Unobstructed \Rightarrow quantum mechanics \end{matrix}

(176)

\begin{matrix} GA (1, 0) & Negative probabilities in the RQM \end{matrix}

(177)

\begin{matrix} GA (2, 0) & Unobstructed \Rightarrow toy model \end{matrix}

(178)

\begin{matrix} GA (1, 1) & Negative probabilities in the RQM \end{matrix}

(179)

\begin{matrix} GA (0, 2) & Not isomorphic to a real matrix algebra \end{matrix}

(180)

\begin{matrix} GA (3, 0) & Not isomorphic to a real matrix algebra \end{matrix}

(181)

\begin{matrix} GA (2, 1) & Not isomorphic to a real matrix algebra \end{matrix}

(182)

\begin{matrix} GA (1, 2) & Not isomorphic to a real matrix algebra \end{matrix}

(183)

\begin{matrix} GA (0, 3) & Not isomorphic to a real matrix algebra \end{matrix}

(184)

\begin{matrix} GA (4, 0) & Not isomorphic to a real matrix algebra \end{matrix}

(185)

\begin{matrix} GA (3, 1) & Unobstructed \Rightarrow quantum gravity \land SU (3) \times SU (2) \times U (1) \end{matrix}

(186)

\begin{matrix} GA (2, 2) & Negative probabilities in the RQM \end{matrix}

(187)

\begin{matrix} GA (1, 3) & Not isomorphic to a real matrix algebra \end{matrix}

(188)

\begin{matrix} GA (0, 4) & Not isomorphic to a real matrix algebra \end{matrix}

(189)

\begin{matrix} GA (5, 0) & Not isomorphic to a real matrix algebra \\ ⋮ & ⋮ \end{matrix}

(190)

\begin{matrix} GA (6, 0) & No probability measure as a self - product \\ ⋮ & ⋮ \end{matrix}

(191)

\begin{matrix} \infty \end{matrix}

(192)

Let us now demonstrate the obstructions mentioned above.

Theorem 14

(Not isomorphic to a real matrix algebra). The determinant of the matrix representation of the geometric algebras in this category is either complex-valued or quaternion-valued, making them unsuitable as a probability.

Proof.

These geometric algebras are classified as follows:

\begin{matrix} GA (0, 2) ≅ H \end{matrix}

(193)

\begin{matrix} GA (3, 0) ≅ M_{2} (C) \end{matrix}

(194)

\begin{matrix} GA (2, 1) ≅ M_{2}^{2} (R) \end{matrix}

(195)

\begin{matrix} GA (1, 2) ≅ M_{2} (C) \end{matrix}

(196)

\begin{matrix} GA (0, 3) ≅ H^{2} \end{matrix}

(197)

\begin{matrix} GA (4, 0) ≅ M_{2} (H) \end{matrix}

(198)

\begin{matrix} GA (1, 3) ≅ M_{2} (H) \end{matrix}

(199)

\begin{matrix} GA (0, 4) ≅ M_{2} (H) \end{matrix}

(200)

\begin{matrix} GA (5, 0) ≅ M_{2}^{2} (H) \end{matrix}

(201)

The determinant of these objects is valued in

C

or in

H

, where

C

are the complex numbers, and where

H

are the quaternions. □

Theorem 15

(Negative Probabilities in the RQM). The even sub-algebra, which associates to the RQM part of the theory, of these dimensional configurations allows for negative probabilities, making them unsuitable as a RQM.

Proof.

This category contains three dimensional configurations:

$GA (1, 0)$ :: Let $ψ = a + b e_{1}$ , then:

$\begin{matrix} {(a + b e_{1})}^{‡} (a + b e_{1}) = (a - b e_{1}) (a + b e_{1}) = a^{2} - b^{2} e_{1} e_{1} = a^{2} - b^{2} \end{matrix}$

(202)

which is valued in $R$ .
$GA (1, 1)$ :: Let $ψ = a + b e_{0} e_{1}$ , then:

$\begin{matrix} {(a + b e_{0} e_{1})}^{‡} (a + b e_{0} e_{1}) = (a - b e_{0} e_{1}) (a + b e_{0} e_{1}) = a^{2} - b^{2} e_{0} e_{1} e_{0} e_{1} = a^{2} - b^{2} \end{matrix}$

(203)

which is valued in $R$ .
$GA (2, 2)$ :: Let $ψ = a + b e_{0} e_{\emptyset} e_{1} e_{2}$ , where $e_{0}^{2} = - 1, e_{\emptyset}^{2} = - 1, e_{1}^{2} = 1, e_{2}^{2} = 1$ , then:

$\begin{matrix} {⌊ {(a + b)}^{‡} (a + b) ⌋}_{3, 4} {(a + b)}^{‡} (a + b) \end{matrix}$

(204)

$\begin{matrix} = {⌊ a^{2} + 2 a b + b^{2} ⌋}_{3, 4} (a^{2} + 2 a b + b^{2}) \end{matrix}$

(205)

We note that $b^{2} = b^{2} e_{0} e_{\emptyset} e_{1} e_{2} e_{0} e_{\emptyset} e_{1} e_{2} = b^{2}$ , therefore:

$\begin{matrix} 1 - 1 & = (a^{2} + b^{2} - 2 a b) (a^{2} + b^{2} + 2 a b) \end{matrix}$

(206)

$\begin{matrix} = {(a^{2} + b^{2})}^{2} - 4 a^{2} b^{2} \end{matrix}$

(207)

$\begin{matrix} = {(a^{2} + b^{2})}^{2} - 4 a^{2} b^{2} \end{matrix}$

(208)

which is valued in $R$ .

In all of these cases the RQM probability can be negative. □

Conjecture 1 (No probability measures as a self-product (in 6D)). The multivector representation of the norm in 6D cannot satisfy any observables.

Argument.

In six dimensions and above, the self-product patterns found in Definition 15 collapse. The research by Acus et al.[18] in 6D geometric algebra demonstrates that the determinant, so far defined through a self-products of the multivector, fails to extend into 6D. The crux of the difficulty is evident in the reduced case of a 6D multivector containing only scalar and grade-4 elements:

\begin{matrix} s (B) = b_{1} B f_{5} (f_{4} (B) f_{3} (f_{2} (B) f_{1} (B))) + b_{2} B g_{5} (g_{4} (B) g_{3} (g_{2} (B) g_{1} (B))) \end{matrix}

(209)

This equation is not a multivector self-product but a linear sum of two multivector self-products[18].

The full expression is given in the form of a system of 4 equations, which is too long to list in its entirety. A small characteristic part is shown:

\begin{matrix} a_{0}^{4} - 2 a_{0}^{2} a_{47}^{2} + b_{2} a_{0}^{2} a_{47}^{2} p_{412} p_{422} + 〈 72 monomials 〉 = 0 \end{matrix}

(210)

\begin{matrix} b_{1} a_{0}^{3} a_{52} + 2 b_{2} a_{0} a_{47}^{2} a_{52} p_{412} p_{422} p_{432} p_{442} p_{452} + 〈 72 monomials 〉 = 0 \end{matrix}

(211)

\begin{matrix} 〈 74 monomials 〉 = 0 \end{matrix}

(212)

\begin{matrix} 〈 74 monomials 〉 = 0 \end{matrix}

(213)

From Equation 209, it is possible to see that no observable

O

can satisfy this equation because the linear combination does not allow one to factor it out of the equation.

\begin{matrix} b_{1} O B f_{5} (f_{4} (B) f_{3} (f_{2} (B) f_{1} (B))) + b_{2} B g_{5} (g_{4} (B) g_{3} (g_{2} (B) g_{1} (B))) = b_{1} B f_{5} (f_{4} (B) f_{3} (f_{2} (B) f_{1} (B))) + b_{2} O B g_{5} (g_{4} (B) g_{3} (g_{2} (B) g_{1} (B))) \end{matrix}

(214)

Any equality of the above type between

b_{1} O

and

b_{2} O

is frustrated by the factors

b_{1}

and

b_{2}

, forcing

O = 1

as the only satisfying observable. Since the obstruction occurs within grade-4, which is part of the even sub-algebra it is questionable that a satisfactory theory (with non-trivial observables) be constructible in 6D, using our method. □

This conjecture proposes that the multivector representation of the determinant in 6D does not allow for the construction of non-trivial observables, which is a crucial requirement for a relevant quantum formalism. The linear combination of multivector self-products in the 6D expression prevents the factorization of observables, limiting their role to the identity operator.

Conjecture 2 (No probability measures as a self-product (above 6D)). The norms beyond 6D are progressively more complex than the 6D case, which is already obstructed.

These theorems and conjectures provide additional insights into the unique role of the unobstructed 3+1D signature in our proposal.

It is also interesting that our proposal is able to rule out

GA (1, 3)

even if in relativity, the signature of the metric

(+, -, -, -)

versus

(-, -, -, +)

does not influence the physics. However, in geometric algebra,

GA (1, 3)

represents 1 space dimension and 3 time dimensions. Therefore, it is not the signature itself that is ruled out but rather the specific arrangement of 3 time and 1 space dimensions, as this configuration yields quaternion-valued "probabilities" (i.e.

GA (1, 3) ≅ M_{2} (H)

and

det M_{2} (H) \in H

).

Consequently, the only dimensional configuration (other than the purely scalar configurations of

GA (0) ≅ R

and

GA (0, 1) ≅ C

) in which a ’least biased’ solution to the problem of maximizing the Shannon entropy of quantum measurements relative to an initial preparation exists is 3+1D.

3. Conclusion

In this work, we have introduced a novel framework that derives fundamental physical laws from a single, unifying principle: the maximization of the Shannon entropy of all possible measurements relative to a system’s initial state, subject to a universal measurement constraint. This universal measurement constraint, which encapsulates all possible linear measurements, serves as the cornerstone of our approach, allowing us to recover various pillars of modern physics within a unified and parsimonious framework.

By applying the universal measurement constraint in different contexts, we demonstrate how fundamental theories naturally emerge. Specifically, when considering measurements constrained by the U(1) generator, we recover quantum mechanics, incorporating complex probability amplitudes and interference effects intrinsic to quantum phenomena. When we extend the measurement structure to include measurements in 3+1 dimensions—the largest permissible measurement structure—we derive a relativistic quantum theory that inherently includes Lorentz symmetry. Remarkably, this approach uniquely yields the metric tensor of general relativity through a double-copy mechanism applied to the Dirac current. Moreover, it naturally incorporates the SU(3)×SU(2)×U(1) gauge symmetries of the Standard Model, providing a unified description of fundamental interactions.

Our findings suggest that the foundational structures of physics—quantum mechanics, special relativity, general relativity, and the gauge symmetries governing particle interactions—are not independent axioms but can be understood as natural consequences of an underlying principle of entropy maximization under the universal measurement constraint. This formulation implies that the laws of physics are determined by the requirement of least informational bias consistent with the measurements allowed by nature, rather than being arbitrary.

Furthermore, the emergence of a 3+1-dimensional spacetime within this framework is particularly noteworthy. The dimensionality arises not from empirical input but as a the most sophisticated solution to the entropy maximization problem under the universal measurement constraint. This provides a potential explanation for why our universe exhibits precisely four spacetime dimensions and suggests a profound connection between information theory and the fundamental structure of reality.

By reducing the complexity of fundamental physics to a single, parsimonious principle based on entropy maximization under the universal measurement constraint, this work opens new avenues for understanding the interconnectedness of physical laws. It highlights the power of information-theoretic principles as foundational tools in theoretical physics and invites further exploration into how other physical phenomena might be derived from similar informational and measurement principles. We believe that this framework not only unifies existing theories but also holds the potential to guide future research in uncovering deeper insights into the nature of the universe.

Statements and Declarations

Competing Interests: The author declares that he has no competing financial or non-financial interests that are directly or indirectly related to the work submitted for publication.
Data Availability Statement: No datasets were generated or analyzed during the current study.
During the preparation of this manuscript, we utilized a Large Language Model (LLM), for assistance with spelling and grammar corrections, as well as for minor improvements to the text to enhance clarity and readability. This AI tool did not contribute to the conceptual development of the work, data analysis, interpretation of results, or the decision-making process in the research. Its use was limited to language editing and minor textual enhancements to ensure the manuscript met the required linguistic standards.

Appendix A SM

Here, we solve the Lagrange multiplier equation of SM. Preprints 137761 i005

We solve the maximization problem as follows:

\begin{matrix} 0 & = \frac{\partial L (ρ_{i}, \dots, ρ_{n})}{\partial ρ_{i}} \end{matrix}

(A2)

\begin{matrix} = - ln ρ_{i} - 1 - λ - β E_{i} \end{matrix}

(A3)

\begin{matrix} = ln ρ_{i} + 1 + λ + β E_{i} \end{matrix}

(A4)

\begin{matrix} \Rightarrow ln ρ_{i} & = - 1 - λ - β E_{i} \end{matrix}

(A5)

\begin{matrix} \Rightarrow ρ_{i} & = exp (- 1 - λ) exp (- β E_{i}) \end{matrix}

(A6)

\begin{matrix} = \frac{1}{Z (τ)} exp (- β E_{i}) \end{matrix}

(A7)

The partition function, is obtained as follows:

\begin{matrix} 1 & = \sum_{i} exp (- 1 - λ) exp (- β E_{i}) \end{matrix}

(A8)

\begin{matrix} \Rightarrow {(exp (- 1 - λ))}^{- 1} & = \sum_{i} exp (- β E_{i}) \end{matrix}

(A9)

\begin{matrix} Z (τ) & : = \sum_{i} exp (- β E_{i}) \end{matrix}

(A10)

Finally, the probability measure is:

\begin{matrix} ρ_{i} = \frac{1}{\sum_{i} exp (- β E_{i})} exp (- β E_{i}) \end{matrix}

(A11)

Appendix B RQM in 3+1D

The solution is obtained using the same step-by-step process as the 2D case, and yields: Preprints 137761 i007

Proof.

The Lagrange multiplier equation can be solved as follows:

\begin{matrix} 0 & = \frac{\partial L (ρ_{1}, \dots, ρ_{n})}{\partial ρ_{i}} \end{matrix}

(A14)

\begin{matrix} = - ln \frac{ρ_{i}}{p_{i}} - p_{i} - λ - ζ tr \frac{1}{2} M_{i} \end{matrix}

(A15)

\begin{matrix} = ln \frac{ρ_{i}}{p_{i}} + p_{i} + λ + ζ tr \frac{1}{2} M_{i} \end{matrix}

(A16)

\begin{matrix} \Rightarrow ln \frac{ρ_{i}}{p_{i}} & = - p_{i} - λ - ζ tr \frac{1}{2} M_{i} \end{matrix}

(A17)

\begin{matrix} \Rightarrow ρ_{i} & = p_{i} exp (- p_{i} - λ) exp (- ζ tr \frac{1}{2} M_{i}) \end{matrix}

(A18)

\begin{matrix} = \frac{1}{Z (ζ)} p_{i} exp (- ζ tr \frac{1}{2} M_{i}) \end{matrix}

(A19)

The partition function

Z (ζ)

, serving as a normalization constant, is determined as follows:

\begin{matrix} 1 & = \sum_{i} p_{i} exp (- p_{i} - λ) exp (- ζ tr \frac{1}{2} M_{i}) \end{matrix}

(A20)

\begin{matrix} \Rightarrow {(exp (- p_{i} - λ))}^{- 1} & = \sum_{i} p_{i} exp (- ζ tr \frac{1}{2} M_{i}) \end{matrix}

(A21)

\begin{matrix} Z (ζ) & : = \sum_{i} p_{i} exp (- ζ tr \frac{1}{2} M_{i}) \end{matrix}

(A22)

□

Appendix C SageMath program showing ⌊u ‡ u⌋ 3,4 u ‡ u=detM u

from sage.algebras.clifford_algebra import CliffordAlgebra
from sage.quadratic_forms.quadratic_form import QuadraticForm
from sage.symbolic.ring import SR
from sage.matrix.constructor import Matrix
# Define the quadratic form for GA(3,1) over the Symbolic Ring
Q = QuadraticForm(SR, 4, [-1, 0, 0, 0, 1, 0, 0, 1, 0, 1])
# Initialize the GA(3,1) algebra over the Symbolic Ring
algebra = CliffordAlgebra(Q)
# Define the basis vectors
e0, e1, e2, e3 = algebra.gens()
# Define the scalar variables for each basis element
a = var(’a’)
t, x, y, z = var(’t x y z’)
f01, f02, f03, f12, f23, f13 = var(’f01 f02 f03 f12 f23 f13’)
v, w, q, p = var(’v w q p’)
b = var(’b’)
# Create a general multivector
udegree0=a
udegree1=t*e0+x*e1+y*e2+z*e3
udegree2=f01*e0*e1+f02*e0*e2+f03*e0*e3+f12*e1*e2+f13*e1*e3+f23*e2*e3
udegree3=v*e0*e1*e2+w*e0*e1*e3+q*e0*e2*e3+p*e1*e2*e3
udegree4=b*e0*e1*e2*e3
u=udegree0+udegree1+udegree2+udegree3+udegree4
u2 = u.clifford_conjugate()*u
u2degree0 = sum(x for x in u2.terms() if x.degree() == 0)
u2degree1 = sum(x for x in u2.terms() if x.degree() == 1)
u2degree2 = sum(x for x in u2.terms() if x.degree() == 2)
u2degree3 = sum(x for x in u2.terms() if x.degree() == 3)
u2degree4 = sum(x for x in u2.terms() if x.degree() == 4)
u2conj34 = u2degree0+u2degree1+u2degree2-u2degree3-u2degree4
I = Matrix(SR, [[1, 0, 0, 0],
[0, 1, 0, 0],
[0, 0, 1, 0],
[0, 0, 0, 1]])
#MAJORANA MATRICES
y0 = Matrix(SR, [[0, 0, 0, 1],
[0, 0, -1, 0],
[0, 1, 0, 0],
[-1, 0, 0, 0]])
y1 = Matrix(SR, [[0, -1, 0, 0],
[-1, 0, 0, 0],
[0, 0, 0, -1],
[0, 0, -1, 0]])
y2 = Matrix(SR, [[0, 0, 0, 1],
[0, 0, -1, 0],
[0, -1, 0, 0],
[1, 0, 0, 0]])
y3 = Matrix(SR, [[-1, 0, 0, 0],
[0, 1, 0, 0],
[0, 0, -1, 0],
[0, 0, 0, 1]])
mdegree0 = a
mdegree1 = t*y0+x*y1+y*y2+z*y3
mdegree2 = f01*y0*y1+f02*y0*y2+f03*y0*y3+f12*y1*y2+f13*y1*y3+f23*y2*y3
mdegree3 = v*y0*y1*y2+w*y0*y1*y3+q*y0*y2*y3+p*y1*y2*y3
mdegree4 = b*y0*y1*y2*y3
m=mdegree0+mdegree1+mdegree2+mdegree3+mdegree4
print(u2conj34*u2 == m.det())

The program outputs

True

showing, by computer assisted symbolic manipulations, that the determinant of the real Majorana representation of a multivector u is equal to the double-copy form:

det M_{u} = {⌊ u^{‡} u ⌋}_{3, 4} u^{‡} u

.

References

Edwin T, Jaynes. Information theory and statistical mechanics. Physical review 1957, 106, 620. [Google Scholar]
Edwin T, Jaynes. Information theory and statistical mechanics. ii. Physical review 1957, 108, 171. [Google Scholar]
Paul Adrien Maurice Dirac. The principles of quantum mechanics. Number 27. Oxford university press, 1981.
John Von Neumann. Mathematical foundations of quantum mechanics: New edition, volume 53. Princeton university press, 2018.
Kip S Thorne, Charles W Misner, and John Archibald Wheeler. Gravitation. Freeman San Francisco, 2000.
Albert Einstein. The general theory of relativity. In The meaning of relativity, pages 54–75. Springer, 1922.
Sean M Carroll. Spacetime and geometry. Cambridge University Press, 2019.
Steven Weinberg. The quantum theory of fields, volume 2. Cambridge university press, 1995.
David J, Gross. The role of symmetry in fundamental physics. Proceedings of the National Academy of Sciences 1996, 93, 14256–14259. [Google Scholar]
Bryce S, DeWitt. Bryce S DeWitt. Quantum theory of gravity. i. the canonical theory. Physical Review 1967, 160, 1113. [Google Scholar]
Zvi Bern, John Joseph M Carrasco, and Henrik Johansson. Perturbative quantum gravity as a double copy of gauge theory. Physical Review Letters 2010, 105, 061602. [Google Scholar] [CrossRef] [PubMed]
Solomon Kullback and Richard A, Leibler. On information and sufficiency. The annals of mathematical statistics 1951, 22, 79–86. [Google Scholar]
Claude Elwood, Shannon. A mathematical theory of communication. Bell system technical journal 1948, 27, 379–423. [Google Scholar]
David, Hestenes. Spacetime physics with geometric algebra. American Journal of Physics 2003, 71, 691–714. [Google Scholar]
Douglas Lundholm. Geometric (clifford) algebra and its applications. arXiv preprint math/0605280, 2006.
David, Hestenes. Space-time structure of weak and electromagnetic interactions. Foundations of Physics 1982, 12, 153–168. [Google Scholar]
Anthony Lasenby. Some recent results for su(3) and octonions within the geometric algebra approach to the fundamental forces of nature. arXiv preprint arXiv:2202.06733, 2022.
A Acus and A Dargys. Inverse of multivector: Beyond p+ q= 5 threshold. arXiv preprint arXiv:1712.05204, 2017.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Deriving Fundamental Physics via Entropy Maximization under a Universal Measurement Constraint

Abstract

Keywords:

Subject:

1. Introduction

2. Results

2.1. Quantum Mechanics

2.2. RQM in 2D

2.2.1. 1+1D Obstruction

2.3. RQM in 3+1D

2.3.1. Preliminaries

2.3.2. RQM

2.3.3. Standard Model Gauge Symmetries

2.3.4. A Starting Point for a Theory of Quantum Gravity

2.3.5. The Einstein Field Equation

2.4. Dimensional Obstructions

3. Conclusion

Statements and Declarations

Appendix A SM

Appendix B RQM in 3+1D

Appendix C SageMath program showing ⌊u ‡ u⌋ 3,4 u ‡ u=detM u

References

MDPI Initiatives

Important Links

Subscribe