Constructing Physics From Measurements

Alexandre Harvey-Tremblay

doi:10.20944/preprints202404.1009.v16

Submitted:

31 March 2025

Posted:

01 April 2025

Read the latest preprint version here

Abstract

We present a reformulation of fundamental physics from an enumeration of independent axioms into the solution of a single optimization problem. Any experiment begins with an initial state preparation, involves some physical operation, and ends with a final measurement. Working from this structure, we maximize the entropy of a final measurement relative to its initial preparation subject to a measurement constraint. Solving this optimization problem for the natural constraint --the most permissive constraint compatible with said problem-- identifies an optimal physical theory. Rather than existing as a collection of postulates, quantum mechanics, general relativity, and Yang-Mills emerge within a unified theory. Notably, mathematical consistency further restricts valid solutions to 3+1 dimensions only. This reformulation reveals that the apparent complexity of modern physics, with its various forces, symmetries, and dimensional constraints, emerges as the solution to an optimization problem constructed over all experiments realizable within the constraint of nature.

Keywords:

foundations of physics

Subject:

Physical Sciences - Quantum Science and Technology

1. Introduction

Statistical mechanics (SM), in the formulation developed by E.T. Jaynes [1,2], is founded on an entropy optimization principle. Specifically, the Boltzmann entropy is maximized under the constraint of a fixed average energy

\bar{E}

:

\begin{matrix} \bar{E} : = \sum_{i} ρ_{i} E_{i} \end{matrix}

(1)

The Lagrange multiplier equation defining the optimization problem is:

\begin{matrix} L : = - k_{B} \sum_{i} ρ_{i} (β) ln ρ_{i} (β) + λ (1 - \sum_{i} ρ_{i} (β)) + β (\bar{E} - \sum_{i} ρ_{i} (β) E_{i}) \end{matrix}

(2)

where

λ

and

β

are Lagrange multipliers enforcing the normalization and average energy constraints. Solving this optimization problem yields the Gibbs measure:

\begin{matrix} ρ_{i} (β) = \frac{1}{Z (β)} exp (- β E_{i}), \end{matrix}

(3)

where

Z (β) : = \sum_{i} exp (- β E_{i})

is the partition function.

For comparison, quantum mechanics (QM) is not formulated as the solution to an optimization problem, but rather consists of a collection of axioms[3,4]:

QM Axiom 1 of 5: State Space: Every physical system is associated with a complex Hilbert space, and its state is represented by a ray (an equivalence class of vectors differing by a non-zero scalar multiple) in this space.
QM Axiom 2 of 5: Observables: Physical observables correspond to Hermitian (self-adjoint) operators acting on the Hilbert space.
QM Axiom 3 of 5: Dynamics: The time evolution of a quantum system is governed by the Schrödinger equation, where the Hamiltonian operator represents the system’s total energy.
QM Axiom 4 of 5: Measurement: Measuring an observable projects the system into an eigenstate of the corresponding operator, yielding one of its eigenvalues as the measurement result.
QM Axiom 5 of 5: Probability Interpretation: The probability of obtaining a specific measurement outcome is given by the squared magnitude of the projection of the state vector onto the relevant eigenstate (Born rule).

Physical theories have traditionally been constructed in two distinct ways. Some, like QM, are defined through a set of mathematical axioms that are first postulated and then verified against experiments. Others, like SM, emerge as solutions to optimization problems with experimentally-verified constraints.

We propose to generalize the optimization methodology of E.T. Jaynes to encompass all of physics, aiming to derive a unified theory from a single optimization problem.

To that end, we introduce the following constraint:

Axiom 1

(Nature).

\begin{matrix} \bar{M} : = \sum_{i} ρ_{i} M_{i} \end{matrix}

where

M_{i}

are

n \times n

matrices, and

\bar{M}

is their average.

This constraint, as it replaces the scalar

E_{i}

with the matrix

M_{i}

, extends E.T. Jaynes’ optimization method to encompass non-commutative observables and symmetry group generators required for fundamental physics.

We then construct an optimization problem:

Definition 1

(Physics). Physics is the solution to:

\begin{matrix} \underset{\begin{matrix} an \\ optimization \\ problem \end{matrix}}{\underset{︸}{L}} : = \underset{\begin{matrix} on the entropy \\ of a measurement \\ relative to its preparation \\ over all \end{matrix}}{\underset{︸}{- \sum_{i} ρ_{i} (τ) ln \frac{ρ_{i} (τ)}{ρ_{i} (0)}}} + \underset{\begin{matrix} predictive theories \end{matrix}}{\underset{︸}{λ (1 - \sum_{i} ρ_{i} (τ))}} + \underset{\begin{matrix} of nature \end{matrix}}{\underset{︸}{τ tr (\bar{M} - \sum_{i} ρ_{i} (τ) M_{i})}} \end{matrix}

where λ and τ are Lagrange multipliers enforcing the normalization and natural constraints, respectively.

This definition constitutes our complete proposal for reformulating fundamental physics—no additional principles will be introduced. By replacing the Boltzmann entropy with the relative Shannon entropy, the optimization problem extends beyond thermodynamic variables to encompass any type of experiment. This generalization occurs because relative entropy captures the essence of any experiment: the relationship between a final measurement state and its initial preparation state.

Two key constraints shape our framework. The normalization constraint ensures we are working with a proper predictive theory, while the natural constraint spawns the domain of applicability of the theory. The crucial insight is that because our formulation maintains complete generality in the structure of experiments while optimizing over all possible predictive theories, the resulting solution holds true, by construction, for all realizable experiments within its domain.

This approach reduces our reliance on postulating axioms through trial and error, and simplifies the foundations of physics. Specifically, when we employ the natural constraint—the most permissive constraint for this problem (see Discussion for proof)—, the solution spawns its largest domain, pointing towards a unified physics where fundamental theories emerge naturally—e.g. SM when

M ≅ R

, QM when

M ≅ u (1)

, and general relativity (acting on spacetime) + Yang-Mills (acting on internal spaces) when

M ≅ R \oplus spinc (3, 1)

. As we found, these three solutions are the only possible ones, as those entailed by other algebras encounter obstructions which violates the axioms of probability theory.

Theorem 1.

The general solution of the optimization problem is:

\begin{matrix} ρ_{i} (τ) = \frac{1}{\sum_{j} det exp (- τ M_{j}) det ψ_{j} (0)} det exp (- τ M_{i}) det ψ_{i} (0) \end{matrix}

where

det ψ_{i} (τ) : = ρ_{i} (τ)

.

Proof.

We solve the maximization problem by setting the derivative of the Lagrange multiplier equation with respect to

ρ_{i} (τ)

to zero:

\begin{matrix} \frac{\partial L [ρ_{1} (τ), \dots, ρ_{i} (τ), \dots, ρ_{n} (τ)]}{\partial ρ_{i} (τ)} & = - ln \frac{ρ_{i} (τ)}{ρ_{i} (0)} - 1 - λ - τ tr M_{i} = 0 . \end{matrix}

(4)

\begin{matrix} \Rightarrow ln \frac{ρ_{i} (τ)}{ρ_{i} (0)} & = - 1 - λ - τ tr M_{i} . \end{matrix}

(5)

\begin{matrix} \Rightarrow ρ_{i} (τ) & = ρ_{i} (0) exp (- 1 - λ) exp (- τ tr M_{i}) . \end{matrix}

(6)

Normalizing the probabilities using

\sum_{j} ρ_{j} (τ) = 1

, we find:

\begin{matrix} 1 & = \sum_{j} ρ_{j} (τ) = exp (- 1 - λ) \sum_{j} ρ_{j} (0) exp (- τ tr M_{j}), \end{matrix}

(7)

\begin{matrix} \Rightarrow exp (1 + λ) & = \sum_{j} ρ_{j} (0) exp (- τ tr M_{j}) . \end{matrix}

(8)

Substituting back, we obtain:

\begin{matrix} 1 - 1 ρ_{i} (τ) & = \frac{1}{\sum_{j} ρ_{j} (0) exp (- τ tr M_{j})} exp (- τ tr M_{i}) ρ_{i} (0) . \end{matrix}

(9)

Finally, using the identity

det exp (M) \equiv exp tr M

for square matrices

M

, we get:

\begin{matrix} ρ_{i} (τ) = \frac{1}{\sum_{j} det exp (- τ M_{j}) det ψ_{j} (0)} det exp (- τ M_{i}) det ψ_{i} (0) . \end{matrix}

(10)

where

det ψ_{i} (τ) : = ρ_{i} (τ)

. □

As we will see in the results section, this solution encapsulates three distinct special cases:

Statistical Mechanics:

To recover SM from Equation 10, we consider the case where the matrices $M_{i}$ are $1 \times 1$ , i.e., real scalars. Specifically, we set:

$\begin{matrix} \bar{M} = \sum_{i} ρ_{i} M_{i}, with M_{i} = E_{i}, \end{matrix}$

(11)

and take $ρ_{i} (0)$ to be a uniform distribution. Then, Equation 10 reduces to the Gibbs distribution:

$\begin{matrix} ρ_{i} (τ) = \frac{1}{Z} exp (- τ E_{i}), \end{matrix}$

(12)

where $τ$ corresponds to the $β$ of SM. This demonstrates that our solution generalizes SM, as it recovers it when $M_{i}$ are scalars.
Quantum Mechanics:

By choosing $M_{i}$ to generate the U(1) group, we derive the axioms of QM from entropy maximization. Specifically, we set:

$\begin{matrix} \bar{M} = \sum_{i} ρ_{i} M_{i}, with M_{i} = [\begin{matrix} 0 & - E_{i} \\ E_{i} & 0 \end{matrix}], \end{matrix}$

(13)

where $E_{i}$ are energy levels. In the results section, we will detail how this choice leads to the the Born rule in lieu of the Gibbs measure, and that the partition function is unitary invariant—the solution is shown to satisfy all five axioms of QM.
Fundamental Physics:

Extending our approach, we choose $M_{i}$ to be $4 \times 4$ matrices representing the $R \oplus spinc (3, 1)$ algebra. Specifically, we consider multivectors of the form $u = a + f + b$ , where a is a scalar, where $f$ is a bivector and $b$ is a pseudoscalar of the 3+1D geometric algebra $GA (3, 1)$ . This constitute its even sub-algebra. The matrix representation of $M_{i}$ is:

$\begin{matrix} M_{i} = [\begin{matrix} a + f_{02} & b - f_{13} & - f_{01} + f_{12} & f_{03} + f_{23} \\ - b + f_{13} & a + f_{02} & f_{03} + f_{23} & f_{01} - f_{12} \\ - f_{01} - f_{12} & f_{03} - f_{23} & a - f_{02} & - b - f_{13} \\ f_{03} - f_{23} & f_{01} + f_{12} & b + f_{13} & a - f_{02} \end{matrix}], \end{matrix}$

(14)

where $f_{01}, f_{02}, f_{03}, f_{12}, f_{13}, f_{23}$ , and b correspond to the generators of the $Spinc (3, 1)$ group, which includes both Lorentz boosts/rotations and the four-volume orientation, and where a is the generator of the group $R^{+}$ . Solving the optimization problem with this choice leads to a relativistic quantum probability measure extending the Born rule from $C$ to $R^{+} \times Spinc (3, 1)$ . The solution is shown to uniquely satisfy both general relativity (acting on spacetime) and Yang-Mills (acting on its internal spaces).
Dimensional Obstructions:

Definition 1 yields valid probability measures only in specific cases of Axiom 1. Beyond the instances of statistical mechanics and quantum mechanics, Axiom 1 produces a consistent solution only in 3+1 dimensions. In other dimensional configurations, various obstructions arises violating the axioms of probability theory. The following table summarizes the geometric cases and their obstructions:

$\begin{matrix} Dimensions & Optimal Predictive Theory of Nature \end{matrix}$

$\begin{matrix} GA (0) & Statistical Mechanics \end{matrix}$

(15)

$\begin{matrix} GA (0, 1) & Quantum Mechanics \end{matrix}$

(16)

$\begin{matrix} GA (1, 0) & Obstructed (Negative probabilities) \end{matrix}$

(17)

$\begin{matrix} GA (2, 0) & Quantum Mechanics \end{matrix}$

(18)

$\begin{matrix} GA (1, 1) & Obstructed (Negative probabilities) \end{matrix}$

(19)

$\begin{matrix} GA (0, 2) & Obstructed (Non - real probabilities) \end{matrix}$

(20)

$\begin{matrix} GA (3, 0) & Obstructed (Non - real probabilities) \end{matrix}$

(21)

$\begin{matrix} GA (2, 1) & Obstructed (Non - real probabilities) \end{matrix}$

(22)

$\begin{matrix} GA (1, 2) & Obstructed (Non - real probabilities) \end{matrix}$

(23)

$\begin{matrix} GA (0, 3) & Obstructed (Non - real probabilities) \end{matrix}$

(24)

$\begin{matrix} GA (4, 0) & Obstructed (Non - real probabilities) \end{matrix}$

(25)

$\begin{matrix} GA (3, 1) & Gravity + Yang - Mills \end{matrix}$

(26)

$\begin{matrix} GA (2, 2) & Obstructed (Negative probabilities) \end{matrix}$

(27)

$\begin{matrix} GA (1, 3) & Obstructed (Non - real probabilities) \end{matrix}$

(28)

$\begin{matrix} GA (0, 4) & Obstructed (Non - real probabilities) \end{matrix}$

(29)

$\begin{matrix} GA (5, 0) & Obstructed (Non - real probabilities) \\ ⋮ & ⋮ \end{matrix}$

(30)

$\begin{matrix} GA (6, 0) & Suspected Obstructed (No observables) \\ ⋮ & ⋮ \end{matrix}$

(31)

where $GA (p, q)$ means the geometric algebra of $p + q$ dimensions, where p is the number of positive signature dimensions and q of negative signature dimensions. QM shows up twice because both $GA (0, 1)$ and the even-subalgebra of $GA (2, 0)$ are isomorphic to $C$ .

We will first investigate the unobstructed cases in Section 2.1, Section 2.2 and Section 2.3 and then demonstrate the obstructions in Section 2.4. These obstructions are desirable because they automatically limit the theory to 3+1D, thus providing a built-in mechanism for the observed dimensionality of our universe.

2. Results

2.1. $u (1)$ -constraint: Quantum Mechanics

In SM, the central observation is that energy measurements of a thermally equilibrated system tend to cluster around a fixed average value (Equation 1). In contrast, QM is characterized by the presence of interference effects in measurement outcomes. To capture these features, we introduce the following special case of Axiom 1:

Definition 2

(

u (1)

constraint).We reduce the generality of Axiom 1 to the generator of the

U (1)

group. Specifically, we replace

\begin{matrix} \bar{M} = \sum_{i} ρ_{i} M_{i} with M_{i} = [\begin{matrix} 0 & - E_{i} \\ E_{i} & 0 \end{matrix}] = I E_{i} \end{matrix}

where

E_{i}

are scalar values (e.g., energy levels),

ρ_{i}

are the probabilities of outcomes, the matrices

M_{i}

generate the

U (1)

group, and where

I : = [\begin{matrix} 0 & - 1 \\ 1 & 0 \end{matrix}]

and

I^{2} = - [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}] ≅ - 1

.

Let us also substitute

τ : = t / ℏ

analogously to

β : = 1 / (k_{B} T)

. Then, the Lagrange multiplier equation (Definition 1) becomes:

\begin{matrix} L = - \sum_{i} ρ_{i} (t) ln \frac{ρ_{i} (t)}{ρ_{i} (0)} + λ (1 - \sum_{i} ρ_{i} (t)) + \frac{t}{ℏ} tr (I \bar{E} - \sum_{i} ρ_{i} (t) I E_{i}) \end{matrix}

(32)

The general solution of the optimization problem (Theorem 1), with the above-mentioned replacements, reduces to:

\begin{matrix} ρ_{i} (t) = \frac{1}{\sum_{j} det exp (- I t E_{j} / ℏ) det ψ_{j} (0)} det exp (- I t E_{i} / ℏ) det ψ_{i} (0) \end{matrix}

(33)

Though initially unfamiliar, this form effectively establishes a comprehensive formulation of QM, as we will demonstrate.

To align our results with conventional QM notation, we note the following equivalence with the square modulus:

\begin{matrix} det exp [\begin{matrix} a & - b \\ b & a \end{matrix}] & \equiv r^{2} det [\begin{matrix} cos (b) & - sin (b) \\ sin (b) & cos (b) \end{matrix}], where r : = exp a \end{matrix}

(34)

\begin{matrix} \equiv r^{2} ({cos}^{2} (b) + {sin}^{2} (b)) \end{matrix}

(35)

\begin{matrix} \equiv {| r cos (b) + r i sin (b) |}^{2} \end{matrix}

(36)

\begin{matrix} \equiv {| r exp (i b) |}^{2} \end{matrix}

(37)

Consequently, changing to the square modulus representation for both the numerator and to the denominator, consolidates a unitary invariant ensemble, an evolution operator, and an initial preparation into the Born rule:

\begin{matrix} ρ_{i} (t) = \underset{\begin{matrix} Unitary Invariant \\ Ensemble \end{matrix}}{\underset{︸}{\frac{1}{\sum_{j} {| exp (- i t E_{j} / ℏ) ψ_{j} (0) |}^{2}}}} {| \underset{\begin{matrix} Evolution \\ Operator \end{matrix}}{\underset{︸}{exp (- i t E_{i} / ℏ)}} |}^{2} \underset{\begin{matrix} Initial \\ Preparation \end{matrix}}{\underset{︸}{{| ψ_{i} (0) |}^{2}}} \end{matrix}

(38)

where

{| ψ_{i} (τ) |}^{2} : = ρ_{i} (τ)

.

This equation describes what occurs at the instant of measurement—specifically, a part of the state space structure (e.g. the complex phase) is erased under measurement via action of the square modulus, thus yielding a classical probability measure.

By inspection, we can recover the definition of the state space, which is a wavefunction. To obtain it, we decompose the square modulus into a complex number and its conjugate. The state space is then identified as a vector within a complex n-dimensional Hilbert space. The partition function acts as the inner product. This relationship is articulated as follows:

\begin{matrix} Z (t) : = \sum_{j} {| exp (- i t E_{j} / ℏ) ψ_{j} (0) |}^{2} = 〈 ψ | ψ 〉 \end{matrix}

(39)

where

\begin{matrix} [\begin{matrix} ψ_{1} (t) \\ ⋮ \\ ψ_{n} (t) \end{matrix}] : = [\begin{matrix} exp (- i t E_{1} / ℏ) \\ ⋱ \end{matrix}] \begin{matrix} exp (- i t E_{n} / ℏ) \end{matrix} [\begin{matrix} ψ_{1} (0) \\ ⋮ \\ ψ_{n} (0) \end{matrix}] \end{matrix}

(40)

Let us now investigate how the axioms of QM are recovered from this result:

The entropy maximization procedure inherently normalizes the vectors $| ψ 〉$ with $1 / Z = 1 / 〈 ψ | ψ 〉$ . This normalization links $| ψ 〉$ to a unit vector in Hilbert space. Furthermore, as physical states associate to the probability measure, and the probability is defined up to a phase, we conclude that physical states map to Rays within Hilbert space. This demonstrates QM Axiom 1 of 5.
An observable of the ensemble must satisfy:

$\begin{matrix} \bar{O} : = \sum_{j} O_{j} {| exp (- i t E_{j} / ℏ) ψ_{j} (0) |}^{2} \end{matrix}$

(41)

Since $Z = 〈 ψ | ψ 〉$ , then any self-adjoint operator satisfying the condition $〈 O ψ | ϕ 〉 = 〈 ψ | O ϕ 〉$ will equate the above equation, simply because $〈 O 〉 : = 〈 ψ | O | ψ 〉$ . This demonstrates QM Axiom 2 of 5.
Upon transforming Equation 40 out of its eigenbasis through unitary operations, we find that the energy $E_{i}$ transforms in the manner of a Hamiltonian operator:

$\begin{matrix} | ψ (t) 〉 = exp (- i t H / ℏ) | ψ (0) 〉 \end{matrix}$

(42)

The system’s dynamics emerge from differentiating the solution with respect to the Lagrange multiplier. This is manifested as:

$\begin{matrix} \frac{\partial}{\partial t} | ψ (t) 〉 & = \frac{\partial}{\partial t} (exp (- i t H / ℏ) | ψ (0) 〉) \end{matrix}$

(43)

$\begin{matrix} = - i H / ℏ exp (- i t H / ℏ) | ψ (0) 〉 \end{matrix}$

(44)

$\begin{matrix} = - i H / ℏ | ψ (t) 〉 \end{matrix}$

(45)

$\begin{matrix} \Rightarrow H | ψ (t) 〉 & = i ℏ \frac{\partial}{\partial t} | ψ (t) 〉 \end{matrix}$

(46)

which is the Schrödinger equation. This demonstrates QM Axiom 3 of 5.
From Equation 40 it follows that the possible microstates $E_{i}$ of the system correspond to specific eigenvalues of $H$ . An observation can thus be conceptualized as sampling from $ρ$ , with the measured state being the occupied microstate i. Consequently, when a measurement occurs, the system invariably emerges in one of these microstates, which directly corresponds to an eigenstate of $H$ . Measured in the eigenbasis, the probability measure is:

$\begin{matrix} ρ_{i} (t) = \frac{1}{〈 ψ | ψ 〉} {| ψ_{i} (t) |}^{2} . \end{matrix}$

(47)

In scenarios where the probability measure $ρ_{i} (τ)$ is expressed in a basis other than its eigenbasis, the probability $P (λ_{i})$ of obtaining the eigenvalue $λ_{i}$ is given as a projection on a eigenstate:

$\begin{matrix} P (λ_{i}) = {| 〈 λ_{i} | ψ 〉 |}^{2} \end{matrix}$

(48)

Here, $| 〈 λ_{i} | ψ 〉 |^{2}$ signifies the squared magnitude of the amplitude of the state $| ψ 〉$ when projected onto the eigenstate $| λ_{i} 〉$ . As this argument hold for any observable, this demonstrates QM Axiom 4 of 5.
Finally, since the probability measure (Equation 38) replicates the Born rule, QM Axiom 5 of 5 is also demonstrated.

Revisiting QM with this perspective offers a coherent and unified narrative. Specifically, the

U (1)

generating constraint is sufficient to entail the foundations of QM through the principle of entropy maximization—in this formulation, QM Axioms 1, 2, 3, 4, and 5 are not fundamental, but the solution to an optimization problem.

2.2. $R \oplus spin (2)$ -constraint: Euclidean QM in 2D

In this section, we investigate a model, isomorphic to QM, that lives in 2D—it provides a valuable starting point before addressing the more complex 3+1D case. Before we solve the optimization problem, we will first introduce tools to express a bilinear form over

M

using the multivectors of

GA (2, 0)

.

2.2.1. Bilinear Form

In general a multivector

u : = a + x + b

of

GA (2, 0)

, where a is a scalar,

x : = x e_{1} + y e_{2}

is a vector and

b : = b e_{1} e_{2}

a pseudo-scalar, can be represented as a real

2 \times 2

matrix via an isomorphism:

Definition 3

(Pauli Algebra Isomorphism). The map

φ : GA (2, 0) \to Mat (2, R)

defined by:

\begin{matrix} φ (1) : = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}], φ (e_{x}) : = σ_{x} = [\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}], φ (e_{y}) : = σ_{y} = [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}], \end{matrix}

(49)

extends linearly and multiplicatively to an isomorphism between

GA (2, 0)

and the algebra of real

2 \times 2

matrices. In particular, the basis bivector maps to:

\begin{matrix} φ (e_{x} e_{y}) = σ_{x} σ_{y} = [\begin{matrix} 0 & - 1 \\ 1 & 0 \end{matrix}] . \end{matrix}

(50)

Definition 4

(Matrix Representation). For a multivector

u : = a + x + b

, its matrix representation under φ is:

φ (u) = a φ (1) + x φ (e_{x}) + y φ (e_{y}) + b φ (e_{x} e_{y}) = [\begin{matrix} a + x & y - b \\ y + b & a - x \end{matrix}],

We now introduce the multivector conjugate, also known as the Clifford conjugate, which generalizes the concept of complex conjugation to multivectors.

Definition 5

(Multivector Conjugate—in

GA (2, 0)

).Let

u : = a + x + b

be in

GA (2, 0)

. Then its multivector conjugate is defined as:

\begin{matrix} u^{‡} : = a - x - b \end{matrix}

The determinant of the matrix representation of a multivector can be expressed as a multivector self-product:

Theorem 2

(Multivector Determinant—in

GA (2, 0)

).Let

u : = a + x + b

be in

GA (2, 0)

, then:

\begin{matrix} u^{‡} u \equiv det φ (u) \end{matrix}

(51)

Proof.

Let

u : = a + x + b

thus

φ (u) = [\begin{matrix} a + x & y - b \\ y + b & a - x \end{matrix}]

. Then:

\begin{matrix} 1 : u^{‡} u & = {(a + x + b)}^{‡} (a + x + b) \end{matrix}

(52)

\begin{matrix} = (a - x - b) (a + x + b) \end{matrix}

(53)

\begin{matrix} = a^{2} + a x + a b - x a - x^{2} - x b - b a - b x - b^{2} \end{matrix}

(54)

\begin{matrix} = a^{2} - x^{2} + b^{2} \end{matrix}

(55)

\begin{matrix} = a^{2} - x^{2} - y^{2} + b^{2}, \sin ce x^{2} = x^{2} + y^{2} \end{matrix}

(56)

\begin{matrix} 2 : det φ (u) & = det [\begin{matrix} a + x & y - b \\ y + b & a - x \end{matrix}] \end{matrix}

(57)

\begin{matrix} = (a + x) (a - x) - (y - b) (y + b) \end{matrix}

(58)

\begin{matrix} = a^{2} - x^{2} - y^{2} + b^{2} \end{matrix}

(59)

□

Building upon the concept of the multivector conjugate, we introduce the multivector conjugate transpose, which serves as an extension of the Hermitian conjugate to the domain of multivectors.

Definition 6

(Multivector Conjugate Transpose). Let

|V〉 {〉 \in (GA (2, 0))}^{n}

:

\begin{matrix} |V〉 〉 : = [\begin{matrix} a_{1} + x_{1} + b_{1} \\ ⋮ \\ a_{n} + x_{n} + b_{n} \end{matrix}] \end{matrix}

(60)

The multivector conjugate transpose of

|V〉 〉

is defined as first taking the transpose and then the element-wise multivector conjugate:

\begin{matrix} 〈 〈V| : = [\begin{matrix} a_{1} - x_{1} - b_{1} & \dots & a_{n} - x_{n} - b_{n} \end{matrix}] \end{matrix}

(61)

Definition 7

(Bilinear Form). Let

|V〉 〉

and

|W〉 〉

be two vectors valued in

GA (2, 0)

:

\begin{matrix} |V〉 〉 : = [\begin{matrix} a_{1} + x_{1} + b_{1} \\ ⋮ \\ a_{n} + x_{n} + b_{n} \end{matrix}] & |W〉 〉 : = [\begin{matrix} a_{1}^{'} + x_{1}^{'} + b_{1}^{'} \\ ⋮ \\ a_{n}^{'} + x_{n}^{'} + b_{n}^{'} \end{matrix}] \end{matrix}

(62)

We introduce the following bilinear form:

\begin{matrix} 〈 〈V | W〉 〉 = (a_{1} - x_{1} - b_{1}) (a_{1}^{'} + x_{1}^{'} + b_{1}^{'}) + \dots (a_{n} - x_{n} - b_{n}) (a_{n}^{'} + x_{n}^{'} + b_{n}^{'}) \end{matrix}

(63)

Theorem 3

(Inner Product). Restricted to the even sub-algebra of

GA (2, 0)

, the bilinear form is an inner product.

Proof.

\begin{matrix} {〈 〈V | W〉 〉}_{x \to 0} & = (a_{1} - b_{1}) (a_{1} + b_{1}) + \dots (a_{n} - b_{n}) (a_{n} + b_{n}) \end{matrix}

(64)

This is isomorphic to the inner product of a complex Hilbert space, with the identification

{(σ_{x} σ_{y})}^{2} ≅ - 1

□

2.2.2. 1+1D Obstruction

The reader may wonder why we are using 2D instead of the more physically relevant 1+1D for the lower dimensional example. As stated in the introduction the 1+1D is obstructed. Specifically, the 1+1D theory results in a split-complex quantum theory due to the bilinear form

(a - b e_{0} \land e_{1}) (a + b e_{0} \land e_{1})

, which yields negative probabilities:

a^{2} - b^{2} \in R

for certain wavefunction states, in contrast to the non-negative probabilities

a^{2} + b^{2} \in R^{\geq 0}

obtained in the Euclidean 2D case. As such, neither 1+1D nor any of its sub-algebras satisfy all axioms of probability theory, hence it is obstructed. In contrast, the even sub-algebra of 2D is unobstructed simply because it is isomorphic to the complex numbers.

2.2.3. $spin (2)$ -constraint: ≅ Quantum Mechanics

Let us first investigate the

spin (2)

-constraint, then we will investigate the more general

R \oplus spin (2)

-constraint. This constraint is recovered by posing

a \to 0

and

x \to 0

then

φ (u)

reduces as follows:

\begin{matrix} {u = a + x + b |}_{a \to 0, x \to 0} = b \Rightarrow φ (u) = [\begin{matrix} 0 & - b \\ b & 0 \end{matrix}] \end{matrix}

(65)

The fundamental Lagrange Multiplier Equation:

\begin{matrix} L : = - \sum_{i} ρ_{i} (θ) ln \frac{ρ_{i} (θ)}{ρ_{i} (0)} + λ (1 - \sum_{i} ρ_{i} (θ)) + \frac{1}{2} θ tr (\bar{b} - \sum_{i} ρ_{i} (θ) b_{i}) \end{matrix}

(66)

where

(1): $λ$ and $θ$ are the Lagrange multipliers
(2): $b_{i}$ are the multivectors of $GA (2, 0)$ , reduced by $a \to 0$ and $x \to 0$
(3): the factor (1/2) is there to regularize the adjoint action on a vector $e^{- (1 / 2) b_{i}} v e^{(1 / 2) b_{i}} = v^{'}$

It yields the following solution:

\begin{matrix} ρ_{i} = \underset{\begin{matrix} Spin (2) Invariant \\ Ensemble \end{matrix}}{\underset{︸}{\frac{1}{\sum_{j} det exp (- \frac{1}{2} θ b_{j}) det ψ_{j} (θ)}}} det \underset{\begin{matrix} Evolution \\ Operator \end{matrix}}{\underset{︸}{exp (- \frac{1}{2} θ b_{i})}} \underset{\begin{matrix} Initial \\ Preparation \end{matrix}}{\underset{︸}{det ψ_{i} (0)}} \end{matrix}

(67)

As with the

u (1)

-constraint case, this equation describes what must be erased from the state space structure at the instant of measurement, to yield a classical probability measure—in this case a Spin(2) phase. The wavefunction with this structure is:

Definition 8

(Spin(2)-valuedWavefunction).

\begin{matrix} [\begin{matrix} ψ_{1} (θ) \\ ⋮ \\ ψ_{n} (θ) \end{matrix}] : = [\begin{matrix} exp (- \frac{1}{2} θ b_{1}) \\ ⋱ \\ exp (- \frac{1}{2} θ b_{n}) \end{matrix}] [\begin{matrix} ψ_{1} (0) \\ ⋮ \\ ψ_{n} (0) \end{matrix}] \end{matrix}

The dynamics are described by a variant of the Schrödinger equation, which is derived by taking the derivative of the wavefunction with respect to the Lagrange multiplier,

θ

:

Definition 9

(Spin(2)-valued Schrödinger Equation).

\begin{matrix} \frac{d}{d θ} [\begin{matrix} ψ_{1} (θ) \\ ⋮ \\ ψ_{n} (θ) \end{matrix}] = - \frac{1}{2} [\begin{matrix} b_{1} \\ ⋱ \\ b_{n} \end{matrix}] [\begin{matrix} ψ_{1} (θ) \\ ⋮ \\ ψ_{n} (θ) \end{matrix}] \end{matrix}

where

θ represents a global one-parameter evolution parameter akin to time
$b_{i}$ is the generator of $Spin (2)$ transformations.

Since

Spin (2) ≅ U (1)

, then it should come to no surprise that the theory resulting from the

spin (2)

-constraint is of the same mathematical form as QM, obtained from the

u (1)

-constraint.

2.2.4. $R \oplus spin (2)$ -constraint: Euclidean QM in 2D

Now, we solve the optimization problem for the curvilinear case where

\frac{1}{2} (e_{μ} e_{ν} + e_{ν} e_{μ}) = g_{μ ν}

. We will also move to the continuum

\sum \to \int

. The optimization problem is described as follows:

\begin{matrix} L : = - \int_{L} ρ (θ, r) ln \frac{ρ (θ, r)}{ρ (0, r)} \sqrt{h_{θ}} d r + λ (1 - \int_{L} ρ (θ, r) \sqrt{h_{θ}} d r) + \frac{1}{2} θ (\bar{M} - \int_{L} ρ (θ, r) M (r) \sqrt{h_{θ}} d r) \end{matrix}

(68)

where

$λ$ and $θ$ are the Lagrange multipliers
$\sqrt{h_{θ}}$ describes the metric of the 2D space foliated in slices of constant $θ$ . This foliation results in radial lines from the origin to infinity with angle $θ$ . A less sophisticated way of saying this is that we use polar coordinates in curved space (specifically, for flat space $\sqrt{h_{θ}} = r$ ).
L is the integration length for the slices.
$M$ is the lie algebra $R \oplus spin (2)$ .

We note that the lie algebra

R \oplus spin (2)

is isomorphic to the even subalgebra of

GA (2, 0)

:

\begin{matrix} R \oplus spin (2) ≅ a + b \end{matrix}

(69)

where a generates dilation (corresponding to translations on the

θ

-foliated slices) and

b

generates rotations (corresponding to circular motion on the

θ

-foliated slices). However, this isomorphism is only valid in the absence of curvature. In the presence of curvature, these generators are replaced with:

\begin{matrix} σ_{1} e^{μ} (\partial_{μ} + \frac{1}{2} ω_{μ}^{12} e_{1} e_{2}) \end{matrix}

(70)

where

D_{μ} : = \partial_{μ} + \frac{1}{2} ω_{μ}^{12} e_{1} e_{2}

is the covariant derivate. The expression includes

\partial_{μ}

, the partial derivative, and

\frac{1}{2} ω_{μ}^{12} e_{1} e_{2}

, the

Spin (2)

connection—acting on the wavefunction to transform its components in curved spacetime. The term

e^{μ}

is required to contract the indices of

\partial_{μ}

and

ω_{μ}

, producing an odd multivector—and the term

σ_{1}

converts it back to an even multivector. The term

σ_{1}

also selects a preferred frame—the laboratory frame. Its role is similar to

γ_{0}

in the Dirac Lagrangian.

Solving the optimization problem for this value of

M

, yields:

\begin{matrix} ρ (θ, r) = \frac{1}{\int_{L} det exp (- \frac{1}{2} θ σ_{1} e^{μ} D_{μ}) det ψ (0, r) \sqrt{h_{θ}} d r} det exp (- \frac{1}{2} θ σ_{1} e^{μ} D_{μ}) det ψ (0, r) \end{matrix}

(71)

Probabilities, normalized for a given initial slice, are conserved as the system evolves along

θ

.

As before, this equation identifies the structure of the state space that a measurement must erase to yield a classical probability measure. This structure yields the wavefunction, given as follows:

\begin{matrix} ψ (θ, r) : = exp (- \frac{1}{2} θ σ_{1} e^{μ} D_{μ}) ψ (0, r) \end{matrix}

(72)

Then, taking the derivative with respect to

θ

yields the Schrödinger equation:

\begin{matrix} \frac{\partial}{\partial θ} ψ (θ, r) = - \frac{1}{2} σ_{1} e^{μ} D_{μ} ψ (θ, r) \end{matrix}

(73)

Revealing that the Hamiltonian, curiously expressed as a multivector, is

H : = \frac{1}{2} σ_{1} e^{μ} D_{μ}

(more on that in Theorem 4).

Definition 10

(David Hestenes’ Formulation). In 3+1D, the David Hestenes’ formulation [5] of the wavefunction is

ψ = \sqrt{ρ} R e^{i b / 2}

, where

R = e^{f / 2}

is a Lorentz boost or rotation and where

e^{i b / 2}

is a phase. In 2D, as the algebra only admits a bivector, his formulation would reduce to

ψ = \sqrt{ρ} R

, where ρ is a probability density and R is a rotor—this is the form we have recovered.

\begin{matrix} \sqrt{ρ} R \equiv exp (a / 2) exp (f / 2) \end{matrix}

The definition of the Dirac current applicable to our wavefunction follows the formulation of David Hestenes:

Definition 11

(Dirac Current). The Dirac current for the 2D theory is defined as:

\begin{matrix} J : = ψ^{‡} e_{μ} ψ = ρ \underset{SO (2)}{\underset{︸}{R^{‡} e_{μ} R}} = ρ e_{μ}^{'} \end{matrix}

where

e_{μ}^{'}

is a

SO (2)

-rotated basis vector.

We recall that in QM, the Hamiltonian relate to the Lagrangian as follows:

\begin{matrix} L = tr (\frac{ψ^{‡} H ψ}{ψ^{‡} ψ}) \end{matrix}

(74)

With that, we can prove the following:

Theorem 4

(Dirac equation). The equation of motion of the Schrödinger equation (Equation 73), is the Dirac equation. Since

H = \frac{1}{2} σ_{1} e^{μ} D_{μ}

(Equation 73), we write:

\begin{matrix} \frac{δ}{δ ψ^{‡}} \int_{M} tr (\frac{ψ^{‡} σ_{1} e^{μ} D_{μ} ψ}{2 ψ^{‡} ψ}) \sqrt{g} d θ d r = 0 \Rightarrow \underset{Dirac}{\underset{︸}{e^{μ} D_{μ} ϕ = 0}} Equation \end{matrix}

(75)

Proof.

\begin{matrix} \frac{δ}{δ ψ^{‡}} \int_{M} tr (\frac{ψ^{‡} σ_{1} e^{μ} D_{μ} ψ}{2 ψ^{‡} ψ}) \sqrt{g} d θ d r = 0 \end{matrix}

(76)

We can already recognize the numerator term

ψ^{‡} σ_{1} e^{μ} D_{μ} ψ

as the Dirac Lagrangian (in 2D). Nonetheless, let us show explicitly:

\begin{matrix} \Rightarrow & \frac{δ}{δ ψ^{‡}} tr (\frac{ψ^{‡} σ_{1} e^{μ} D_{μ} ψ}{2 ψ^{‡} ψ}) = 0 \end{matrix}

(77)

\begin{matrix} \Rightarrow & \frac{σ_{1} e^{μ} D_{μ} ψ 2 ψ^{‡} ψ}{2 ψ^{‡} ψ} δ ψ^{‡} + \frac{ψ^{‡} σ_{1} e^{μ} D_{μ} ψ 2 ψ}{2 ψ^{‡} ψ} δ ψ^{‡} = 0 \end{matrix}

(78)

\begin{matrix} \Rightarrow & σ_{1} e^{μ} D_{μ} ψ δ ψ^{‡} + σ_{1} e^{μ} D_{μ} ψ δ ψ^{‡} = 0 \end{matrix}

(79)

\begin{matrix} \Rightarrow & e^{μ} D_{μ} ψ = 0 \end{matrix}

(80)

which is the Dirac equation in 2D. □

Observations: One might be tempted to simply extend

e^{μ} D_{μ} ψ = 0

from 2D to 3+1D by adding the basis vectors

e_{0}

and

e_{3}

. This approach would produce the conventional Dirac equation in spacetime. However, when we solve the optimization problem directly in 3+1D, we discover that the Dirac equation represents only a subsolution of a more comprehensive probabilistic structure. This complete structure naturally incorporates both gravity (acting on spacetime) and Yang-Mills (acting on internal spaces). These results, presented in the following section, suggest that the conventional Dirac equation in 3+1D, in a sense, represents an extension of 2D quantum theory that captures only a part of the full probabilistic structure available in 3+1D spacetime.

2.3. $R \oplus spinc (3, 1)$ -constraint: Gravity + Yang-Mills

Extending the framework to relativistic quantum mechanics begins by considering a measurement constraint having the

R^{+} \times Spinc (3, 1)

symmetry. This allows for transformations that include dilations, boosts/rotations, and re-orientations (David Hestene describes "re-orientation" as representing the changing orientation of the spin plane due to Zitterbewegung).

Another necessary change regards the interpretation of

ψ

from a probability amplitude, to that of a field amplitude

ϕ

. As such, and consistently with usual quantum field theory (QFT) interpretation, the notion of charge conservation will replace that of probability conservation. The notation will be changed as follows:

\begin{matrix} ψ & \to ϕ \end{matrix}

(81)

\begin{matrix} \sqrt{ρ} R e^{- i b / 2} & \to \sqrt{χ} R e^{- i b / 2} \end{matrix}

(82)

M

will represent the algebra of

R \oplus spinc (3, 1)

:

\begin{matrix} M = [\begin{matrix} a + f_{02} & b - f_{13} & - f_{01} + f_{12} & f_{03} + f_{23} \\ - b + f_{13} & a + f_{02} & f_{03} + f_{23} & f_{01} - f_{12} \\ - f_{01} - f_{12} & f_{03} - f_{23} & a - f_{02} & - b - f_{13} \\ f_{03} - f_{23} & f_{01} + f_{12} & b + f_{13} & a - f_{02} \end{matrix}] \end{matrix}

(83)

Using

GA (3, 1)

notation,

M

can be equivalently represented as:

\begin{matrix} M ≅ a + f + b \end{matrix}

(84)

where a is a scalar,

f

is a bivector and

b

is a pseudo-scalar.

As we did for the

R \oplus spin (2)

-constraint, we will develop the framework for the continuum case

\sum \to \int

which can be obtained by solving the following Lagrange multiplier equation:

\begin{matrix} L : = - \int_{V} χ (ζ, x) ln \frac{χ (ζ, x)}{χ (0, x)} \sqrt{h_{ζ}} d^{3} x + \frac{1}{2} ζ tr (\bar{M} - \int_{V} χ (ζ, x) M (x) \sqrt{h_{ζ}} d^{3} x) \end{matrix}

(85)

where

$ζ$ is the twisted-rapidity acting on $R \oplus spinc (3, 1)$ to form a one-parameter group via the exponential map.
$h_{ζ}$ is the determinant of the induced spatial metric on the hypersurface of constant twisted-rapidity $ζ$ . Is is a general curvature version of Rindler’s coordinates.
V represents the causally accessible region
the normalization constraint $λ (1 - \int_{V} χ (x, ζ) \sqrt{h_{ζ}} d^{3} x)$ has been dropped, consistently with a conserved charge interpretation (which will come from the Lagrangian) replacing probability conservation (which comes from a constraint in the optimization problem).

The integration is performed over a foliation of spacetime by surfaces of constant

ζ

, where twisted-rapidity is measured relative to a specified reference frame. This ensures the solution exists only over events that are causally accessible to observers characterized by a specific rapidity, respecting both quantum principles and relativistic causal structure.

Using the same technique as Theorem 1, solving the optimization problem here yields:

\begin{matrix} χ (ζ, x) = det \underset{\begin{matrix} Spinc (3, 1) Evolution \\ Operator \end{matrix}}{\underset{︸}{exp (- \frac{1}{2} ζ M (x))}} \underset{\begin{matrix} Initial \\ Preparation \end{matrix}}{\underset{︸}{det ϕ (0, x)}} \end{matrix}

(86)

where

det ϕ (ζ, x) : = χ (ζ, x)

. The partition function is absent because we dropped the normalization constraint.

As before this equation describes the structure that is erased from the state space at the instant of measurement to yield a classical measure–in the present case a

R^{+} \times Spinc (3, 1)

-valued phase is erased. In what follows, we will describe the wavefunction with this structure. But first, in the next section, we will express the determinant of

4 \times 4

real matrices using the multivectors of

GA (3, 1)

.

2.3.1. The Multivector Determinant

As we did in the beginning of the 2D case, our goal here will be to express

det M

as a multivector self-product. To achieve that, we begin by defining a general multivector in the geometric algebra

GA (3, 1)

:

\begin{matrix} u : = a + x + f + v + b \end{matrix}

(87)

where a is a scalar,

x

a vector,

f

a bivector,

v

is pseudo-vector and

b

a pseudo-scalar. Explicitly,

\begin{matrix} u & : = a \end{matrix}

(88)

\begin{matrix} + t γ_{0} + x γ_{1} + y γ_{2} + z γ_{3} \end{matrix}

(89)

\begin{matrix} + f_{01} γ_{0} γ_{1} + f_{02} γ_{0} γ_{2} + f_{03} γ_{0} γ_{3} + f_{12} γ_{1} γ_{2} + f_{13} γ_{1} γ_{3} + f_{23} γ_{2} γ_{3} \end{matrix}

(90)

\begin{matrix} + p γ_{1} γ_{2} γ_{3} + q γ_{0} γ_{2} γ_{3} + v γ_{0} γ_{1} γ_{3} + w γ_{0} γ_{1} γ_{2} \end{matrix}

(91)

\begin{matrix} + b γ_{0} γ_{1} γ_{2} γ_{3} \end{matrix}

(92)

Definition 12

(Real-Majorana Algebra Isomorphism). The map

φ : GA (3, 1) \to Mat (4, R)

defined by:

\begin{matrix} φ (1) & : = d i a g (1, 1, 1, 1) \end{matrix}

(93)

\begin{matrix} φ (γ_{0}) & : = [\begin{matrix} 0 & 0 & 0 & 1 \\ 0 & 0 & - 1 & 0 \\ 0 & 1 & 0 & 0 \\ - 1 & 0 & 0 & 0 \end{matrix}] & φ (γ_{2}) : = [\begin{matrix} 0 & - 1 & 0 & 0 \\ - 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & - 1 \\ 0 & 0 & - 1 & 0 \end{matrix}] \end{matrix}

(94)

\begin{matrix} φ (γ_{0}) & : = [\begin{matrix} 0 & 0 & 0 & 1 \\ 0 & 0 & - 1 & 0 \\ 0 & - 1 & 0 & 0 \\ 1 & 0 & 0 & 0 \end{matrix}] & φ (γ_{2}) : = [\begin{matrix} - 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & - 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}] \end{matrix}

(95)

\begin{matrix} φ (γ_{μ} γ_{ν}) & : = φ (γ_{μ}) φ (γ_{ν}) \end{matrix}

(96)

\begin{matrix} φ (γ_{μ} γ_{ν} γ_{κ}) & : = φ (γ_{μ}) φ (γ_{ν}) φ (γ_{κ}) \end{matrix}

(97)

\begin{matrix} φ (γ_{0} γ_{1} γ_{2} γ_{3}) & : = φ (γ_{0}) φ (γ_{1}) φ (γ_{2}) φ (γ_{3}) \end{matrix}

(98)

extends linearly and multiplicatively to an isomorphism between

GA (3, 1)

and the algebra of real

4 \times 4

matrices.

Definition 13

(Matrix Representation).

\begin{matrix} φ (u) = [\begin{matrix} a + f_{02} - q - z & b - f_{13} + w - x & - f_{01} + f_{12} - p + v & f_{03} + f_{23} + t + y \\ - b + f_{13} + w - x & a + f_{02} + q + z & f_{03} + f_{23} - t - y & f_{01} - f_{12} - p + v \\ - f_{01} - f_{12} + p + v & f_{03} - f_{23} + t - y & a - f_{02} + q - z & - b - f_{13} - w - x \\ f_{03} - f_{23} - t + y & f_{01} + f_{12} + p + v & b + f_{13} - w - x & a - f_{02} - q + z \end{matrix}] \end{matrix}

To manipulate and analyze multivectors in

GA (3, 1)

, we introduce several important operations, such as the multivector conjugate, the pseudo-blade conjugate, and the multivector determinant.

Definition 14

(Multivector Conjugate—in

GA (3, 1)

).

\begin{matrix} u^{‡} : = a - x - f + v + b \end{matrix}

Definition 15

(Pseudo-Blade Conjugate—in

GA (3, 1)

). The pseudo-blade conjugate of

u

is

\begin{matrix} u^{†} : = a + x + f - v - b \end{matrix}

Lundholm[6] proposes a number the multivector norms, and shows that they are the unique forms which carries the properties of the determinants such as

N (u v) = N (u) N (v)

to the domain of multivectors:

Definition 16.

The self-products associated with low-dimensional geometric algebras are:

\begin{matrix} GA (0, 1) : & u^{*} u \end{matrix}

(99)

\begin{matrix} GA (2, 0) : & u^{‡} u \end{matrix}

(100)

\begin{matrix} GA (3, 0) : & {(u^{‡} u)}^{*} u^{‡} u \end{matrix}

(101)

\begin{matrix} GA (3, 1) : & {(u^{‡} u)}^{†} u^{‡} u \end{matrix}

(102)

\begin{matrix} GA (4, 1) : & {({(u^{‡} u)}^{†} u^{‡} u)}^{*} ({(u^{‡} u)}^{†} u^{‡} u) \end{matrix}

(103)

where

u^{*}

is a conjugate that reverses the sign of pseudo-scalar blade (i.e. the highest degree blade of the algebra).

We can now express the determinant of the matrix representation of a multivector via a self-product. This choice is unique:

Theorem 5

(The Multivector Determinant—in GA(3,1)).

\begin{matrix} {(u^{‡} u)}^{†} u^{‡} u \equiv det φ (u) \end{matrix}

Proof.

Please find a computer assisted proof of this equality in Annex Appendix B. □

As can be seen from this theorem, the relationship between determinants and multivector products becomes more sophisticated in 3+1D. Unlike the 2D case where the determinant could be expressed using a product of two terms, in

GA (3, 1)

the determinant requires two products involving four copies of the multivector. This is reflected in the structure

{(u^{‡} u)}^{†} u^{‡} u

, which cannot be reduced to a simpler self-product of two terms.

Theorem 6

(Positive-Definiteness over

R^{+} \times Spinc (3, 1)

).Let

u = exp (\frac{1}{2} (a + f + b))

be a general invertible element of the even-subalgebra of

GA (3, 1)

. As such,

u

is in

R^{+} \times Spinc (3, 1)

. Then the multivector determinant

{(u^{‡} u)}^{†} u^{‡} u

is positive-definite.

Proof.

Since scalars, bivectors and pseudoscalars commute, we have:

\begin{matrix} exp (\frac{1}{2} (a + f + b)) = e^{a / 2} e^{f / 2} e^{b / 2} \end{matrix}

(104)

Using this convenient form, the proof is as follows:

\begin{matrix} {(u^{‡} u)}^{†} u^{‡} u & = e^{a / 2} e^{- f / 2} e^{- b / 2} e^{a / 2} e^{f / 2} e^{- b / 2} e^{a / 2} e^{- f / 2} e^{b / 2} e^{a / 2} e^{f / 2} e^{b / 2} \end{matrix}

(105)

\begin{matrix} = e^{2 a} \end{matrix}

(106)

which is positive-definite—the exponential of a real number a is in

R^{+}

. □

2.3.2. The $R^{+} \times {Spin}^{c} (3, 1)$ -valued Field

By inspection, the solution to the optimisation problem for the

R \oplus spinc (3, 1)

-constraint, identifies the following field:

Definition 17

(

R^{+} \times {Spin}^{c} (3, 1)

-valued Field).

\begin{matrix} ϕ (ζ) : = exp (\frac{1}{2} ζ (a + f + b)) ϕ (0) \end{matrix}

Theorem 7

(David Hestenes’ Wavefunction). The

R^{+} \times {Spin}^{c} (3, 1)

-valued field is formulated using the same geometric structure as David Hestenes’[5] formulation of the wavefunction within GA(3,1). Specifically, David Hestenes’ wavefunction is a special case of our result, where the field magnitude sums to 1.

Proof.

\begin{matrix} \underset{ours}{\underset{︸}{e^{\frac{1}{2} (a (x) + f (x) + b (x))}}} \propto \underset{Hestenes ’}{\underset{︸}{\sqrt{ρ (x)} R (x) e^{- i b (x) / 2}}} \end{matrix}

where

e^{\frac{1}{2} a (x)} \propto \sqrt{ρ (x)}

,

e^{\frac{1}{2} f (x)} = R (x)

and

e^{\frac{1}{2} b (x)} = e^{- i b (x) / 2}

. Here,

ρ (x)

is a probability density (versus a field magnitude),

R (x)

is a rotor and

e^{- i b (x) / 2}

describes the four-volume orientation. Adding the normalisation constraint to the optimisation problem forces the field magnitude to sum to 1, which recovers David Hestenes’ wavefunction as a special case. □

This field leads to a variant of the Schrödinger equation obtained by taking its derivative with respect to the Lagrange multiplier

ζ

:

Definition 18

(

R^{+} \times {Spin}^{c} (3, 1)

-valued Schrödinger equation).

\begin{matrix} \frac{d}{d ζ} ϕ (ζ) = \frac{1}{2} (a + f + b) ϕ (ζ) \end{matrix}

(107)

This Schrödinger equation is able to act on the wavefunction to dilate its probability measure, rotate or boost its rotor and to jiggle its orientation in flat spacetime. In curved spacetime, it generalizes to:

\begin{matrix} \frac{d}{d ζ} ϕ (ζ, x) & = \frac{1}{2} γ_{0} e^{μ} (\partial_{μ} + \frac{1}{2} ω_{μ}^{a b} γ_{a b} + I V_{μ}) ϕ (ζ, x) \end{matrix}

(108)

where

D_{μ} : = \partial_{μ} + \frac{1}{2} ω_{μ}^{a b} γ_{a b} + I V_{μ}

is the covariant derivative. The expression includes

\partial_{μ}

, the partial derivative,

\frac{1}{2} ω_{μ}^{a b} γ_{a b}

the

Spinc (3, 1)

connection and

I V_{μ}

a

U (1)

connection acting on the four-volume orientation—all acting on the wavefunction to transform its components in curved spacetime. Likewise to the 2D case,

e^{μ}

is used to contract with

D_{μ}

, leaving no free indices. But since it produces an odd-multivector in the process, the term

γ_{0}

is added converting the result back into an even-multivector. It also picks a preferred frame—the laboratory frame. Its effect is similar to the presence of

γ_{0}

in the Dirac Lagrangian.

2.3.3. Geometry

Definition 19

(Dirac Current). Using a single-copy of the multivector determinant, the definition of the Dirac current is the same as Hestenes’:

\begin{matrix} J & : = \overset{one}{\overset{︷}{ϕ^{‡} e_{0} ϕ}} copy \end{matrix}

(109)

\begin{matrix} = χ R^{‡} e^{- i b / 2} e_{0} e^{- i b / 2} R \end{matrix}

(110)

\begin{matrix} = χ R^{‡} e_{0} e^{i b / 2} e^{- i b / 2} R \end{matrix}

(111)

\begin{matrix} = χ R^{‡} e_{0} R \end{matrix}

(112)

\begin{matrix} = χ e_{0}^{'} \end{matrix}

(113)

where

e_{0}^{'}

is a SO(3,1) rotated basis vector.

Theorem 8

(Metric Tensor). Taking advantage of the multivector determinant formulation, we utilize both copies to obtain the metric tensor as a basis vectors measurement:

\begin{matrix} tr (\frac{{(\overset{copy 1}{\overset{︷}{ϕ^{‡} e_{μ} ϕ}})}^{†} \overset{copy 2}{\overset{︷}{ϕ^{‡} e_{ν} ϕ}}}{\underset{χ^{2}}{\underset{︸}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}}}) = g_{μ ν} \end{matrix}

Proof.

\begin{matrix} tr (\frac{{(ϕ^{‡} e_{μ} ϕ)}^{†} ϕ^{‡} e_{ν} ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}) & = tr (R^{‡} e^{- i b / 2} e_{μ} e^{- i b / 2} R R^{‡} e^{- i b / 2} e_{ν} e^{- i b / 2} R) \end{matrix}

(114)

\begin{matrix} = tr (R^{‡} e_{μ} e^{i b / 2} e^{- i b / 2} R R^{‡} e_{ν} e^{i b / 2} e^{- i b / 2} R) \end{matrix}

(115)

\begin{matrix} = tr (R^{‡} e_{μ} R R^{‡} e_{ν} R) \end{matrix}

(116)

\begin{matrix} = tr (e_{μ}^{'} e_{ν}^{'}) \end{matrix}

(117)

\begin{matrix} = tr (e_{μ}^{'} \cdot e_{ν}^{'} + e_{ν}^{'} \land e_{μ}^{'}) \end{matrix}

(118)

\begin{matrix} = tr (g_{μ ν} + e_{ν}^{'} \land e_{μ}^{'}) \end{matrix}

(119)

\begin{matrix} = g_{μ ν} \end{matrix}

(120)

□

2.3.4. Dynamics

In the

R \oplus spin (2)

-constraint section we utilized the correspondance between the Hamiltonian of the Schrodinger equation and the Lagrangian (Equation 74):

\begin{matrix} L = tr (\frac{ψ^{‡} H ψ}{ψ^{‡} ψ}) \end{matrix}

(121)

This definition generalizes to the multivector determinant in 3+1D as follows:

\begin{matrix} L : = tr (\frac{{(ϕ^{‡} H ϕ)}^{†} ϕ^{‡} H ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}) \end{matrix}

(122)

which contains two copies.

Definition 20

(Kinetic Energy). Applying the Hamiltonian

H = \frac{1}{2} γ_{0} e^{μ} D_{μ}

to each bilinear copy, yields:

\begin{matrix} T : = tr (\frac{{(\overset{copy 1}{\overset{︷}{ϕ^{‡} γ_{0} e^{μ} D_{μ} ϕ}})}^{†} \overset{copy 2}{\overset{︷}{ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ}}}{4 \underset{χ^{2}}{\underset{︸}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}}}) \end{matrix}

Theorem 9

(Dirac Equation). Varying the action yields the Dirac equation as a sufficient (but not necessary) equation of motion:

\begin{matrix} δ \int_{M} tr (\frac{{(ϕ^{‡} γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ}{4 {(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}) \sqrt{- | g |} d^{4} x = 0 \Rightarrow \underset{Dirac Equation}{\underset{︸}{e^{μ} D_{μ} ϕ = 0}} \end{matrix}

Proof.

\begin{matrix} \frac{δ}{δ ϕ^{‡ †}} tr (\frac{{(ϕ^{‡} γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ}{4 {(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}) = 0 \end{matrix}

(123)

\begin{matrix} \Rightarrow & tr \frac{δ}{δ ϕ^{‡ †}} (\frac{{(ϕ^{‡} γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ}{4 {(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}) = 0 \end{matrix}

(124)

\begin{matrix} \Rightarrow & tr (\frac{{(γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ {(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}{{({(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ)}^{2}} δ ϕ^{‡ †} + \frac{{(ϕ^{‡} γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ ϕ^{†} ϕ^{‡} ϕ}{{({(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ)}^{2}} δ ϕ^{‡ †}) = 0 \end{matrix}

(125)

\begin{matrix} \Rightarrow & tr (\frac{{(γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ} δ ϕ^{‡ †} + \frac{{(γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ} δ ϕ^{‡ †}) = 0 \end{matrix}

(126)

\begin{matrix} \Rightarrow & {(γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ = 0 \end{matrix}

(127)

For the condition to be satisfied, it is sufficient but not necessary that

e^{μ} D_{μ} ϕ = 0

, which is the Dirac equation. A second condition

ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ = 0

also reduces to the Dirac equation because

ϕ^{‡}

is invertible by definition. □

The multivector determinant formulation thus contains the solutions that satisfy the Dirac equation. However, broader solutions where the trace condition

tr ({(ϕ^{‡} γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{ν} D_{ν} ϕ) = 0

holds without

e^{μ} D_{μ} ϕ = 0

, also exist. We will now investigate these broader solutions.

2.3.5. Gravity

Theorem 10

(Quantum Action). Let us investigate a subspace of the field where

R = 1

and

e^{- i b / 2} = 1

, such that

ϕ = \sqrt{χ}

. Due to its non-linearity, the kinetic energy produces a quantum potential in addition to the usual kinetic energy term:

\begin{matrix} tr (\frac{{(\sqrt{χ} e^{μ} D_{μ} χ)}^{†} e^{ν} D_{ν} \sqrt{χ}}{χ^{2}}) = \underset{Quantum Kinetics}{\underset{︸}{\frac{1}{2 χ^{2}} {(\partial χ)}^{2}}} - (\underset{Quantum Potential}{\underset{︸}{\frac{1}{4 χ^{2}} {(\partial χ)}^{2} - \frac{\partial^{2} χ}{2 χ}}}) \end{matrix}

(128)

The quantum potential herein described is the relativistic version of the quantum potential found in the Bohm-Broglie reformulation of QM, whereas the quantum kinetics can be understood as a scalar field kinetic term. When integrated, they define a quantity that we refer to as the quantum action:

\begin{matrix} S = \underset{Quantum Action}{\underset{︸}{\int (\frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ}) \sqrt{- | g |} d^{4} x}} \end{matrix}

(129)

Proof.

\begin{matrix} tr ({(χ^{- 2} \sqrt{χ} e^{μ} D_{μ} χ)}^{†} e^{ν} D_{ν} \sqrt{χ}) \end{matrix}

(130)

\begin{matrix} = - tr (χ^{- 2} \sqrt{χ} (e^{μ} \partial_{μ} χ) e^{ν} \partial_{ν} \sqrt{χ} + χ^{- 2} \sqrt{χ} χ e^{μ} \partial_{μ} e^{ν} \partial_{ν} \sqrt{χ}) \\ = - tr (χ^{- 2} 2^{- 1} (e^{μ} \partial_{μ} χ) (e^{ν} \partial_{ν} χ)) + tr (χ^{- 1} \sqrt{χ} 4^{- 1} χ^{- 3 / 2} e^{μ} \partial_{μ} e^{ν} \partial_{ν} χ) \end{matrix}

(131)

\begin{matrix} - tr (χ^{- 1} \sqrt{χ} 2^{- 1} χ^{- 1 / 2} e^{μ} \partial_{μ} e^{ν} \partial_{ν} χ) \end{matrix}

(132)

\begin{matrix} = - tr (\frac{(e^{μ} \partial_{μ} χ) (e^{ν} \partial_{ν} χ)}{2 χ^{2}} - \frac{(e^{μ} \partial_{μ} χ) (e^{ν} \partial_{ν} χ)}{4 χ^{2}} + \frac{e^{μ} \partial_{μ} e^{ν} \partial_{ν} χ}{2 χ}) \end{matrix}

(133)

\begin{matrix} = \frac{1}{2 χ^{2}} {(\partial χ)}^{2} - \frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ} \end{matrix}

(134)

\begin{matrix} = \frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ} \end{matrix}

(135)

□

Theorem 11

(Equation of Motion). Varying the quantum action:

\begin{matrix} S = \int (\frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ}) \sqrt{- | g |} d^{4} x \end{matrix}

(136)

produces:

\begin{matrix} \partial^{2} χ = χ □ χ \end{matrix}

(137)

as the equation of motion.

Proof.

\begin{matrix} δ (\frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ}) = 0 \end{matrix}

(138)

\begin{matrix} \Rightarrow & - \frac{{(\partial χ)}^{2}}{2 χ^{3}} δ χ - \partial_{μ} (\frac{\partial^{μ} χ}{2 χ^{2}}) δ χ + \frac{\partial^{2} (δ χ)}{2 χ} - \frac{\partial^{2} χ}{2 χ^{2}} δ χ = 0 \end{matrix}

(139)

\begin{matrix} \Rightarrow & - \frac{{(\partial χ)}^{2}}{2 χ^{3}} δ χ + \frac{{(\partial χ)}^{2}}{χ^{3}} δ χ - \frac{\partial^{2} χ}{2 χ^{2}} δ χ + \frac{\partial^{2} (δ χ)}{2 χ} - \frac{\partial^{2} χ}{2 χ^{2}} δ χ = 0 \end{matrix}

(140)

\begin{matrix} \Rightarrow & \frac{{(\partial χ)}^{2}}{2 χ^{3}} δ χ - \frac{\partial^{2} χ}{χ^{2}} δ χ + \frac{\partial^{2} (δ χ)}{2 χ} = 0 \end{matrix}

(141)

\begin{matrix} \Rightarrow & \frac{{(\partial χ)}^{2}}{2 χ^{3}} δ χ - \frac{\partial^{2} χ}{χ^{2}} δ χ - \frac{\partial^{2} χ}{2 χ^{2}} δ χ + \frac{{(\partial χ)}^{2}}{χ^{3}} δ χ = 0 \end{matrix}

(144)

\begin{matrix} \Rightarrow & \frac{3 {(\partial χ)}^{2}}{2 χ^{3}} δ χ - \frac{3 \partial^{2} χ}{2 χ^{2}} δ χ = 0 \end{matrix}

(145)

\begin{matrix} \Rightarrow & \partial^{2} χ = \frac{{(\partial χ)}^{2}}{χ} \end{matrix}

(146)

\begin{matrix} \Rightarrow & χ □ χ = {(\partial χ)}^{2} \end{matrix}

(147)

□

To interpret this action and resulting equation of motion, let us now introduce the surprisal field and associated definitions.

Definition 21

(Surprisal Field). We define a change of variable:

\begin{matrix} φ : = - ln χ \end{matrix}

We call φ the surprisal field.

Definition 22

(Surprisal Equation of Motion). We note that the change of variable

φ = - ln χ

, changes the equation of motion as follows:

\begin{matrix} χ □ χ = {(\partial χ)}^{2} \underset{φ = - ln χ}{\underset{︸}{\to}} □ φ = 0 \end{matrix}

which is the Klein-Gordon equation in curved spacetime, applied to the surprisal field.

Definition 23

(Surprisal Conservation). The following current:

\begin{matrix} \nabla_{μ} (\partial^{μ} φ) = 0 \end{matrix}

identifies the surprisal as the conserved charge of this action.

Definition 24

(Surprisal Expectation Value). The surprisal expectation value is merely the entropy H of a region V of the manifold:

\begin{matrix} \underset{expectation value}{\underset{︸}{〈 ln χ 〉}} : = \underset{Definition of Entropy}{\underset{︸}{- \int_{V} χ (x) \underset{observable}{\underset{︸}{ln χ (x)}} \sqrt{h_{ζ}} d^{3} x}} \end{matrix}

Interpretation:

In information theory, the surprisal of an event x with probability density

ρ (x)

is defined as

- ln ρ (x)

, and the entropy

H = - \int ρ ln ρ d^{4} x

represents its expectation value. As the unit of surprisal is the bit, it represents the quantity of information associated to the event—and it is conserved by

□ φ = 0

. In contrast, also in information theory, the units of entropy are the bits per symbol—this is not conserved.

In our framework, the field

χ

replaces

ρ

—it has most of its properties, but differs critically as follows:

$χ$ is not a probability density—it lacks a conserved current ( $\nabla_{μ} (χ u^{μ}) \neq 0$ ) and is not normalized—but it is positive-definite.
Instead, $χ$ is interpreted as an information density, encoding spacetime’s local information content.

The surprisal is defined as

φ = - ln χ

, which in this theory satisfies the Klein-Gordon equation

□ φ = 0

. This ensures:

Conservation: The current $j^{μ} = \partial^{μ} φ$ is conserved ( $\nabla_{μ} j^{μ} = 0$ ), making $Q = \int_{V} j^{μ} \sqrt{h_{ζ}} d^{3} x$ a conserved charge.
Causal Propagation: Surprisal propagates at light speed, enforcing that bits of information cannot spread superluminally—a core tenet of relativity.

Before we continue the interpretation of this theory, let us introduce a few more theorems.

Theorem 12

(Ricci Scalar). Let us investigate another subspace of the field where

\sqrt{χ} = 1

and

e^{- i b / 2} = 1

, such that

ϕ = R

. Then the kinetic energy T reduces to the Ricci scalar

R

.

\begin{matrix} tr ({(\tilde{R} e^{μ} D_{μ} R)}^{†} \tilde{R} e^{ν} D_{ν} R) = R \end{matrix}

Proof.

\begin{matrix} tr ({(\tilde{R} e^{μ} D_{μ} R)}^{†} \tilde{R} e^{ν} D_{ν} R) \end{matrix}

(148)

\begin{matrix} = - tr (\tilde{R} e^{μ} D_{μ} e^{ν} D_{ν} R) & via \tilde{R} R = 1 \end{matrix}

(149)

\begin{matrix} = - tr (\tilde{R} D^{2} R) \end{matrix}

(150)

\begin{matrix} = - tr (R \tilde{R} D^{2}) \end{matrix}

(151)

\begin{matrix} = tr (D^{2}) \end{matrix}

(152)

\begin{matrix} = R & via Lichnerowicz - Weitzenb ö ck identity \end{matrix}

(153)

which is the Ricci scalar. □

Definition 25

(Gravity). Let us now consider the full space of the wavefunction

ψ = \sqrt{ρ} R e^{- i b / 2}

. We are automatically lead into a theory of gravity:

\begin{matrix} S & = \int_{M} tr (\frac{{(ϕ^{‡} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} e^{ν} D_{ν} ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ^{‡}}) \sqrt{- | g |} d^{4} x \end{matrix}

(154)

which expands, via Theorem 10 and 12, as follows:

\begin{matrix} 1 - 1 S & = \int_{M} (R + cross - terms + \frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ}) \sqrt{- | g |} d^{4} x \end{matrix}

(155)

We note the following equations of motion which must be simultaneously satisfied:

Varying with respect to $g_{μ ν}$ yields the EFE with the Einstein tensor from $R$ , and is sourced by the quantum action variation yielding the stress-energy tensor.
Varying with respect to χ gives equations of motion that define the flow of χ in spacetime.

Interpretation (cont’d):

Thus, while quantum mechanics relies on probabilistic amplitudes

ψ

, our formulation recasts general relativity as a deterministic theory of information dynamics, where spacetime geometry and surprisal flux are dual aspects of

R

and

χ

. The distribution of surprisal in spacetime dictates its geometric structure, which in turns dictates how it propagates. General relativity is to information, what quantum mechanics is to probability. Revisiting General Relativity with this perspective shows that the natural constraint is sufficient to entail the theory through the principle of entropy maximization—in this formulation, the speed of light as a limit on the propagation of the quantity of information (via the surprisal obeying the Klein-Gordon equation), and even the Einstein field equations are not fundamental, but the solution to an optimization problem on entropy.

2.3.6. Yang-Mills

In QFT, the standard method to identify a local gauge symmetry is to start with a global symmetry of the action or probability measure and then localize it by introducing gauge fields. For example, the

U (1)

gauge symmetry arises naturally in electromagnetism as the group preserving the probability density (Born rule) under local phase transformations. However, the non-Abelian

SU (2)

and

SU (3)

gauge symmetries of the Standard Model are not derived from first principles in this way; their inclusion is empirically motivated by particle physics experiments.

Improvement via Multivector Determinant Formulation: Our framework demonstrates that Yang-Mills theories emerge naturally from constraints on the wavefunction’s probability measure and Dirac current. Specifically:

Probability Measure: The quadratic form ${(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ = χ^{2}$ enforces rotor invariance $ϕ \to R ϕ$ , restricting transformations to those satisfying $R^{‡} R = 1$ , for some rotor R of a geometric algebra of n dimensions:

$\begin{matrix} {(ϕ^{‡} R^{‡} R ϕ)}^{†} ϕ^{‡} R^{‡} R ϕ = {(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ψ \Rightarrow R^{‡} R = 1 . \end{matrix}$

(156)

Solutions to $R^{‡} R = 1$ are rotor transformations generated by bivectors in the Clifford algebra. For a $2 n$ -dimensional algebra, these generate $Spin (2 n)$ , whose subgroups include $SU (n)$ .
Dirac Current: The spacetime current $ϕ^{‡} e_{0} ϕ = e_{0}$ requires gauge generators to commute with $e_{0}$ , confining them to an internal space. This implies:

$\begin{matrix} ϕ^{‡} e^{- θ^{i} f_{i}} e_{0} e^{θ^{i} f_{i}} ϕ = ϕ^{‡} e_{0} ϕ \Rightarrow [f_{i}, e_{0}] = 0, \end{matrix}$

(157)

where $f_{i}$ are bivector generators. Thus, $f_{i}$ act only on internal degrees of freedom, orthogonal to spacetime.
Spacetime: The origin of the multivector determinant from STA, defines the resulting internal space againts spacetime.

These constraints limit the allowable symmetry to groups generated by bivector exponentials (which are compact Lie groups), and acting on the internal spaces of spacetime. Since

SU (n) \subset Spin (2 n)

, this framework inherently includes the Standard Model within its landscape but also generalizes to larger symmetries such as those found in condensed matter systems with emergent

SU (n)

symmetries.

Wavefunction and Symmetry Structure:

The total wavefunction is a tensor product of spacetime (STA) and internal space components:

For $SU (n)$ Yang-Mills:

$\begin{matrix} ϕ_{STA} \otimes ϕ_{C^{n}} . \end{matrix}$

(158)
For the Standard Model $SU (3) \times SU (2) \times U (1)$ :

$\begin{matrix} ϕ_{STA} \otimes ϕ_{C} \otimes ϕ_{C^{2}} \otimes ϕ_{C^{3}} . \end{matrix}$

(159)

Action:

Our previous gravitational action is reconstructed with a spectral function f:

\begin{matrix} S = \int_{M} tr (f (\frac{1}{Λ^{2}} \frac{{(ϕ^{‡} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} e^{μ} D_{μ} ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ})) \sqrt{- | g |} d^{4} x . \end{matrix}

(160)

A heat kernel expansion yields the invariants of the theory (more on that in a moment).

Covariant Derivative (Ex. Standard Model):

Taking the Standard Model as an example, the covariant derivative incorporates spacetime curvature (gravity) and gauge fields:

\begin{matrix} D_{μ} : = (\begin{matrix} \partial_{μ} + \frac{ω_{μ}^{a b}}{2} γ_{a b} + i g^{'} Y B_{μ} + i g \frac{σ^{a}}{2} W_{μ}^{a} + i g_{s} \frac{λ^{a}}{2} G_{μ}^{a} & Φ \\ Φ^{†} & \partial_{μ} + \frac{ω_{μ}^{a b}}{2} γ_{a b} + i g^{'} Y B_{μ} + i g_{s} \frac{λ^{a}}{2} G_{μ}^{a} \end{matrix}), \end{matrix}

(161)

where:

$γ_{a b}$ : Generators of $Spin (3, 1)$ (gravitational spin connection).
$B_{μ}, W_{μ}^{a}, G_{μ}^{a}$ : $U (1)$ , $SU (2)$ , and $SU (3)$ gauge fields.
$Φ$ : Higgs field (SU(2) doublet).

It acts on the left/right split of the field.

Expanding f yield the field strength term

tr (f (D^{2} / Λ^{2}))

which via the Heat kernel further yields the Standard Model + gravity (see A. H. Chamseddine and Alain Connes [7] for method). The invariants recovered are:

Leading Terms:

(a)

Cosmological constant: $\propto Λ^{4} \int \sqrt{- | g |} d^{4} x$ .

(b)

Einstein-Hilbert term: $\propto Λ^{2} \int R \sqrt{- | g |} d^{4} x$ .
Yang-Mills and Higgs:

(a)

Gauge kinetic terms: $\propto \int \frac{1}{4} F_{μ ν}^{a} F^{μ ν a} \sqrt{- | g |} d^{4} x$ .

(b)

Higgs kinetic and potential terms:

$\begin{matrix} \propto \int (| D_{μ} {Φ |}^{2} + Λ^{2} {| Φ |}^{2} + \frac{1}{Λ^{2}} {| Φ |}^{4}) \sqrt{- | g |} d^{4} x . \end{matrix}$

(162)
Yukawa Couplings (from matter fields):

$\begin{matrix} \propto \int y_{i j} {\bar{ϕ}}_{i} Φ ϕ_{j} \sqrt{- | g |} d^{4} x . \end{matrix}$

(163)

Key Notes:

Higher-Order Terms: Higher order field strength terms appear but are suppressed by $Λ^{- 2}$ , making them negligible at low energies.
Uniqueness: The Standard Model is not uniquely selected by the optimization problem but resides within the landscape of allowed Yang-Mills theories.
Experimental Consistency: The framework ressembles Connes’ spectral action (see A. H. Chamseddine and Alain Connes [7]), recovering the Standard Model and general relativity while allowing for testable extensions (e.g., higher-curvature gravity).

This formulation unifies gauge symmetries and gravity within the multivector determinant structure.

2.3.7. Yang-Mills Axioms as Theorems

In Section 2.1, we demonstrated that all 5 axioms of quantum mechanics are derivable from the solution to the optimization problem in

GA (0, 1)

. Here, our aim is to do the same but for the axioms of Yang-Mills theory. First, let us list the axioms:

Compact Gauge Group: The symmetry group is a compact Lie group G.
Local Gauge Invariance: Fields transform under spacetime-dependent (local) group elements $T (x) \in G$ .
Gauge Connections: Gauge fields $A_{μ}$ are introduced as connections in the covariant derivative $D_{μ} = \partial_{μ} + A_{μ}$ .
Field Strength: The curvature $F_{μ ν} = [D_{μ}, D_{ν}]$ defines the dynamics.
Yang-Mills Action: The action depends on $F_{μ ν}$ , e.g., $\int tr (F_{μ ν} F^{μ ν})$ .

Now for the theorems.

Theorem 13

(Compact Gauge Group). The allowed symmetries form a compact Lie group

G \subset Spin (2 n)

.

Proof.

:

Constraint: ${(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ = χ^{2}$ implies invariance of arbitrary n-dimentional rotors: $R^{‡} R = 1$ .
Structure of Solutions: Rotor transformations in finite-dimensional Clifford algebras are generated by bivectors. These generate Spin( $2 n$ ) and its subgroups, which are compact Lie groups.

Thus, the gauge group G is inherently compact and derived from the algebra structure. □

Theorem 14

(Local Gauge Invariance). The theory is invariant under spacetime-dependent

T (x) \in G

.

Proof. Wavefunction Transformation:

ϕ \to R (x) ϕ

, where

R (x) = e^{θ^{i} (x) f_{i}}

(exponentials of spacetime-dependent bivectors).

Probability Measure:

{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ \to {(ϕ^{‡} R^{‡} R ϕ)}^{†} ϕ^{‡} R^{‡} R ϕ = χ^{2}

.

Dirac Current:

ϕ^{‡} e_{0} ϕ \to ϕ^{‡} R^{‡} e_{0} R ϕ = ϕ^{‡} e_{0} ϕ

, since

[f_{i}, e_{0}] = 0

. □

Theorem 15

(Gauge Connections). The covariant derivative

D_{μ} = \partial_{μ} + A_{μ}

emerges to maintain invariance under local

R (x)

.

Proof.

:

Minimal Coupling: To preserve $D_{μ} ϕ \to R (x) D_{μ} ϕ$ , the derivative must transform as $\partial_{μ} \to \partial_{μ} + A_{μ}$ , where $A_{μ} = f_{i} A_{μ}^{i} (x)$ .
Gauge Field Definition: Let $\partial_{μ} R (x) = A_{μ} R (x)$ , then: $D_{μ} ϕ = \partial_{μ} ϕ + A_{μ} ϕ \Rightarrow D_{μ} (R ϕ) = R D_{μ} ϕ .$
Clifford Algebra Embedding: The $A_{μ}$ are bivector fields in $C ℓ (2 n)$ , ensuring $A_{μ} \in g$ (the Lie algebra of G)).

□

Theorem 16

(Field Strength). The commutator

F_{μ ν} = [D_{μ}, D_{ν}]

defines the field strength.

Proof.

:

Kinetic Energy: The kinetic energy expands to include the field strength tensor:

$\begin{matrix} \frac{{(ϕ^{‡} γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{μ} D_{μ} ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ} = kinetic terms + F_{μ ν} \end{matrix}$

(164)

where

F_{μ ν}

is the field strength (Shown in Definition 25). □

Theorem 17

(Yang-Mills Action). The spectral action over the kinetic energy includes the kinetic term

\int tr (F_{μ ν} F^{μ ν})

.

Proof.

: Heat Kernel Expansion: As shown in Equation 160 (see A. H. Chamseddine and Alain Connes [7] for method), the field strength term of the spectral action

S = tr (f (D^{2} / Λ^{2}))

expands as:

S \sim \int (\dots + F_{μ ν}^{a} F^{a μ ν} + \dots) \sqrt{- | g |} d^{4} x .

□

Revisiting Yang-Mills with this perspective shows that the natural constraint is sufficient to entail the theory through the principle of entropy maximization—in this formulation, Yang-Mills axioms 1, 2, 3, 4, and 5 are not fundamental, but the solution to this optimization problem.

2.4. Dimensional Obstructions

In this section, we explore the dimensional obstructions that arise when attempting to solve the entropy maximization problem for other dimensional configurations. We found that all geometric configurations except the previously explored cases are obstructed. By obstructed, we mean that the solution to the entropy maximization problem,

ρ

, does not satisfy all axioms of probability theory. These obstructions also holds for the less restrictive interpretation in 3+1D of

χ

as an information density, because this interpretation nonetheless requires positive-definiteness which is not satisfied in other dimensional configurations.

\begin{matrix} Dimensions & Optimal Predictive Theory of Nature \end{matrix}

\begin{matrix} GA (0) & Statistical Mechanics \end{matrix}

(165)

\begin{matrix} GA (0, 1) & Quantum Mechanics \end{matrix}

(166)

\begin{matrix} GA (1, 0) & Obstructed (Negative probabilities) \end{matrix}

(167)

\begin{matrix} GA (2, 0) & Quantum Mechanics \end{matrix}

(168)

\begin{matrix} GA (1, 1) & Obstructed (Negative probabilities) \end{matrix}

(169)

\begin{matrix} GA (0, 2) & Obstructed (Non - real probabilities) \end{matrix}

(170)

\begin{matrix} GA (3, 0) & Obstructed (Non - real probabilities) \end{matrix}

(171)

\begin{matrix} GA (2, 1) & Obstructed (Non - real probabilities) \end{matrix}

(172)

\begin{matrix} GA (1, 2) & Obstructed (Non - real probabilities) \end{matrix}

(173)

\begin{matrix} GA (0, 3) & Obstructed (Non - real probabilities) \end{matrix}

(174)

\begin{matrix} GA (4, 0) & Obstructed (Non - real probabilities) \end{matrix}

(175)

\begin{matrix} GA (3, 1) & Gravity + Yang - Mills \end{matrix}

(176)

\begin{matrix} GA (2, 2) & Obstructed (Negative probabilities) \end{matrix}

(177)

\begin{matrix} GA (1, 3) & Obstructed (Non - real probabilities) \end{matrix}

(178)

\begin{matrix} GA (0, 4) & Obstructed (Non - real probabilities) \end{matrix}

(179)

\begin{matrix} GA (5, 0) & Obstructed (Non - real probabilities) \\ ⋮ & ⋮ \end{matrix}

(180)

\begin{matrix} GA (6, 0) & Suspected Obstructed (No observables) \\ ⋮ & ⋮ \end{matrix}

(181)

Let us now demonstrate the obstructions mentioned above.

Theorem 18

(Non-real probabilities). The determinant of the matrix representation of the geometric algebras in this category is either complex-valued or quaternion-valued, making them unsuitable as a probability.

Proof.

These geometric algebras are classified as follows:

\begin{matrix} GA (0, 2) ≅ H \end{matrix}

(182)

\begin{matrix} GA (3, 0) ≅ M_{2} (C) \end{matrix}

(183)

\begin{matrix} GA (2, 1) ≅ M_{2}^{2} (R) \end{matrix}

(184)

\begin{matrix} GA (1, 2) ≅ M_{2} (C) \end{matrix}

(185)

\begin{matrix} GA (0, 3) ≅ H^{2} \end{matrix}

(186)

\begin{matrix} GA (4, 0) ≅ M_{2} (H) \end{matrix}

(187)

\begin{matrix} GA (1, 3) ≅ M_{2} (H) \end{matrix}

(188)

\begin{matrix} GA (0, 4) ≅ M_{2} (H) \end{matrix}

(189)

\begin{matrix} GA (5, 0) ≅ M_{2}^{2} (H) \end{matrix}

(190)

The determinant of these objects is valued in

C

or in

H

, where

C

are the complex numbers, and where

H

are the quaternions. □

Theorem 19

(Negative probabilities). The even sub-algebra of these dimensional configurations allows for negative probabilities, making them unsuitable.

Proof.

This category contains three dimensional configurations:

$GA (1, 0)$ :: Let $ψ = a + b e_{1}$ , then:

$\begin{matrix} {(a + b e_{1})}^{‡} (a + b e_{1}) = (a - b e_{1}) (a + b e_{1}) = a^{2} - b^{2} e_{1} e_{1} = a^{2} - b^{2} \end{matrix}$

(191)

which is valued in $R$ .
$GA (1, 1)$ :: Let $ψ = a + b e_{0} e_{1}$ , then:

$\begin{matrix} {(a + b e_{0} e_{1})}^{‡} (a + b e_{0} e_{1}) = (a - b e_{0} e_{1}) (a + b e_{0} e_{1}) = a^{2} - b^{2} e_{0} e_{1} e_{0} e_{1} = a^{2} - b^{2} \end{matrix}$

(192)

which is valued in $R$ .
$GA (2, 2)$ :: Let $ψ = a + b e_{0} e_{\emptyset} e_{1} e_{2}$ , where $e_{0}^{2} = - 1, e_{\emptyset}^{2} = - 1, e_{1}^{2} = 1, e_{2}^{2} = 1$ , then:

$\begin{matrix} {({(a + b)}^{‡} (a + b))}^{†} {(a + b)}^{‡} (a + b) \end{matrix}$

(193)

$\begin{matrix} = {(a^{2} + 2 a b + b^{2})}^{†} (a^{2} + 2 a b + b^{2}) \end{matrix}$

(194)

We note that $b^{2} = b^{2} e_{0} e_{\emptyset} e_{1} e_{2} e_{0} e_{\emptyset} e_{1} e_{2} = b^{2}$ , therefore:

$\begin{matrix} 1 - 1 & = (a^{2} + b^{2} - 2 a b) (a^{2} + b^{2} + 2 a b) \end{matrix}$

(195)

$\begin{matrix} = {(a^{2} + b^{2})}^{2} - 4 a^{2} b^{2} \end{matrix}$

(196)

$\begin{matrix} = {(a^{2} + b^{2})}^{2} - 4 a^{2} b^{2} \end{matrix}$

(197)

which is valued in $R$ .

In all of these cases the probability can be negative. □

Conjecture 1

(No observables (6D)).The multivector representation of the norm in 6D cannot satisfy any observables.

Argument.

In six dimensions and above, the self-product patterns found in Definition 16 collapse. The research by Acus et al.[7] in 6D geometric algebra concludes that the determinant, so far defined through a self-products of the multivector, fails to extend into 6D. The crux of the difficulty is evident in the reduced case of a 6D multivector containing only scalar and grade-4 elements:

\begin{matrix} s (B) = b_{1} B f_{5} (f_{4} (B) f_{3} (f_{2} (B) f_{1} (B))) + b_{2} B g_{5} (g_{4} (B) g_{3} (g_{2} (B) g_{1} (B))) \end{matrix}

(198)

This equation is not a multivector self-product but a linear sum of two multivector self-products[7].

The full expression is given in the form of a system of 4 equations, which is too long to list in its entirety. A small characteristic part is shown:

\begin{matrix} a_{0}^{4} - 2 a_{0}^{2} a_{47}^{2} + b_{2} a_{0}^{2} a_{47}^{2} p_{412} p_{422} + 〈 72 monomials 〉 = 0 \end{matrix}

(199)

\begin{matrix} b_{1} a_{0}^{3} a_{52} + 2 b_{2} a_{0} a_{47}^{2} a_{52} p_{412} p_{422} p_{432} p_{442} p_{452} + 〈 72 monomials 〉 = 0 \end{matrix}

(200)

\begin{matrix} 〈 74 monomials 〉 = 0 \end{matrix}

(201)

\begin{matrix} 〈 74 monomials 〉 = 0 \end{matrix}

(202)

From Equation 198, it is possible to see that no observable

O

can satisfy this equation because the linear combination does not allow one to factor it out of the equation.

\begin{matrix} b_{1} O B f_{5} (f_{4} (B) f_{3} (f_{2} (B) f_{1} (B))) + b_{2} B g_{5} (g_{4} (B) g_{3} (g_{2} (B) g_{1} (B))) = b_{1} B f_{5} (f_{4} (B) f_{3} (f_{2} (B) f_{1} (B))) + b_{2} O B g_{5} (g_{4} (B) g_{3} (g_{2} (B) g_{1} (B))) \end{matrix}

(203)

Any equality of the above type between

b_{1} O

and

b_{2} O

is frustrated by the factors

b_{1}

and

b_{2}

, forcing

O = 1

as the only satisfying observable. Since the obstruction occurs within grade-4, which is part of the even sub-algebra it is questionable that a satisfactory theory (with non-trivial observables) be constructible in 6D, using our method. □

This conjecture proposes that the multivector representation of the determinant in 6D does not allow for the construction of non-trivial observables, which is a crucial requirement for a relevant quantum formalism. The linear combination of multivector self-products in the 6D expression prevents the factorization of observables, limiting their role to the identity operator.

Conjecture 2

(No observables (above 6D)). The norms beyond 6D are progressively more complex than the 6D case, which is already obstructed.

These theorems and conjectures provide additional insights into the unique role of the unobstructed 3+1D signature in our proposal.

It is also interesting that our proposal is able to rule out

GA (1, 3)

even if in relativity, the signature of the metric

(+, -, -, -)

versus

(-, -, -, +)

does not influence the physics. However, in geometric algebra,

GA (1, 3)

represents 1 space dimension and 3 time dimensions. Therefore, it is not the signature itself that is ruled out but rather the specific arrangement of 3 time and 1 space dimensions, as this configuration yields quaternion-valued "probabilities" (i.e.

GA (1, 3) ≅ M_{2} (H)

and

det M_{2} (H) \in H

).

3. Discussion

When asked to define what a physical theory is, an informal answer may be that it is a measurements-constrained mathematical framework that applies to all possible experiments realizable within a domain, with nature as a whole being the most general domain. While physicists have expressed these theories through sets of axioms, we propose a more direct approach—mathematically realizing this fundamental definition itself. This definition is realized as an optimization problem (Definition 1) that can be solved directly (Theorem 1). The solution to this optimization problem yields precisely those structures that realize the physical theory over said domain. Succinctly, physics is the solution to:

\begin{matrix} \underset{\begin{matrix} an \\ optimization \\ problem \end{matrix}}{\underset{︸}{L}} : = \underset{\begin{matrix} on the entropy \\ of a measurement \\ relative to its preparation \\ over all \end{matrix}}{\underset{︸}{- \sum_{i} ρ_{i} (τ) ln \frac{ρ_{i} (τ)}{ρ_{i} (0)}}} + \underset{\begin{matrix} predictive theories \end{matrix}}{\underset{︸}{λ (1 - \sum_{i} ρ_{i} (τ))}} + \underset{\begin{matrix} of nature \end{matrix}}{\underset{︸}{τ tr (\bar{M} - \sum_{i} ρ_{i} (τ) M_{i})}} \end{matrix}

(204)

The relative Shannon entropy represents the basic structure of any experiment, quantifying the informational difference between its initial preparation and its final measurement.

The natural constraint is chosen to be the most general structure that admits a solution to this optimization problem. This generality follows from key mathematical requirements. The constraint must involve quantities that form an algebra, as the solution requires taking exponentials:

\begin{matrix} exp X = 1 + X + \frac{1}{2} X^{2} + \dots \end{matrix}

(205)

which involves addition, powers, and scalar multiplication of X. The use of the trace operation further necessitates that X must be represented by square

n \times n

matrices. Thus Axiom 1 involves

n \times n

matrices:

\begin{matrix} \bar{M} : = \sum_{i} ρ_{i} M_{i} \end{matrix}

(206)

The trace operation is utilized because the constraint must be converted back to a scalar for use in the Lagrange multiplier equation; while any function that maps an algebra to a scalar would achieve that, picking the trace recovers QM in the

GA (0, 1) ≅ C

case and SM in the

GA (0) ≅ R

case.

These mathematical requirements demonstrate that the natural constraint, as it admits the minimal mathematical structure required to solve an arbitrary entropy maximization problem, can be understood as the most general extension of the statistical mechanics average energy constraint which contains QM and SM (as induced by the trace) as specific solutions.

Thus, having established both the mathematical structure and its generality, we can understand how this minimal ontology operates. Since our formulation keeps the structure of experiments completely general, our optimization considers all possible predictive theories for that structure, and the constraint is the most general constraint possible for that structure, the resulting optimal physical theory applies, by construction, to all realizable experiments within its domain.

This ontology is both operational, being grounded in the basic structure of experiments rather than abstract entities, and constructive, showing how physical laws emerge from optimization over all possible predictive theories subject to the natural constraint. Physics is encapsulated not as a pre-defined collection of fundamental axioms but as the optimal solution to a well-defined optimization problem over all experiments realizable within the domain. This represents a significant philosophical shift from traditional physical ontologies where laws are typically taken as primitive.

The next step in our derivation is to represent the determinant of the

n \times n

matrices through a self-product of multivectors involving various conjugate structures. By examining the various dimensional configurations of geometric algebras, we find that GA(3,1), representing

4 \times 4

real matrices, admits a sub-algebra whose determinant is positive-definite for its invertible members. All other dimensional configurations fail to admit such a positive-definite structure, with two exceptions: statistical mechanics (found in GA(0)) and quantum mechanics (found in GA(0,1) and in a sub-algebra of GA(2,0)).

The solution reveals that the 3+1D case harbours a new type of field amplitude structure analogous to complex amplitudes, one that exhibits the characteristic elements of a quantum mechanical theory. Instead of complex-valued amplitudes, we have amplitudes valued in the invertible subset of the even sub-algebra of GA(3,1). When normalized, this amplitude is identical to David Hestenes’ wavefunction, but comes with an extended Born rule represented by the determinant, and rather than a complex Hilbert space, it lives in a "double-product structure". This double-product structure automatically incorporates gravity via the Spin(3,1) connection and local gauge theories as Yang-Mills theories. The square of the Dirac operator, automatically generated by the Lagrangian, then generates the invariants of gravity and of the Yang-Mills theory via a heat kernel expansion, along with the matter fields quantifying the system’s information via surprisal and limiting its propagation speed.

Interpretation: At the foundation of any experiment lie two empirically irreducible elements: an initial preparation described by a classical probability distribution

p_{i}

, and a final measurement described by a classical probability distribution

ρ_{i}

. Between these endpoints, traditional physics posits dynamical laws governing the evolution of intermediate states (e.g., wavefunction). Our framework inverts this ontology: quantum mechanics, spacetime geometry, and gauge symmetries are not fundamental entities but emergent tools that optimally interpolate between

p_{i}

and

ρ_{i}

under the constraint of nature (Axiom 1).

Dissolving the Measurement Problem: Traditional interpretations reify intermediate quantum states, demanding ad hoc "collapse" rules to reconcile unitary evolution with definite outcomes. By contrast, our framework eliminates ontological commitment to the wavefunction entirely. The Schrödinger equation and Born rule describe not physical dynamics but inferential relationships between

p_{i}

and

ρ_{i}

. Measurement is not a physical process but a consistency condition: the optimized solution must align initial and final classical distributions. This negates the need for collapse, as intermediate states exist only as interpolational devices.

4. Conclusion

E.T. Jaynes fundamentally reoriented statistical mechanics by recasting it as a problem of inference rather than mechanics. His approach revealed that the equations of thermodynamics are not arbitrary physical laws but necessary consequences of maximizing entropy subject to measured constraints. This work extends Jaynes’ inferential paradigm to address a more fundamental question: what is a physical theory itself?

A physical theory, at its essence, is a measurements-constrained mathematical framework that applies to all possible experiments realizable within a domain. While this definition is informal, our contribution lies in mathematizing this concept directly. By formulating it as an optimization problem—maximizing the relative entropy of measurement outcomes subject to the natural constraint—we transform an abstract definition into a precise, solvable mathematical structure.

This approach represents a profound methodological shift. Rather than constructing physical theories through trial and error enumerations of axioms, we derive them as necessary solutions to a well-defined optimization problem. Physics thus emerges not as a collection of independently discovered laws but as the unique optimal bridge between experimental preparation and measurement under the constraint of nature.

The power of this formulation lies in its generality: by varying only the algebraic structure of the constraint, we recover established physical theories as special cases of the same optimization principle. Jaynes showed that statistical inference with minimal assumptions yields thermodynamics; we suggest that this same principle, properly generalized, has the potential to yield the very foundation of all physics.

Statements and Declarations

Funding: This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
Competing Interests: The author declares that he has no competing financial or non-financial interests that are directly or indirectly related to the work submitted for publication.
Data Availability Statement: No datasets were generated or analyzed during the current study.
During the preparation of this manuscript, we utilized a Large Language Model (LLM), for assistance with spelling and grammar corrections, as well as for minor improvements to the text to enhance clarity and readability. This AI tool did not contribute to the conceptual development of the work, data analysis, interpretation of results, or the decision-making process in the research. Its use was limited to language editing and minor textual enhancements to ensure the manuscript met the required linguistic standards.

Appendix E SM

Here, we solve the Lagrange multiplier equation of SM.

\begin{matrix} L : = \underset{Boltzmann Entropy}{\underset{︸}{- k_{B} \sum_{i} ρ_{i} ln ρ_{i}}} + \underset{2 c m Normalization Constraint}{\underset{︸}{λ (1 - \sum_{i} ρ_{i})}} + \underset{Average Energy Constraint}{\underset{︸}{β (\bar{E} - \sum_{i} ρ_{i} E_{i})}} \end{matrix}

(A1)

We solve the maximization problem as follows:

\begin{matrix} 0 & = \frac{\partial L (ρ_{1}, \dots, ρ_{i}, \dots, ρ_{n})}{\partial ρ_{i}} \end{matrix}

(A2)

\begin{matrix} = - ln ρ_{i} - 1 - λ - β E_{i} \end{matrix}

(A3)

\begin{matrix} = ln ρ_{i} + 1 + λ + β E_{i} \end{matrix}

(A4)

\begin{matrix} \Rightarrow ln ρ_{i} & = - 1 - λ - β E_{i} \end{matrix}

(A5)

\begin{matrix} \Rightarrow ρ_{i} & = exp (- 1 - λ) exp (- β E_{i}) \end{matrix}

(A6)

\begin{matrix} = \frac{1}{Z (τ)} exp (- β E_{i}) \end{matrix}

(A7)

The partition function, is obtained as follows:

\begin{matrix} 1 & = \sum_{j} exp (- 1 - λ) exp (- β E_{j}) \end{matrix}

(A8)

\begin{matrix} \Rightarrow {(exp (- 1 - λ))}^{- 1} & = \sum_{j} exp (- β E_{j}) \end{matrix}

(A9)

\begin{matrix} Z (τ) & = \sum_{j} exp (- β E_{j}) \end{matrix}

(A10)

Finally, the probability measure is:

\begin{matrix} ρ_{i} = \frac{1}{\sum_{j} exp (- β E_{j})} exp (- β E_{i}) \end{matrix}

(A11)

Appendix F SageMath program showing ⌊u ‡ u⌋ 3,4 u ‡ u=detM u

from sage.algebras.clifford_algebra import CliffordAlgebra
from sage.quadratic_forms.quadratic_form import QuadraticForm
from sage.symbolic.ring import SR
from sage.matrix.constructor import Matrix
# Define the quadratic form for GA(3,1) over the Symbolic Ring
Q = QuadraticForm(SR, 4, [-1, 0, 0, 0, 1, 0, 0, 1, 0, 1])
# Initialize the GA(3,1) algebra over the Symbolic Ring
algebra = CliffordAlgebra(Q)
# Define the basis vectors
e0, e1, e2, e3 = algebra.gens()
# Define the scalar variables for each basis element
a = var(’a’)
t, x, y, z = var(’t x y z’)
f01, f02, f03, f12, f23, f13 = var(’f01 f02 f03 f12 f23 f13’)
v, w, q, p = var(’v w q p’)
b = var(’b’)
# Create a general multivector
udegree0=a
udegree1=t*e0+x*e1+y*e2+z*e3
udegree2=f01*e0*e1+f02*e0*e2+f03*e0*e3+f12*e1*e2+f13*e1*e3+f23*e2*e3
udegree3=v*e0*e1*e2+w*e0*e1*e3+q*e0*e2*e3+p*e1*e2*e3
udegree4=b*e0*e1*e2*e3
u=udegree0+udegree1+udegree2+udegree3+udegree4
u2 = u.clifford_conjugate()*u
u2degree0 = sum(x for x in u2.terms() if x.degree() == 0)
u2degree1 = sum(x for x in u2.terms() if x.degree() == 1)
u2degree2 = sum(x for x in u2.terms() if x.degree() == 2)
u2degree3 = sum(x for x in u2.terms() if x.degree() == 3)
u2degree4 = sum(x for x in u2.terms() if x.degree() == 4)
u2conj34 = u2degree0+u2degree1+u2degree2-u2degree3-u2degree4
I = Matrix(SR, [[1, 0, 0, 0],
[0, 1, 0, 0],
[0, 0, 1, 0],
[0, 0, 0, 1]])
#MAJORANA MATRICES
y0 = Matrix(SR, [[0, 0, 0, 1],
[0, 0, -1, 0],
[0, 1, 0, 0],
[-1, 0, 0, 0]])
y1 = Matrix(SR, [[0, -1, 0, 0],
[-1, 0, 0, 0],
[0, 0, 0, -1],
[0, 0, -1, 0]])
y2 = Matrix(SR, [[0, 0, 0, 1],
[0, 0, -1, 0],
[0, -1, 0, 0],
[1, 0, 0, 0]])
y3 = Matrix(SR, [[-1, 0, 0, 0],
[0, 1, 0, 0],
[0, 0, -1, 0],
[0, 0, 0, 1]])
mdegree0 = a
mdegree1 = t*y0+x*y1+y*y2+z*y3
mdegree2 = f01*y0*y1+f02*y0*y2+f03*y0*y3+f12*y1*y2+f13*y1*y3+f23*y2*y3
mdegree3 = v*y0*y1*y2+w*y0*y1*y3+q*y0*y2*y3+p*y1*y2*y3
mdegree4 = b*y0*y1*y2*y3
m=mdegree0+mdegree1+mdegree2+mdegree3+mdegree4
print(u2conj34*u2 == m.det())

The program outputs

True

showing, by computer assisted symbolic manipulations, that the determinant of the real Majorana representation of a multivector u is equal to the double-product:

det M_{u} = {⌊ u^{‡} u ⌋}_{3, 4} u^{‡} u

.

References

Jaynes, E.T. Information theory and statistical mechanics. Physical review 1957, 106, 620. [Google Scholar] [CrossRef]
Jaynes, E.T. Information theory and statistical mechanics. II. Physical review 1957, 108, 171. [Google Scholar] [CrossRef]
Dirac, P.A.M. The principles of quantum mechanics; Number 27, Oxford university press, 1981.
Von Neumann, J. Mathematical foundations of quantum mechanics: New edition; Vol. 53, Princeton university press, 2018.
Hestenes, D. Spacetime physics with geometric algebra. American Journal of Physics 2003, 71, 691–714. [Google Scholar] [CrossRef]
Lundholm, D. Geometric (Clifford) algebra and its applications. arXiv preprint math/0605280, 2006. [Google Scholar]
Acus, A.; Dargys, A. Inverse of multivector: Beyond p+ q= 5 threshold. arXiv preprint arXiv:1712.05204, 2017. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Constructing Physics From Measurements

Abstract

Keywords:

Subject:

1. Introduction

2. Results

2.1. $u (1)$ -constraint: Quantum Mechanics

2.2. $R \oplus spin (2)$ -constraint: Euclidean QM in 2D

2.2.1. Bilinear Form

2.2.2. 1+1D Obstruction

2.2.3. $spin (2)$ -constraint: ≅ Quantum Mechanics

2.2.4. $R \oplus spin (2)$ -constraint: Euclidean QM in 2D

2.3. $R \oplus spinc (3, 1)$ -constraint: Gravity + Yang-Mills

2.3.1. The Multivector Determinant

2.3.2. The $R^{+} \times {Spin}^{c} (3, 1)$ -valued Field

2.3.3. Geometry

2.3.4. Dynamics

2.3.5. Gravity

2.3.6. Yang-Mills

2.3.7. Yang-Mills Axioms as Theorems

2.4. Dimensional Obstructions

3. Discussion

4. Conclusion

Statements and Declarations

Appendix E SM

Appendix F SageMath program showing ⌊u ‡ u⌋ 3,4 u ‡ u=detM u

References

MDPI Initiatives

Important Links

Subscribe

Constructing Physics From Measurements

Abstract

Keywords:

Subject:

1. Introduction

2. Results

2.1. u ( 1 ) -constraint: Quantum Mechanics

2.2. R ⊕ spin ( 2 ) -constraint: Euclidean QM in 2D

2.2.1. Bilinear Form

2.2.2. 1+1D Obstruction

2.2.3. spin ( 2 ) -constraint: ≅ Quantum Mechanics

2.2.4. R ⊕ spin ( 2 ) -constraint: Euclidean QM in 2D

2.3. R ⊕ spinc ( 3 , 1 ) -constraint: Gravity + Yang-Mills

2.3.1. The Multivector Determinant

2.3.2. The R + × Spin c ( 3 , 1 ) -valued Field

2.3.3. Geometry

2.3.4. Dynamics

2.3.5. Gravity

2.3.6. Yang-Mills

2.3.7. Yang-Mills Axioms as Theorems

2.4. Dimensional Obstructions

3. Discussion

4. Conclusion

Statements and Declarations

Appendix E SM

Appendix F SageMath program showing ⌊u ‡ u⌋ 3,4 u ‡ u=detM u

References

MDPI Initiatives

Important Links

Subscribe

2.1. $u (1)$ -constraint: Quantum Mechanics

2.2. $R \oplus spin (2)$ -constraint: Euclidean QM in 2D

2.2.3. $spin (2)$ -constraint: ≅ Quantum Mechanics

2.2.4. $R \oplus spin (2)$ -constraint: Euclidean QM in 2D

2.3. $R \oplus spinc (3, 1)$ -constraint: Gravity + Yang-Mills

2.3.2. The $R^{+} \times {Spin}^{c} (3, 1)$ -valued Field