Constructing Physics From Measurements

Alexandre Harvey-Tremblay

doi:10.20944/preprints202404.1009.v17

Submitted:

14 April 2025

Posted:

16 April 2025

Read the latest preprint version here

Abstract

We present a reformulation of fundamental physics, transitioning from an enumeration of independent axioms to the solution of a single optimization problem derived from the structure of experiments. Any experiment comprises an initial preparation, a physical evolution, and a final measurement. Grounded in this structure, we determine the final measurement distribution by minimizing its entropy relative to its initial preparation distribution, subject to a natural constraint. Solving this optimization problem identifies a unified theory encompassing quantum mechanics, general relativity (acting on spacetime geometry), and Yang-Mills gauge theories (acting on internal spaces). Notably, consistency requirements restrict valid solutions to 3+1 dimensions, thus deriving spacetime dimensionality. This reformulation suggests that the established laws of physics, including their specific forces, symmetries, and dimensionality, emerge naturally from the requirement of the minimal informational change from preparation to measurement, consistent with the natural constraint.

Keywords:

foundations of physics

Subject:

Physical Sciences - Quantum Science and Technology

1. Introduction

Statistical mechanics (SM), in the formulation developed by E.T. Jaynes [1,2], is founded on an entropy optimization principle. Specifically, the Boltzmann entropy is maximized under the constraint of a fixed average energy

\bar{E}

:

\begin{matrix} \bar{E} : = \sum_{i} ρ_{i} E_{i} \end{matrix}

(1)

The Lagrange multiplier equation defining the optimization problem is:

\begin{matrix} L : = - k_{B} \sum_{i} ρ_{i} (β) ln ρ_{i} (β) + λ (1 - \sum_{i} ρ_{i} (β)) + β (\bar{E} - \sum_{i} ρ_{i} (β) E_{i}) \end{matrix}

(2)

where

λ

and

β

are Lagrange multipliers enforcing the normalization and average energy constraints. Solving this optimization problem yields the Gibbs measure:

\begin{matrix} ρ_{i} (β) = \frac{1}{Z (β)} exp (- β E_{i}), \end{matrix}

(3)

where

Z (β) : = \sum_{i} exp (- β E_{i})

is the partition function.

For comparison, quantum mechanics (QM) is not formulated as the solution to an optimization problem, but rather consists of a collection of axioms[3,4]:

QM Axiom 1 of 5: State Space: Every physical system is associated with a complex Hilbert space, and its state is represented by a ray (an equivalence class of vectors differing by a non-zero scalar multiple) in this space.
QM Axiom 2 of 5: Observables: Physical observables correspond to Hermitian (self-adjoint) operators acting on the Hilbert space.
QM Axiom 3 of 5: Dynamics: The time evolution of a quantum system is governed by the Schrödinger equation, where the Hamiltonian operator represents the system’s total energy.
QM Axiom 4 of 5: Measurement: Measuring an observable projects the system into an eigenstate of the corresponding operator, yielding one of its eigenvalues as the measurement result.
QM Axiom 5 of 5: Probability Interpretation: The probability of obtaining a specific measurement outcome is given by the squared magnitude of the projection of the state vector onto the relevant eigenstate (Born rule).

Physical theories have traditionally been constructed in two distinct ways. Some, like QM, are defined through a set of mathematical axioms that are first postulated and then verified against experiments. Others, like SM, emerge as solutions to optimization problems with experimentally-verified constraints.

We propose to generalize the optimization methodology of E.T. Jaynes to encompass all of physics, aiming to derive a unified theory from a single optimization problem.

To that end, we introduce the following constraint:

Axiom 1

(Nature).

\begin{matrix} \bar{M} : = \sum_{i} ρ_{i} M_{i} \end{matrix}

where

M_{i}

are

n \times n

matrices, and

\bar{M}

is their average.

This constraint, as it replaces the scalar

E_{i}

with the matrix

M_{i}

, extends E.T. Jaynes’ optimization method to encompass non-commutative observables and symmetry group generators required for fundamental physics.

We then construct an optimization problem:

Definition 1

(Physics). Physics is the solution to:

\begin{matrix} \underset{\begin{matrix} a n \\ o p t i m i z a t i o n \\ p r o b l e m \end{matrix}}{\underset{︸}{L}} : = \underset{\begin{matrix} o n t h e e n t r o p y \\ o f a m e a s u r e m e n t \\ r e l a t i v e t o i t s p r e p a r a t i o n \\ o v e r a l l \end{matrix}}{\underset{︸}{- \sum_{i} ρ_{i} (t) ln \frac{ρ_{i} (t)}{ρ_{i} (0)}}} + \underset{\begin{matrix} p r e d i c t i v e t h e o r i e s \end{matrix}}{\underset{︸}{λ (1 - \sum_{i} ρ_{i} (t))}} + \underset{\begin{matrix} o f n a t u r e \end{matrix}}{\underset{︸}{t tr (\bar{M} - \sum_{i} ρ_{i} (t) M_{i})}} \end{matrix}

where λ and t are Lagrange multipliers enforcing the normalization and natural constraints, respectively.

This definition constitutes our complete proposal for reformulating fundamental physics—no additional principles will be introduced. By replacing the Boltzmann entropy with the relative Shannon entropy, the optimization problem extends beyond thermodynamic variables to encompass any type of experiment. This generalization occurs because relative entropy captures the essence of any experiment: the relationship between a final measurement and its initial preparation.

Two key constraints shape our framework. The normalization constraint ensures we are working with a proper predictive theory, while the natural constraint spawns the domain of applicability of the theory. The crucial insight is that because our formulation maintains complete generality in the structure of experiments while optimizing over all possible predictive theories, the resulting solution holds true, by construction, for all realizable experiments within its domain.

This approach reduces our reliance on postulating axioms through trial and error, and simplifies the foundations of physics. Specifically, when we employ the natural constraint—the most permissive constraint for this problem (see Discussion for proof)—, the solution spawns its largest domain, pointing towards a unified physics where fundamental theories emerge naturally—e.g. SM when

M ≅ R

, QM when

M ≅ u (1)

, and general relativity (acting on spacetime) + Yang-Mills (acting on internal spaces) when

M ≅ spin (3, 1) \oplus u (1)

. As we found, these three solutions are the only possible ones, as those entailed by other algebras encounter obstructions which violates the axioms of probability theory.

Theorem 1.

The general solution of the optimization problem is:

\begin{matrix} ρ_{i} (t) = \frac{1}{\sum_{j} det exp (- t M_{j}) ρ_{j} (0)} det exp (- t M_{i}) ρ_{i} (0) \end{matrix}

Proof.

We solve the entropy minimization problem by setting the derivative of the Lagrange multiplier equation with respect to

ρ_{i} (t)

to zero:

\begin{matrix} \frac{\partial L [ρ_{1} (t), \dots, ρ_{i} (t), \dots, ρ_{n} (t)]}{\partial ρ_{i} (t)} & = - ln \frac{ρ_{i} (t)}{ρ_{i} (0)} - 1 - λ - t tr M_{i} = 0 . \end{matrix}

(4)

\begin{matrix} \Rightarrow ln \frac{ρ_{i} (t)}{ρ_{i} (0)} & = - 1 - λ - t tr M_{i} . \end{matrix}

(5)

\begin{matrix} \Rightarrow ρ_{i} (t) & = ρ_{i} (0) exp (- 1 - λ) exp (- t tr M_{i}) . \end{matrix}

(6)

Normalizing the probabilities using

\sum_{j} ρ_{j} (t) = 1

, we find:

\begin{matrix} 1 & = \sum_{j} ρ_{j} (t) = exp (- 1 - λ) \sum_{j} ρ_{j} (0) exp (- t tr M_{j}), \end{matrix}

(7)

\begin{matrix} \Rightarrow exp (1 + λ) & = \sum_{j} ρ_{j} (0) exp (- t tr M_{j}) . \end{matrix}

(8)

Substituting back, we obtain:

\begin{matrix} 1 - 1 ρ_{i} (t) & = \frac{1}{\sum_{j} ρ_{j} (0) exp (- t tr M_{j})} exp (- t tr M_{i}) ρ_{i} (0) \end{matrix}

(9)

then using the identity

det exp (M) \equiv exp tr M

for square matrices

M

, we get:

\begin{matrix} 1 - 1 & = \frac{1}{\sum_{j} det exp (- t M_{j}) ρ_{j} (0)} det exp (- t M_{i}) ρ_{i} (0) . \end{matrix}

(10)

□

As we will see in the results section, this solution encapsulates three distinct special cases:

Statistical Mechanics: To recover SM from Equation 10, we consider the case where the matrices $M_{i}$ are $1 \times 1$ , i.e., real scalars. Specifically, we set:

$\begin{matrix} \bar{M} = \sum_{i} ρ_{i} M_{i}, with M_{i} = E_{i}, \end{matrix}$

(11)

and take $ρ_{i} (0)$ to be a uniform distribution. Then, Equation 10 reduces to the Gibbs distribution:

$\begin{matrix} ρ_{i} (t) = \frac{1}{Z} exp (- t E_{i}), \end{matrix}$

(12)

where t corresponds to the $β$ of SM. This demonstrates that our solution generalizes SM, as it recovers it when $M_{i}$ are scalars.
Quantum Mechanics: By choosing $M_{i}$ to represent the $u (1)$ algebra, we derive the axioms of QM from optimization. Specifically, we set:

$\begin{matrix} \bar{M} = \sum_{i} ρ_{i} M_{i}, with M_{i} = [\begin{matrix} 0 & - E_{i} \\ E_{i} & 0 \end{matrix}], \end{matrix}$

(13)

In the results section, we will detail how this choice leads to the the Born rule in lieu of the Gibbs measure, and that the partition function is unitary invariant—the solution is shown to satisfy all five axioms of QM.
Unified Theory: Extending our approach, we choose $M_{i}$ to be $4 \times 4$ matrices representing the $spin (3, 1) \oplus u (1)$ algebra. Specifically, we consider multivectors of the form $u = f + b$ , where $f$ is a bivector and $b$ is a pseudoscalar of the 3+1D geometric algebra $GA (3, 1)$ . The matrix representation of $M_{i}$ is:

$\begin{matrix} M_{i} = [\begin{matrix} f_{02} & b - f_{13} & - f_{01} + f_{12} & f_{03} + f_{23} \\ - b + f_{13} & f_{02} & f_{03} + f_{23} & f_{01} - f_{12} \\ - f_{01} - f_{12} & f_{03} - f_{23} & - f_{02} & - b - f_{13} \\ f_{03} - f_{23} & f_{01} + f_{12} & b + f_{13} & - f_{02} \end{matrix}], \end{matrix}$

(14)

where $f_{01}, f_{02}, f_{03}, f_{12}, f_{13}, f_{23}$ , and b correspond to the generators of the $Spinc (3, 1)$ group, which includes both Lorentz boosts/rotations and the four-volume orientation. Solving the optimization problem with this choice leads to a relativistic quantum probability measure extending the Born rule from $C$ to $Spinc (3, 1)$ . The solution is shown to uniquely satisfy both general relativity (acting on spacetime) and Yang-Mills (acting on its internal spaces).
Dimensional Obstructions: Definition 1 yields valid probability measures only in specific cases of Axiom 1. Beyond the instances of statistical mechanics and quantum mechanics, Axiom 1 produces a consistent solution only in 3+1 dimensions. In other dimensional configurations, various obstructions arises violating the axioms of probability theory. The following table summarizes the geometric cases and their obstructions:

$\begin{matrix} Dimensions & Optimal Predictive Theory of Nature \end{matrix}$

$\begin{matrix} GA (0) & Statistical Mechanics \end{matrix}$

(15)

$\begin{matrix} GA (0, 1) & Quantum Mechanics \end{matrix}$

(16)

$\begin{matrix} GA (1, 0) & Obstructed (Negative probabilities) \end{matrix}$

(17)

$\begin{matrix} GA (2, 0) & Quantum Mechanics \end{matrix}$

(18)

$\begin{matrix} GA (1, 1) & Obstructed (Negative probabilities) \end{matrix}$

(19)

$\begin{matrix} GA (0, 2) & Obstructed (Non - real probabilities) \end{matrix}$

(20)

$\begin{matrix} GA (3, 0) & Obstructed (Non - real probabilities) \end{matrix}$

(21)

$\begin{matrix} GA (2, 1) & Obstructed (Non - real probabilities) \end{matrix}$

(22)

$\begin{matrix} GA (1, 2) & Obstructed (Non - real probabilities) \end{matrix}$

(23)

$\begin{matrix} GA (0, 3) & Obstructed (Non - real probabilities) \end{matrix}$

(24)

$\begin{matrix} GA (4, 0) & Obstructed (Non - real probabilities) \end{matrix}$

(25)

$\begin{matrix} GA (3, 1) & Gravity + Yang - Mills \end{matrix}$

(26)

$\begin{matrix} GA (2, 2) & Obstructed (Negative probabilities) \end{matrix}$

(27)

$\begin{matrix} GA (1, 3) & Obstructed (Non - real probabilities) \end{matrix}$

(28)

$\begin{matrix} GA (0, 4) & Obstructed (Non - real probabilities) \end{matrix}$

(29)

$\begin{matrix} GA (5, 0) & Obstructed (Non - real probabilities) \\ ⋮ & ⋮ \end{matrix}$

(30)

$\begin{matrix} GA (6, 0) & Suspected Obstructed (No observables) \\ ⋮ & ⋮ \end{matrix}$

(31)

where $GA (p, q)$ means the geometric algebra of $p + q$ dimensions, where p is the number of positive signature dimensions and q of negative signature dimensions. QM shows up twice because both $GA (0, 1)$ and the even-subalgebra of $GA (2, 0)$ are isomorphic to $C$ .

We will first investigate the unobstructed cases in Section 2.1, Section 2.2 and Section 2.3 and then demonstrate the obstructions in Section 2.4. These obstructions are desirable because they automatically limit the theory to 3+1D, thus providing a built-in mechanism for the observed dimensionality of our universe.

2. Results

2.1. $u (1)$ -constraint: Quantum Mechanics

In SM, the central observation is that energy measurements of a thermally equilibrated system tend to cluster around a fixed average value (Equation 1). In contrast, QM is characterized by the presence of interference effects in measurement outcomes. To capture these features, we introduce the following special case of Axiom 1:

Definition 2

(

u (1)

constraint).We reduce the generality of Axiom 1 to the generator of the

U (1)

group. Specifically, we replace

\begin{matrix} \bar{M} = \sum_{i} ρ_{i} M_{i} with M_{i} = \frac{1}{ℏ} [\begin{matrix} 0 & - E_{i} \\ E_{i} & 0 \end{matrix}] : = \frac{1}{ℏ} I E_{i} \end{matrix}

where

E_{i}

are scalar values (e.g., energy levels),

ρ_{i}

are the probabilities of outcomes, the matrices

M_{i}

generate the

U (1)

group, and where

I : = [\begin{matrix} 0 & - 1 \\ 1 & 0 \end{matrix}]

and

I^{2} = - [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}] ≅ - 1

.

Then, the Lagrange multiplier equation (Definition 1) becomes:

\begin{matrix} L = - \sum_{i} ρ_{i} (t) ln \frac{ρ_{i} (t)}{ρ_{i} (0)} + λ (1 - \sum_{i} ρ_{i} (t)) + \frac{t}{ℏ} tr (I \bar{E} - \sum_{i} ρ_{i} (t) I E_{i}) \end{matrix}

(32)

The general solution of the optimization problem (Theorem 1), with the above-mentioned replacements, reduces to the following ensemble:

\begin{matrix} ρ_{i} (t) = \frac{1}{\sum_{j} det exp (- I t E_{j} / ℏ) ρ_{j} (0)} det exp (- I t E_{i} / ℏ) ρ_{i} (0) \end{matrix}

(33)

Though initially unfamiliar, this form effectively establishes a comprehensive formulation of QM, as we will demonstrate.

Let us introduce a definition for

ρ_{i} (0) : = det ψ_{i} (0)

. Since

ρ_{i} (0)

is a real number equal or greater than 0, it means that

ψ_{i} (0)

is a structure whose determinant is positive. Furthermore, the evolution operator

exp (- I t E_{i} / ℏ)

must preserve this structure when acting on it. These two requirements means that

ψ_{i} (0) : = [\begin{matrix} a & - b \\ b & a \end{matrix}] ≅ a + i b

. We also note that

det [\begin{matrix} a & - b \\ b & a \end{matrix}] = a^{2} + b^{2} ≅ {| a + i b |}^{2}

, and that

det A B = det A det B

and

| z_{1} z_{2} |^{2} = | z_{1} |^{2} {| z_{2} |}^{2}

. Consequently, we can reposition

ρ_{i} (0)

inside the determinant where it becomes

ψ_{i} (0)

:

\begin{matrix} ρ_{i} (t) = \frac{1}{\sum_{j} det (exp (- I t E_{j} / ℏ) ψ_{j} (0))} det (exp (- I t E_{i} / ℏ) ψ_{i} (0)) \end{matrix}

(34)

In this matrix representation, the determinant is equivalent to the complex norm. Replacing the former with the later yields:

\begin{matrix} ρ_{i} (t) = \underset{\begin{matrix} U n i t a r y I n v a r i a n t \\ E n s e m b l e \end{matrix}}{\underset{︸}{\frac{1}{\sum_{j} {| exp (- i t E_{j} / ℏ) ψ_{j} (0) |}^{2}}}} {| \underset{\begin{matrix} E v o l u t i o n \\ O p e r a t o r \end{matrix}}{\underset{︸}{exp (- i t E_{i} / ℏ)}} |}^{2} \underset{\begin{matrix} I n i t i a l \\ P r e p a r a t i o n \end{matrix}}{\underset{︸}{| ψ_{i} {(0) |}^{2}}} \end{matrix}

(35)

This equation describes the time-evolution of a quantum system in its eigenvector: where

ρ_{i} (t) = ρ_{i} (0)

is the expected outcome. The full gamut of unitary evolution is obtained as a property of the ensemble, which is unitary invariant. In fact, the partition function is a map

C^{n} \to R_{\geq 0}

, and as such it defines the inner product of a n-dimensional Hilbert space. This relationship is articulated as follows:

\begin{matrix} Z (t) : = \sum_{j} {| exp (- i t E_{j} / ℏ) ψ_{j} (0) |}^{2} = 〈 ψ | ψ 〉 = 〈 U ψ | U ψ 〉 \end{matrix}

(36)

where

\begin{matrix} [\begin{matrix} ψ_{1} (t) \\ ⋮ \\ ψ_{n} (t) \end{matrix}] : = [\begin{matrix} exp (- i t E_{1} / ℏ) \\ ⋱ \\ exp (- i t E_{n} / ℏ) \end{matrix}] [\begin{matrix} ψ_{1} (0) \\ ⋮ \\ ψ_{n} (0) \end{matrix}] \end{matrix}

(37)

Furthermore, since

Z (t)

is unitary invariant, this relation generalizes as follows, by any change of basis

U^{†} U = I

:

\begin{matrix} [\begin{matrix} ψ_{1} (t) \\ ⋮ \\ ψ_{n} (t) \end{matrix}] & : = U^{†} [\begin{matrix} exp (- i t E_{1} / ℏ) \\ ⋱ \\ exp (- i t E_{n} / ℏ) \end{matrix}] U [\begin{matrix} ψ_{1} (0) \\ ⋮ \\ ψ_{n} (0) \end{matrix}] \end{matrix}

(38)

Thus yielding the general solution:

\begin{matrix} | ψ (t) 〉 & = exp (- i t H) | ψ (0) 〉 \end{matrix}

(39)

Let us now investigate how the axioms of QM are recovered from this result:

The entropy maximization procedure inherently normalizes physical states with $1 / Z$ . Furthermore, as physical states associate to the probability measure, and the probability is defined up to a phase, we conclude that physical states map to Rays within Hilbert space. This demonstrates QM Axiom 1 of 5.
An observable of the ensemble must satisfy:

$\begin{matrix} \bar{O} : = \sum_{j} O_{j} {| exp (- i t E_{j} / ℏ) ψ_{j} (0) |}^{2} \end{matrix}$

(40)

Since $Z = 〈 ψ | ψ 〉$ , then any self-adjoint operator satisfying the condition $〈 O ψ | ϕ 〉 = 〈 ψ | O ϕ 〉$ will equate the above equation, simply because $〈 O 〉 : = 〈 ψ | O | ψ 〉$ . This demonstrates QM Axiom 2 of 5.
The system’s dynamics emerge from differentiating Equation 39 with respect to the Lagrange multiplier. This is manifested as:

$\begin{matrix} \frac{\partial}{\partial t} | ψ (t) 〉 & = \frac{\partial}{\partial t} (exp (- i t H / ℏ) | ψ (0) 〉) \end{matrix}$

(41)

$\begin{matrix} = - i H / ℏ exp (- i t H / ℏ) | ψ (0) 〉 \end{matrix}$

(42)

$\begin{matrix} = - i H / ℏ | ψ (t) 〉 \end{matrix}$

(43)

$\begin{matrix} \Rightarrow i ℏ \frac{\partial}{\partial t} | ψ (t) 〉 & = H | ψ (t) 〉 \end{matrix}$

(44)

which is the Schrödinger equation. This demonstrates QM Axiom 3 of 5.
From Equation 39 it follows that the possible microstates $E_{i}$ of the system correspond to specific eigenvalues of $H$ . An observation can thus be conceptualized as sampling from $ρ$ , with the measured state being the occupied microstate i. Consequently, when a measurement occurs, the system invariably emerges in one of these microstates, which directly corresponds to an eigenstate of $H$ . Measured in the eigenbasis, the probability measure is:

$\begin{matrix} ρ_{i} (t) = \frac{1}{〈 ψ | ψ 〉} {| ψ_{i} (t) |}^{2} . \end{matrix}$

(45)

In scenarios where the probability measure $ρ_{i} (τ)$ is expressed in a basis other than its eigenbasis, the probability $P (λ_{i})$ of obtaining the eigenvalue $λ_{i}$ is given as a projection on a eigenstate:

$\begin{matrix} P (λ_{i}) = {| 〈 λ_{i} | ψ 〉 |}^{2} \end{matrix}$

(46)

Here, $| 〈 λ_{i} | ψ 〉 |^{2}$ signifies the squared magnitude of the amplitude of the state $| ψ 〉$ when projected onto the eigenstate $| λ_{i} 〉$ . As this argument hold for any observable, this demonstrates QM Axiom 4 of 5.
Finally, since the probability measure (Equation 35) replicates the Born rule, QM Axiom 5 of 5 is also demonstrated.

These results show that the

u (1)

-constraint is sufficient to entail the foundations of QM through the principle of relative entropy minimization.

2.2. $spin (2)$ -constraint: Euclidean QM in 2D

In this section, we investigate a model, isomorphic to QM, that lives in 2D—it provides a valuable starting point before addressing the more complex 3+1D case. Since

Spin (2) ≅ U (1)

, then this model is isomorphic to QM. Before we solve the optimization problem, we will first define the determinant of a multivector of

GA (2, 0)

.

2.2.1. Multivector Determinant

In general a multivector of

GA (2, 0)

can be written as

u : = a + x + b

, where a is a scalar,

x : = x e_{1} + y e_{2}

is a vector and

b : = b e_{1} e_{2} = b I

a pseudo-scalar, can be represented as a real

2 \times 2

matrix via an isomorphism:

Definition 3

(Pauli Algebra Isomorphism). The map

φ : GA (2, 0) \to Mat (2, R)

defined by:

\begin{matrix} φ (1) : = [\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}], φ (e_{x}) : = σ_{1} = [\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}], φ (e_{y}) : = σ_{3} = [\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}], \end{matrix}

(47)

extends linearly and multiplicatively to an isomorphism between

GA (2, 0)

and the algebra of real

2 \times 2

matrices. In particular, the basis bivector maps to:

\begin{matrix} φ (e_{x} e_{y}) = σ_{1} σ_{3} = [\begin{matrix} 0 & - 1 \\ 1 & 0 \end{matrix}] . \end{matrix}

(48)

Definition 4

(Matrix Representation). For a multivector

u : = a + x + b

, its matrix representation under φ is:

φ (u) = a φ (1) + x φ (e_{x}) + y φ (e_{y}) + b φ (e_{x} e_{y}) = [\begin{matrix} a + y & x - b \\ x + b & a - y \end{matrix}],

We now introduce the multivector conjugate, also known as the Clifford conjugate, which generalizes the concept of complex conjugation to multivectors.

Definition 5

(Multivector Conjugate—in

GA (2, 0)

).Let

u : = a + x + b

be in

GA (2, 0)

. Then its multivector conjugate is defined as:

\begin{matrix} u^{‡} : = a - x - b \end{matrix}

The determinant of the matrix representation of a multivector can be expressed as a multivector self-product:

Theorem 2

(Multivector Determinant—in

GA (2, 0)

).Let

u : = a + x + b

be in

GA (2, 0)

, then:

\begin{matrix} u^{‡} u \equiv det φ (u) \end{matrix}

(49)

Proof.

Let

u : = a + x + b

thus

φ (u) = [\begin{matrix} a + y & x - b \\ x + b & a - y \end{matrix}]

. Then:

\begin{matrix} 1 : u^{‡} u & = {(a + x + b)}^{‡} (a + x + b) \end{matrix}

(50)

\begin{matrix} = (a - x - b) (a + x + b) \end{matrix}

(51)

\begin{matrix} = a^{2} + a x + a b - x a - x^{2} - x b - b a - b x - b^{2} \end{matrix}

(52)

\begin{matrix} = a^{2} - x^{2} + b^{2} \end{matrix}

(53)

\begin{matrix} = a^{2} - x^{2} - y^{2} + b^{2}, \sin ce x^{2} = x^{2} + y^{2} \end{matrix}

(54)

\begin{matrix} 2 : det φ (u) & = det [\begin{matrix} a + y & x - b \\ x + b & a - y \end{matrix}] \end{matrix}

(55)

\begin{matrix} = (a + y) (a - y) - (x - b) (x + b) \end{matrix}

(56)

\begin{matrix} = a^{2} - x^{2} - y^{2} + b^{2} \end{matrix}

(57)

□

2.2.2. Inner Product

Building upon the concept of the multivector conjugate, we introduce the multivector conjugate transpose, which serves as an extension of the Hermitian conjugate to the domain of multivectors.

Definition 6

(Multivector Conjugate Transpose). Let

|V〉 {〉 \in (GA (2, 0))}^{n}

:

\begin{matrix} |V〉 〉 : = [\begin{matrix} a_{1} + x_{1} + b_{1} \\ ⋮ \\ a_{n} + x_{n} + b_{n} \end{matrix}] \end{matrix}

(58)

The multivector conjugate transpose of

|V〉 〉

is defined as first taking the transpose and then the element-wise multivector conjugate:

\begin{matrix} 〈 〈V| : = [\begin{matrix} a_{1} - x_{1} - b_{1} & \dots & a_{n} - x_{n} - b_{n} \end{matrix}] \end{matrix}

(59)

Definition 7

(Bilinear Form). Let

|V〉 〉

and

|W〉 〉

be two vectors valued in

GA (2, 0)

:

\begin{matrix} |V〉 〉 : = [\begin{matrix} a_{1} + x_{1} + b_{1} \\ ⋮ \\ a_{n} + x_{n} + b_{n} \end{matrix}] & |W〉 〉 : = [\begin{matrix} a_{1}^{'} + x_{1}^{'} + b_{1}^{'} \\ ⋮ \\ a_{n}^{'} + x_{n}^{'} + b_{n}^{'} \end{matrix}] \end{matrix}

(60)

We introduce the following bilinear form:

\begin{matrix} 〈 〈V | W〉 〉 = (a_{1} - x_{1} - b_{1}) (a_{1}^{'} + x_{1}^{'} + b_{1}^{'}) + \dots (a_{n} - x_{n} - b_{n}) (a_{n}^{'} + x_{n}^{'} + b_{n}^{'}) \end{matrix}

(61)

Theorem 3

(Inner Product). Restricted to the even sub-algebra of

GA (2, 0)

, the bilinear form is an inner product.

Proof.

\begin{matrix} {〈 〈V | W〉 〉}_{x \to 0} & = (a_{1} - b_{1}) (a_{1} + b_{1}) + \dots + (a_{n} - b_{n}) (a_{n} + b_{n}) \end{matrix}

(62)

\begin{matrix} = a_{1}^{2} + b_{1}^{2} + \dots + a_{n}^{2} + b_{n}^{2} \end{matrix}

(63)

This is isomorphic to the inner product of a complex Hilbert space, with the identification

a_{i} + b_{i} \leftrightarrow a_{i} + i b_{i}

where

I = e_{x} e_{y}

corresponds to i. □

2.2.3. The Optimization Problem

The

spin (2)

-constraint is recovered by posing

a \to 0

and

x \to 0

then

φ (u)

reduces as follows:

\begin{matrix} {u = a + x + b |}_{a \to 0, x \to 0} = b \Rightarrow φ (u) = [\begin{matrix} 0 & - b \\ b & 0 \end{matrix}] \end{matrix}

(64)

The fundamental Lagrange Multiplier Equation:

\begin{matrix} L : = - \sum_{i} ρ_{i} (θ) ln \frac{ρ_{i} (θ)}{ρ_{i} (0)} + λ (1 - \sum_{i} ρ_{i} (θ)) + \frac{1}{2} θ tr (\bar{b} - \sum_{i} ρ_{i} (θ) b_{i}) \end{matrix}

(65)

where

$λ$ and $θ$ are the Lagrange multipliers
$b_{i}$ are the multivectors of $GA (2, 0)$ , reduced by $a \to 0$ and $x \to 0$
the factor (1/2) is there to regularize the adjoint action on a vector $e^{- (1 / 2) b_{i}} v e^{(1 / 2) b_{i}} = v^{'}$

It yields the following solution:

\begin{matrix} ρ_{i} = \underset{\begin{matrix} S p i n (2) I n v a r i a n t \\ E n s e m b l e \end{matrix}}{\underset{︸}{\frac{1}{\sum_{j} det exp (- \frac{1}{2} θ b_{j}) det ψ_{j} (θ)}}} det \underset{\begin{matrix} E v o l u t i o n \\ O p e r a t o r \end{matrix}}{\underset{︸}{exp (- \frac{1}{2} θ b_{i})}} \underset{\begin{matrix} I n i t i a l \\ P r e p a r a t i o n \end{matrix}}{\underset{︸}{det ψ_{i} (0)}} \end{matrix}

(66)

where

det ψ_{i} (0) : = ρ_{i} (0)

.

As with the

u (1)

-constraint case, the partition function defines the inner product of a n-dimensional Hilbert space. The wavefunction is:

Definition 8

(

R^{+} \times Spin (2)

-valued Wavefunction).

\begin{matrix} [\begin{matrix} ψ_{1} (θ) \\ ⋮ \\ ψ_{n} (θ) \end{matrix}] : = [\begin{matrix} exp (- \frac{1}{2} θ b_{1}) \\ ⋱ \\ exp (- \frac{1}{2} θ b_{n}) \end{matrix}] [\begin{matrix} ψ_{1} (0) \\ ⋮ \\ ψ_{n} (0) \end{matrix}] \end{matrix}

where

ψ_{i} (0) = exp (\frac{1}{2} (a_{i} + b_{i}))

.

The dynamics are described by a variant of the Schrödinger equation, which is derived by taking the derivative of the wavefunction with respect to the Lagrange multiplier,

θ

:

Definition 9

(

spin (2)

-valued Schrödinger Equation).

\begin{matrix} I \frac{d}{d θ} [\begin{matrix} ψ_{1} (θ) \\ ⋮ \\ ψ_{n} (θ) \end{matrix}] = \frac{1}{2} [\begin{matrix} b_{1} \\ ⋱ \\ b_{n} \end{matrix}] [\begin{matrix} ψ_{1} (θ) \\ ⋮ \\ ψ_{n} (θ) \end{matrix}] \end{matrix}

Since

Spin (2) ≅ U (1)

, then it should come to no surprise that the theory resulting from the

spin (2)

-constraint is of the same mathematical form as QM, obtained from the

u (1)

-constraint.

One difference however, is that we gain these extra structures:

Definition 10

(David Hestenes’ Formulation). In 3+1D, the David Hestenes’ formulation [5] of the wavefunction is

ψ = \sqrt{ρ} R e^{i b / 2}

, where

R = e^{f / 2}

is a Lorentz boost or rotation and where

e^{i b / 2}

is a phase. In 2D, as the algebra only admits a bivector, his formulation would reduce to

ψ = \sqrt{ρ} R

, where ρ is a probability density and R is a Spin(2)-valued rotor—this is the form we have recovered:

\begin{matrix} \sqrt{ρ} R \equiv exp (a / 2) exp (b / 2) \end{matrix}

The definition of the Dirac current applicable to our wavefunction follows the formulation of David Hestenes:

Definition 11

(Dirac Current). The Dirac current for the 2D theory is defined as:

\begin{matrix} J : = ψ^{‡} e_{1} ψ = ρ \underset{SO (2)}{\underset{︸}{R^{‡} e_{1} R}} = ρ e_{1}^{'} \end{matrix}

where

e_{μ}^{'}

is a

SO (2)

-rotated basis vector.

2.3. $spin (3, 1) \oplus u (1)$ -constraint: Gravity + Yang-Mills

Extending the framework to relativistic quantum mechanics begins by considering a measurement constraint having a

Spinc (3, 1)

-phase symmetry. This allows for transformations that include boosts and rotations, and re-orientations (David Hestene describes "re-orientation" as representing the changing orientation of the spin plane due to Zitterbewegung). We begin with a definition of the determinant for a multivector of

GA (3, 1)

.

2.3.1. The Multivector Determinant

As we did in the beginning of the 2D case, our goal here will be to express

det M

as a multivector self-product. To achieve that, we begin by defining a general multivector in the geometric algebra

GA (3, 1)

:

\begin{matrix} u : = a + x + f + v + b \end{matrix}

(67)

where a is a scalar,

x

a vector,

f

a bivector,

v

is pseudo-vector and

b

a pseudo-scalar. Explicitly,

\begin{matrix} u & : = a \end{matrix}

(68)

\begin{matrix} + t γ_{0} + x γ_{1} + y γ_{2} + z γ_{3} \end{matrix}

(69)

\begin{matrix} + f_{01} γ_{0} γ_{1} + f_{02} γ_{0} γ_{2} + f_{03} γ_{0} γ_{3} + f_{12} γ_{1} γ_{2} + f_{13} γ_{1} γ_{3} + f_{23} γ_{2} γ_{3} \end{matrix}

(70)

\begin{matrix} + p γ_{1} γ_{2} γ_{3} + q γ_{0} γ_{2} γ_{3} + v γ_{0} γ_{1} γ_{3} + w γ_{0} γ_{1} γ_{2} \end{matrix}

(71)

\begin{matrix} + b γ_{0} γ_{1} γ_{2} γ_{3} \end{matrix}

(72)

Definition 12

(Real-Majorana Algebra Isomorphism). The map

φ : GA (3, 1) \to Mat (4, R)

defined by:

\begin{matrix} φ (1) & : = d i a g (1, 1, 1, 1) \end{matrix}

(73)

\begin{matrix} φ (γ_{0}) & : = [\begin{matrix} 0 & 0 & 0 & 1 \\ 0 & 0 & - 1 & 0 \\ 0 & 1 & 0 & 0 \\ - 1 & 0 & 0 & 0 \end{matrix}] & φ (γ_{2}) : = [\begin{matrix} 0 & - 1 & 0 & 0 \\ - 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & - 1 \\ 0 & 0 & - 1 & 0 \end{matrix}] \end{matrix}

(74)

\begin{matrix} φ (γ_{0}) & : = [\begin{matrix} 0 & 0 & 0 & 1 \\ 0 & 0 & - 1 & 0 \\ 0 & - 1 & 0 & 0 \\ 1 & 0 & 0 & 0 \end{matrix}] & φ (γ_{2}) : = [\begin{matrix} - 1 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & - 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}] \end{matrix}

(75)

\begin{matrix} φ (γ_{μ} γ_{ν}) & : = φ (γ_{μ}) φ (γ_{ν}) \end{matrix}

(76)

\begin{matrix} φ (γ_{μ} γ_{ν} γ_{κ}) & : = φ (γ_{μ}) φ (γ_{ν}) φ (γ_{κ}) \end{matrix}

(77)

\begin{matrix} φ (γ_{0} γ_{1} γ_{2} γ_{3}) & : = φ (γ_{0}) φ (γ_{1}) φ (γ_{2}) φ (γ_{3}) \end{matrix}

(78)

extends linearly and multiplicatively to an isomorphism between

GA (3, 1)

and the algebra of real

4 \times 4

matrices.

Definition 13

(Matrix Representation).

\begin{matrix} φ (u) = [\begin{matrix} a + f_{02} - q - z & b - f_{13} + w - x & - f_{01} + f_{12} - p + v & f_{03} + f_{23} + t + y \\ - b + f_{13} + w - x & a + f_{02} + q + z & f_{03} + f_{23} - t - y & f_{01} - f_{12} - p + v \\ - f_{01} - f_{12} + p + v & f_{03} - f_{23} + t - y & a - f_{02} + q - z & - b - f_{13} - w - x \\ f_{03} - f_{23} - t + y & f_{01} + f_{12} + p + v & b + f_{13} - w - x & a - f_{02} - q + z \end{matrix}] \end{matrix}

To manipulate and analyze multivectors in

GA (3, 1)

, we introduce several important operations, such as the multivector conjugate, the pseudo-blade conjugate, and the multivector determinant.

Definition 14

(Multivector Conjugate—in

GA (3, 1)

).

\begin{matrix} u^{‡} : = a - x - f + v + b \end{matrix}

Definition 15

(Pseudo-Blade Conjugate—in

GA (3, 1)

). The pseudo-blade conjugate of

u

is

\begin{matrix} u^{†} : = a + x + f - v - b \end{matrix}

Lundholm[6] proposes a number the multivector norms, and shows that they are the unique forms which carries the properties of the determinants such as

N (u v) = N (u) N (v)

to the domain of multivectors:

Definition 16.

The self-products associated with low-dimensional geometric algebras are:

\begin{matrix} GA (0, 1) : & u^{*} u \end{matrix}

(79)

\begin{matrix} GA (2, 0) : & u^{‡} u \end{matrix}

(80)

\begin{matrix} GA (3, 0) : & {(u^{‡} u)}^{*} u^{‡} u \end{matrix}

(81)

\begin{matrix} GA (3, 1) : & {(u^{‡} u)}^{†} u^{‡} u \end{matrix}

(82)

\begin{matrix} GA (4, 1) : & {({(u^{‡} u)}^{†} u^{‡} u)}^{*} ({(u^{‡} u)}^{†} u^{‡} u) \end{matrix}

(83)

where

u^{*}

is a conjugate that reverses the sign of pseudo-scalar blade (i.e. the highest degree blade of the algebra).

We can now express the determinant of the matrix representation of a multivector via a self-product. This choice is unique:

(3,1)).Theorem 4 (The Multivector Determinant—in GA

\begin{matrix} {(u^{‡} u)}^{†} u^{‡} u \equiv det φ (u) \end{matrix}

Proof.

Please find a computer assisted proof of this equality in Annex F. □

As can be seen from this theorem, the relationship between determinants and multivector products becomes more sophisticated in 3+1D. Unlike the 2D case where the determinant could be expressed using a product of two terms, in

GA (3, 1)

the determinant requires two products involving four copies of the multivector. This is reflected in the structure

{(u^{‡} u)}^{†} u^{‡} u

, which cannot be reduced to a simpler self-product of two terms.

Theorem 5

(Positive-Definiteness over

R^{+} \times Spinc (3, 1)

).Let

u = exp (\frac{1}{2} (a + f + b))

be a general invertible element of the even-subalgebra of

GA (3, 1)

. As such,

u

is in

R^{+} \times Spinc (3, 1)

. Then the multivector determinant

{(u^{‡} u)}^{†} u^{‡} u

is positive-definite.

Proof.

Since scalars, bivectors and pseudoscalars commute, we have:

\begin{matrix} exp (\frac{1}{2} (a + f + b)) = e^{a / 2} e^{f / 2} e^{b / 2} \end{matrix}

(84)

Using this convenient form, the proof is as follows:

\begin{matrix} {(u^{‡} u)}^{†} u^{‡} u & = e^{a / 2} e^{- f / 2} e^{- b / 2} e^{a / 2} e^{f / 2} e^{- b / 2} e^{a / 2} e^{- f / 2} e^{b / 2} e^{a / 2} e^{f / 2} e^{b / 2} \end{matrix}

(85)

\begin{matrix} = e^{2 a} \end{matrix}

(86)

which is positive-definite—the exponential of a real number a is in

R^{+}

. □

2.3.2. The Optimization Problem

A number of technical modifications are required to the general structure of our optimization problem:

We will solve the optimization problem for the continuum $\sum \to \int$ .
We will adjust the interpretation of $ψ$ from a probability amplitude to that of a field amplitude $ϕ$ . As such, and consistently with usual quantum field theory (QFT) interpretation, the notion of charge conservation will replace that of probability conservation. The notation will be changed as follows:

$\begin{matrix} ψ & \to ϕ \end{matrix}$

(87)

$\begin{matrix} ρ & \to χ \end{matrix}$

(88)
In 3+1D, we are interested in the case where $M$ is an element of the algebra of $spin (3, 1) \oplus u (1)$ :

$\begin{matrix} M = [\begin{matrix} f_{02} & b - f_{13} & - f_{01} + f_{12} & f_{03} + f_{23} \\ - b + f_{13} & f_{02} & f_{03} + f_{23} & f_{01} - f_{12} \\ - f_{01} - f_{12} & f_{03} - f_{23} & - f_{02} & - b - f_{13} \\ f_{03} - f_{23} & f_{01} + f_{12} & b + f_{13} & - f_{02} \end{matrix}] \end{matrix}$

(89)

However, since our field $ϕ$ will be parametrized in spacetime, we must replace $M$ with a connection valued in $spin (3, 1) \oplus u (1)$ :

$\begin{matrix} M \to ω_{μ} = \frac{1}{2} ω_{μ}^{a b} γ_{a b} + I V_{μ} \end{matrix}$

(90)
We also consider translations $\partial_{x}, \partial_{y}$ and $\partial_{z}$ . The covariant derivative is:

$\begin{matrix} D_{i} : = \partial_{i} + \frac{1}{2} ω_{i}^{a b} γ_{a b} + I V_{i} \end{matrix}$

(91)
Likewise to the 2D case, $e^{μ}$ is here used to contract with $D_{μ}$ , leaving no free indices. But since it produces an odd-multivector in the process, the term $γ_{0}$ is added converting the result back into an even-multivector. It also picks a preferred frame—the laboratory frame. Its effect is similar to the presence of $γ_{0}$ in the Dirac Lagrangian.
We will drop the normalization constraint $λ (1 - \int_{- \infty}^{\infty} χ (t, \vec{x}) d^{3} x)$ , consistently with a conserved charge interpretation.

Flat Spacetime:

The optimization problem will be as follows:

\begin{matrix} L : = - \int_{- \infty}^{\infty} p (t, \vec{x}) ln \frac{p (t, \vec{x})}{p (0, \vec{x})} d^{3} x + \frac{i t}{ℏ} tr (\bar{ω} - \int_{- \infty}^{\infty} p (t, \vec{x}) γ_{0} (e^{i} D_{i} + e^{t} ω_{t}) d^{3} x) \end{matrix}

(92)

where

p (t, \vec{x}) = χ {(t, \vec{x})}^{†} χ (t, \vec{x})

.

The solution is:

\begin{matrix} p (t, \vec{x}) = det exp (- \frac{i t}{ℏ} γ_{0} (e^{i} D_{i} + e^{t} ω_{t})) p (0, \vec{x}) \end{matrix}

(93)

The base field is:

\begin{matrix} ϕ (t, \vec{x}) = exp (- \frac{i t}{ℏ} γ_{0} (e^{i} D_{i} + e^{t} ω_{t}) ϕ (0, \vec{x}) \end{matrix}

(94)

where

ϕ (0, \vec{x})

is defined as:

Definition 17

(

R^{+} \times Spinc (3, 1)

-valued Field).

\begin{matrix} ϕ (0, \vec{x}) = ϕ (0, \vec{x}) = exp (\frac{1}{2} a (\vec{x})) exp (\frac{1}{2} f (\vec{x})) exp (\frac{1}{2} b (\vec{x})) \end{matrix}

(95)

As such, the covariant derivative can act on all components of

ϕ (0, \vec{x})

.

Applying the determinant to

ϕ (0, \vec{x})

causes the

exp (\frac{1}{2} f) exp (\frac{1}{2} b)

terms to vanish, leaving

{(exp (\frac{1}{2}) a)}^{4} = exp 2 a

, which we define as

exp 2 a : = χ

. The result is positive-definite since

\forall a \in R : exp 2 a > 0

.

Theorem 6

(David Hestenes’ Wavefunction). The

R^{+} \times {Spin}^{c} (3, 1)

-valued field is formulated using the same geometric structure as David Hestenes’[5] formulation of the wavefunction within GA(3,1). Specifically, David Hestenes’ wavefunction is a special case of our result, where the field magnitude sums to 1.

Proof.

\begin{matrix} \underset{ours}{\underset{︸}{exp (\frac{1}{2} a (\vec{x})) exp (\frac{1}{2} f (\vec{x})) exp (\frac{1}{2} b (\vec{x}))}} \propto \underset{Hestenes ’}{\underset{︸}{\sqrt{ρ (x)} R (x) e^{- i b (x) / 2}}} \end{matrix}

where

exp (\frac{1}{2} a (\vec{x})) \propto \sqrt{ρ (x)}

,

exp (\frac{1}{2} f (\vec{x})) = R (x)

and

exp (\frac{1}{2} b (\vec{x})) = e^{- i b (x) / 2}

. Here,

ρ (x)

is a probability density (versus a field magnitude),

R (x)

is a rotor (same as ours) and

e^{- i b (x) / 2}

describes the four-volume orientation (also same as ours). Adding the normalisation constraint to the optimisation problem forces the field magnitude to sum to 1, which recovers David Hestenes’ wavefunction as a special case. □

Definition 18

(Alternative Notion).

\begin{matrix} exp (\frac{1}{2} a) exp (\frac{1}{2} f) exp (\frac{1}{2} b) \equiv \sqrt{χ} R e^{- b / 2} \end{matrix}

(96)

This field leads to a variant of the Schrödinger equation obtained by taking its derivative with respect to the Lagrange multiplier:

Definition 19

(

spin (3, 1) \oplus u (1)

-valued Schrödinger equation).

\begin{matrix} i ℏ \frac{\partial ϕ (t, \vec{x})}{\partial t} = γ_{0} (e^{i} D_{i} + e^{t} ω_{t}) ϕ (t, \vec{x}) \end{matrix}

(97)

This Schrödinger equation is simply the massless Dirac equation in Hamiltonian form (with an additional

Spinc (3, 1)

connection).

The Dirac equation is obtained as follows:

\begin{matrix} i ℏ \frac{\partial ϕ (t, \vec{x})}{\partial t} = γ_{0} (e^{i} D_{i} + e^{t} ω_{t}) ϕ (t, \vec{x}) \end{matrix}

(98)

\begin{matrix} \Rightarrow & 0 = γ_{0} (e^{i} D_{i} + e^{t} ω_{t}) ϕ (t, \vec{x}) - i ℏ \frac{\partial ϕ (t, \vec{x})}{\partial t} \end{matrix}

(99)

\begin{matrix} \Rightarrow & 0 = e^{μ} D_{μ} ϕ (t, \vec{x}) \end{matrix}

(100)

where

D_{μ}

is the covariant derivative over all 4 spacetime coordinates.

Curved Spacetime:

In curved spacetime, we consider the ADM formalism. We foliate spacetime in hypersurfaces

Σ_{t}

of constant t:

\begin{matrix} d s^{2} = - N^{2} d t^{2} + h_{i j} (d x^{i} + N^{i} d t) (d x^{j} + N^{j} d t) \end{matrix}

(101)

The optimization problem Lagrangian remains similar, but the constraint now acquires lapse and shift functions:

\begin{matrix} L : = - \int_{- \infty}^{\infty} p (t, \vec{x}) ln \frac{p (t, \vec{x})}{p (0, \vec{x})} \sqrt{h} d^{3} x + \frac{i t}{ℏ} tr (\bar{ω} - \int_{- \infty}^{\infty} p (t, \vec{x}) (N γ_{0} (e^{i} D_{i} + e^{t} ω_{t}) - i ℏ N^{i} D_{i}) \sqrt{h} d^{3} x) \end{matrix}

(102)

where

h = det h_{i j}

.

The problem is solved in a manner similar to the flat case and leads to the Schrödinger equation:

\begin{matrix} i ℏ \frac{\partial ϕ (t, \vec{x})}{\partial t} = (N γ_{0} (e^{i} D_{i} + e^{t} ω_{t}) - i ℏ N^{i} D_{i}) ϕ (t, \vec{x}) \end{matrix}

(103)

This is the Hamiltonian form of the massless Dirac equation

e^{μ} D_{μ} ϕ = 0

with covariant derivative

D_{μ}

expressed with lapse and shift functions and containing a spin and pseudoscalar connection.

Field Functionals:

We now consider the functional integral version of the optimization problem:

\begin{matrix} L : = - \int D [e] p_{t} [e] ln \frac{p_{t} [e]}{p_{0} [e]} + \frac{i t}{ℏ} tr (\bar{ω} - \int D [e] p_{t} [e] (N γ_{0} (e^{i} D_{i} + e^{t} ω_{t}) - i ℏ N^{i} D_{i})) \end{matrix}

(104)

where

e = e_{i}^{a} (t, \vec{x})

and

h_{i j} (\vec{x}) = δ_{a b} e_{i}^{a} (\vec{x}) e_{j}^{b} (\vec{x})

.

The problem is solved in a manner similar to the flat case and leads to a fieldfunctional parametrized in terms of frame fields. The resulting Schrödinger equation is:

\begin{matrix} i ℏ \frac{\partial ϕ_{t} [e]}{\partial t} = (N γ_{0} (e^{i} D_{i} + e^{t} ω_{t}) - i ℏ N^{i} D_{i}) ϕ_{t} [e] \end{matrix}

(105)

This is the Hamiltonian form of the massless Dirac equation

e^{μ} D_{μ} ϕ [e] = 0

with covariant derivative

D_{μ}

expressed with lapse and shift functions and containing a spin and pseudoscalar connection.

Let us now investigate how geometry and gravity emerges from these solutions. The results of the next sections applies generally to all three cases of the optimization problem: flat space, curved space, and field functional.

2.3.3. Geometry

Definition 20

(Dirac Current). Using a single-copy of the multivector determinant, the definition of the Dirac current is the same as Hestenes’:

\begin{matrix} J & : = \overset{one}{\overset{︷}{ϕ^{‡} e_{0} ϕ}} copy \end{matrix}

(106)

\begin{matrix} = χ R^{‡} e^{- i b / 2} e_{0} e^{- i b / 2} R \end{matrix}

(107)

\begin{matrix} = χ R^{‡} e_{0} e^{i b / 2} e^{- i b / 2} R \end{matrix}

(108)

\begin{matrix} = χ R^{‡} e_{0} R \end{matrix}

(109)

\begin{matrix} = χ e_{0}^{'} \end{matrix}

(110)

where

e_{0}^{'}

is a SO(3,1) rotated basis vector.

Theorem 7

(Metric Tensor). Taking advantage of the multivector determinant formulation, we utilize both copies to obtain the metric tensor as a basis vectors measurement:

\begin{matrix} tr (\frac{{(\overset{copy 1}{\overset{︷}{ϕ^{‡} e_{μ} ϕ}})}^{†} \overset{copy 2}{\overset{︷}{ϕ^{‡} e_{ν} ϕ}}}{\underset{χ^{2}}{\underset{︸}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}}}) = g_{μ ν} \end{matrix}

Proof.

\begin{matrix} tr (\frac{{(ϕ^{‡} e_{μ} ϕ)}^{†} ϕ^{‡} e_{ν} ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}) & = tr (R^{‡} e^{- i b / 2} e_{μ} e^{- i b / 2} R R^{‡} e^{- i b / 2} e_{ν} e^{- i b / 2} R) \end{matrix}

(111)

\begin{matrix} = tr (R^{‡} e_{μ} e^{i b / 2} e^{- i b / 2} R R^{‡} e_{ν} e^{i b / 2} e^{- i b / 2} R) \end{matrix}

(112)

\begin{matrix} = tr (R^{‡} e_{μ} R R^{‡} e_{ν} R) \end{matrix}

(113)

\begin{matrix} = tr (e_{μ}^{'} e_{ν}^{'}) \end{matrix}

(114)

\begin{matrix} = tr (e_{μ}^{'} \cdot e_{ν}^{'} + e_{ν}^{'} \land e_{μ}^{'}) \end{matrix}

(115)

\begin{matrix} = tr (g_{μ ν} + e_{ν}^{'} \land e_{μ}^{'}) \end{matrix}

(116)

\begin{matrix} = g_{μ ν} \end{matrix}

(117)

□

The definition of the kinetic energy also exploits the double-structure:

Definition 21

(Kinetic Energy). The kinetic energy is defined as

\begin{matrix} tr (\frac{{(\overset{copy 1}{\overset{︷}{ϕ^{‡} e^{μ} D_{μ} ϕ}})}^{†} \overset{copy 2}{\overset{︷}{ϕ^{‡} e^{ν} D_{ν} ϕ}}}{\underset{χ^{2}}{\underset{︸}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ}}}) = T \end{matrix}

(118)

We now give an example:

Theorem 8

(Kinetic Energy of GR). Let us calculate the kinetic energy for a subspace of the field where

\sqrt[4]{χ} = 1

and

e^{- i b / 2} = 1

, such that

ϕ = R

. It reduces to the Ricci scalar

R

, which is the kinetic energy of the Einstein-Hilbert action.

\begin{matrix} tr ({(\tilde{R} e^{μ} D_{μ} R)}^{†} \tilde{R} e^{ν} D_{ν} R) = R \end{matrix}

Proof.

\begin{matrix} tr ({(\tilde{R} e^{μ} D_{μ} R)}^{†} \tilde{R} e^{ν} D_{ν} R) \end{matrix}

(119)

\begin{matrix} = tr (\tilde{R} e^{μ} D_{μ} R \tilde{R} e^{ν} D_{ν} R) \end{matrix}

(120)

\begin{matrix} = tr (\tilde{R} e^{μ} D_{μ} e^{ν} D_{ν} R) & via \tilde{R} R = 1 \end{matrix}

(121)

\begin{matrix} = tr (\tilde{R} D^{2} R) \end{matrix}

(122)

\begin{matrix} = tr (R \tilde{R} D^{2}) \end{matrix}

(123)

\begin{matrix} = tr (D^{2}) \end{matrix}

(124)

\begin{matrix} = R & via Lichnerowicz - Weitzenb ö ck identity \end{matrix}

(125)

which is the Ricci scalar. □

2.3.4. Gravity

Theorem 9

(Quantum Action). Let us investigate a subspace of the field where

R = 1

and

e^{- i b / 2} = 1

, such that

ϕ = \sqrt{χ}

. Due to its non-linearity, the kinetic energy produces a quantum potential in addition to a kinetic energy term:

\begin{matrix} tr (\frac{{(\sqrt{χ} e^{μ} D_{μ} χ)}^{†} e^{ν} D_{ν} \sqrt{χ}}{χ^{2}}) = \underset{Quantum Kinetics}{\underset{︸}{\frac{1}{2 χ^{2}} {(\partial χ)}^{2}}} - (\underset{Quantum Potential}{\underset{︸}{\frac{1}{4 χ^{2}} {(\partial χ)}^{2} - \frac{\partial^{2} χ}{2 χ}}}) \end{matrix}

(126)

The quantum potential herein described is the relativistic version of the quantum potential found in the Bohm-Broglie reformulation of QM, whereas the quantum kinetics can be understood as a scalar field kinetic term. When integrated, they define a quantity that we refer to as the quantum action:

\begin{matrix} S = \underset{Quantum Action}{\underset{︸}{\int (\frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ}) \sqrt{- | g |} d^{4} x}} \end{matrix}

(127)

Proof.

\begin{matrix} tr ({(χ^{- 2} \sqrt{χ} e^{μ} D_{μ} χ)}^{†} e^{ν} D_{ν} \sqrt{χ}) \end{matrix}

(128)

\begin{matrix} = - tr (χ^{- 2} \sqrt{χ} (e^{μ} \partial_{μ} χ) e^{ν} \partial_{ν} \sqrt{χ} + χ^{- 2} \sqrt{χ} χ e^{μ} \partial_{μ} e^{ν} \partial_{ν} \sqrt{χ}) \\ = - tr (χ^{- 2} 2^{- 1} (e^{μ} \partial_{μ} χ) (e^{ν} \partial_{ν} χ)) + tr (χ^{- 1} \sqrt{χ} 4^{- 1} χ^{- 3 / 2} e^{μ} \partial_{μ} e^{ν} \partial_{ν} χ) \end{matrix}

(129)

\begin{matrix} - tr (χ^{- 1} \sqrt{χ} 2^{- 1} χ^{- 1 / 2} e^{μ} \partial_{μ} e^{ν} \partial_{ν} χ) \end{matrix}

(130)

\begin{matrix} = - tr (\frac{(e^{μ} \partial_{μ} χ) (e^{ν} \partial_{ν} χ)}{2 χ^{2}} - \frac{(e^{μ} \partial_{μ} χ) (e^{ν} \partial_{ν} χ)}{4 χ^{2}} + \frac{e^{μ} \partial_{μ} e^{ν} \partial_{ν} χ}{2 χ}) \end{matrix}

(131)

\begin{matrix} = \frac{1}{2 χ^{2}} {(\partial χ)}^{2} - \frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ} \end{matrix}

(132)

\begin{matrix} = \frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ} \end{matrix}

(133)

□

Theorem 10

(Equation of Motion). Varying the quantum action:

\begin{matrix} S = \int (\frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ}) \sqrt{- | g |} d^{4} x \end{matrix}

(134)

produces:

\begin{matrix} \partial^{2} χ = χ □ χ \end{matrix}

(135)

as the equation of motion.

Proof.

\begin{matrix} δ (\frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ}) = 0 \end{matrix}

(136)

\begin{matrix} \Rightarrow & - \frac{{(\partial χ)}^{2}}{2 χ^{3}} δ χ - \partial_{μ} (\frac{\partial^{μ} χ}{2 χ^{2}}) δ χ + \frac{\partial^{2} (δ χ)}{2 χ} - \frac{\partial^{2} χ}{2 χ^{2}} δ χ = 0 \end{matrix}

(137)

\begin{matrix} \Rightarrow & - \frac{{(\partial χ)}^{2}}{2 χ^{3}} δ χ + \frac{{(\partial χ)}^{2}}{χ^{3}} δ χ - \frac{\partial^{2} χ}{2 χ^{2}} δ χ + \frac{\partial^{2} (δ χ)}{2 χ} - \frac{\partial^{2} χ}{2 χ^{2}} δ χ = 0 \end{matrix}

(138)

\begin{matrix} \Rightarrow & \frac{{(\partial χ)}^{2}}{2 χ^{3}} δ χ - \frac{\partial^{2} χ}{χ^{2}} δ χ + \frac{\partial^{2} (δ χ)}{2 χ} = 0 \end{matrix}

(139)

\begin{matrix} \Rightarrow & \frac{{(\partial χ)}^{2}}{2 χ^{3}} δ χ - \frac{\partial^{2} χ}{χ^{2}} δ χ - \frac{\partial^{2} χ}{2 χ^{2}} δ χ + \frac{{(\partial χ)}^{2}}{χ^{3}} δ χ = 0 \end{matrix}

(142)

\begin{matrix} \Rightarrow & \frac{3 {(\partial χ)}^{2}}{2 χ^{3}} δ χ - \frac{3 \partial^{2} χ}{2 χ^{2}} δ χ = 0 \end{matrix}

(143)

\begin{matrix} \Rightarrow & \partial^{2} χ = \frac{{(\partial χ)}^{2}}{χ} \end{matrix}

(144)

\begin{matrix} \Rightarrow & χ □ χ = {(\partial χ)}^{2} \end{matrix}

(145)

□

To interpret this action and resulting equation of motion, let us now introduce the surprisal field and associated definitions.

Definition 22

(Surprisal Field). We define a change of variable:

\begin{matrix} φ : = - ln χ \end{matrix}

We call φ the surprisal field.

Definition 23

(Surprisal Equation of Motion). We note that the change of variable

φ = - ln χ

, changes the equation of motion as follows:

\begin{matrix} χ □ χ = {(\partial χ)}^{2} \underset{φ = - ln χ}{\underset{︸}{\to}} □ φ = 0 \end{matrix}

which is the Klein-Gordon equation in curved spacetime, applied to the surprisal field.

Definition 24

(Surprisal Conservation). The following current:

\begin{matrix} \nabla_{μ} (\partial^{μ} φ) = 0 \end{matrix}

identifies the surprisal as the conserved charge of this action.

Definition 25

(Surprisal Expectation Value). The surprisal expectation value is merely the entropy H of a region V of the manifold:

\begin{matrix} \underset{expectation value}{\underset{︸}{〈 ln χ 〉}} : = \underset{Definition Entropy}{\underset{︸}{- \int_{V} χ (x) \underset{observable}{\underset{︸}{ln χ (x)}} \sqrt{h_{ζ}} d^{3} x}} of \end{matrix}

Interpretation:

In information theory, the surprisal of an event x with probability density

ρ (x)

is defined as

- ln ρ (x)

, and the entropy

H = - \int ρ ln ρ d^{4} x

represents its expectation value. As the unit of surprisal is the bit, it represents the quantity of information associated to the event—and it is conserved by

□ φ = 0

. In contrast, also in information theory, the units of entropy are the bits per symbol—this is not conserved.

In our framework, the field

χ

replaces

ρ

—it has most of its properties, but differs critically as follows:

$χ$ is not a probability density—it lacks a conserved current ( $\nabla_{μ} (χ u^{μ}) \neq 0$ ) and is not normalized—but it is positive-definite.
Instead, $χ$ is interpreted as an information density, encoding spacetime’s local information content.

The surprisal is defined as

φ = - ln χ

, which in this theory satisfies the Klein-Gordon equation

□ φ = 0

. This ensures:

Conservation: The current $j^{μ} = \partial^{μ} φ$ is conserved ( $\nabla_{μ} j^{μ} = 0$ ), making $Q = \int_{V} j^{μ} \sqrt{h_{ζ}} d^{3} x$ a conserved charge.
Causal Propagation: Surprisal propagates at light speed, enforcing that bits of information cannot spread superluminally—a core tenet of relativity.

Before we continue the interpretation of this theory, let us introduce another theorem.

Definition 26

(Gravity). Let us now consider the full space of the wavefunction

ψ = \sqrt{ρ} R e^{- i b / 2}

. We are automatically lead into a theory of gravity:

\begin{matrix} S & = \int_{M} tr (\frac{{(ϕ^{‡} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} e^{ν} D_{ν} ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ^{‡}}) \sqrt{- | g |} d^{4} x \end{matrix}

(146)

which expands, via Theorem 9 and 8, as follows:

\begin{matrix} 1 - 1 S & = \int_{M} (R + cross - terms + \frac{1}{4 χ^{2}} {(\partial χ)}^{2} + \frac{\partial^{2} χ}{2 χ}) \sqrt{- | g |} d^{4} x \end{matrix}

(147)

We note the following equations of motion which must be simultaneously satisfied:

Varying with respect to $g_{μ ν}$ yields the EFE with the Einstein tensor from $R$ , and is sourced by the quantum action variation yielding the stress-energy tensor.
Varying with respect to χ gives equations of motion that define the flow of information density χ in spacetime.

Interpretation (cont’d): Thus, while quantum mechanics relies on probabilistic amplitudes

ψ

, our formulation recasts general relativity as a deterministic theory of information dynamics, where spacetime geometry and surprisal flux are dual aspects of

R

and

χ

. The distribution of surprisal in spacetime dictates its geometric structure, which in turns dictates how it propagates. General relativity is to information, what quantum mechanics is to probability. Revisiting General Relativity with this perspective shows that the natural constraint is sufficient to entail the theory through the principle of entropy maximization—in this formulation, the speed of light as a limit on the propagation of the quantity of information (via the surprisal obeying the Klein-Gordon equation), and even the Einstein field equations are not fundamental, but naturally emerge as the solution to an optimization problem on entropy. The

spinc (3, 1)

-valued Schrödinger equation thus describes gravity.

2.3.5. Yang-Mills

In QFT, the standard method to identify a local gauge symmetry is to start with a global symmetry of the action or probability measure and then localize it by introducing gauge fields. For example, the

U (1)

gauge symmetry arises naturally in electromagnetism as the group preserving the probability density (Born rule) under local phase transformations. However, the non-Abelian

SU (2)

and

SU (3)

gauge symmetries of the Standard Model are not derived from first principles in this way; their inclusion is empirically motivated by particle physics experiments. Improvement via Multivector Determinant Formulation: Our framework demonstrates that Yang-Mills theories emerge naturally from constraints on the wavefunction’s probability measure and Dirac current. Specifically:

Probability Measure: The quadratic form ${(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ = χ^{2}$ enforces rotor invariance $ϕ \to R ϕ$ , restricting transformations to those satisfying $R^{‡} R = 1$ , for some rotor R of a geometric algebra of n dimensions:

$\begin{matrix} {(ϕ^{‡} R^{‡} R ϕ)}^{†} ϕ^{‡} R^{‡} R ϕ = {(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ψ \Rightarrow R^{‡} R = 1 . \end{matrix}$

(148)

Solutions to $R^{‡} R = 1$ are rotor transformations generated by bivectors in the Clifford algebra. For a $2 n$ -dimensional algebra, these generate $Spin (2 n)$ , whose subgroups include $SU (n)$ .
Dirac Current: The spacetime current $ϕ^{‡} e_{0} ϕ = e_{0}$ requires gauge generators to commute with $e_{0}$ , confining them to an internal space. This implies:

$\begin{matrix} ϕ^{‡} e^{- θ^{i} f_{i}} e_{0} e^{θ^{i} f_{i}} ϕ = ϕ^{‡} e_{0} ϕ \Rightarrow [f_{i}, e_{0}] = 0, \end{matrix}$

(149)

where $f_{i}$ are bivector generators. Thus, $f_{i}$ act only on internal degrees of freedom, orthogonal to spacetime.
Spacetime: The origin of the multivector determinant from STA, defines the resulting internal space againts spacetime.

These constraints limit the allowable symmetry to groups generated by bivector exponentials (which are compact Lie groups), and acting on the internal spaces of spacetime. Since

SU (n) \subset Spin (2 n)

, this framework inherently includes the Standard Model within its landscape but also generalizes to larger symmetries such as those found in condensed matter systems with emergent

SU (n)

symmetries.

Wavefunction and Symmetry Structure:

The total wavefunction is a tensor product of spacetime (STA) and internal space components:

For $SU (n)$ Yang-Mills:

$\begin{matrix} ϕ_{STA} \otimes ϕ_{C^{n}} . \end{matrix}$

(150)
For the Standard Model $SU (3) \times SU (2) \times U (1)$ :

$\begin{matrix} ϕ_{STA} \otimes ϕ_{C} \otimes ϕ_{C^{2}} \otimes ϕ_{C^{3}} . \end{matrix}$

(151)

Action:

Our previous gravitational action is reconstructed with a spectral function f:

\begin{matrix} S = \int_{M} tr (f (\frac{1}{Λ^{2}} \frac{{(ϕ^{‡} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} e^{μ} D_{μ} ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ})) \sqrt{- | g |} d^{4} x . \end{matrix}

(152)

A heat kernel expansion yields the invariants of the theory (more on that in a moment).

Covariant Derivative (Ex. Standard Model):

Taking the Standard Model as an example, the covariant derivative incorporates spacetime curvature (gravity) and gauge fields:

\begin{matrix} D_{μ} : = (\begin{matrix} \partial_{μ} + \frac{ω_{μ}^{a b}}{2} γ_{a b} + i g^{'} Y B_{μ} + i g \frac{σ^{a}}{2} W_{μ}^{a} + i g_{s} \frac{λ^{a}}{2} G_{μ}^{a} & Φ \\ Φ^{†} & \partial_{μ} + \frac{ω_{μ}^{a b}}{2} γ_{a b} + i g^{'} Y B_{μ} + i g_{s} \frac{λ^{a}}{2} G_{μ}^{a} \end{matrix}), \end{matrix}

(153)

where:

$γ_{a b}$ : Generators of $Spin (3, 1)$ (gravitational spin connection).
$B_{μ}, W_{μ}^{a}, G_{μ}^{a}$ : $U (1)$ , $SU (2)$ , and $SU (3)$ gauge fields.
$Φ$ : Higgs field (SU(2) doublet).

It acts on the left/right split of the field.

Expanding f yield the field strength term

tr (f (D^{2} / Λ^{2}))

which via the Heat kernel further yields the Standard Model + gravity (see A. H. Chamseddine and Alain Connes [7] for heat kernel expansion details). The invariants recovered are:

1.

Leading Terms:

(a): Cosmological constant: $\propto Λ^{4} \int \sqrt{- | g |} d^{4} x$ .
(b): Einstein-Hilbert term: $\propto Λ^{2} \int R \sqrt{- | g |} d^{4} x$ .

2.

Yang-Mills and Higgs:

(a): Gauge kinetic terms: $\propto \int \frac{1}{4} F_{μ ν}^{a} F^{μ ν a} \sqrt{- | g |} d^{4} x$ .
(b): Higgs kinetic and potential terms:

$\begin{matrix} \propto \int (| D_{μ} {Φ |}^{2} + Λ^{2} {| Φ |}^{2} + \frac{1}{Λ^{2}} {| Φ |}^{4}) \sqrt{- | g |} d^{4} x . \end{matrix}$

(154)

3.

Yukawa Couplings (from matter fields):

\begin{matrix} \propto \int y_{i j} {\bar{ϕ}}_{i} Φ ϕ_{j} \sqrt{- | g |} d^{4} x . \end{matrix}

(155)

Key Notes:

Higher-Order Terms: Higher order field strength terms appear but are suppressed by $Λ^{- 2}$ , making them negligible at low energies.
Uniqueness: The Standard Model is not uniquely selected by the optimization problem but resides within the landscape of allowed Yang-Mills theories.

2.3.6. Yang-Mills Axioms as Theorems

In Section 2.1, we demonstrated that all 5 axioms of quantum mechanics are derivable from the solution to the optimization problem in

GA (0, 1)

. Here, our aim is to do the same but for the axioms of Yang-Mills theory. First, let us list the axioms:

Compact Gauge Group: The symmetry group is a compact Lie group G.
Local Gauge Invariance: Fields transform under spacetime-dependent (local) group elements $T (x) \in G$ .
Gauge Connections: Gauge fields $A_{μ}$ are introduced as connections in the covariant derivative $D_{μ} = \partial_{μ} + A_{μ}$ .
Field Strength: The curvature $F_{μ ν} = [D_{μ}, D_{ν}]$ defines the dynamics.
Yang-Mills Action: The action depends on $F_{μ ν}$ , e.g., $\int tr (F_{μ ν} F^{μ ν})$ .

Now for the theorems.

Theorem 11

(Compact Gauge Group). The allowed symmetries form a compact Lie group

G \subset Spin (2 n)

.

Proof.

:

Constraint: ${(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ = χ^{2}$ implies invariance of arbitrary n-dimentional rotors: $R^{‡} R = 1$ .
Structure of Solutions: Rotor transformations in finite-dimensional Clifford algebras are generated by bivectors. These generate Spin( $2 n$ ) and its subgroups, which are compact Lie groups.

Thus, the gauge group G is inherently compact and derived from the algebra structure. □

Theorem 12

(Local Gauge Invariance). The theory is invariant under spacetime-dependent

T (x) \in G

.

Proof.

:

Wavefunction Transformation: $ϕ \to R (x) ϕ$ , where $R (x) = e^{θ^{i} (x) f_{i}}$ (exponentials of spacetime-dependent bivectors).
Probability Measure: ${(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ \to {(ϕ^{‡} R^{‡} R ϕ)}^{†} ϕ^{‡} R^{‡} R ϕ = χ^{2}$ .
Dirac Current: $ϕ^{‡} e_{0} ϕ \to ϕ^{‡} R^{‡} e_{0} R ϕ = ϕ^{‡} e_{0} ϕ$ , since $[f_{i}, e_{0}] = 0$ .

□

Theorem 13

(Gauge Connections). The covariant derivative

D_{μ} = \partial_{μ} + A_{μ}

emerges to maintain invariance under local

R (x)

.

Proof.

:

Minimal Coupling: To preserve $D_{μ} ϕ \to R (x) D_{μ} ϕ$ , the derivative must transform as $\partial_{μ} \to \partial_{μ} + A_{μ}$ , where $A_{μ} = f_{i} A_{μ}^{i} (x)$ .
Gauge Field Definition: Let $\partial_{μ} R (x) = A_{μ} R (x)$ , then: $D_{μ} ϕ = \partial_{μ} ϕ + A_{μ} ϕ \Rightarrow D_{μ} (R ϕ) = R D_{μ} ϕ .$
Clifford Algebra Embedding: The $A_{μ}$ are bivector fields in $C ℓ (2 n)$ , ensuring $A_{μ} \in g$ (the Lie algebra of G)).

□

Theorem 14

(Field Strength). The commutator

F_{μ ν} = [D_{μ}, D_{ν}]

defines the field strength.

Proof. Kinetic Energy: The kinetic energy expands to include the field strength tensor:

\begin{matrix} \frac{{(ϕ^{‡} γ_{0} e^{μ} D_{μ} ϕ)}^{†} ϕ^{‡} γ_{0} e^{μ} D_{μ} ϕ}{{(ϕ^{‡} ϕ)}^{†} ϕ^{‡} ϕ} = kinetic terms + F_{μ ν} \end{matrix}

(156)

where

F_{μ ν}

is the field strength (Shown in Definition 26). □

Theorem 15

(Yang-Mills Action). The spectral action over the kinetic energy includes the kinetic term

\int tr (F_{μ ν} F^{μ ν})

.

Proof. Heat Kernel Expansion: As shown in Equation 152 (see A. H. Chamseddine and Alain Connes [7] for method), the field strength term of the spectral action

S = tr (f (D^{2} / Λ^{2}))

expands as:

S \sim \int (\dots + F_{μ ν}^{a} F^{a μ ν} + \dots) \sqrt{- | g |} d^{4} x .

□

Revisiting Yang-Mills with this perspective shows that the natural constraint is sufficient to entail the theory through the principle of entropy maximization—in this formulation, Yang-Mills axioms 1, 2, 3, 4, and 5 are not fundamental, but the solution to the optimization problem.

2.4. Dimensional Obstructions

In this section, we explore the dimensional obstructions that arise when attempting to solve the entropy maximization problem for other dimensional configurations. We found that all geometric configurations except the previously explored cases are obstructed. By obstructed, we mean that the solution to the entropy maximization problem,

ρ

, does not satisfy all axioms of probability theory. These obstructions also holds for the less restrictive interpretation in 3+1D of

χ

as an information density, because this interpretation nonetheless requires positive-definiteness which is not satisfied in other dimensional configurations.

\begin{matrix} Dimensions & Optimal Predictive Theory of Nature \end{matrix}

\begin{matrix} GA (0) & Statistical Mechanics \end{matrix}

(157)

\begin{matrix} GA (0, 1) & Quantum Mechanics \end{matrix}

(158)

\begin{matrix} GA (1, 0) & Obstructed (Negative probabilities) \end{matrix}

(159)

\begin{matrix} GA (2, 0) & Quantum Mechanics \end{matrix}

(160)

\begin{matrix} GA (1, 1) & Obstructed (Negative probabilities) \end{matrix}

(161)

\begin{matrix} GA (0, 2) & Obstructed (Non - real probabilities) \end{matrix}

(162)

\begin{matrix} GA (3, 0) & Obstructed (Non - real probabilities) \end{matrix}

(163)

\begin{matrix} GA (2, 1) & Obstructed (Non - real probabilities) \end{matrix}

(164)

\begin{matrix} GA (1, 2) & Obstructed (Non - real probabilities) \end{matrix}

(165)

\begin{matrix} GA (0, 3) & Obstructed (Non - real probabilities) \end{matrix}

(166)

\begin{matrix} GA (4, 0) & Obstructed (Non - real probabilities) \end{matrix}

(167)

\begin{matrix} GA (3, 1) & Gravity + Yang - Mills \end{matrix}

(168)

\begin{matrix} GA (2, 2) & Obstructed (Negative probabilities) \end{matrix}

(169)

\begin{matrix} GA (1, 3) & Obstructed (Non - real probabilities) \end{matrix}

(170)

\begin{matrix} GA (0, 4) & Obstructed (Non - real probabilities) \end{matrix}

(171)

\begin{matrix} GA (5, 0) & Obstructed (Non - real probabilities) \\ ⋮ & ⋮ \end{matrix}

(172)

\begin{matrix} GA (6, 0) & Suspected Obstructed (No observables) \\ ⋮ & ⋮ \end{matrix}

(173)

Let us now demonstrate the obstructions mentioned above.

Theorem 16

(Non-real probabilities). The determinant of the matrix representation of the geometric algebras in this category is either complex-valued or quaternion-valued, making them unsuitable as a probability.

Proof.

These geometric algebras are classified as follows:

\begin{matrix} GA (0, 2) ≅ H \end{matrix}

(174)

\begin{matrix} GA (3, 0) ≅ M_{2} (C) \end{matrix}

(175)

\begin{matrix} GA (2, 1) ≅ M_{2}^{2} (R) \end{matrix}

(176)

\begin{matrix} GA (1, 2) ≅ M_{2} (C) \end{matrix}

(177)

\begin{matrix} GA (0, 3) ≅ H^{2} \end{matrix}

(178)

\begin{matrix} GA (4, 0) ≅ M_{2} (H) \end{matrix}

(179)

\begin{matrix} GA (1, 3) ≅ M_{2} (H) \end{matrix}

(180)

\begin{matrix} GA (0, 4) ≅ M_{2} (H) \end{matrix}

(181)

\begin{matrix} GA (5, 0) ≅ M_{2}^{2} (H) \end{matrix}

(182)

The determinant of these objects is valued in

C

or in

H

, where

C

are the complex numbers, and where

H

are the quaternions. □

Theorem 17

(Negative probabilities). The even sub-algebra of these dimensional configurations allows for negative probabilities, making them unsuitable.

Proof.

This category contains three dimensional configurations:

$GA (1, 0)$ :: Let $ψ = a + b e_{1}$ , then:

$\begin{matrix} {(a + b e_{1})}^{‡} (a + b e_{1}) = (a - b e_{1}) (a + b e_{1}) = a^{2} - b^{2} e_{1} e_{1} = a^{2} - b^{2} \end{matrix}$

(183)

which is valued in $R$ .
$GA (1, 1)$ :: Let $ψ = a + b e_{0} e_{1}$ , then:

$\begin{matrix} {(a + b e_{0} e_{1})}^{‡} (a + b e_{0} e_{1}) = (a - b e_{0} e_{1}) (a + b e_{0} e_{1}) = a^{2} - b^{2} e_{0} e_{1} e_{0} e_{1} = a^{2} - b^{2} \end{matrix}$

(184)

which is valued in $R$ .
$GA (2, 2)$ :: Let $ψ = a + b e_{0} e_{\emptyset} e_{1} e_{2}$ , where $e_{0}^{2} = - 1, e_{\emptyset}^{2} = - 1, e_{1}^{2} = 1, e_{2}^{2} = 1$ , then:

$\begin{matrix} {({(a + b)}^{‡} (a + b))}^{†} {(a + b)}^{‡} (a + b) \end{matrix}$

(185)

$\begin{matrix} = {(a^{2} + 2 a b + b^{2})}^{†} (a^{2} + 2 a b + b^{2}) \end{matrix}$

(186)

We note that $b^{2} = b^{2} e_{0} e_{\emptyset} e_{1} e_{2} e_{0} e_{\emptyset} e_{1} e_{2} = b^{2}$ , therefore:

$\begin{matrix} 1 - 1 & = (a^{2} + b^{2} - 2 a b) (a^{2} + b^{2} + 2 a b) \end{matrix}$

(187)

$\begin{matrix} = {(a^{2} + b^{2})}^{2} - 4 a^{2} b^{2} \end{matrix}$

(188)

$\begin{matrix} = {(a^{2} + b^{2})}^{2} - 4 a^{2} b^{2} \end{matrix}$

(189)

which is valued in $R$ .

In all of these cases the probability can be negative. □

Conjecture 1 (No observables (6D)). The multivector representation of the norm in 6D cannot satisfy any observables.

(Argument). In six dimensions and above, the self-product patterns found in Definition 16 collapse. The research by Acus et al.[8] in 6D geometric algebra concludes that the determinant, so far defined through a self-products of the multivector, fails to extend into 6D. The crux of the difficulty is evident in the reduced case of a 6D multivector containing only scalar and grade-4 elements:

\begin{matrix} s (B) = b_{1} B f_{5} (f_{4} (B) f_{3} (f_{2} (B) f_{1} (B))) + b_{2} B g_{5} (g_{4} (B) g_{3} (g_{2} (B) g_{1} (B))) \end{matrix}

(190)

This equation is not a multivector self-product but a linear sum of two multivector self-products[8].

The full expression is given in the form of a system of 4 equations, which is too long to list in its entirety. A small characteristic part is shown:

\begin{matrix} a_{0}^{4} - 2 a_{0}^{2} a_{47}^{2} + b_{2} a_{0}^{2} a_{47}^{2} p_{412} p_{422} + 〈 72 monomials 〉 = 0 \end{matrix}

(191)

\begin{matrix} b_{1} a_{0}^{3} a_{52} + 2 b_{2} a_{0} a_{47}^{2} a_{52} p_{412} p_{422} p_{432} p_{442} p_{452} + 〈 72 monomials 〉 = 0 \end{matrix}

(192)

\begin{matrix} 〈 74 monomials 〉 = 0 \end{matrix}

(193)

\begin{matrix} 〈 74 monomials 〉 = 0 \end{matrix}

(194)

From Equation 190, it is possible to see that no observable

O

can satisfy this equation because the linear combination does not allow one to factor it out of the equation.

\begin{matrix} b_{1} O B f_{5} (f_{4} (B) f_{3} (f_{2} (B) f_{1} (B))) + b_{2} B g_{5} (g_{4} (B) g_{3} (g_{2} (B) g_{1} (B))) = b_{1} B f_{5} (f_{4} (B) f_{3} (f_{2} (B) f_{1} (B))) + b_{2} O B g_{5} (g_{4} (B) g_{3} (g_{2} (B) g_{1} (B))) \end{matrix}

(195)

Any equality of the above type between

b_{1} O

and

b_{2} O

is frustrated by the factors

b_{1}

and

b_{2}

, forcing

O = 1

as the only satisfying observable. Since the obstruction occurs within grade-4, which is part of the even sub-algebra it is questionable that a satisfactory theory (with non-trivial observables) be constructible in 6D, using our method. □

This conjecture proposes that the multivector representation of the determinant in 6D does not allow for the construction of non-trivial observables, which is a crucial requirement for a relevant quantum formalism. The linear combination of multivector self-products in the 6D expression prevents the factorization of observables, limiting their role to the identity operator.

Conjecture 2 (No observables (above 6D)). The norms beyond 6D are progressively more complex than the 6D case, which is already obstructed.

These theorems and conjectures provide additional insights into the unique role of the unobstructed 3+1D signature in our proposal.

It is also interesting that our proposal is able to rule out

GA (1, 3)

even if in relativity, the signature of the metric

(+, -, -, -)

versus

(-, -, -, +)

does not influence the physics. However, in geometric algebra,

GA (1, 3)

represents 1 space dimension and 3 time dimensions. Therefore, it is not the signature itself that is ruled out but rather the specific arrangement of 3 time and 1 space dimensions, as this configuration yields quaternion-valued "probabilities" (i.e.

GA (1, 3) ≅ M_{2} (H)

and

det M_{2} (H) \in H

).

3. Discussion

When asked to define what a physical theory is, an informal answer might be that it is a set of equations that applies to all experiments realizable within a domain, with nature as a whole being the most general domain. While physicists have expressed these theories through sets of axioms, we propose a more direct approach—mathematically realizing the fundamental definition itself. This definition is realized as a constrained optimization problem (Axiom 1 and Definition 1) that can be solved directly (Theorem 1). The solution to this optimization problem yields precisely those structures that realize the physical theory over said domain. Succinctly, physics is the solution to:

\begin{matrix} \underset{\begin{matrix} an \\ optimization \\ problem \end{matrix}}{\underset{︸}{L}} : = \underset{\begin{matrix} on the entropy \\ of a measurement \\ relative to its preparation \\ over all \end{matrix}}{\underset{︸}{- \sum_{i} ρ_{i} (τ) ln \frac{ρ_{i} (τ)}{ρ_{i} (0)}}} + \underset{\begin{matrix} predictive theories \end{matrix}}{\underset{︸}{λ (1 - \sum_{i} ρ_{i} (τ))}} + \underset{\begin{matrix} of nature \end{matrix}}{\underset{︸}{τ tr (\bar{M} - \sum_{i} ρ_{i} (τ) M_{i})}} \end{matrix}

(196)

The relative Shannon entropy represents the basic structure of any experiment, quantifying the informational difference between its initial preparation and its final measurement.

The natural constraint is chosen to be the most general structure that admits a solution to this optimization problem. This generality follows from key mathematical requirements. The constraint must involve quantities that form an algebra, as the solution requires taking exponentials:

\begin{matrix} exp X = 1 + X + \frac{1}{2} X^{2} + \dots \end{matrix}

(197)

which involves addition, powers, and scalar multiplication of X. The use of the trace operation further necessitates that X must be represented by square

n \times n

matrices. Thus Axiom 1 involves

n \times n

matrices:

\begin{matrix} \bar{M} : = \sum_{i} ρ_{i} M_{i} \end{matrix}

(198)

The trace operation is utilized because the constraint must be converted back to a scalar for use in the Lagrange multiplier equation; while any function that maps an algebra to a scalar would achieve that, picking the trace recovers QM in the

GA (0, 1) ≅ C

case and SM in the

GA (0) ≅ R

case.

These mathematical requirements demonstrate that the natural constraint, as it admits the minimal mathematical structure required to solve an arbitrary entropy maximization problem, can be understood as the most general extension of the statistical mechanics average energy constraint which contains QM and SM (as induced by the trace) as specific solutions.

Thus, having established both the mathematical structure and its generality, we can understand how this minimal ontology operates. Since our formulation keeps the structure of experiments completely general, our optimization considers all possible predictive theories for that structure, and the constraint is the most general constraint possible for that structure, the resulting optimal physical theory applies, by construction, to all realizable experiments within its domain.

This ontology is both operational, being grounded in the basic structure of experiments rather than abstract entities, and constructive, showing how physical laws emerge from optimization over all possible predictive theories subject to the natural constraint. This represents a significant philosophical shift from traditional physical ontologies where laws are typically taken as primitive.

The next step in our derivation is to represent the determinant of the

n \times n

matrices through a self-product of multivectors involving various conjugate structures. By examining the various dimensional configurations of geometric algebras, we find that GA(3,1), representing

4 \times 4

real matrices, admits a sub-algebra whose determinant is positive-definite for its invertible members. All other dimensional configurations fail to admit such a positive-definite structure, with two exceptions: statistical mechanics (found in

GA (0)

) and quantum mechanics (found in

GA (0, 1)

and in a sub-algebra of

GA (2, 0)

).

The solution reveals that the 3+1D case harbours a new type of field amplitude structure analogous to complex amplitudes, one that exhibits the characteristic elements of a quantum mechanical theory. Instead of complex-valued amplitudes, we have amplitudes valued in the invertible subset of the even sub-algebra of

GA (3, 1)

. When normalized, this amplitude is identical to David Hestenes’ wavefunction, but comes with an extended Born rule represented by the determinant, and rather than a complex Hilbert space, it lives in a "double-product structure". This double-product structure automatically incorporates gravity via the

Spin (3, 1)

connection and local gauge theories as Yang-Mills theories. The square of the Dirac operator, automatically generated by the Lagrangian, then generates the invariants of gravity and of the Yang-Mills theory via a heat kernel expansion, along with the matter fields quantifying the system’s information via surprisal and limiting its propagation speed.

3.1. Proposed Interpretation of QM

An experiment begins with a known initial preparation

ρ (0)

, evolves under a constraint (Axiom 1) and ends with a final measurement

ρ (τ)

. By treating the experiment as the fundamental ontic entity, we resolve a redundancy inherent in traditional physical theories: Specifically, physics is not a set of laws that are simultaneouslyaxiomatic and validated by experiments (i.e., a redundancy—that which is validated by something else is not axiomatic) but an optimal interpolation device connecting

ρ (0)

to

ρ (τ)

under the constraint of nature. The experiment is fundamental, but the physical laws that are derived from it are not.

3.1.1. Demystifying the Measurement Problem

Given a statistical ensemble

E

, and some probability measure

ρ

over

E

, our derivation demonstrates that QM is the optimal interpolation device that connects

ρ (0)

to

ρ (τ)

, under the constraint of nature. This is different from an interpolation from

ρ (τ)

to some

q \in E

, which would be required for a ’collapse’ to occur. Thus, the final sampling (from

ρ (τ)

to

q \in E

) exists outside of QM (defined from

ρ (0)

to

ρ (τ)

).

If QM cannot account for the collapse, what can? Foundational to our framework is the notion of the experiment. This notion supersedes QM (the latter being its derived product) and is sufficient to demystify the collapse. In the introduction, we have stated that Definition 1 represents the set of all experiments realizable within a domain. In practice, however, we must perform each experiment atomically—the set of all realizable experiments is derived from many such experiments.

An atomic experiment will be defined as a pair of elements of

E

, where the first element of the pair is the initial measurement outcome, and the second element is the final measurement outcome. As an example, let us consider an experimental run comprising n atomic experiments over a two-state ensemble

E = {q_{1}, q_{2}}

:

\begin{matrix} E_{1} & = (q_{1}, q_{1}) \end{matrix}

(199)

\begin{matrix} E_{2} & = (q_{1}, q_{1}) \end{matrix}

(200)

\begin{matrix} E_{3} & = (q_{2}, q_{1}) \\ ⋮ & ⋮ \end{matrix}

(201)

\begin{matrix} E_{n} & = (q_{2}, q_{2}) \end{matrix}

(202)

Assuming the law of large numbers, one can construct a representative probability measure

ρ (0)

and

ρ (τ)

. Specifically,

ρ_{i} (0)

is obtained by counting the total occurrence of

q_{i}

in the first element of the pairs and dividing by n, and

ρ_{i} (τ)

by counting the total occurrence

q_{i}

in the second element of the pairs and also dividing by n. This gives us the starting and ending points to define the set of all realizable experiments using the probability measure representation

ρ (0)

and

ρ (τ)

.

We can show that the map from experimental runs to probability measure representation is many-to-one, making it non-invertible. Indeed, consider two experimental runs:

\begin{matrix} Run 1 & Run 2 \end{matrix}

\begin{matrix} E_{1} = (q_{1}, q_{1}) & E_{1}^{'} = (q_{2}, q_{1}) \end{matrix}

(203)

\begin{matrix} E_{2} = (q_{2}, q_{2}) & E_{2}^{'} = (q_{1}, q_{2}) \end{matrix}

(204)

Since both of these runs, although different, produce the same

ρ (0)

and the same

ρ (τ)

, the map must in general be non-invertible.

From this, we can deduce that the measurement problem is an artifact of idealized statistical inference. Specifically, claiming a probability measure representation from the law of large numbers allows us to discard the notion of atomic experiments, yielding a tractable but imperfect representation of reality. The measurement collapse problem is then an attempt to make this representation perfect again by inverting the map (i.e., to express reality in terms of atomic experiments rather than probability measures), but failing to do so because the map is non-invertible.

3.1.2. Dissolving the Measurement Problem

To dissolve the measurement problem, it is important to understand that our approach reframes the preparation of quantum states as an initial measurement—that is, the initial preparation is

ρ (0)

, not

ψ (0)

. Then fundamental physical evolution is understood to be in terms of atomic experiments mapping initial measurement outcome to final measurement outcome. At this fundamental level, the measurement problem is entirely dissolved. This operational perspective aligns with laboratory practice but challenges the standard formulation, which takes

ψ (0)

as its initial preparation instead of

ρ (0)

.

Core Argument:

1.

We propose that a well-defined experiment begin with a measurement outcome

q \in E

, not an abstract quantum state

ψ (0)

.

2.

Example: Preparing

| ψ 〉 = \frac{1}{\sqrt{2}} (| 0 〉 + | 1 〉)

requires:

(a): Measure systems to collapse to $| 0 〉$ or $| 1 〉$ .
(b): Discard all systems in state $| 1 〉$ .
(c): Apply a Hadamard gate H to $| 0 〉$ .
(d): The preparation is complete.

Neglecting the initial measurement (a) implies that systems of unknown states are sent into the Hadamard gate—the resulting experiment is ill-defined.

Challenges and Solutions:

1.

Objection 1: Preparation Without Collapse

(a)

Issue: Traditional QM superficially appears to allow preparing

| ψ 〉

without collapsing it (e.g., via unitary gates, cooling, etc.).

(b)

Response: In practice, all preparations are validated by measurement (or an equivalent).

(c)

Example:

i.: Cooling various qubits $| ψ 〉$ to $| 0 〉$ is non-invertible (one cannot return to the initial $| ψ 〉$ because of dissipative effects). The end result is mathematically equivalent to a measurement $| 0 〉$ or $| 1 〉$ followed by a discard of $| 1 〉$ .
ii.: Creating $| + 〉 = H | 0 〉$ requires assuming the initial $| 0 〉$ , validated by prior conditions.

2.

Objection 2: Loss of Quantum Coherence

(a)

Issue: If preparation starts with a measurement, how do we account for coherence (e.g., interference)?

(b)

Response: Coherence emerges operationally.

(c)

Example:

i.: Measure systems to collapse to $| 0 〉$ or $| 1 〉$ .
ii.: Discard all systems in state $| 1 〉$ .
iii.: Apply H to many initial $| 0 〉$ -verified states.
iv.: Aggregate final measurements ( $q \in E$ ) show interference patterns, even though individual experiments start with collapsed states.

3.

Objection 3: Entanglement and Nonlocality

(a)

Issue: Entangled states require joint preparation of superpositions.

(b)

Response: Entanglement is preparable from an initial measurement like any other state.

(c)

Example:

i.: Measure systems to collapse to $| 00 〉$ , $| 01 〉$ , $| 10 〉$ , or $| 11 〉$ .
ii.: Discard all systems in state $| 01 〉$ , $| 10 〉$ , and $| 11 〉$ .
iii.: Apply a Hadamard gate to the first qubit: $(H \otimes I) | 00 〉 = \frac{1}{\sqrt{2}} (| 0 〉 + | 1 〉) \otimes | 0 〉 = \frac{1}{\sqrt{2}} (| 00 〉 + | 10 〉)$
iv.: Apply a $CNOT$ gate (with first qubit as control, second as target): $CNOT [\frac{1}{\sqrt{2}} (| 00 〉 + | 10 〉)] = \frac{1}{\sqrt{2}} (| 00 〉 + | 11 〉)$

The final state

\frac{1}{\sqrt{2}} (| 00 〉 + | 11 〉)

is an entangled state—specifically, it’s one of the Bell states (sometimes denoted as

| Φ^{+} 〉

).

In all cases, neglecting the initial measurement results in systems of unknown state entering the experiment and making it ill-defined. An ill-defined experiment is still potentially insightful but not sufficient to uniquely entail QM from entropy optimization—we may call an ill-defined experiment an observation1,2,3.

The complete picture is that QM is an optimal interpolation device derived from a limiting case of atomic experiments mapping initial measurements to final measurements. The measurement problem is entirely dissolved at the level of atomic experiments, but emerges in QM proper due to the non-invertibility of the limiting process.

4. Conclusion

E.T. Jaynes fundamentally reoriented statistical mechanics by recasting it as a problem of inference rather than mechanics. His approach revealed that the equations of thermodynamics are not arbitrary physical laws but necessary consequences of maximizing entropy subject to constraints. This work extends Jaynes’ inferential paradigm to address a more fundamental question: what is a physical theory itself?

A physical theory, at its essence, is a set of equations that applies to all experiments realizable within a domain. While this definition is informal, our contribution lies in making this concept mathematical. By formulating it as an optimization problem—minimizing the relative entropy of measurement outcomes subject to the natural constraint—we transform an abstract definition into a precise, solvable mathematical problem.

This approach represents a profound methodological shift. Rather than constructing physical theories through trial and error enumerations of axioms, we derive them as necessary solutions to a well-defined optimization problem. Physics thus emerges not as a collection of independently discovered laws but as the unique optimal interpolation device between arbitrary experimental preparation and measurement under the constraint of nature.

The power of this formulation lies in its generality: by varying only the algebraic structure of the constraint, we recover established physical theories as special cases of the same optimization principle. Jaynes showed that statistical inference with minimal assumptions yields thermodynamics; we suggest that this same principle, properly generalized, may yield the foundation to all of physics.

Statements and Declarations

Funding: This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
Competing Interests: The author declares that he has no competing financial or non-financial interests that are directly or indirectly related to the work submitted for publication.
Data Availability Statement: No datasets were generated or analyzed during the current study.
During the preparation of this manuscript, we utilized a Large Language Model (LLM), for assistance with spelling and grammar corrections, as well as for minor improvements to the text to enhance clarity and readability. This AI tool did not contribute to the conceptual development of the work, data analysis, interpretation of results, or the decision-making process in the research. Its use was limited to language editing and minor textual enhancements to ensure the manuscript met the required linguistic standards.

Appendix E SM

Here, we solve the Lagrange multiplier equation of SM.

\begin{matrix} L : = \underset{1.6 c m B o l t z m a n n E n t r o p y}{\underset{︸}{- k_{B} \sum_{i} ρ_{i} ln ρ_{i}}} + \underset{2 c m N o r m a l i z a t i o n C o n s t r a i n t}{\underset{︸}{λ (1 - \sum_{i} ρ_{i})}} + \underset{Average Energy Constraint}{\underset{︸}{β (\bar{E} - \sum_{i} ρ_{i} E_{i})}} \end{matrix}

(A205)

We solve the maximization problem as follows:

\begin{matrix} 0 & = \frac{\partial L (ρ_{1}, \dots, ρ_{i}, \dots, ρ_{n})}{\partial ρ_{i}} \end{matrix}

(A206)

\begin{matrix} = - ln ρ_{i} - 1 - λ - β E_{i} \end{matrix}

(A207)

\begin{matrix} = ln ρ_{i} + 1 + λ + β E_{i} \end{matrix}

(A208)

\begin{matrix} \Rightarrow ln ρ_{i} & = - 1 - λ - β E_{i} \end{matrix}

(A209)

\begin{matrix} \Rightarrow ρ_{i} & = exp (- 1 - λ) exp (- β E_{i}) \end{matrix}

(A210)

\begin{matrix} = \frac{1}{Z (τ)} exp (- β E_{i}) \end{matrix}

(A211)

The partition function, is obtained as follows:

\begin{matrix} 1 & = \sum_{j} exp (- 1 - λ) exp (- β E_{j}) \end{matrix}

(A212)

\begin{matrix} \Rightarrow {(exp (- 1 - λ))}^{- 1} & = \sum_{j} exp (- β E_{j}) \end{matrix}

(A213)

\begin{matrix} Z (τ) & = \sum_{j} exp (- β E_{j}) \end{matrix}

(A214)

Finally, the probability measure is:

\begin{matrix} ρ_{i} = \frac{1}{\sum_{j} exp (- β E_{j})} exp (- β E_{i}) \end{matrix}

(A215)

Appendix F SageMath program showing ⌊u ‡ u⌋ 3,4 u ‡ u=detϕ(u)

References

Edwin T Jaynes. Information theory and statistical mechanics. Physical review, 106(4):620, 1957.
Edwin T Jaynes. Information theory and statistical mechanics. ii. Physical review, 108(2):171, 1957.
Paul Adrien Maurice Dirac. The principles of quantum mechanics. Number 27. Oxford university press, 1981.
John Von Neumann. Mathematical foundations of quantum mechanics: New edition, volume 53. Princeton university press, 2018.
David Hestenes. Spacetime physics with geometric algebra (page 6). American Journal of Physics, 71(7):691–714, 2003.
Douglas Lundholm. Geometric (clifford) algebra and its applications. arXiv preprint math/0605280, 2006.
Ali H Chamseddine and Alain Connes. The spectral action principle. Communications in Mathematical Physics, 186(3):731–750, 1997.
A Acus and A Dargys. Inverse of multivector: Beyond p+ q= 5 threshold. arXiv preprint arXiv:1712.05204, 2017.

1	The author suggests that observations, so defined, may constitute a broader conceptual category that could entail a richer landscape of effective theories beyond what experiments alone feasibly entail. Observations allow us to study parts of the universe whose complexity far exceeds our ability to precisely connect an initial preparation to a final measurement via unitary transformations in the laboratory. Accounting for this observed complexity suggests the development of effective theories across various domains, including biology, chemistry, complex systems theory, emergent phenomena, and cosmology. This extension of the optimization problem to observations, however, falls outside the scope of the current paper.
2	As statistical mechanics’ optimization problem does not reference an initial preparation, it could be argued, from these definitions, that it is based on observations and not on experiments.
3	This definition should not be taken as pejorative of observations.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Constructing Physics From Measurements

Abstract

Keywords:

Subject:

1. Introduction

2. Results

2.1. $u (1)$ -constraint: Quantum Mechanics

2.2. $spin (2)$ -constraint: Euclidean QM in 2D

2.2.1. Multivector Determinant

2.2.2. Inner Product

2.2.3. The Optimization Problem

2.3. $spin (3, 1) \oplus u (1)$ -constraint: Gravity + Yang-Mills

2.3.1. The Multivector Determinant

2.3.2. The Optimization Problem

2.3.3. Geometry

2.3.4. Gravity

2.3.5. Yang-Mills

2.3.6. Yang-Mills Axioms as Theorems

2.4. Dimensional Obstructions

3. Discussion

3.1. Proposed Interpretation of QM

3.1.1. Demystifying the Measurement Problem

3.1.2. Dissolving the Measurement Problem

4. Conclusion

Statements and Declarations

Appendix E SM

Appendix F SageMath program showing ⌊u ‡ u⌋ 3,4 u ‡ u=detϕ(u)

References

MDPI Initiatives

Important Links

Subscribe

Constructing Physics From Measurements

Abstract

Keywords:

Subject:

1. Introduction

2. Results

2.1. u ( 1 ) -constraint: Quantum Mechanics

2.2. spin ( 2 ) -constraint: Euclidean QM in 2D

2.2.1. Multivector Determinant

2.2.2. Inner Product

2.2.3. The Optimization Problem

2.3. spin ( 3 , 1 ) ⊕ u ( 1 ) -constraint: Gravity + Yang-Mills

2.3.1. The Multivector Determinant

2.3.2. The Optimization Problem

2.3.3. Geometry

2.3.4. Gravity

2.3.5. Yang-Mills

2.3.6. Yang-Mills Axioms as Theorems

2.4. Dimensional Obstructions

3. Discussion

3.1. Proposed Interpretation of QM

3.1.1. Demystifying the Measurement Problem

3.1.2. Dissolving the Measurement Problem

4. Conclusion

Statements and Declarations

Appendix E SM

Appendix F SageMath program showing ⌊u ‡ u⌋ 3,4 u ‡ u=detϕ(u)

References

MDPI Initiatives

Important Links

Subscribe

2.1. $u (1)$ -constraint: Quantum Mechanics

2.2. $spin (2)$ -constraint: Euclidean QM in 2D

2.3. $spin (3, 1) \oplus u (1)$ -constraint: Gravity + Yang-Mills