1. Introduction
The current classical theory of probability distributions have been based on the presumptive Euclidean space of parameters and random variables. In this sense the joint probability densities being derived from sequential partial derivatives of corresponding cumulative distribution function [
1,
2,
3]. The non-Euclidean manifolds endowed with Riemannian metrics in the context of probability theory, have been introduced by many authors for phase and parameter space [
4,
5,
6,
7]. Moreover using differential geometry, some variations of information theory having been devised on non-Euclidean (Riemannian) metric spaces by Fisher, Rao, Amari and others [
5,
8]. In these approaches, probability distributions of various models exhibited as points on some Riemannian manifolds. By this background, differential geometry technique could be applied to analyze the probability distribution manifolds. The most applicable metric tensor on these spaces that first introduced by Fisher and Rao, is the so-called Fisher information metric [
9]. Applying the Riemannian metric to define the basic concepts in statistics such as mean and covariance matrix of random variable, has also been introduced in other works [
6,
7,
10]. In this short article we introduce a new model based on binary data arrays and matrices to analyse the variables distributions of a system of particles on the Riemannian manifold. A cloud of a large number of particles at a certain time is considered in the space of variables without analysis of its dynamical evolution and quantum physics uncertainties restrictions. The coordinates of variables in this space, is divided to infinitesimal intervals while each interval labelled by an ordered integer number. The variables of each particle occupies just one infinitesimal interval on each variable coordinate
labelled by some integer number
i. For any specified infinitesimal interval on each coordinates, there is a specific array with binary entries
that determines the particles whose
variable restricted to this interval. Based on this background, in sec (2) we develop the concept of particle oriented coordinates which span a flat Euclidean space on which we embed Riemannian manifolds of variable coordinates. Collection of all binary arrays of all variables, yield a binary matrix containing entire information of the particles system. By introducing a generalized inner product for a set of vectors, joint probability densities of variables, is calculated as inner product of the binary arrays that stands for some vectors in cotangent space at each point on manifold and consequently the tensor density properties of joint probability density is proved. In sec (3) based on tensor density property of the joint probabilities of variables at any point on manifold, we present a new definition for joint probability densities by symmetrized covariant derivative of cumulative distribution function. A new method to connect concepts in continuous and discrete probability theory and a novel interpretation of covariant derivative by generalized inner product has been proposed. In section (4) some examples that reveal the tensor density properties of famous physical entities are presented.
Definition 1.
The space (manifold) of variables spanned by coordinates with where d is the dimension of manifold. So the number of involved independent variables (degree of freedom) is d .The variable space in present article, generally presumed to be a Riemannian manifold with local coordinates and basis vectors at any point . If we divide the coordinate into small intervals , while is a large number, then integer i refers to the interval of this parameter. This means that i stands for the coordinate value of point p along and ranges between 0 and a large integer :
The whole manifold includes all coordinates and associated basis vectors. The overall number of intervals reads as:
In this setting the vector space is a lattice space where the coordinates of points on manifold specified by d digit numbers, therefor the related field of will be . At the limit manifold , is a smooth one.
Definition 2.
Regarding the definition of cumulative distribution or joint distribution function (CDF) [1,2]. We may define a function on any point as follows:
Axiom
For a system consisting of a large number particles, there is a smooth and differentiable probability density function on the manifold that yields the density of particles at any volume element in manifold. This postulate is consistent with the accepted postulates of the kinetic theory of gases.
2. Introducing Particle Dependent Coordinates
Assume a system consisting of a large number
N of particles confined in an interval of space-time. Taking into account of such a system of particles, brings us the advantage of choosing a sufficient huge number of particles, moreover we could substitute particles by any kind of systems defined by their arbitrary points in parametric space. Suppose a set of independent parameters labelled by
is to be considered in a small interval of time
. We label the particles by integer numbers up to
N , and divide the possible range of each parameter into such small intervals that satisfy the accuracy of the experiment. These intervals defined as in definition 1, denoted by
where
i stands for the ordered location number of an intervals on the coordinate
:
So the variable’s value of each particle falls in just one of these intervals.
Definition 3. Let the basis vectors span a vector space of the dimension N over the field , where N is the number of particles. regarded a lattice Euclidean spaces endowed with Cartesian coordinates. the particles are labelled by ordered integer numbers, then is the basis for the first particle and is the basis for second particle and so on. Here any particle specifies an independent (basis) coordinate with two possible values 0 and 1. Obviously these basis are orthogonal. We call these set of basis as particle oriented coordinate that as a coordinate chart is homeomorphic to a sub-space of Euclidean lattice space . In this case the dual basis presentation coincides the same original basis i.e. . The dual basis could also be represented as a binary array like ; . At any point on manifold the basis span the tangent space . The basis acts on the basis of tangent space at the point by the relation because is specified to the variable and acts on to return the component "1" for the related particle, while its action on which is independent from returns "0". Consequently, the vectors are living in cotangent space .
For each interval
, there are some particles that their variable
places in this interval. Let record the results of these outcomes in an array whose entries are 0 or 1 in such a way that for particles with parameter value in interval
,the corresponding value in array reads 1 and otherwise 0. For instance for the first particle, if its value of parameter
falls in the range of
, it returns 1, and otherwise 0. If this process iterates for all particles then we obtain an array of entries for this interval which could be arranged as a vector
. This vector is a binary array which carries the information of this interval for system of particles e.g.
Each of these 0 and 1 connected to a specific particle in the system and the total number of entries equals the total number of particles
N. In this example the first entry 1 means that the value of variable
for particle with label 1 is in the
ith interval. therefor these vectors are members of a vector space of dimension
N. As is defined in definition 3, One may attribute to any particle, an independent basis
for
i th particle as
where only the
i th entry takes the value 1. The result of projection of a set of
on the coordinate
turns out the array
in Equation (
5). This means that the sum of basis
of all particles with the common value of
yields the vector
. For example the vector in Equation (
5), is the sum
.
The vectors at each point on belongs to a sub-space of defined in definition 3. Obviously the dimension of is d. At any point P on , If the tangent space spanned by denoted by and tangent space at the same point on denoted by , then there is a one to one mapping between these spaces.
Remark 1. Respect to definition 3, the vectors as the sum over at any point is a functional on P with local coordinates , thus belongs to the dual space of i.e.
Lemma 1. The set of basis for fixed ν are mutually orthogonal.
Proof. Any particle takes just one value and consequently one interval on
coordinate. So as was shown in definition 4, columns of
carries just one entry 1 while other entries takes 0. If the
n th component of
be denoted by
, then the inner product
can be read as a sum over all particles:
The components
and
do not take the 1 value simultaneously, because a particle can not take two values on
, thus it is easy to conclude that inner product
vanishes for
and orthogonality of these bases is proved. □
Definition 4. For a fixed ν, we define the matrix with as row. as described in definition 1, is the number of infinitesimal intervals on . Obviously each column of this matrix contains just one entry 1 and other entries are 0, because each column belongs to a particle which occupies just one value (interval) at .
Because of orthogonality, the set of
for a fixed
span a tangent space
which is in one to one mapping with tangent
with coordinates
.
Manifold is an lattice Euclidean space but the manifold as a lattice space, is not necessarily flat and may be endowed by a general Riemannian metric.
The probability density of particles within interval
which will be denoted by
, is proportional to the ratio of total number of entry "1" in the
, denoted by
, to total particles number
N :
It is straight forward to conclude that
equals the inner product
. In an infinitesimal limit of
the basis
as the dual vector of
approaches to
which belongs to the cotangent (dual) space basis of
while carries the information about that interval in considered system. Respect to coordinate transformations
in variable space we apply the rule of
transformation with summation on
:
In this equation and all tensor equations in next sections, we use Einstein’s summation convention on repeated covariant and contravariant indices.
2.1. Binary Data Matrix
Combining all the row vectors
for all d variables of
N particles, gives a matrix
with
m rows defined in Equation (
2) and
N as the total number of particles . This matrix itself is also the conjoint of
d matrices each for a specific variable. As an example if for a variable with coordinate
, there are
intervals, then a matrix
with
rows and
N columns could be identified as a part of
where
constitute the set of dual basis for coordinates
at any location
i. The columns of
carries just one entry 1 , because each column corresponds to one particle that its variable’s value falls in just one of intervals
, namely
. Obviously combining all
result in the data matrix
. As we showed in Equation (
8) the set of
span a tangent space
on a point
.
Remark 2.
Joint probability density of two variable and is proportional to the number of particles that simultaneously have the same parameter values of and or equivalently are confined in and where i and j indicate the values of the coordinates and (i.e. coordinate values). If this number presented by the exact form of joint probability density reads as:
is the volume element.
It is noteworthy to remind that for the joint density and reduce to and respectively as will be shown in Lemma 3 .
Definition 5.
We define the generalized inner (scalar) product ⊙ for vectors in a vector space with orthogonal local coordinates as a multi-linear map:
Where , , .. are components of U, V, .. respectively. This is simply a generalization of inner products in usual definition. For inner product of two basis vector in Cartesian coordinate on Euclidean tangent or cotangent space, results in metric tensor. As an example the inner product for and results in the metric tensor
Lemma 2.
Generalized scalar operation ⊙ is linear and commutative .
Proof. Due to the definition 5, it is straightforward to derive the equations of linear and commutative properties. □
If the vector U is a binary vector, the idempotent property also holds true:
Lemma 3.
For inner product ⊙ of k vectors when , the repeated vectors reduces to one vector
Proof.Applying definition in Equation (14) and commutative property of ⊙ proves the Equation (15). □
2.2. Joint probability densities as tensor densities
Theorem 1.
The joint probability density for particles with common coordinate values , , ... can be given by generalized inner product of the basis via the equation:
Proof. By using the Equation (
13) for basis
and by omitting the location indices
i ,
j ,
k , .. we have:
Summation carried out on all particles. Since components of
take two values 0 or 1; the non-vanishing terms are those with components that simultaneously take 1, and therefore right side sum of Equation (
17) reduces to the number of particles whose parameter values simultaneously are located in the intervals
i ,
j ,
k , .. on corresponding
, .. coordinates. This number id denoted as
. Normalization of
as defined in Equation (
12) by
yields the ratio of this number to total number of particles and consequently gives the joint probability
of particles with common coordinate values
,
.. .
□
The joint probability
and
are smooth functions at the limit
and
. The Equation (
18) reveals a specific configuration or state of related system for which there is a specific set of joint probabilities
that represents the exact state of it. Any configuration (state) of this system can be represented by such a specific set
for all points on manifold
and vice versa. Evidently the order of indices
.., does not affect on the related joint probability density. The example of this symmetry could be found in symmetric properties of metric tensors
or
respect to their lower and upper indices respectively. We show the tensor properties of
in the next theorem.
Lemma 4. are covariant tensors.
Proof. Let
and
. From Equation (
18) we have
Since
, the Equation (
19) is a map
from
to
or in a brief notation:
Respect to linear properties of ⊙ in equations (14)
is a multi-linear map with the following property:
There is a one to one map between local tangent vectors
on
and tangent vectors
on
at any point
P. Since
is the common field of both vector spaces, therefore the tangent spaces
and tangent space
are isomorphic. The term
, due to the axiom in section (1) and Equation (
12) is smooth and differentiable function and as multi-linear map defined in (19) is covariant tensor at any point
, with the rank
. Thus under the coordinate transformation
we have:
□
Theorem 2. Joint probability density defined in (18) is a covariant tensor density of weight -1.
Proof. With the coordinate transformation
equations (19), (20) and (24) gives:
Respect to Equation (
19) we have:
Due to the definition of volume elements
and
we obtain:
□
Where
denoted as determinant of Jacobian. Respect to the definition of tensor density,
in Equation (
26) is a tensor density of weight -1.
Remark 3.
Considering the commutative property of ⊙ from equations (15), the tensor density is symmetric respect to covariant indices .. as expected for a joint probability. On the other hand repeated covariant indices reduces to not-repeated indices as we showed in Lemma 3: