1. Introduction
The following notations will be used in this paper: Real scalar variables, whether mathematical or random, will be denoted by lower-case letters such as . Real vector/matrix variables, whether mathematical or random, whether square matrices or rectangular matrices, will be denoted by capital letters such as . Scalar constants will be denoted by lower-case letters such as and vector/matrix constants will be denoted by etc. A tilde will be used to designate variables in the complex domain such as . No tilde will be used on constants. When Greek letters and other symbols appear, the notations will be explained then and there. Let be a matrix where the elements are functionally independent or distinct real scalar variables. Then, the wedge product of differentials will be defined as . When x and y are real scalars, then the wedge product of their differentials is defined as so that . For a square matrix A the determinant will be denoted as or as . When A is in the complex domain, then the absolute value of the determinant or modulus of the determinant will be denoted as . If real scalars, then . If is in the complex domain, then one can write real, then the wedge product of differentials in will be defined as . We will consider only real-valued scalar functions in this paper. will denote integral over X of the real-valued scalar function of X. When is a real-valued scalar function of X, whether X is scalar or vector or matrix in the real or complex domain, and if for all X and , then will be defined as a density or statistical density. When a square matrix X is positive definite then it will be denoted as where , a prime denoting the transpose. Conjugate transpose of any matrix in the complex domain will be written as . When a square matrix is in the complex domain and if then is Hermitian. If then is called Hermitian positive definite. When , then means the integral of the real-valued scalar function over the real positive definite matrix X such that (all positive definite), where and are constant matrices, and similar notation and interpretation in the complex domain also. In order to avoid multiplicity of numbers, the following procedure will be used. For a function number or equation number in the complex domain, corresponding to the same in the real domain, a letter c will be affixed to the function number and section number of the equation number. For example will correspond to in the real domain and equation number will correspond to in the real domain. This notation will enable a reader to recognize a function or equation in the complex domain instantly by recognizing the subscript c. Other notations will be explained whenever they occur for the first time.
Matrix-variate statistical distributions are widely used in all types of disciplines such as Statistics, Physics, Communication Theory, Engineering problems. A matrix-variate density where a trace with an exponent enters into the density as a product and when the exponential trace has an arbitrary power, known as Kotz’ model in the literature, is widely used in the analysis of data coming from various areas such as multi-look return signals in radar and sonar, see for example, [2] regarding the analysis of PolSAR (Polarimetric Synthetic Aperture Radar) data. Kotz’ model is a generalization of the basic matrix-variate Gaussian model or it can also be considered as a generalization of matrix-variate gamma model or Wishart model. When analysing radar data, it is found that Gaussian-based models fit well when the surface is disturbance-free. It is found that Gaussian-based models are not appropriate in certain regions such as urban area, sea surface, forests etc, see for example, [3–6]. Hence, we will consider some non-Gaussian or non-Wishart models also in this paper, along with Gaussian-based models. In most of the applications in engineering areas, each scalar variable has two components, such as time and phase, so that a complex variable is very appropriate to represent such a scalar variable. Hence, it is found that distributions in the complex domain are more important in applications in physical sciences and engineering areas. When a statistical density is used in any applied problem, computation of the normalizing constant there is the most important step because when studying all sorts of properties of such a model, the computations naturally follow the format of the evaluation of the normalizing constant in that model. Explicit evaluation of the normalizing constant in a general model, often referred to also as a Kotz’ model in the real domain, does not seem to be available in the literature. Normalizing constant in the general model in the real domain appearing in [7], which the authors claim to have been available elsewhere in earlier literature, seems to be the one widely used in all the applications where Kotz’ model in the real domain is used. But, unfortunately the normalizing constant quoted in [7] does not seem to be correct. Kotz’ type model in the complex domain does not seem to be available in the literature and the normalizing constant therein does not seem to be available also. Hence, one of the aims of this paper is to give the derivation of the normalizing constant in the general model in detail, in the real and complex domains, and also to extend the ideas to Mathai’s pathway family [1], namely matrix-variate gamma, type-1 beta and type-2 beta families of densities. Since the derivation of the normalizing constant is the most important step in the construction of any statistical model, various matrix-variate models are listed in this paper by showing the computations of the normalizing constants in each case. Some applications Kotz’ model in the real domain may be seen from [8–11].
This paper is organized as follows:
Section 1 contains the introductory material.
Section 2 gives explicit evaluation of the normalizing constant in an extended matrix-variate gamma type or Gaussian type or Wishart type or Kotz type model, both in the real and complex domains, and then deals with multivariate and matrix-variate extended Gaussian and gamma type distributions.
Section 3 examines extended matrix-variate type-2 beta type models in the real and complex domains.
Section 4 contains extended matrix-variate models of the type-1 beta type in the real and complex domains. Throughout the paper, the results in the complex domain are listed side by side with the corresponding results in the real domain. Detailed derivations are done for the real domain cases only, since most of the steps in the complex domain are parallel to those in the real domain.
2. Evaluation of Some Matrix-variate Integrals and the Resulting Models
Let us start with an example of the evaluation of an integral in the real domain which will show the different types of hurdles to overcome to achieve the final result. Let
be a
matrix of rank
p where the
elements
’s are functionally independent (distinct) real scalar variables. Suppose that we wish to evaluate the following integral, where
is a real-valued scalar function of the
matrix
X, integral over
X and the wedge product of differentials
are already explained in
Section 1:
where
is
and
is
positive definite constant matrices,
is the positive definite square root of the positive definite matrix
,
denotes the expected value of
and
means the real part of
. The first step here is to simplify the matrix
into a convenient form by making a transformation
from the Lemma 2.1 given below by observing that
since
M is a constant matrix. The corresponding integral in the complex domain is the following:
where
(both Hermitian positive definite),
A is
,
B is
and
. The transformation in the complex case is
.
Lemma 2.1.
Let the matrix be in the real domain where the elements ’s are functionally independent (distinct) real scalar variables and let A be a and B be a nonsingular constant matrices. Then,
Let the matrix be in the complex domain and where A and B be and nonsingular constant matrices respectively in the real or complex domain, then
where denotes the absolute value of the determinant of .
The proof of Lemma 2.1 and other lemmas to follow, may be seen from [12]. When a matrix X is symmetric, then we have a companion result to Lemma 2.1 which will be stated next.
Lemma 2.2.
Let be a symmetric matrix and let A be a constant nonsingular matrix. Then,
and when a matrix in the complex domain is Hermitian and when A is a nonsingular constant matrix in the real or complex domain, then
Now, under Lemma 2.1,
reduces to an integral over
Y. Let us denote it as
. Then,
The corresponding integral in the complex case is the following:
The function in the real domain when is often referred to as Kotz’ model by most of the authors who use such a model. When the exponent of the determinant then the evaluation of the integral over is very difficult, which will be seen from the computations to follow. When , and in the real domain, [7] calls the model also as Kotz’ model but the normalizing constant given by them and claimed to be available in earlier literature, does not seem to be correct. The correct normalizing constant and its evaluation in the real and complex domains will be given in detail below. Since involves a determinant and a trace where the determinant is a product of eigenvalues and trace is a sum, two elementary symmetric functions, if there a transformation involving elementary symmetric functions then one can handle the determinant and trace together. This author does not know any such transformation. Going through eigenvalues does not seem to be a good option because the Jacobian element will involve a Vandermonde determinant and not very convenient to handle. The next possibility is triangularization and in this case also one can make the determinant as a product of scalar variables and trace as a sum. Then, one can use a general polar coordinate transformation so that the trace becomes a single variable, namely the radial variable r and in the product r and sine and cosine product will be separated also. Hence, this approach will be a convenient one. Continuing with the evaluation of in the real case, we have the following situations: if and , or , then one would immediately convert into and integrate out by using a real matrix-variate gamma integral in the case of and and integrate out by using the scalar variable gamma integral in the case . This conversion can be done with the help of Lemma 2.3 given below.
Lemma 2.3.
Let the matrix X of rank m be in the real domain with distinct elements ’s. Let the matrix be denoted by which is positive definite. Then, going through a transformation involving a lower triangular matrix with positive diagonal elements and a semi-orthonormal matrix and after integrating out the differential element corresponding to the semi-orthonormal matrix, we will have the following connection between and , see the details from [12]:
where, for example, is the real matrix-variate gamma function given by
where means the trace of the square matrix . Since is associated with the above real matrix-variate gamma integral, we call a real matrix-variate gamma function. This is also known by different names in the literature. When the matrix of rank m, with distinct elements, is in the complex domain and letting , which is and Hermitian positive definite, then, going through a transformation involving a lower triangular matrix with real and positive diagonal elements and a semi-unitary matrix, we can establish the following connection between and , [12]:
where, for example, is the complex matrix-variate gamma function given by
We call the complex matrix-variate gamma because it is associated with a matrix-variate gamma integral in the complex domain.
But in our , both the determinant and trace enter as multiplicative factors and there is an exponent for the exponential trace. In order to tackle this situation, we will convert to , where T is a lower triangular matrix, by using Theorem 2.14 of [12] which is restated here as a lemma. The idea is that in this case becomes product of the squares of the diagonal elements in T only and is a sum of squares also. This conversion can also be achieved by converting of Lemma 2.3 to a by using another result, where T is lower triangular.
Lemma 2.4.
Let X be matrix of rank m with functionally independent real scalar variables as elements. Let T be a lower triangular matrix and let be a semi-orthonormal matrix, . Consider the transformation where both T and are uniquely selected, for example, with the diagonal elements positive in T and with the first column elements positive in . Then, after integrating out the differential element corresponding the semi-orthonormal matrix , one has the following connection between and , [12]:
and in the complex case, let be matrix of rank m with distinct elements in the complex domain. Let be a lower triangular matrix in the complex domain with the diagonal elements real and positive and be a semi-unitary matrix, , where and are uniquely chosen. Then, after integrating out the differential element corresponding to , one has the following connection between and :
Let us consider the evaluation of
in the real case first. Converting
in
to
by using Lemma 2.4, the integral part of
over
Y, is the following, denoted by
:
The corresponding equation in the complex domain is the following:
Note that, in the real case
where in
there are
p terms and second sum has
terms, thus a total of
terms. The corresponding quantity in the complex domain is the following:
where
real, and in the first sum there are
p square terms but in the sum
there are a total of
square terms, thus a total of
square terms in the complex case.
Let us consider a polar coordinate transformation in the real case on all the
terms by using the transformation on page 44 of [12] which is restated here for convenience, that is
,
.
for
for
in the real case and
in the complex case. The structure of the polar coordinate transformation in the complex case remains as in the real case, we will denote it as (2c.4), the only change is that in the real case
and in the complex case
. The Jacobian of the transformation in the real case is
and in the complex case the Jacobian is given by
for the same ranges for
’s as in the real case, but in the complex case
.
The normalizing constant c in the real case coming from (2.3) is quoted in [7] by citing earlier works. But none of them seems to have given the evaluation of the integral in (2.3) explicitly. The normalizing constant c given in [7] does not seem to be correct. Since the integral in (2.3) appears in very many places as Kotz’ integral, and used in many disciplines, a detailed evaluation of the integral in (2.3) is warranted. Also, none seems to have given in the complex case. Hence, the evaluations of c and in the real and complex cases will be given here in detail.
2.1,2c.1 Evaluation of the integral in (2.3) in the real case and (2c.3) in the complex case
Note that
. From the Jacobian part, the factor containing
r is
. In the product
each
contains a
. Also, the Jacobian part
. Collecting all
r, the exponent of
in the real case is the following:
Then, integration over
r gives the following:
for
. The corresponding integral over
r in the complex domain is the following:
for
.
2.2 Evaluation of the sine and cosine product in the real case
Consider the integration of factors containing the
’s in the real case. These
’s come from
and from the Jacobian part. Consider
. The exponent of
is
. The exponent of
is
and the part coming from the Jacobian
. Note that
. Then, the integral over over
, denoting the integral over
as
, gives the following, where in all the integrations over the
’s to follow, we will use the transformations
:
Now, collecting all factors containing
and proceeding as in the case of
we have the following result for the integral over
:
Note that the denominator gamma in
cancels with one numerator gamma in
. The pattern of cancellation of the denominator gamma in the next step with one numerator gamma in its previous step will continue leaving only one factor in the numerator and no factor in the denominator, except the very first step involving
and
where the first denominator gamma, namely
is left out. When, integrating
, we have the following:
Note that when considering
there is no cosine factor coming from
and the cosine factor comes only from the Jacobian part. We can see that
Again, the denominator gamma in
cancels with one numerator gamma in
. This pattern will continue for
. For the integrals over
the only contribution is from the Jacobian part, no sine factor will be there. Consider
. We see that
Again, cancellation will hold. Now, consider a few last cases of
. For
we have
and for
and the last
goes from
to
and no contribution from the Jacobian part and hence
Note that starting from
to
the gamma factor left in the numerator is
. There are
such factors and the last one is
, thus the product is
. For
the factors left out in the numerator are
and for
we have
giving
. For
there is one gamma left in the denominator, namely,
. Taking the product of integral over all
’s in the real case is
where
is the real matrix-variate gamma defined in Lemma 2.3.
2c.2 Evaluation of the integral over the ’s in the complex case
The sine and cosine functions come from the transformations corresponding to
, from the Jacobian when going from
to
and from the Jacobian in the polar coordinate transformation. The Jacobian part in the polar coordinate transformation is
Collecting factors containing
, observing that
comes from
and
comes from
and the Jacobian part. The exponent of
is
and the exponent of
is
. In all the integrals to follow,
due to evenness of the integrand. Then, we will use the transformations
, the steps parallel to those in the real case. Therefore
for
. Collecting the factors containing
, we note that the exponent of
is
and the exponent of
is
. Hence,
Note that
from the denominator of
cancels with the same in the numerator of
leaving one gamma, namely
in the numerator of
and one gamma, namely
in the denominator of
. The pattern of cancellation of the gamma in the denominator of a step canceling with a gamma in the numerator of the previous step will continue as seen in the real case. Let us check for
to see whether the pattern is continuing, where in
there is no contribution of sine function, the only cosine function coming is from the Jacobian part. For
we have
For
The pattern of cancelation is continuing. But, starting from
the factor left out in the numerator is
and the last factor gives
because the range here is
and hence the factors left out in the numerator are
and one gamma, namely
is left out in the denominator of the integration over
. Hence, the integration over all sine and cosine functions in the complex case is
where
is the complex matrix-variate gamma function defined in Lemma 2.3. Then the final result of integration over
r and integration over all
’s in the real case is the following:
for
,and
are constant matrices where
A is
and
B is
and
X is
real matrix of rank
p, and the corresponding integral in the complex case is given by
for
. Therefore the normalizing constants
c and
are the following:
for
and
for
.
From the general results in (2.6, 2c.6) we can have the following interesting special cases: ; ; ; ; ; . Since the integral over the sine and cosine product is very important in many types of applications, we will give these as theorems here.
Theorem 2.1.
Let in the real case, and integral over denoted by , is the following:
The corresponding result in the complex case is the following, where
here has the same format as in the real case but here
.
Theorem 2c.1.
Let . The integral over in the complex case is the following:
From (2.6) in the real case we have the following theorems:
Theorem 2.2.
Let Y be a matrix of rank p with the elements functionally independent real scalar variables. For ,
Note 2.1. In the widely used normalizing constant in [7], which was quoted from earlier references, and corresponding to the normalizing constant c in (2.3) above, a gamma factor, namely, in the denominator is missing. Either it is a computational slip or due to the use of some wrong results in the derivation of the normalizing constant in [7] and in earlier works of others.
Theorem 2c.2.
Let be a matrix in the complex domain with rank p where the elements are functionally independent complex scalar variables. For
Corollary 2.1.
For as defined in Theorem 2.2,
for .
The results quoted from some earlier works of others and reported in [7], corresponding to our Theorems 2.2 and Corollary 2.1, do not agree with our results.
The result corresponding Corollary 2.1 in the complex case is the following:
Corollary 2c.1.
For as defined in Theorem 2c.2,
for .
Corollary 2.2.
For as defined in Theorem 2.2,
The corresponding result in the complex domain is the following:
Corollary 2c.2.
For as defined in Theorem 2c.2,
for .
Theorem 2.3.
Let be a real positive definite matrix with functionally independent real scalar variables ’s. Then, the following integral over is equivalent to the integral over Y where Y is and of rank p with distinct real scalar elements. Then,
for .
This result enables us to go back and forth from a real full-rank rectangular matrix to a real positive definite matrix. The proof is straightforward. Let . Then . Then, from Lemma 2.3, which establishes the result. The corresponding result in the complex domain is the following:
Theorem 2c.3.
Let the matrix in the complex domain Hermitian positive definite, where the distinct scalar complex variables be the elements ’s. Then, the following integral over is equivalent to the integral over where is a matrix in the complex domain of rank p with distinct complex variables as elements. Then,
for .
The proof is parallel to that in the real case. Here we use Lemma 2.3 in the complex case, that is all the difference.
4. Matrix-variate Type-1 Beta Forms
Let
X be a
vector of distinct real scalar variables. Consider the following multivariate function
that is,
X is confined to the interior of the
p-dimensional sphere of radius
and
is assumed to be zero outside this sphere. If
is the normalizing constant there so that
is a density, let us compute
. Let
by Lemma 2.3. Let
. Then,
for
. Note that
is also the density connected with type-1 beta distributed isotropic random points in geometrical probability problems, see [15]. Corresponding format in
Section 3 is associated with type-2 beta distributed random points. A more general model is available by replacing
by
is a
real constant positive definite matrix. The only change will be that the normalizing constant
will be multiplied by
and the structure of the function remains the same.
Theorem 4.1.
Let X be a real vector of distinct real scalar variables as elements. Consider the quadratic form where A is a constant positive definite matrix. Let . Then, in
for , and elsewhere, is given by
The density and the normalizing constant in the complex case, corresponding to Theorem 4.1, are the following. The evaluation of the normalizing constant is parallel to that in the real case and hence only the results are given here.
Theorem 4c.1.
Let be a vector in the complex domain with distinct scalar complex variables as elements. Let , a Hermitian form where is a constant Hermitian positive definite matrix. Note that the Hermitian form is real and hence the following function is real-valued and hence a density when is the normalizing constant there. Let . Then, the density and the normalizing constant are the following:
and elsewhere, and
Now, consider
a
matrix of rank
p where the
elements
’s are distinct real scalar variables. Consider the model
for
and
outside this sphere. Note that
sum of squares of
real scalar variables here and
Then, proceeding as in the derivation of
in
, or in Theorem 4.1, we have the following:
Theorem 4.2.
Let be as defined in (4.2) for X a real matrix of rank p, then the normalizing constant in is given by
for .
In the corresponding complex case the result is the following:
Theorem 4c.2.
Let be matrix in the complex domain of rank p where the elements are distinct scalar complex variables. Then, the following function is a density:
for and elsewhere, where
for .
A more general model is available in the real case by replacing by where are constant and matrices respectively and . Consider the transformation by Lemma 2.1. Then, the density of Y is the same as of (4.3). Hence, the only change will be that the normalizing constant in (4.3) is multiplied by . Therefore this case is not listed here separately. A similar comment holds in the complex case also. In the complex case the multiplicative factor is .
Note 4.1. From the model in Theorem 4.2, one can easily evaluate the density of or in the general case the density of from the normalizing constant . Treating , we have for an arbitrary h. Hence, from the inverse Mellin transform one has the density of or that in the general case. The same comment holds for Theorem 4.1 also. Similar comments hold in the complex case also.
If the multiplicative factor
in the real case is replaced by a determinant
let us see what happens to such a model. Again, let
X be a
matrix of rank
p with distinct
real scalar variables as elements. Consider the model
for
and
elsewhere. Then, we have the following result:
Theorem 4.3.
Let X and the parameters be as defined in (4.4). Then,
for
Proof: Let
where
is a lower triangular matrix and
is a semi-orthonormal matrix,
, where both
T and
uniquely chosen. Then, from Lemma 2.4, after integrating out the differential element corresponding to the semi-orthonormal matrix
, one has the relationship
Note that
and
where in
there are
terms. Consider a polar coordinate transformation on all these
terms
’s.
. Then, the Jacobian element is already discussed in the proof of Theorem 2.1.
r has exponent
in the Jacobian element. Then, collecting all factors containing
r in the transformed
we have
and the integral over
r gives the following:
for
. The integral over all the sine and cosine product is available from the proof of Theorem 2.1, which is
for
. Taking the product with that in
establishes the theorem. In the complex case the density and the normalizing constant are the following:
for
a
matrix of rank
p in the complex domain with distinct
complex scalar variables as elements such that
and
elsewhere. Then, the normalizing constant
is available from the following theorem.
Theorem 4c.3.
For as defined in (4c.4) and following through the derivation parallel to that in the real case, the normalizing constant is the following:
for .
As explained in Note 4.1, arbitrary moments and exact density of or its general form in the real case are available from the normalizing constant . Corresponding comment holds in the complex case also.
Note that here also a more general model in the real case is available by replacing
by
as mentioned before. The only change will be that the normalizing constant will be multiplied by
. Hence, this general case is not listed here separately. In the complex case, the multiplicative factor is
. A more general case is available by introducing another factor containing a trace also into
. Consider the following model in the real case:
for
is
and of rank
p,
and
outside this sphere. Proceeding exactly as in the proof of Theorem 4.3, we have the following result:
Theorem 4.4.
Let be as defined in (4.5) and . Then, the normalizing constant is given by
Model corresponding to the one in (4.5) in the complex case, is the following, where
is a
matrix in the complex domain of rank
p with
distinct scalar complex variables as elements such that
, the parameters are such that
:
and
elsewhere. By using steps parallel to those in the derivation of the normalizing constant
in the real case, one can see that the normalizing constant in the complex case is given in the following theorem:
Theorem 4c.4.
For as defined in (4c.5), the normalizing constant is the following:
Again, a more general model in the real case is available by replacing by . Also, from the structure of it is clear that the exact density and arbitrary moments of the determinant or is available by replacing the parameter by and then taking the ratio of the normalizing constant as explained before. Similarly, the exact density or arbitrary moments of or its general form is available by replacing by and taking the ratio of the normalizing constant . Similar comments hold in the complex domain also, the multiplicative factor will be in the complex case.
Now, we will consider an exponentiated type-1 beta type model. Again, let
X be
matrix of rank
p with
distinct real scalar variables as elements. Consider the model
for
and
outside the sphere. Then, we have the following result:
Theorem 4.5.
Let X and the parameters be as defined in . Then, the normalizing constant is given by the following:
for , where is the extended zeta function defined in Theorem 3.5.
The proof is straightforward. Since
we can use a binomial expansion and write
Now, the exponential trace part joins with the exponential trace part remaining in
becoming
. Then, one can integrate out by using Theorem 4.4 by replacing
there by
and interpreting the result in terms of an extended zeta function, Theorem 4.5 is established. The corresponding model in the complex domain is the following:
for
a
and of rank
p matrix in the complex domain with
distinct scalar complex variables as elements,
and
elsewhere. Then, following through the derivation parallel to that in the real case, the normalizing constant
is the following:
Theorem 4c.5.
Under the conditions stated in (4c.6),