Preprint
Article

A Bivariate Extension of Type II Generalized Crack Distribution for Modelling Heavy-Tailed Losses

Altmetrics

Downloads

52

Views

21

Comments

0

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

Submitted:

31 October 2024

Posted:

01 November 2024

You are already at the latest version

Alerts
Abstract
As an extension of the (univariate) Birnbaum-Saunders distribution, the Type II generalized crack (GCR2) distribution, built on an appropriate base density, provides a sufficient level of flexibility to fit various distributional shapes including heavy-tailed ones. In this paper, we develop a bivariate extension of the Type-II generalized crack distribution and study its dependency structure. For practical applications, several specific distributions, GCR2-Generalized Gaussian, GCR2-Student’s t and GCR2-Logistic, are constructed. The expectation-maximization algorithm is implemented to estimate the parameters in the bivariate GCR2 models. The model fitting results on the catastrophic loss dataset show that the bivariate GCR2 distribution base on the generalized Gaussian density fits the data significantly better than other alternative models.
Keywords: 
Subject: Computer Science and Mathematics  -   Probability and Statistics

1. Introduction

Researchers have examined diverse classes of distributions to study various facets of problems. Most loss datasets in the context of actuarial loss modelling, share some common characteristics, such as being skewed to the right, unimodal in form (multimodal in certain situations), and having a thin left tail and a moderate to extremely thick right tail. In recent years, various classes of heavy-tailed distributions including the subexponential distribution class, have been studied for modelling heavy-tailed (or extreme) data ([1,2,3]). On the other hand, a different stream of problems, like the periodic vibrations in commercial aircraft, motivated the introduction of the Birnbaum-Saunders (BS) distribution ([4]). The BS distribution models the total time elapsed until a critical threshold is exceeded by fatigue accumulated on a subject of interest (material), causing the failure event of the material to occur. The BS distribution is a two-parameter lifespan distribution derived for material fatigue data modelling. Due to its ability to fit right-skewed data, it is highly effective for modelling numerous scenarios. For example, in situations where there is an accumulation of a certain factor that drives a quantifiable characteristic to surpass a critical threshold ([5]).
Various classes of extension of the BS distribution have been discussed in the literature. In [6], a type of generalized Birnbaum-Saunders distribution family that replaced the standard normal with elliptically contoured density functions is introduced. The extreme value version of the generalized Birnbaum-Saunders (GBS) distribution, which proved that the tail thickness of the auxiliary distribution (i.e., standard elliptically symmetric distribution) determines that of the GBS distribution, has been discussed in [7]. Some applications of the extreme value BS models can be found in [5,8].
Another important extension of the BS distribution is the three-parameter (Gaussian) crack distribution ([9]). The constructed distribution is a two-component mixture of inverse Gaussian and length-biased inverse Gaussian distributions with a weight parameter p. The Gaussian crack distribution features increased flexibility to fit various datasets due to the additional the mixture weight parameter. However, the thin-tailedness of standard normal base density imposes restrictions on the Gaussian crack distribution when it comes to modelling heavy-tailed data. Due to the importance of the heavy-tailed distributions and extension to various applications beyond reliability theory and survival analysis, researchers looked for new statistical distributions. The introduction of heavy-tailed distributions with additional parameters addresses the shortcomings of the traditional distributions. A notable extension by [10] results in a large class of generalized crack (GCR) distributions, which contains the three-parameter Gaussian crack (lifetime) distribution as a specific member. Each member of the generalized crack distribution class is built on a specific choice of a base density function that determines the tail characteristics of the resulting GCR distribution. In [10,11], applications of GCR distributions built on the Student’s t and the generalized Gaussian base density functions to catastrophic losses and heavy-tailed precipitation time series, respectively, are given. The GCR distribution class has recently been further extended to the class of Type II generalized crack (GCR2) distribution ([12]) in which an additional shape parameter τ is included to increase flexibility over the GCR class. The tail characteristics of the base density function and the new shape parameter in the GCR2 model determine the right tail properties of the GCR2 class.
Regarding applications for bivariate data with heavy-tailed marginals, a bivariate GCR distribution with a form of four-component mixture of independent models, has been constructed in [13]. The authors demonstrate that bivariate GCR models can exhibit various dependence structures and serve as valuable models for various real-world situations.
This paper aims to extend the univariate GCR2 models to bivariate cases by employing the mixture model structure used in [13]. This study considers three specific examples of GCR2 distributions, a newly constructed GCR2 model based on the logistic density in addition to the two GCR2 models introduced in [12], to effectively fit insurance and catastrophic losses. This paper studies some theoretical properties of the constructed model such as conditional distribution and the measures of dependency (e.g., Spearman’s rho and Kendall’s tau), a method for model parameter estimation through the Expectation-Maximization (EM) algorithm, and fitting the model to natural disasters datasets to demonstrate its applicability.
The rest of this paper is organized as follows. Section 2 gives a brief review of the origin, definition, examples, and key properties of the univariate Type II generalized crack distributions for the reader’s convenience. In Section 3, the bivariate Type-II generalized crack distribution is introduced with some detailed discussions on its theoretical properties. A method of model estimation and its application on a real catastrophic loss dataset are presented in Section 4 and Section 5, respectively. Finally Section 6 provides some concluding comments.

2. Type-II Generalized Crack Distribution

2.1. Birnbaum-Saunders Distribution

Birnbaum-Saunders (BS) distribution (also known as the fatigue-life distribution) was motivated by the problem of periodic vibration in aircraft that caused the materials to fatigue [4]. The Birnbaum-Saunders distribution is a well-known distribution for modelling the time of a crack (or a failure) happening when a material specimen is used repeatedly and experiences material fatigue due to gradual accumulation of stress/damages. The failure event occurs when the accumulated stress on the material specimen hits a critical threshold w > 0 and the failure time T is the first hitting time of the accumulated stress to the critical threshold. Details of the Birnbaum-Saunders setting based on Brownian motion can be found in [12]. The Birnbaum-Saunders distribution is known to be a two-point mixture of the inverse Gaussian (IG) and the length-biased inverse Gaussian (LB-IG) distributions with equal mixture weights. Specifically the cumulative distribution function (cdf) and probability density function (pdf) of the BS distribution are as follows respectively:
F B S ( t ; α , β ) = Φ 1 α t β β t , t > 0 ,
f B S ( t ; α , β ) = 1 2 f I G ( x ; α , β ) + 1 2 f L B I G ( t ; α , β ) , t > 0
where
f I G ( t ; α , β ) = β α t 3 / 2 ϕ 1 α t β 1 / 2 β t 1 / 2
f L B I G ( t ; α , β ) = 1 α β t 1 / 2 ϕ 1 α t β 1 / 2 β t 1 / 2
Here β > 0 and α > 0 are the scale and shape parameters, respectively. Φ ( · ) and ϕ ( · ) denote the cdf and pdf of the standard normal distribution, whereas f I G and f L B I G denotes the cdf and pdf of the two-parameter IG and the LB-IG distributions, respectively.

2.2. Gaussian Crack (CR) Distribution

The Gaussian crack distribution ([9]) is an extension of BS distribution by introducing the weight parameter p [ 0 , 1 ] in place of the fixed weight of 1/2 in the BS distribution. Naturally the CR distribution allows for higher flexibility compared to the classical BS distribution. The pdf of the Gaussian crack (CR) distribution is given as follows:
f C R ( t ; α , β , p ) = p f I G ( t ; α , β ) + q f L B I G ( t ; α , β ) = p β α t 3 / 2 + ( 1 p ) 1 α β t 1 / 2 ϕ 1 α t β 1 / 2 β t 1 / 2 , t > 0 ,
where β > 0 , α > 0 and 0 p 1 are the scale, shape and mixture weight parameters, respectively, and q = 1 p .
Despite the fact that the right tail of the CR distribution thickens as p decreases, the tail maintains the shape of an exponential distribution. It is easy to see that the limit of the hazard rate function of the crack distribution converges to a constant that is greater than zero, i.e.,
lim t f C R ( t ) 1 F C R ( t ) = 1 2 β α 2 > 0 ,
which suggests that the Gaussian crack distribution does not belong to heavy-tailed distribution class ([3,14]).

2.3. Generalized Crack Distribution

In [10], the Gaussian CR distribution family has been extended to a large class of generalized crack (GCR) distribution whose members depend on the specification of base density function g ( · ) , where g ( · ) is a base (or auxiliary) density function which is symmetric about zero. The base density function replaces the standard normal density function ϕ ( · ) used for the Gaussian crack density.
Specifically, a random variable T has the generalized crack(GCR) distribution with base density g, denoted as G C R ( α , β , p ; g ) where the parameters α > 0 , β > 0 and the mixture weight 0 p 1 , if its pdf is given as: PDF of GCR:
f G C R ( t ; α , β , p ; g ) = p f I S ( t ; α , β ; g ) + q f L B I S ( t ; α , β ; g )
where
f I S ( t ; α , β ) = β α t 3 / 2 g 1 α t β 1 / 2 β t 1 / 2 f L B I S ( t ; α , β ) = 1 α β t 1 / 2 g 1 α t β 1 / 2 β t 1 / 2
Here, f I S and f L B I S are referred to as the pdfs of the inverse symmetric (IS) and the length-biased inverse symmetric (LB-IS) distribution, respectively, on the base density function g ( · ) . The cdf of the G C R ( α , β , p ; g ) distribution is given as:
F G C R g ( t ; α , β , p ) = G ¯ ( b ( t ) ) + ( 2 p 1 ) s 2 + 4 / α 2 g x 2 4 α 2 d x
where G ¯ ( · ) = 1 G ( · ) , G ( · ) is the distribution function of base density function g ( · ) , and b ( t ) = 1 α t / β β / t .
A general expression for the nth raw moments of the IS and LB-IS random variable along with details on the tail behaviour of the GCR distribution is provided in [10]. Their findings demonstrated that, with an appropriate choice of base density function, the heavy-tailed generalized crack distribution performs better than many well-known parametric distributions, such as Log-normal, Pareto type II, and Weibull distributions, which are frequently used in modelling positively skewed and heavy-tailed extreme data sets.

2.4. Type-II Birnbaum-Saunders (BS2) Distribution

The type-II Birnbaum-Saunders distribution extends the BS distribution by introducing another shape parameter τ . Applying the inverse transform method to Eq. 1, it can easily be verified that the following stochastic relationship holds. Let us define a random variable T as
T = β α X + α 2 X 2 + 4 2 2
where X is a standard normal random variable. The Type-II Birnbaum-Saunders (BS2) distribution introduces an additional shape parameter τ > 0 to the expression above, defined formally as
T = β α X + α 2 X 2 + 4 2 1 / τ .
The pdf of the Type-II Birnbaum-Saunders (BS2) distribution is
f B S 2 ( x ; α , β , τ ) = 2 τ α β 1 2 β x τ + 1 + 1 2 x β τ 1 ϕ 1 α x β τ β x τ = 1 2 f B S 2 ( 1 ) ( x ; α , β , τ ) + 1 2 f B S 2 ( 2 ) ( x ; α , β , τ ) ,
where α > 0 , β > 0 , τ > 0 are model parameters, and
f B S 2 ( 1 ) ( x ; α , β , τ ) = 2 τ α β β x τ + 1 ϕ 1 α x β τ β x τ f B S 2 ( 2 ) ( x ; α , β , τ ) = 2 τ α β x β τ 1 ϕ 1 α x β τ β x τ .
The cdf is given as
F B S 2 ( x ; α , β , τ ) = Φ 1 α x β τ β x τ .
Like the BS distribution, the BS2 distribution is a two-point mixture of densities f B S 2 ( 1 ) and f B S 2 ( 2 ) having equal weights. However, with the extra shape parameter τ , the BS2 distribution provides more flexibility over the BS distribution. The BS2 distribution reduces to the Birnbaum-Saunders distribution when τ = 1 / 2 .

2.5. Type-II Generalized Crack Distribution

The Type-II generalized crack distribution class can be seen as a natural extension of the Type-II Birnbaum-Saunders distribution by replacing the standard normal density with a symmetric base density and including the mixture weight parameter p ([12]). Specifically the pdf of GCR2 distribution with base density g is given as follows:
f G C R 2 ( x ; α , β , τ , p ; g ) = p f G C R 2 ( 1 ) ( x ; α , β , τ ; g ) + q f G C R 2 ( 2 ) ( x ; α , β , τ ; g )
where
f G C R 2 ( 1 ) ( x ; α , β , τ ; g ) = 2 τ α β β x τ + 1 g 1 α x β τ β x τ f G C R 2 ( 2 ) ( x ; α , β , τ ; g ) = 2 τ α β x β τ 1 g 1 α x β τ β x τ .
Respectively the cdf of GCR2 distribution with base density g is given as
F G C R 2 ( x ; α , β , τ , p ; g ) = p F G C R 2 ( 1 ) ( x ; α , β , τ ; g ) + q F G C R 2 ( 2 ) ( x ; α , β , τ ; g )
where
F G C R 2 ( 1 ) ( x ; α , β , τ ; g ) = b ( x ) g ( s ) 1 + s s 2 + 4 / α 2 d s F G C R 2 ( 2 ) ( x ; α , β , τ ; g ) = b ( x ) g ( s ) 1 s s 2 + 4 / α 2 d s ,
and b ( x ) : = b ( x ; α , β , τ ) = α 1 [ ( β / x ) τ ( x / β ) τ ] .

2.6. Specific Examples of GCR2 Distributions

The specification of the base density function on which GCR2 distribution is built may depend on some key distributional features (such as tail characteristics) that are required for each specific application. Two practical examples, the GCR2-t and the GCR2-GG distributions whose base densities are the Student’s t and the generalized Gaussian (normal) distributions, respectively, are given in [12]. In this paper we also consider the GCR2 distribution with the logistic base density, referred to as the GCR2-LG distribution, as another member of the GCR2 distribution class.
Example 2.1.(GCR2-t distribution).
The Student’s t distribution has a regular varying tail and its density function is given as
g t ( x ; ν ) = Γ ν + 1 2 ν π Γ ν 2 1 + x 2 ν ν + 1 2 , ν > 0 .
The corresponding density of the GCR2-t can be easily expressed as
f GCR 2 t ( t ; α , β , τ , p ; ν ) = 2 τ Γ ν + 1 2 α β ν π Γ ν 2 p β t τ + 1 + q t β τ 1 × 1 + 1 α 2 ν { t β 2 τ + β t 2 τ 2 } ν + 1 2 .
The Student’s t distribution belongs to the Maximum Domain of Attraction (MDA) of the Fréchet distribution with index ν , which means that the distribution of a properly normalized maximum of independent and identically distributed (i.i.d.) Student’s t random variables converge to a Fréchet distribution asymptotically. Bae and Volodin[12] show that the tail of the GCR2-t distribution is regular varying with index τ ν , and thus it also belongs to the MDA of the Fréchet distribution. Comparing with the tail of the GCR-t distribution that has the index 1 2 ν , the tail of GCR2-t becomes heavier than that of the GCR-t when τ gets smaller than 1 / 2 . Figure 1, Figure 2 and Figure 3 illustrate the shapes of the GCR2-t density function for a few prescribed parameter sets.
Example 2.2. (GCR2-GG distribution).
The GCR2-GG is a large distribution family which encompasses both thin to moderately heavy-tailed ones, and it can be useful for various practical applications. In particular when θ < 1 , the generalized Gaussian model is subexponential ([3]). The pdf of the Generalized Gaussian (GG) is given as
g GG ( x ; θ ) = θ 2 λ Γ 1 θ exp x μ λ θ .
For symmetry, we set μ = 0 and, for the identification of parameters, we set λ = Γ 1 / θ / Γ 3 / θ . The density function of the resulting GCR2-GG distribution has the following expression:
f GCR 2 GG t ; α , β , τ , p ; θ = τ θ α β λ Γ 1 θ p β t τ + 1 + q t β τ 1 × exp 1 α λ t β τ β t τ θ .
Figure 4Figure 6 illustrate the shapes of the GCR2-GG density function for a few prescribed parameter sets.
Example 2.3. (GCR2-LG distribution).
The Logistic (LG) distribution has the density function
g LG ( x ) = e x / s s ( 1 + e x / s ) 2 .
For the identification of parameters, we set s = 3 π . The resulting GCR2-LG distribution has the density function as follows:
f GCR 2 LG t ; α , β , τ , p = 2 τ α s β p β t τ + 1 + q t β τ 1 × exp { 1 α s t β τ β t τ } 1 + exp { 1 α s t β τ β t τ } 2 .
The GCR2-LG distribution can be an effective model to fit thin to moderate-tailed datasets for financial and insurance applications and, with one less paramter, it is simpler than the GCR2-t and GCR2-GG distributions.
Figure 7Figure 9 illustrate the shapes of the GCR2-LG density function for a few prescribed parameter sets.
As seen from these figures, various distribution forms can be constructed depending on the distributional features of the base density function and the related shape parameters. For a GCR2 distribution family with a specific base density, the right tail of the distribution becomes heavier as the mixture weight parameter p gets smaller. The larger the shape parameter α is, the heavier the tail becomes, and the opposite is true for the shape parameter τ . For details of tail properties of the GCR2 distribution, see Theorem 1 and Theorem 2 in Bae and Volodin[12].

3. Bivaraite GCR2 Distribution

We introduce a bivariate extension of the Type II generalized crack distribution and study some properties like the conditional distribution and measures of association. Let g be a probability density function symmetric about zero. For two positive random variables, we define the bivariate Type-II generalized crack distribution as follows.
Definition 1.
A pair of random variables T = ( T 1 , T 2 ) with base density g and parameters α =( α 1 , α 2 ), β = ( β 1 , β 2 ), τ = ( τ 1 , τ 2 ) and p = ( p 11 , p 12 , p 21 , p 22 ), has a bivariate Type-II generalized crack distribution, denoted as BVGCR2( α , β , τ ,p;g), if and only if its joint pdf is given as follows:
f B V G C R 2 ( t 1 , t 2 ; α , β , τ , p ; g ) = p 11 f G C R 2 ( 1 ) ( t 1 ; α 1 , β 1 , τ 1 ; g ) f G C R 2 ( 1 ) ( t 2 ; α 2 , β 2 , τ 2 ; g ) + p 12 f G C R 2 ( 1 ) ( t 1 ; α 1 , β 1 , τ 1 ; g ) f G C R 2 ( 2 ) ( t 2 ; α 2 , β 2 , τ 2 ; g ) + p 21 f G C R 2 ( 2 ) ( t 1 ; α 1 , β 1 , τ 1 ; g ) f G C R 2 ( 1 ) ( t 2 ; α 2 , β 2 , τ 2 ; g ) + p 22 f G C R 2 ( 2 ) ( t 1 ; α 1 , β 1 , τ 1 ; g ) f G C R 2 ( 2 ) ( t 2 ; α 2 , β 2 , τ 2 ; g ) , t 1 , t 2 > 0 ,
where the mixture weight parameters satisfy 0 p j 1 , j J : = { 11 , 12 , 21 , 22 } , j J p j = 1 , and, for each i = 1 , 2 ,
f G C R 2 ( 1 ) ( t i ; α i , β i , τ i ; g ) = 2 τ i α i β i β i t i τ i + 1 g 1 α i t i β i τ i β i t i τ i ,
and
f G C R 2 ( 2 ) ( t i ; α i , β i , τ i ; g ) = 2 τ i α i β i t i β i τ i 1 g 1 α i t i β i τ i β i t i τ i .
BVGCR2( α , β , τ ,p;g) is a mixture of four combinations of independent bivariate distributions. It is easy to see that the marginal distributions of T 1 and T 2 are GCR2 ( α 1 , β 1 , τ 1 , p 1 = p 11 + p 12 ; g ) and GCR2 ( α 2 , β 2 , τ 2 , p 2 = p 11 + p 21 ; g ) , respectively. Note that, for simplicity, we assume that the marginal distributions are built based on the same base density function g whose model parameter can be distinct for each marginal.
We provide the conditional distribution which is an essential property for the simulation of the pair of random variables. Following the relationship between the two mixture components of the GCR2 distribution, f G C R 2 ( 2 ) ( t ; α , β , τ ; g ) / f G C R 2 ( 1 ) ( t ; α , β , τ ; g ) = t / β 2 τ , the conditional density of T 2 given T 1 = t 1 is expressed as follows. For notational convenience, we drop subscript and the parameters in the functions and write as;
f 2 | 1 ( t 2 | t 1 ; g ) : = p 11 f ( 1 ) ( t 1 ; g ) f ( 1 ) ( t 2 ; g ) + p 12 f ( 1 ) ( t 1 ; g ) f ( 2 ) ( t 2 ; g ) p 1 f ( 1 ) ( t 1 ; g ) + q 1 f ( 2 ) ( t 1 ; g ) + p 21 f ( 2 ) ( t 1 ; g ) f ( 1 ) ( t 2 ; g ) + p 22 f ( 2 ) ( t 1 ; g ) f ( 2 ) ( t 2 ; g ) p 1 f ( 1 ) ( t 1 ; g ) + q 1 f ( 2 ) ( t 1 ; g ) = p 11 + p 21 t 1 β 1 2 τ f ( 1 ) ( t 2 ; g ) + p 12 + p 22 t 1 β 1 2 τ f ( 2 ) ( t 2 ; g ) p 1 + q 1 t 1 β 1 2 τ = p 11 + p 21 t 1 β 1 2 τ p 1 + q 1 t 1 β 1 2 τ f ( 1 ) ( t 2 ; g ) + p 12 + p 22 t 1 β 1 2 τ p 1 + q 1 t 1 β 1 2 τ f ( 2 ) ( t 2 ; g ) .
Hence, the conditional distribution of T 2 given T 1 = t 1 is also GCR2 ( α 2 , β 2 , τ 2 , p 2 | 1 ; g ) where,
p 2 | 1 = p 11 + p 21 ( t 1 / β 1 ) 2 τ 1 p 1 + q 1 ( t 1 / β 1 ) 2 τ 1 .
Using the conditional distribution, one can in sequence simulate the pairs of random variates ( T 1 , T 2 ) for the BVGCR2 models, by first simulating T 1 from the GCR2 ( α 1 , β 1 , τ 1 , p 1 ; g ) using the acceptance-rejection method as shown in Bae and Chen[10] and then, simulate T 2 from the conditional distribution GCR2 ( α 2 , β 2 , τ 2 , p 2 | 1 ; g ) .

3.1. Dependence Measures

In this section we derive expressions for Spearman’s rho and Kendall’s tau of BVGCR2 random variables.

3.1.1. Spearman’s Rho

Spearman’s rho is a commonly used measure of association between two random variables. Due to the invariance under monotone transformations, Spearman’s rho provides a broad interpretation of the dependence structure for any bivariate distributions. With marginal densities, f T 1 ( t 1 ) and f T 2 ( t 2 ) for the random variables T 1 and T 2 , respectively, and the joint distribution F ( t 1 , t 2 ) on ( t 1 , t 2 ) R + 2 , the (population version) Spearman’s rho is defined as;
ρ s = E ( U 1 , U 2 ) E ( U 1 ) E ( U 2 ) V a r ( U 1 ) V a r ( U 2 ) = 12 0 0 F ( t 1 , t 2 ) f T 1 ( t 1 ) f T 2 ( t 2 ) d t 1 d t 2 3
where U 1 = F T 1 ( T 1 ) and U 2 = F T 2 ( T 2 ) are uniform random variables. That is, Spearman’s rho is Pearson’s correlation between transformations of the variables into uniformly distributed ones. The following proposition provides an expression for Spearman’s rho of random variables having a BVGCR2 distribution.
Proposition 1.
Suppose ( T 1 , T 2 ) BVGCR2( α , β , τ ,p;g). Then, Spearman’s rho between T 1 and T 2 is given by
ρ s = 48 ( p 11 p 22 p 12 p 21 ) γ 1 γ 2 ,
where
γ i = t t 2 + 4 / α i 2 G i ( t ) g i ( t ) d t , i = 1 , 2 .
Proof. See Appendix A.

3.1.2. Kendall’s Tau

Kendall’s tau is a measure of concordance between two random variables. Formally, for the random variables T 1 and T 2 with the joint distribution F ( t 1 , t 2 ) on ( t 1 , t 2 ) R + 2 , the (population-version) Kendall’s tau is defined as
τ k = P ( ( T 1 T 1 ) ( T 2 T 2 ) > 0 ) P ( ( T 1 T 1 ) ( T 2 T 2 ) < 0 ) = 4 0 0 F ( t 1 , t 2 ) f ( t 1 , t 2 ) d t 1 d t 2 1 ,
where the pair ( T 1 , T 2 ) has the joint distribution F and is independent to ( T 1 , T 2 ) .
The following proposition provides an expression for Kendall’s tau of random variables having a BVGCR2 distribution.
Proposition 2.
Suppose ( T 1 , T 2 ) BVGCR2( α , β , τ ,p;g). Then , Kendall’s tau between T 1 and T 2 is given by
τ k = 32 ( p 11 p 22 p 12 p 21 ) γ 1 γ 2 ,
where
γ i = t t 2 + 4 / α i 2 G i ( t ) g i ( t ) d t , i = 1 , 2 .
Proof. See Appendix B.
Remark 3.1.
It is important to note that,
0 γ 1 4 .
This inequalities can be proven by using the symmetry of g ( t ) , G ( t ) = 1 G ( t ) and integration by parts. Specifically,
γ = t t 2 + 4 / α 2 G ( t ) g ( t ) d t = 0 t t 2 + 4 / α 2 G ( t ) g ( t ) d t + 0 t t 2 + 4 / α 2 G ( t ) g ( t ) d t = 0 s s 2 + 4 / α 2 G ( s ) g ( s ) d s + 0 t t 2 + 4 / α 2 G ( t ) g ( t ) d t = 0 s s 2 + 4 / α 2 ( 1 G ( s ) ) g ( s ) d s + 0 t t 2 + 4 / α 2 G ( t ) g ( t ) d t = 0 s s 2 + 4 / α 2 g ( s ) d s + 2 0 s s 2 + 4 / α 2 G ( s ) g ( s ) d s = 0 ( 2 G ( s ) 1 ) s s 2 + 4 / α 2 g ( s ) d s .
For 0 < s < , 0 2 G ( s ) 1 1 , 0 s s 2 + 4 / α 2 1 , and applying integration by parts,
0 ( 2 G ( s ) 1 ) s s 2 + 4 / α 2 g ( s ) d s 0 ( 2 G ( s ) 1 ) g ( s ) d s = 1 2 0 G ( s ) g ( s ) d s = 1 4 .
With this and by Eqs. (1) and (2), we obtain the following bounds for ρ s and τ k :
| ρ s | 3 4 , | τ k | 1 2 ,
Note that ρ s and τ k get the maximum when p 11 = p 22 =0.5 and γ attains its maximum of 1/4.
Also remark that, if p 11 p 22 = p 12 p 21 , and thus ρ s = τ k = 0 , the joint density of the BVGCR2 model can be expressed as a product of two GCR2 marginals, i.e., the two random variables are independent.

3.2. Tail Independence

As remarked in the previous section, the dependency measures of the proposed BVGCR2 model are bounded, and thus the model may not be suitable for the cases where the extreme dependency is required, i.e., market turmoils. Here we further investigate the tail dependence of the BVGCR2 model in terms of the upper tail dependence which is defined as
λ U = lim q 1 Pr [ T 1 > F T 1 1 ( q ) | T 2 > F T 2 1 ( q ) ] = lim q 1 Pr [ T 1 > F T 1 1 ( q ) , T 2 > F T 2 1 ( q ) ] 1 q = lim q 1 1 2 q + F T 1 , T 2 ( F T 1 1 ( q ) , F T 2 1 ( q ) ) 1 q = 2 lim q 1 d d q F T 1 , T 2 ( F T 1 1 ( q ) , F T 2 1 ( q ) ) ,
where the last equality holds by the L’hospital’s rule. In fact,
d d q F T 1 , T 2 ( F T 1 1 ( q ) , F T 2 1 ( q ) ) = p 11 d d q F T 1 ( 1 ) ( F T 1 1 ( q ) ) F T 2 ( 1 ) ( F T 2 1 ( q ) ) + p 12 d d q F T 1 ( 1 ) ( F T 1 1 ( q ) ) F T 2 ( 2 ) ( F T 2 1 ( q ) ) + p 21 d d q F T 1 ( 2 ) ( F T 1 1 ( q ) ) F T 2 ( 1 ) ( F T 2 1 ( q ) ) + p 22 d d q F T 1 ( 2 ) ( F T 1 1 ( q ) ) F T 2 ( 2 ) ( F T 2 1 ( q ) ) = p 11 F T 2 ( 1 ) ( F T 2 1 ( q ) ) f T 1 ( 1 ) ( F T 1 1 ( q ) ) f T 1 ( F T 1 1 ( q ) ) + F T 1 ( 1 ) ( F T 1 1 ( q ) ) f T 2 ( 1 ) ( F T 2 1 ( q ) ) f T 2 ( F T 2 1 ( q ) ) + p 12 F T 2 ( 2 ) ( F T 2 1 ( q ) ) f T 1 ( 1 ) ( F T 1 1 ( q ) ) f T 1 ( F T 1 1 ( q ) ) + F T 1 ( 1 ) ( F T 1 1 ( q ) ) f T 2 ( 2 ) ( F T 2 1 ( q ) ) f T 2 ( F T 2 1 ( q ) ) + p 21 F T 2 ( 1 ) ( F T 2 1 ( q ) ) f T 1 ( 2 ) ( F T 1 1 ( q ) ) f T 1 ( F T 1 1 ( q ) ) + F T 1 ( 2 ) ( F T 1 1 ( q ) ) f T 2 ( 1 ) ( F T 2 1 ( q ) ) f T 2 ( F T 2 1 ( q ) ) + p 22 F T 2 ( 2 ) ( F T 2 1 ( q ) ) f T 1 ( 2 ) ( F T 1 1 ( q ) ) f T 1 ( F T 1 1 ( q ) ) + F T 1 ( 2 ) ( F T 1 1 ( q ) ) f T 2 ( 2 ) ( F T 2 1 ( q ) ) f T 2 ( F T 2 1 ( q ) ) .
Since f T 1 ( t 1 ) = ( p 11 + p 12 ) f ( 1 ) ( t 1 ) + ( p 21 + p 22 ) f ( 2 ) ( t 1 ) , f T 2 ( t 2 ) = ( p 11 + p 21 ) f ( 1 ) ( t 2 ) + ( p 12 + p 22 ) f ( 2 ) ( t 2 ) , and f ( 2 ) ( t i ) = ( t i / β i ) 2 τ i , i = 1 , 2 , we have
lim q 1 F T i ( 1 ) ( F T i 1 ( q ) ) = 1 , lim q 1 f T i ( 1 ) ( F T i 1 ( q ) ) f T i ( F T i 1 ( q ) ) = 0 , i = 1 , 2 , lim q 1 f T 1 ( 2 ) ( F T 1 1 ( q ) ) f T 1 ( F T 1 1 ( q ) ) = 1 p 21 + p 22 , lim q 1 f T 2 ( 2 ) ( F T 2 1 ( q ) ) f T 2 ( F T 2 1 ( q ) ) = 1 p 12 + p 22 .
Therefore,
lim q 1 d d q F T 1 , T 2 ( F T 1 1 ( q ) , F T 2 1 ( q ) ) = p 12 p 12 + p 22 + p 21 p 21 + p 22 + p 22 p 21 + p 22 + p 22 p 12 + p 22 = 2 .
Then, by Eq. (9), we have λ U = 2 lim q 1 d d q F T 1 , T 2 ( F T 1 1 ( q ) , F T 2 1 ( q ) ) = 0 . That is, the BVGCR2 model does not have tail dependence.

4. Model Parameter Estimation

In this section, we discuss parameter estimation of the bivariate Type-II generalized crack distribution using the expectation-maximization algorithm. We briefly review the EM algorithm in a general setting and provide a specific application to the BVGCR2 model.

4.1. Maximum Likelihood Estimation

Suppose ( X 1 , X 2 , . . . , X n ) is an independent and identically distributed random sample drawn from a density f ( x | θ ) . Likelihood function is given as
L ( θ ) = f ( x ˜ | θ ) = i = 1 n f ( x i | θ ) .
The maximum likelihood estimation (MLE) aims to find the parameter estimate that maximizes the likelihood function, or equivalently, the log-likelihood function:
θ ^ M L E = arg max θ { log f ( x ˜ | θ ) } .
In the presence of latent (hidden/unobserved) values, however, the direct maximization method often does not provide reliable estimates; hence, other alternative methods are usually considered, and one of such methods is the expectation-maximization algorithm.

4.2. Expectation-Maximization Algorithm

For simple mixture models with a small number of mixture components, the direct optimization of the log-likelihood function may be used to obtain the maximum likelihood estimates of the model parameters. However, the direct optimization often fail to converge when the number of mixture components is large relative to the sample size.
The expectation maximization (EM) algorithm ([15]) is an efficient iterative procedure used to compute the maximum likelihood estimates in the presence of missing or hidden data. When applied to finite mixture model settings, the expectation step renders the separation of the mixture weight parameters from other model parameters for optimization, and the maximization step gives an explicit solution for updating the mixture weights. Due to this, the algorithm is the most widely used maximum likelihood estimation technique for finite mixture models.
Here we briefly review the general form of the EM algorithm. Suppose we have observed data x ˜ = ( x 1 , x 2 , . . . , x n ) with density p ( x ˜ | θ ) and some latent (hidden/unobserved data) z ˜ = ( z 1 , z 2 , . . . , z n ) with density p ( z ˜ | θ ) . The density of the complete data is denoted by p ( x ˜ , z ˜ | θ ) . The goal of the EM algorithm is to find the MLE, that is, the maximum of the observed data likelihood function,
L ( θ ) = z ˜ p ( x ˜ , z ˜ | θ ) .
The EM algorithm proceeds by iterating between the following two steps;
  • E-Step: This step calculates the expectation of the likelihood with respect to the conditional distribution of z ˜ given x ˜ and the initial parameter estimate θ ( m ) , that is,
    Q ( θ | θ ( m ) ) = E Z | x ˜ , θ ( m ) log p ( x ˜ , Z | θ ) .
  • M-Step: Choose θ ^ ( m + 1 ) = arg m a x θ Q ( θ | θ ( m ) ) .
Lemma 4.1.
The EM algorithm improves Q ( θ | θ ( m ) ) , if Q ( θ | θ ( m ) ) Q ( θ ( m ) | θ ( m ) ) then l ( θ ( m + 1 ) ) l ( θ ( m ) ) .
Proof. See Appendix C.
We now provide the EM algorithm for the estimation of the parameters in the BVGCR2 model. Let t = ( t 1 , t 2 , . . . , t n ) be a random sample drawn from a BVGCR2( α , β , τ , p ; g ) distribution, where t i = ( t i 1 , t i 2 ) is a pair of observations for each i = 1 , . . . , n , and the base density g may have its own parameter(s) such as ν involved in the Student’s t density. We denote the vector of parameters involved in the base densities by θ = ( θ 1 , θ 2 ) . Letting γ = ( α , β , τ , p , θ ) be the vector of all model parameters, the likelihood function based on the incomplete data is
L ( γ | t ) = i = 1 n j J q j f j ( t i 1 , t i 2 ; α , β , τ ; θ ; g )
where J = { 11 , 12 , 21 , 22 } is the set of indexes and
f j ( t i ; γ ; g ) = f G C R 2 ( 1 ) ( t i 1 ; α 1 , β 1 , τ 1 ; θ 1 ; g ) f G C R 2 ( 1 ) ( t i 2 ; α 2 , β 2 , τ 2 ; θ 2 ; g ) , j = 11 f G C R 2 ( 1 ) ( t i 1 ; α 1 , β 1 , τ 1 ; θ 1 ; g ) f G C R 2 ( 2 ) ( t i 2 ; α 2 , β 2 , τ 2 ; θ 2 ; g ) , j = 12 f G C R 2 ( 2 ) ( t i 1 ; α 1 , β 1 , τ 1 ; θ 1 ; g ) f G C R 2 ( 1 ) ( t i 2 ; α 2 , β 2 , τ 2 ; θ 2 ; g ) , j = 21 f G C R 2 ( 2 ) ( t i 1 ; α 1 , β 1 , τ 1 ; θ 1 ; g ) f G C R 2 ( 2 ) ( t i 2 ; α 2 , β 2 , τ 2 ; θ 2 ; g ) , j = 22 .
On the other hand, letting Z = ( Z 1 , Z 2 , . . . , Z n ) be a set of latent variables where Pr ( Z i = j ) = q j , j = 1 , 2 , for each i { 1 , . . . , n } , the likelihood function based on the (augmented) complete data is
L ( γ | t , Z ) = i = 1 n j [ q j f j ( t i 1 , t i 2 ; α , β , τ ; θ ; g ) ] I ( Z i = j )
where I ( · ) denotes the indicator function.
Let γ ( m ) = ( α ( m ) , β ( m ) , τ ( m ) , p ( m ) , θ ( m ) ) denote the current estimate of γ after m-th iteration of the EM algorithm. Then, by Bayes’ theorem, we have
p i j ( m ) : = Pr ( Z i = j | t i , γ ( m ) ) = q j ( m ) f j ( t i 1 , t i 2 ; α ( m ) , β ( m ) , τ ( m ) ; θ ( m ) ) j J q j ( m ) f j ( t i 1 , t i 2 ; α ( m ) , β ( m ) , τ ( m ) ; θ ( m ) ) ,
The expectation step (E-step) of the EM algorithm follows.
E Z | t , γ ( m ) log L ( γ | t , Z ) = i = 1 n E Z | t , γ ( m ) ( log j [ q j f j ( t i 1 , t i 2 ; α , β , τ ; θ ; g ) ] I ( Z i = j ) ) = i = 1 n E Z | t , γ ( m ) ( j J I ( Z i = j ) log [ q j f j ( t i 1 , t i 2 ; α , β , τ ; θ ; g ) ] ) = i = 1 n j J Pr ( Z i = j | t i , γ ( m ) ) log q j f j ( t i 1 , t i 2 ; α , β , τ ; θ ; g ) = i = 1 n j J p i j ( m ) log q j f j ( t i 1 , t i 2 ; α , β , τ ; θ ; g ) = i = 1 n j j p i j ( m ) log q j + i = 1 n j J p i j ( m ) log f j ( t i 1 , t i 2 ; α , β , τ ; θ ; g ) .
The maximization step (M-step) finds the updated parameter estimates that maximize the objective function Equation (Section 4.2) which separates q j and the other parameters ( α , β , τ ; θ ). The update of q j can be dealt with separately by applying the method of Lagrange multiplier. The updated estimate is
q j ( m + 1 ) = arg max q i = 1 n j J p i j ( m ) log q j = 1 n i = 1 n p i , 11 ( m ) , . . . , 1 n i = 1 n p i , 22 ( m ) .
The updated estimates α ( m + 1 ) , β ( m + 1 ) , τ ( m + 1 ) and θ ( m + 1 ) are the maximizers of the objective function
i = 1 n j J p i j ( m ) log f j ( t i 1 , t i 2 ; α , β , τ ; θ ; g ) = ( Q 1 ) + ( Q 2 ) ,
where
( Q 1 ) = i = 1 n [ ( p i , 11 ( m ) + p i , 12 ( m ) ) log f G C R 2 ( 1 ) ( t i 1 ; α 1 , β 1 , τ 1 ; θ 1 ; g ) + ( p i , 21 ( m ) + p i , 22 ( m ) ) log f G C R 2 ( 2 ) ( t i 1 ; α 1 , β 1 , τ 1 ; θ 1 ; g ) ] ,
and
( Q 2 ) = i = 1 n [ ( p i , 11 ( m ) + p i , 21 ( m ) ) log f G C R 2 ( 1 ) ( t i 2 ; α 2 , β 2 , τ 2 ; θ 2 ; g ) + ( p i , 12 ( m ) + p i , 22 ( m ) ) log f G C R 2 ( 2 ) ( t i 2 ; α 2 , β 2 , τ 2 ; θ 2 ; g ) ] .
That is, the maximization for ( α 1 , β 1 , τ 1 ; θ 1 ) can be proceeded separately from that for ( α 2 , β 2 , τ 2 ; θ 2 ) and thus, the dimensionality of the optimization problem is reduced significantly.

5. Applications

In [12] the authors demonstrate the usefulness of the univariate GCR2 distribution for modelling heavy-tailed loss severity data. Here, we fit some bivariate Type-II generalized crack distributions on real catastrophic loss data compiled from the International Disaster Database (EM-DAT, www.emdat.be). Particularly, we consider a set of observations composed of two variables, losses from meteorological (Meteo) and hydrological (Hydro) disasters spanning from 1950 to 2022 in Asia. For the purpose of bivariate model fitting, we remove the pairs with missing observations, resulting in a bivariate dataset with 166 observations. The bivariate data is composed of quarterly times series of (estimated) damage due to meteorological disasters such as storms and extreme temperatures, and hydrologic disasters such as floods and landslides. The loss values are inflation-adjusted to be equivalent to the US dollar value in 2021 with a unit of 100 million USD.
Table 1 presents summary statistics of losses due to the hydrological and meteorological disasters in Asia. The descriptive statistics together with the histogram and normal Q-Q plots (Figure 10 and Figure 11) suggest that the two datasets are both positively skewed and heavy-tailed. To isolate the deterministic components, the quarterly time series are decomposed under the multiplicative model assumption. Figure 12 presents the datasets decomposition which shows the presence of strong seasonal components in both variables and some evidence of weak long-term trends. Since the proposed model does not assume any deterministic seasonality, we deseasonalize by dividing each time series with its estimated seasonal component.
Figure 13 gives the scatter plots of the two deseasonalized variables and their log-transformed versions. The figure shows some evidence of a dependent relationship between the two variables. The sample Spearman’s rho and Kendall’s tau are 0.425 and 0.291, respectively.
We apply the EM algorithm described in Section 4.2 to fit the data using six specific bivariate models: GCR-GG, GCR-t, GCR-LG, GCR2-GG, GCR2-t, and GCR2-LG. For each model fitting, the EM algorithm requires the initialization of the parameter values. To obtain a reliable result, we first fit the marginal models separately using the estimation method given in [12], and the fitted values of the marginal parameters, i.e., ( α i , β i , τ i , θ ) for GCR models and ( α i , β i , τ i , θ ) , i = 1 , 2 , for the GCR2 models, are used for the initialization of the BVGCR (or BVGCR2) model parameters. For the mixture weight parameter initialization, we use a large set of possible parameter values satisfying p 11 + p 12 = p 1 and p 11 + p 21 = p 2 , where p 1 and p 2 are the mixture weight parameter estimates for the marginal models. The log-likelihood function values of the EM fits under the set of initializations are compared and the one gives the largest log-likelihood value is selected for the final fit.
Since the number of estimated model parameters differs by model, we compare the fits of candidate BVGCR2 models in terms of the Akaike information criterion (AIC) and the Bayesian information criterion (BIC), defined as
A I C = 2 ( Log - Likelihood ) + 2 k B I C = 2 ( Log - Likelihood ) + k log n ,
respectively, where k is the number of estimated parameters and n is the sample size. Amongst candidate models, the preferred model is the one with the smallest value of either of these criteria. From Table 2, we see that the fitted BVGCR2-GG model outperforms all the other alternative models based on the Akaike information criterion. However, the BVGCR-LG model is preferred in terms of the Bayesian information criterion which heavily penalizes complex models. Comparing the BVGCR-GG to the BVGCR2-GG, one can see the BVGCR2-GG significantly improves model fitting due to the additional shape parameter.
Table 3 gives parameter estimates of the marginal distributions and the bivariate model of the fitted GCR2-GG model.
Note that the values of ( p 11 + p 12 ) and ( p 11 + p 21 ) based on the fitted BVGCR2-GG model are larger than the corresponding estimates of p 1 and p 2 . Spearman’s rho and Kendall’s tau under the fitted BVGCR2-GG model are 0.173 and 0.115, respectively, which are lower than the empirical counterparts. This may be because the empirical data contains spurious dependence due to the deterministic trend.
To test the statistical significance of the parameter estimates of the BVGCR2-GG model, we construct the bootstrap confidence intervals by computing the 2.5th and the 97.5th percentiles of the estimates based on the 100 bootstrap samples. Table 4 shows that all the parameter estimates on the original dataset fall in the corresponding 95% bootstrap confidence intervals.
Figure 14 gives the contour plots of the fitted BVGCR2-GG model and that of the log-transformed random variables and Figure 15 presents the scatter plots of the simulated random variables from the BVGCR2-GG model and their log-transformations. Comparing these plots with Figure 13, we can see that the fitted model explains the dependence structure of the empirical data well.

6. Conclusions

In this paper, we constructed a bivariate extension of the Type-II generalized crack distribution and studied a few specific examples of the bivariate GCR2 distributions base on the generalized Gaussian, Student’s t and logistic densities to demonstrate the applicability of the constructed model. Specifically, our main theoretical finding is that the level of dependence of the constructed BVGCR2 model in terms of Kendall’s tau and Spearman’s rho is a weak to medium association. The model fitting to a catastrophic loss data showed that the fitted BVGCR2-GG model outperformed all the other alternative models based on the Akaike information criterion. Especially when compared to the BVGCR-GG model, the BVGCR2-GG model has shown a significant improvement due to the increased flexibility. With an appropriate choice of base density function, the propose BVGCR2 model can be effectively used for various applications that requires a weak to moderate level of dependence.

Author Contributions

Conceptualization, T. Bae; methodology, T.B., H.Q.; software, H.Q.; validation, T.B.; formal analysis, T.B, H.Q.; investigation, T.B.; resources, T.B.; data curation, H.Q.; writing—original draft preparation, T.B., H.Q.; writing—review and editing, T.B.; visualization, H.Q.; supervision, T.B.; project administration, T.B.; funding acquisition, T.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by the Discovery Grant program of the Natural Science and Engineering Research Council of Canada (NSERC) with a funding reference number RGPIN-2022-03428

Data Availability Statement

The data used in this research is available upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Proof of Proposition 1

We drop parameters in the expressions of the functions for notational convenience and write G ¯ ( · ) = 1 G ( · ) and b ( t i ) = α i 1 [ ( β i / t i ) τ ( t i / β i ) τ ] , i = 1 , 2 . By f B V G C R 2 ( t 1 , t 2 ; α , β , τ , p ) the joint distribution of ( T 1 , T 2 ) having BVGCR2( α , β , τ ,p;g), is
F ( t 1 , t 2 ) = p 11 F G C R 2 ( 1 ) ( t 1 ) F G C R 2 ( 1 ) ( t 2 ) + p 12 F G C R 2 ( 1 ) ( t 1 ) F G C R 2 ( 2 ) ( t 2 ) + p 21 F G C R 2 ( 2 ) ( t 1 ) F G C R 2 ( 1 ) ( t 2 ) + p 22 F G C R 2 ( 2 ) ( t 1 ) F G C R 2 ( 2 ) ( t 2 ) ,
and the marginal density of T i , i = 1 , 2 , is
f T i ( t i ) = p i f G C R 2 ( 1 ) ( t i ) + q i f G C R 2 ( 2 ) ( t i ) ,
also, the cdf of GCR2 ( t ; α , β , τ , p ) distribution can be written as
F G C R 2 ( t ; α , β , τ , p ) = 1 G ( b ( t ) ) + ( 2 p 1 ) H ( t ; α , β , τ , p ) ,
where p 1 = p 11 + p 12 , p 2 = p 11 + p 21 , q i = 1 p i , i = 1 , 2 , G ( x ) = x g ( s ) d s is the cdf of g and
H ( t ; α , β , τ , p ) = b ( t ) s s 2 + 4 / α 2 g ( s ) d s .
By expanding the integrand in Equation 6 using these expressions and taking the double integration, we have
0 0 F ( t 1 , t 2 ) f T 1 ( t 1 ) f T 2 ( t 2 ) d t 1 d t 2 = p 11 { p 1 p 2 E [ F S 1 ( S 1 ) ] E [ F S 2 ( S 2 ) ] + p 1 q 2 ζ 2 E [ F S 1 ( S 1 ) ] + q 1 p 2 ζ 1 E [ F S 2 ( S 2 ) ] + q 1 q 2 ζ 1 ζ 2 } + p 12 { p 1 p 2 η 2 E [ F S 1 ( S 1 ) ] + p 1 q 2 E [ F S 1 ( S 1 ) ] E [ F V 2 ( V 2 ) ] + q 1 p 2 ζ 1 η 2 + q 1 q 2 ζ 1 E [ F V 2 ( V 2 ) ] } + p 21 { p 1 p 2 η 1 E [ F S 2 ( S 2 ) ] + p 1 q 2 η 1 ζ 2 + q 1 p 2 E [ F V 1 ( V 1 ) ] E [ F S 2 ( S 2 ) ] + q 1 q 2 ζ 2 E [ F V 2 ( V 2 ) ] } + p 22 { p 1 p 2 η 1 η 2 + p 1 q 2 η 1 E [ F V 2 ( V 2 ) ] + q 1 p 2 η 2 E [ F V 1 ( V 1 ) ] + q 1 q 2 E [ F V 1 ( V 1 ) ] E [ F V 2 ( V 2 ) ] } .
where, for each i = 1 , 2 ,
E [ F S i ( S i ) ] = 0 F G C R 2 ( 1 ) ( t i ) f G C R 2 ( 1 ) ( t i ) d t i = 1 2 E [ F V i ( V i ) ] = 0 F G C R 2 ( 2 ) ( t i ) f G C R 2 ( 2 ) ( t i ) d t i = 1 2 ζ i : = 0 F G C R 2 ( 1 ) ( t i ) f G C R 2 ( 2 ) ( t i ) d t i = 0 [ F G C R 2 ( 2 ) ( t i ) + 2 H ( t i ) ] f G C R 2 ( 2 ) ( t i ) d t i = E [ F V i ( V i ) ] + 2 0 [ H ( t i ) f G C R 2 ( 2 ) ( t i ) ] d t i = 1 2 + 2 0 [ H ( t i ) f G C R 2 ( 2 ) ( t i ) ] d t i η i : = 0 F G C R 2 ( 2 ) ( t i ) f G C R 2 ( 1 ) ( t i ) d t i = 0 [ F G C R 2 ( 1 ) ( t i ) + 2 H ( t i ) ] f G C R 2 ( 1 ) ( t i ) d t i = E [ F S i ( S i ) ] 2 0 [ H ( t i ) f G C R 2 ( 1 ) ( t i ) ] d t i = 1 2 2 0 [ H ( t i ) f G C R 2 ( 1 ) ( t i ) ] d t i .
That is, the evaluating the double integral in Equation 6 reduces to the following two integrals: 0 H ( t ) f G C R 2 ( 1 ) ( t ) d t and 0 H ( t ) f G C R 2 ( 2 ) ( t ) d t . Due to the expression F G C R 2 ( 1 ) ( t ) = 1 G ( b ( t ) ) + H ( t ) and
H ( b 1 ( s ) ) s s 2 + 4 / α 2 g ( s ) d s = s z z 2 + 4 / α 2 s s 2 + 4 / α 2 g ( s ) d s = 0 ,
and changing the order of integrations, we obtain
0 H ( t ) f G C R 2 ( 1 ) ( t ) d t = 0 b ( t ) s s 2 + 4 / α 2 g ( s ) d s f G C R 2 ( 1 ) ( t ) d t = b 1 ( s ) f G C R 2 ( 1 ) ( t ) d t s s 2 + 4 / α 2 g ( s ) d s = ( 1 F G C R 2 ( 1 ) ( b 1 ( s ) ) ) s s 2 + 4 / α 2 g ( s ) d s = ( G ( s ) H ( b 1 ( s ) ) ) s s 2 + 4 / α 2 g ( s ) d s = G ( s ) s s 2 + 4 / α 2 g ( s ) d s H ( b 1 ( s ) ) s s 2 + 4 / α 2 g ( s ) d s = s s 2 + 4 / α 2 G ( s ) g ( s ) d s = γ .
Similarly, due to the expression F G C R 2 ( 2 ) ( t ) = 1 G ( b ( t ) ) H ( t ) , we obtain
0 H ( t ) f G C R 2 ( 2 ) ( t ) d t = 0 b ( t ) s s 2 + 4 / α 2 g ( s ) d s f G C R 2 ( 2 ) ( t ) d t = b 1 ( s ) f G C R 2 ( 2 ) ( t ) d t s s 2 + 4 / α 2 g ( s ) d s = ( 1 F G C R 2 ( 2 ) ( b 1 ( s ) ) ) s s 2 + 4 / α 2 g ( s ) d s = ( G ( s ) + H ( b 1 ( s ) ) ) s s 2 + 4 / α 2 g ( s ) d s = G ( s ) s s 2 + 4 / α 2 g ( s ) d s + H ( b 1 ( s ) ) s s 2 + 4 / α 2 g ( s ) d s = s s 2 + 4 / α 2 G ( s ) g ( s ) d s = γ .
The combination and some simplifications of these results give the expression Equation 7.

Appendix B. Proof of Proposition 2

As in the proof of proposition 1, expanding the integrand in 8 using the joint cdf and pdf of the BVGCR2 and taking double integrals, gives
  0 0 F ( t 1 , t 2 ) f ( t 1 , t 2 ) d t 1 d t 2 = ( p 11 ) 2 E [ F S 1 ( S 1 ) ] E [ F S 2 ( S 2 ) ] + p 11 p 12 ζ 2 E [ F S 1 ( S 1 ) ] + p 11 p 21 ζ 1 E [ F S 2 ( S 2 ) ] + p 11 p 22 ζ 1 ζ 2 + p 12 p 11 η 2 E [ F S 1 ( S 1 ) ] + ( p 12 ) 2 E [ F S 1 ( S 1 ) ] E [ F V 2 ( V 2 ) ] + p 12 p 21 ζ 1 η 2 + p 12 p 22 ζ 1 E [ F V 2 ( V 2 ) ] + p 21 p 11 η 1 E [ F S 2 ( S 2 ) ] + p 21 p 12 η 1 ζ 2 + ( p 21 ) 2 E [ F V 1 ( V 1 ) ] E [ F S 2 ( S 2 ) ] + p 21 p 22 ζ 2 E [ F V 2 ( V 2 ) ] + p 22 p 11 η 1 η 2 + p 21 p 11 η 1 E [ F V 2 ( V 2 ) ] + p 22 p 21 η 2 E [ F V 1 ( V 1 ) ] + ( p 22 ) 2 E [ F V 1 ( V 1 ) ] E [ F V 2 ( V 2 ) ] .
As given the proof of Proposition 1, E [ F S 1 ( S 1 ) ] = E [ F V 2 ( V 2 ) ] = 1 2 , ζ i = 1 2 + 2 γ i and η i = 1 2 2 γ i for each i = 1 , 2 . With these and after some simplification, the above integral can be expressed as
0 0 F ( t 1 , t 2 ) f ( t 1 , t 2 ) d t 1 d t 2 = 1 4 + 8 ( p 11 p 22 p 12 p 21 ) γ 1 γ 2 .

Appendix C. Proof of Lemma 4.1

The log-likelihood function of the observed data is given as l ( θ ) = log p ( x ˜ | θ ) . By the law of total probability,
log p ( x ˜ | θ ) = log z ˜ p ( x ˜ | z ˜ , θ ) p ( z ˜ | θ ) = log z ˜ p ( x ˜ , z ˜ | θ ) .
Multiplying and dividing by a constant p ( z ˜ | x ˜ , θ ( m ) ) , which is the probability of hidden data given the observed data and initial guess parameter. Since this is a probability function, then p ( z ˜ | x ˜ , θ ( m ) ) 0 and z ˜ p ( z ˜ | x ˜ , θ ( m ) ) = 1 . Then, the log-likelihood is;
l ( θ ) = log z ˜ p ( z ˜ | x ˜ , θ ( m ) ) p ( x ˜ , z ˜ | θ ) p ( z ˜ | x ˜ , θ ( m ) ) , = log E Z | x ˜ , θ ( m ) p ( x ˜ , Z | θ ) p ( Z | x ˜ , θ ( m ) ) , E Z | x ˜ , θ ( m ) log p ( x ˜ , Z | θ ) p ( Z | x ˜ , θ ( m ) ) b y J e n s e n i n e q u a l i t y , = E Z | x ˜ , θ ( m ) log p ( x ˜ , Z | θ ) E Z | x ˜ , θ ( m ) log p ( Z | x ˜ , θ ( m ) ) , = Q ( θ | θ ( m ) ) E Z | x ˜ , θ ( m ) log p ( Z | x ˜ , θ ( m ) ) , l ( θ ) Q ( θ | θ ( m ) ) h ( θ ( m ) ) .
Therefore, Q ( θ | θ ( m ) ) gives the lower bound on the log-likelihood, which is the only term that depends on θ . l ( θ ) Q ( θ | θ ( m ) ) h ( θ ( m ) ) is true for all θ including the situation where θ = θ ( m ) . We show the relationship is still true for θ = θ ( m ) ,
z ˜ p ( z ˜ | x ˜ , θ ( m ) ) l o g p ( x ˜ , z ˜ | θ ( m ) ) p ( z ˜ | x ˜ , θ ( m ) ) , = z ˜ p ( z ˜ | x ˜ , θ ( m ) ) l o g p ( x ˜ | θ ( m ) ) b y B a y e s t h e o r e m , = log p ( x ˜ | θ ( m ) ) z ˜ p ( z ˜ | x ˜ , θ ( m ) ) = : l ( θ ( m ) ) .
If we have θ for which Q ( θ | θ ( m ) ) Q ( θ ( m ) | θ ( m ) ) , then
l ( θ ) + h ( θ ( m ) ) Q ( θ | θ ( m ) ) Q ( θ ( m ) | θ ( m ) ) = l ( θ ( m ) ) + h ( θ ( m ) ) .
Therefore, as Q increases, it implies log p ( x ˜ | θ ) also increases. We conclude that, if θ is the MLE, then l ( θ ) l ( θ ( m + 1 ) ) l ( θ ( m ) ) , always holds.

References

  1. Bingham, N.H.; Goldie, C.M.; Teugels, J.L. Regular Variation; Cambridge University Press, 1987.
  2. Embrechts, P.; Klüppelberg, C.; Mikosch, T. Modelling Extremal Events for Insurance and Finance; Springer-Verlag: Berlin, 1997. [Google Scholar]
  3. Foss, S.; Korshunov, D.; Zachary, S. An Introduction to Heavy-Tailed and Subexponential Distributions; Springer Science+Business Media, 2013.
  4. Birnbaum, Z.W.; Saunders, S.C. A new family of life distributions. Journal of applied probability 1969, 6, 319–327. [Google Scholar] [CrossRef]
  5. Leiva, V. The Birnbaum-Saunders Distribution 2015.
  6. Díaz-García, J.A.; Leiva-Sánchez, V. A new family of life distributions based on the elliptically contoured distributions. Journal of Statistical Planning and Inference 2005, 128, 445–457. [Google Scholar] [CrossRef]
  7. Gomes, M.I.; Ferreira, M.; Leiva, V. The extreme value Birnbaum-Saunders model, its moments and an application in biometry. Biometrical Letters 2012, 49, 81–94. [Google Scholar] [CrossRef]
  8. Ferreira, M.; Gomes, M.I.; Leiva, V. On an extreme value version of the Birnbaum–Saunders distribution. REVSTAT-Statistical Journal 2012, 10, 181–210. [Google Scholar]
  9. Volodin, I.N.; Dzhungurova, O.A. On limit distributions emerging in the generalized Birnbaum-Saunders model. Journal of Mathematical Sciences 2000, 99, 1348–1366. [Google Scholar] [CrossRef]
  10. Bae, T.; Chen, J. On heavy-tailed crack distribution for loss severity modeling. International Journal of Statistics and Probability 2017, 6, 92–110. [Google Scholar] [CrossRef]
  11. Mazjini, M.; Bae, T. Statistical modelling of precipitation data in Canadian Prairies with a dynamic mixture structure. Theoretical and Applied Climatology 2023, 153, 173–192. [Google Scholar] [CrossRef]
  12. Bae, T.; Volodin, A. Type-II Generalized Crack Distribution with Application to Heavy-Tailed Data Modeling. Journal of Statistical Theory and Practice 2022, 16, 1–23. [Google Scholar] [CrossRef]
  13. Bae, T.; Choi, Y.H. A bivariate extension of three-parameter generalized crack distribution for loss severity modelling. Journal of the Korean Statistical Society 2022, 51, 378–402. [Google Scholar] [CrossRef]
  14. Klugman, S.A.; Panjer, H.H.; Willmot, G.E. Loss models: from data to decisions; Vol. 715, John Wiley & Sons, 2012.
  15. Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum likelihood from incomplete data via the EM algorithm. Journal of the royal statistical society: series B (methodological) 1977, 39, 1–22. [Google Scholar] [CrossRef]
Figure 1. Density functions of GCR2-t distribution with α = 0.5, β = 5 , τ { 0.3 , 0.5 , 0.8 } , p = 0.5, and ν = 3.
Figure 1. Density functions of GCR2-t distribution with α = 0.5, β = 5 , τ { 0.3 , 0.5 , 0.8 } , p = 0.5, and ν = 3.
Preprints 138191 g001
Figure 2. Density functions of GCR2-t distribution with α { 0.25 , 0.75 , 1 } , β = 5 , τ = 0.3 , p = 0.5, and ν = 5.
Figure 2. Density functions of GCR2-t distribution with α { 0.25 , 0.75 , 1 } , β = 5 , τ = 0.3 , p = 0.5, and ν = 5.
Preprints 138191 g002
Figure 3. Density functions of GCR2-t distribution with α = 0.75, β = 5 , τ = 0.5, p { 0.3 , 0.5 , 0.8 } and ν = 5.
Figure 3. Density functions of GCR2-t distribution with α = 0.75, β = 5 , τ = 0.5, p { 0.3 , 0.5 , 0.8 } and ν = 5.
Preprints 138191 g003
Figure 4. Density functions of GCR2-GG distribution with α = 0.75 , β = 5 , τ { 0.3 , 0.5 , 0.8 } , p = 0.5, and θ = 0.8.
Figure 4. Density functions of GCR2-GG distribution with α = 0.75 , β = 5 , τ { 0.3 , 0.5 , 0.8 } , p = 0.5, and θ = 0.8.
Preprints 138191 g004
Figure 5. Density functions of GCR2-GG distribution with α = 1 , β = 5 , τ = 0.5 , p { 0.2 , 0.5 , 0.8 } , and θ = 1.
Figure 5. Density functions of GCR2-GG distribution with α = 1 , β = 5 , τ = 0.5 , p { 0.2 , 0.5 , 0.8 } , and θ = 1.
Preprints 138191 g005
Figure 6. Density functions of GCR2-GG distribution with α { 0.5 , 0.75 , 1 } , β = 5 , τ = 0.5 , p = 0.5, and θ = 1.2.
Figure 6. Density functions of GCR2-GG distribution with α { 0.5 , 0.75 , 1 } , β = 5 , τ = 0.5 , p = 0.5, and θ = 1.2.
Preprints 138191 g006
Figure 7. Density functions of GCR2-LG distribution with α = 1, β = 5 , τ = 0.5 and p { 0.2 , 0.5 , 0.8 } .
Figure 7. Density functions of GCR2-LG distribution with α = 1, β = 5 , τ = 0.5 and p { 0.2 , 0.5 , 0.8 } .
Preprints 138191 g007
Figure 8. Density functions of GCR2-LG distribution with α =0.75 , β = 5 , τ { 0.5 , 0.75 , 1 } and p = 0.5.
Figure 8. Density functions of GCR2-LG distribution with α =0.75 , β = 5 , τ { 0.5 , 0.75 , 1 } and p = 0.5.
Preprints 138191 g008
Figure 9. Density functions of GCR2-LG distribution with α { 0.5 , 0.75 , 0.9 } , β = 5 , τ = 0.3 and p = 0.5.
Figure 9. Density functions of GCR2-LG distribution with α { 0.5 , 0.75 , 0.9 } , β = 5 , τ = 0.3 and p = 0.5.
Preprints 138191 g009
Figure 10. Histogram of Meteo loss dataset and its normal Q-Q plot.
Figure 10. Histogram of Meteo loss dataset and its normal Q-Q plot.
Preprints 138191 g010
Figure 11. Histogram of Hydro loss dataset and its normal Q-Q plot.
Figure 11. Histogram of Hydro loss dataset and its normal Q-Q plot.
Preprints 138191 g011
Figure 12. Multiplicative decompositions of Meteo and Hydro time series.
Figure 12. Multiplicative decompositions of Meteo and Hydro time series.
Preprints 138191 g012
Figure 13. Scatter plots of the deseasonalized data and their log-transformations.
Figure 13. Scatter plots of the deseasonalized data and their log-transformations.
Preprints 138191 g013
Figure 14. Contour plots of the fitted BVGCR2-GG density and the density of the log-transformed random variables.
Figure 14. Contour plots of the fitted BVGCR2-GG density and the density of the log-transformed random variables.
Preprints 138191 g014
Figure 15. Scatter plots of simulated random variables from the fitted BVGCR2-GG model and their log-transformations.
Figure 15. Scatter plots of simulated random variables from the fitted BVGCR2-GG model and their log-transformations.
Preprints 138191 g015
Table 1. Descriptive summary statistics of the dataset (Unit: 100 million USD).
Table 1. Descriptive summary statistics of the dataset (Unit: 100 million USD).
Data n Min 1st Qu. Median Mean 3rd Qu. Max Skewness Kurtosis
Meteo 166 0.003 2.005 7.275 30.799 35.282 287.090 2.820 11.304
Hydro 166 0.007 1.692 12.172 45.538 53.330 621.207 4.014 23.188
Table 2. Model comparison with AIC and BIC.
Table 2. Model comparison with AIC and BIC.
Model Log-likelihood AIC BIC
BVGCR-GG -1370.332 2758.664 2786.672
BVGCR-t -1372.381 2762.762 2790.770
BVGCR-LG -1372.066 2758.132 2779.916
BVGCR2-GG -1367.346 2756.692 2790.924
BVGCR2-t -1371.782 2765.564 2799.796
BVGCR2-LG -1371.707 2761.414 2789.422
Table 3. Parameter estimates of marginals and Bivariate GCR2-GG models.
Table 3. Parameter estimates of marginals and Bivariate GCR2-GG models.
α ^ 1 β ^ 1 τ ^ 1 θ ^ 1 p 1 ^ α ^ 2 β ^ 2 τ ^ 2 θ ^ 2 p 2 ^ p ^ 11 p ^ 12 p ^ 21 p ^ 22
Meteo 20.075 0.915 0.737 0.623 0.094
Hydro 11.152 1.964 0.669 0.809 0.189
Bivariate 22.644 1.494 0.791 0.477 11.442 1.964 0.675 0.798 0.124 0.040 0.072 0.764
Table 4. The 95% bootstrap confidence intervals for the parameters in Bivariate GCR2-GG model
Table 4. The 95% bootstrap confidence intervals for the parameters in Bivariate GCR2-GG model
α ^ 1 β ^ 1 τ ^ 1 θ ^ 1 α ^ 2 β ^ 2 τ ^ 2 θ ^ 2 p ^ 11 p ^ 12 p ^ 21 p ^ 22
Original estimate 22.644 1.494 0.791 0.477 11.442 1.964 0.675 0.798 0.124 0.040 0.072 0.764
Bootstrap mean 27.673 1.524 0.735 0.583 14.189 2.065 0.680 0.988 0.131 0.037 0.067 0.766
2.5th percentile 7.031 0.762 0.542 0.347 2.356 1.500 0.322 0.418 0.059 0.000 0.000 0.687
97.5th percentile 52.149 2.252 0.951 1.013 42.898 2.803 0.913 3.027 0.192 0.080 0.147 0.843
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated