Stochastic dominance (SD) is a stochastic ordering for two probability distributions. In this section, we give the definitions of SD in both perspectives from the distributions and the utilities. Then, we discuss the fundamental reason on the disagreement about the conventional and Fishburn’s definitions.
Throughout this paper, we adopt the following set notations. Let be the set of positive integers. Define for any non-negative integer c. Denote by and the sets of real numbers and positive real numbers respectively.
2.1. First- and Second-Order Stochastic Dominance
Let
F and
G be two random variables, and
and
be their probability density functions respectively. Define
for all
, where
and
. For two discrete random variables
F and
G, we can use a similar definition by replacing the integrals into summations [
25,
26], i.e.,
Notice that and are the cumulative distributions of F and G respectively. We can obtain by accumulating the values in , which is the second-order cumulative distribution. In a similar way, we can obtain higher-order cumulative distributions of F and G.
Definition 1 (First-Order SD). F dominates G in the first order if and only if for all x. The dominance is strict if and only if the strict inequality holds for some x.
From this definition, we can see that the cumulative distribution of
F is always no larger than the cumulative distribution of
G. Therefore, the expectation of
F is no smaller than that of
G. If we view
x as the reward and
as the chances of getting the reward, we prefer a larger expected value on the reward. That is, we prefer
F more than
G, which is one of the basic explanations of
F dominates
G in the first order. As we will discuss below, first-order SD has a stronger meaning than the above explanation. However, this explanation is one of the most useful properties that can be applied in other fields, e.g., in modeling communication channels with packet loss [
12].
In a more general sense, von Neumann-Morgenstern utility is an extension of the theory of consumer preferences. In this theory, the optimal decision is the one that maximizes the expected utility derived from the choice made, i.e., the distribution that maximizes the expected utility. The simplest case is a monotonically increasing utility, which means that we prefer a largest reward.
In this perspective, we have another definition of first-order SD. Denote by the expectation operator. Let be the set of all monotonically increasing utilities.
Definition 2 (First-Order SD (Utility)). F dominates G in the first order if and only if for all . The dominance is strict if and only if the strict inequality holds for at least one .
In fact, the two definitions are equivalent. We will see that in the discussion later. Notices that not all distributions may have a first-order SD relation.
We may further refine the relation to capture more information such as which one has a better mean while involving less risk. This is the tendency to prefer outcomes with low uncertainty than those with high uncertainty, known as risk aversion. This relation is the second-order SD. Similarly, we have two equivalent definitions. Let be the set of all monotonically increasing concave utilities.
Definition 3 (Second-Order SD). F dominates G in the second order if and only if for all x. The dominance is strict if and only if the strict inequality holds for some x.
Definition 4 (Second-Order SD (Utility)). F dominates G in the second order if and only if for all . The dominance is strict if and only if the strict inequality holds for at least one .
2.2. Higher-Order Stochastic Dominance
In a similar manner, we can define higher-order SD. First, we define the SD used by Fishburn in his works, which is a direct extension of the above definition.
Definition 5 (Fishburn’s nth-Order SD). F dominates G in the nth order if and only if for all x. The dominance is strict if and only if the strict inequality holds for some x.
For the utility-based definition, let
be the set of all increasing utilities that odd derivatives are non-negative and even derivatives are non-positive, up to the
nth derivative.
Definition 6 (nth-Order SD (Utility)). F dominates G in the nth order if and only if for all . The dominance is strict if and only if the strict inequality holds for at least one .
However, the two definitions are not equivalent. To see the reason, we consider the following evaluation of bounded continuous distributions [
25]. The evaluation for unbounded distributions can be found in [
28,
29]. Let
f and
g be two distributions whose supports are bounded by
. Note that
and
for all
n.
For first-order SD, the evaluation stops at Equation (1). Because and for all x, we know that . For second-order SD, the evaluation stops at Equation (2). This time, we have , and for all x, we know that .
The issue arises when we consider third-order SD [
10]. The evaluation stops at Equation (3). We have
,
,
, and
for all
x. However, we cannot conclude whether
or not due to the term
. Similarly, for
nth-order SD, we have the problematic terms
,
,
…,
.
In risk management, we concern about all possible utilities in . To ensure that , the convention is to assume for all , so that all problematic terms become non-negative. This leads to the conventional definition of nth-order SD below, which is equivalent to the utility-based definition.
Definition 7 (Conventional nth-Order SD). F dominates G in the nth order if and only if
for all x; and
for all .
The dominance is strict if and only if there is at least one strict inequality.
Note that this is a compromise to impose the restrictions on the distributions to fulfill the utility definition. If we start from Fishburn’s SD, then we need to restrict the choices of utilities to a specific utility class in the corresponding equivalent definition. The choice of definition depends on the application. For example, if we only concern whether the expectation of F is no less than that of G, then we do not need to use the stronger definition that works for utilities other than the identity function, thus Fishburn’s definition is sufficient.
From the evaluation, we can see that first-order SD implies second-order SD, and so on and so forth. This SD hierarchy, together with the equivalent of utility-based definition and the conventional definition, are valid for discrete distributions, no matter if the utility is continuous or discrete. For discrete utilities on
in ascending order [
26], the characterization becomes
where
When the domain is a sequence of consecutive integers, we can simplify
as
, where
is the
nth-order forward difference operator defined as
From the definition, we can see that when the order is too high, the
nth-order forward difference is undefined if the utility has a bounded domain. There is no such issue when the domain is not right-bounded.
Note that it is not guaranteed for the existence of
n such that either
F dominates
G or
G dominates
F in some integral order in the conventional definition, even when the definition is extended to infinite-order [
30]. The necessary and sufficient condition for such existence in conventional SD was shown in [
30] that applies negative moments and the Bernstein’s theorem on the totally monotone utilities. Surprisingly, such
n exists for bounded discrete random variables in the sense of Fishburn’s SD. For simplicity, we write
to denote
F dominates
G in the
nth order using the definition of Fishburn’s SD. When the dominance is strict, we write
.
Theorem 1. For any two bounded discrete random variables , there exists an such that either or . Furthermore, if the two distributions are distinct, then there exists an such that if , or if , where λ is the smallest value such that .
The idea of the proof is that, if for some and some , we can keep accumulating the sum by increasing the order of summations. Eventually, we will reach for some since .