2. Energy Efficiency and Its Measurement
This Section provides basic concepts to enable a good understanding and interpretation of the computed outputs. There exists a vast literature on the energy efficiency definition and measurement (e.g., [
8,
9]. The reason may particularly reside in the complexity of the energy efficiency technical evaluation [
10]. Issues are the choice of the right aggregation level, the appropriate variables to construct a reference energy consumption trend, the energy units to be applied and interaction between various effects. Uncertainty margins for results lack in most presentations as well. Among a vast literature, we underscore some of these works which disserve a higher attention in the context of this paper. In particular, the authors [
11] present two approaches to measuring energy efficiency. The bottom-up approach to which the ‘Odex’ index is linked. It has been developed under the EU Odyssee-Mure programme. It will be developed in this article. The second approach is that of Top-down which brings together the ‘Decomposition’ methodologies as used by e.g. Netherlands, Canada and New Zealand. Likewise, the authors [
12] propose the calculation of total factor energy efficiency (TFEE) using the concept of global production instead of domestic product which excludes intermediate consumption. A next paper worthy to cite comes from the authors [
13] . In that work, a literature review was carried out and authors found that the currently in use definitions of total-factor energy efficiency and total factor carbon emissions efficiency are confusing and misleading enough.
Regarding institutional research, the Environmental and Energy Study Institute [
14] uses a definition of energy efficiency emphasizing using less energy to perform the same task, i.e. eliminating energy waste. The European Parliamentary Research Service (EPRS) definition [
15] highlights the fact that energy efficiency should refer, in general terms, to the amount of output that can be produced with a given energy input. Most commonly, energy efficiency is measured as the amount of energy output for a given energy input . However, other kinds of output can also be used. The EU Energy Efficiency Directive uses a very broad definition: ‘energy efficiency’ means the ratio of output of performance, service, goods or energy, to input of energy. Following the International Energy Agency [
16], the energy efficiency ratio is a ratio between energy consumption (measured in energy units) and activity data (measured in physical units).
Finally, the UNDP, as an organ of the United Nations, has published a summary work which takes up different approaches to calculating energy efficiency. In fact, this institution in the third chapter [
17] begins by identifying the methodological challenges associated with defining and measuring energy efficiency. It then proposes a framework for understanding energy efficiency trends, integrating the current UNDP approach to energy efficiency developed by various international agencies and national institutions, and establishing a methodology to identify a starting point in relation at which future improvements in energy efficiency can be measured globally and at national levels.
It is worthy to pay attention to the related three next concepts to be used in the next part of the paper. For an economy-wide measure, GDP is often compared to energy use, to give the energy intensity (measured for example in kilowatt-hours per euro). Next, energy savings are the reduction of energy use, without reference to output produced. Finally, as far as energy efficiency assessment is concerned, this can be done at different levels and according to different techniques. These levels may start from economy-wide and sectoral energy intensity to individual units of activity.
We now present some details regarding the ODEX energy efficiency index from which the data to be used in our model are extracted as a priori. This index is published by the Odyssee-Mure project [
18] to measure the energy efficiency progress by main sector (industry, transport, households, services) and for the whole economy (all final consumers). The ODEX composite indicator is calculated as a weighted average of sectoral indices.
For each sector, the index is calculated as a weighted average of sub-sectoral indices of energy efficiency progress. The sub-sectors stand for industrial branches, service sector branches, end-uses for households or transport modes.
This project calculates the scores by including the following energy efficiency components:
the energy efficiency level,
the energy efficiency trends,
the energy efficiency policies,
the overall energy efficiency.
These three criteria are scored between 0 and 1 on the basis of a variety of indicators (extracted from the Odyssee Database) and of energy policies (extracted from the Mure Database). The overall energy efficiency score is obtained as an average of the three scores obtained for “energy efficiency level”, “energy efficiency progress” and “energy efficiency policies” (i.e. one third weighting). This work will use data representing the overall energy efficiency. Following [18], the energy efficiency scoring technique is based on the OECD Composite Indicator methodology . This method allows the countries or regions to be compared in a relevant range where minimum and maximum values indicators define the best and worst scores and countries or regions are ranked between these two extrema. The indicators are calculated and normalized so that they range between 0 and 1 following this formula:
where:
Indicator: The indicator value of the country\region.
Min indicator: The minimum indicator value across all countries\regions.
Max indicator: The maximum indicator value across all countries\regions.
Direction: The favored direction in the level of indicator; -1 if the decline is favored, 1 if the incline is favored.
In spite of a “one third weighting”, the most influential score is the one of the energy efficiency level to which the two remaining scores should be related. Its scoring, according to Odyssee-Mure practice, is done as follows:
Scoring is done separately for the four considered sectors (households, transport, industry and services) and for all sectors together.
The score by sector is based on scores computed for statistically selected indicators of end-uses in buildings or modes in transport. For industry sector an aggregate score is obtained from various industrial branch scores that account for the energy efficiency characteristics of each of them.
The score by sector is calculated as a weighted score of each indicator. The weights correspond to the average shares over the last 3 years of each end-use or transport mode in the sector consumption.
Finally, for comparative reasons, sector score values are normalized into interval 0-1 according to to the next formula:
where:
Indicator: The indicator value of the sector
Min indicator: The minimum indicator value across all sectors.
Max indicator: The maximum indicator value across all sectors.
To close this Section, it is worthy to briefly comment on energy efficiency estimation in the presence of recent energy system known as integrated energy system (IES) .
In the context of the energy crisis and environmental degradation, an integrated energy system (IES) based on the complementarity of multiple energy sources and the cascading use of energy is considered an effective way to mitigate these problems. Due to the different forms of energy and the different characteristics of IESs, the interrelationships between different forms of energy are complicated, which increases the difficulty of assessing the energy efficiency of IESs. A limited number of technique exists. We can send the interested authors to the authors [
19]. These authors have proposed a technique mixing Energy Use Efficiency (EUE) and Exergy Efficiency (EXE) based on the first/second law of thermodynamics [
4].
In many real-world situations, theorists and empiricists observe at a given time two or more quantifiable multivariate stochastic systems and want to infer on an unknown cross-correlation between their random elements. To illustrate that, we implement a cross-entropy formalism to forecast an interprovincial sectoral energy efficiency score matrix based on imperfect and contradictory information from province or sector aggregates.
The basic model for
1 dealing with poorly posed inverse problems is to solve the integral equation of the first type. We formulate this – in the context of a model that will be developed later – as follows:
- -
G the amounts observed in rows or columns;
- -
f is the unknown regional cross-sectoral greenhouse contamination matrix;
- -
D defines the Hilbert support space of the model,
- -
h is the transformation kernel linking measures G and f,
- -
b explains random errors.
The literature on various methodologies devoted to the recovery of ill-posed inverse problem is expansive. In addition to the well-known Tychonoff regularization theory [
20], the Gibbs–Shannon–Jaynes principle of maximal (minimum) entropy, [
21,
22] and recent extensions, [
23,
24] remain the most commonly used techniques to solve this class of problems. The general rule that applies to both approaches is to associate the linear or nonlinear problem of the least squares with the regularization rule ( a
priori or additional information) to
get to a well-posed problem. Moreover, the Gibbs–Shannon–Jaynes principle of maximum (minimum) entropy formalism tries to search for global regularity — related to the second law of thermodynamics — while producing the smoothest reconstructions consistent with the data available in the Bayesian spirit. In this research, as it is often the case in many empirical applications, the discrete form of equation 1 was implemented.
As far as the social sciences are concerned, a number of other techniques have been tried for this class of inverse problems. Examples include the pseudo-inverse Moore–Penrose problem approach or the bi-proportional RAS approach and its extension [
25]. Although the latter technique requires an initial transaction matrix, it offers a less good solution when the model studied is s
tochastic. The authors [
24] showed poorer performance of the Markov chain model compared to the generalized Gibbs–Shannon entropy for this class of inverse problems. Subsequently, the Bayesian approach showed its relative superiority, particularly that associated with the principle of maximal entropy. A neural class model can also be proposed. However, it is not based on a compact theory, its application takes time and the results are not always guaranteed. The entropy model has been successfully applied to update and balance social accounting matrices [
25] . However, on theoretical grounds, this assumes that entropy is a positive linear function of the number of possible states and then ignores the possibility of interdependencies between states and their influence. In a recent paper, the authors [
26] demonstrated the convergence of two standard regularization techniques to two special power law (PL) related q Tsallis values. For q = 2, a Tychonoff regularization is obtained, and for q = 1, the classical formulation of the Boltzmann–Gibbs–Shannon entropy is obtained. The central point is that, in addition to the well-known law of scale, PL exhibits a series of interesting characteristics related to its aggregation properties, in that it is preserved under addition, multiplication, polynomial transformation, minimum and maximum [
27].
Since we are dealing with a poorly conditioned inverse problem, we must meet all three conditions of regularity (existence, uniqueness, and stability of the solution) at the same time.
If, in general, the conditions of existence
and uniqueness remain available due to regular a priori constraints, the
stability of the optimal solution, by random or systematic errors, is much more difficult to find. In short, the problem is, among an infinite number of distributions that meet all the restrictions imposed, to find the one that best replicates the data generation system (DGS). As for the formalism of maximum entropy, thanks to Jaynes' contribution [
5], the reasonable candidate should be the one which reduces uncertainty about the system the most. Similarly, according to Kullback-Leibler, information divergence metrics, [
21,
28] the best candidate should be those posteriors who meet all the binding conditions and deviate the least from the priors.
- a)
Non extensive cross-entropy energy model and confidence interval area
Kullback-Leibler information divergence metrics are best known when they apply to Gaussian phenomena. In the case of PL phenomena - of which the Gaussian law is a particular case, the divergence metric applied is the q-Generalized Kullback-Leibler relative entropy. This metric - in its essence the best fitted to nonlinear systems, works even in the case of scale invariance, long-memory correlation, among others. These properties give it practical advantages over many other laws(e.g., [
29,
30]). The model related to this metric has been extensively presented in different publications [e.g. [
30]] in the context of macroeconomic analysis. Since this work deals with energy management, it is worthy to reformulate the model in the energy management context to enable interpretation of the outputs.
We implement
2 the usual
3 discrete form of the q-Generalized Kullback-Leibler relative entropy (Eq. 5). With the cross-entropy new constraining data, the model updates the initial information(priors in tab.1) and provides new outputs(posteriors).
It is necessary to redefine the parameterization of the generalized linear model (equation 2), which plays the role of constraints. The inside table elements to be forecasted can be meaningfully presented by columns as the discrete Bayesian joint probabilities explaining each region’s average cross-sector weight or probabilities corresponding to individual energy efficiency ratio. The ratio total per column will sum up to unit. We recall that energy efficiency coefficients to be forecasted are in the form of normalized indicators, thus belonging within the space zero and one. In this case, the parameter processing space coincides with the probability space. Under these conditions, the accuracy of the estimated parameters is greater, since a priori there is no loss of information from these data [
30]. In any case, let us briefly present the general procedure of reparametrization in the case of a general linear inverse model:
where the values of unknown parameters
are not necessarily bounded between
and
indicating the need for reparameterization. This
term is an unobservable random term for perturbations, plausibly with finite variance exhibiting observation errors from empirical measurement or random shocks that may be driven by PL. The variable
consists of data observed – with errors – of an unknown data generating system of energy efficiency coefficients by sector and
may represent known average regional energy intensity coefficients – with uncertainty – through the relational parameter matrix
and the unobservable disturbance
to be estimated through the observable error components
. Unlike classical econometric models, no binding assumptions are required – for example, about the distribution of random errors. In particular, as we deal with an ill-behaved inverse problem, the number of parameters to be estimated should be greater than the observed data points, and the quality of the information data collected should be low. The process of the true system recovery requires the entropy objective function to include all of the interacting constraining consistency moments. Thus, referring to the properties of the relative entropy principle, each new piece of constraining information will reduce the entropy level of the system in accordance with the degree of data consistency with the system. For this multi- dimensional space inverse problem , among an unlimited number of model solution candidates, the best solution will result from identifying the one that—in terms of probability—best simulates the data generating system. By taking each of
as a discrete and random variable with a compact support (Golan et al., 1996) and
possible outcomes, it can be estimated using ,
i.e.:
where
is the probability of the outcome
, and the probabilities must be non-negative and added to one. Similarly, by treating each element
(which affects the total uncertainty of the sector efficiency) as a
finite, discrete random variable with a compact support and
possible results centred at zero, we can express
as:
As mentioned, it can be assumed that each previous entire row of errors has been evaluated,
and a similar support space should be constructed as follows:
where
and
are the outcome probabilities in the support spaces respectively.
. Therefore,
and
represent respectively the indexes of the number of row and column whose coefficient sum were estimated with errors. Moreover,
the term error
is empirically set around the empirical standard error of the stated variables and a priori represent the Bayesian hypothesis. The choice of error limits, of course, depends on their own properties. In this study, their sets were determined by Chebyshev's inequality [31] with the boundaries of the support space ranging from -3 to +3. Notice that in spite of this Gaussian property of the priors, posterior probabilities in the support space may represent a class of a non-Gaussian distribution, in particular a PL.
The element
constitutes an a priori information provided by the researcher while
is an unknown probability generating the true parameter
the value of which must be determined by solving a non-extensive cross-entropy econometrics problem. In matrix notation, let us rewrite
with
and
. Also, let
with
and
for K and L the number of rows and columns and
the number of data points over the support space for the error terms inside the regional cross-sector matrix. The same conditions of normality can be easily formulated for any vector of column sums. Next, the cross-entropy econometric estimator of the Tsallis entropy can be presented as follows:
: indicates each sum per column (values observed by energy sector , including unknown errors),
: each row total (observed values per province ) corrected for ,
: total energy intensity indicators by region affected by unknown errors;
: probabilistic structure of energy efficiency ratios by sector and region,
is a positive scaling factor related to Province climatic factor(average annual temperature by province) to match the totals of energy intensity and energy efficiency indicator levels,
is the arbitrary additional scaling factor representing other random variables but to
balance row and column sums
" ": refers to a variable bound to a row or column total, depending on the context.
Non-extensive statistics use a number of binding forms in which expectations can be set. The above model uses Curado-Tsallis (C-T) constraints [
32,
33], the general form of which is as follows:
The parameter
, as already mentioned, represents the Tsallis parameter . As
Table 1 suggests, the
remains a scaling factor related to province climatic factor contributing to match the totals of energy intensity and energy efficiency indicator levels that are known with uncertainty. Unlike the factor
whose role has just been expressed above, the factor
is the arbitrary additional scaling factor representing other random variables but
to complementarily contribute balancing row and column sums. This is because the
which explains climatic differences within provinces could not alone play the balancing role since other factors may exist in explaining the difference between energy intensity and energy efficiency. This point has been alluded to in the introduction Section when talking about the relationships between energy intensity and energy efficiency. Still the sum per column ( i.e.per sector) of the posterior probabilities is constrained to unity, given
. In the above model [equations 5 - 10], their values will play the role of new Bayesian data discriminating in favor of new inferential evidence. We must recall that when no new data are included in the cross-entropy model, the results correspond to those from formulating a maximum entropy principle, without additional conditions, except those of normality.
Above is nonlinear and measures entropy in the model. The relative entropy of the three independent terms (respectively the three posteriors ,, and corresponding priors ) are then added together using the weights . These are the real positives that amount to unity within the mentioned constraints. The first term known as the one of "parameter precision" takes into account the discrepancies between the estimated parameters and the prior parameters (usually defined in the support space). The second and third terms "ex-post predictions" include the empirical error term as the difference between the predicted and observed data values(see the last row and column in tab.1) in the model. The first component of the criterion function can relate to the structure of table parameters, the second component to errors in row totals, and the last component to errors in column totals.
We must find a minimum discrepancy between priors and posteriors, while respecting the restrictions imposed by the system.
It should be noted that the estimates of the model and their variances should be influenced not only by the length of the support space, but also by the spatial scale effect, i.e. the number of affected point values [24]. The greater the number of these points, the better the prior information- i.e. nonlinear starting points, about the system.
Second,
random errors (see Equation 7) explain errors in data collection and processing and are not necessarily related to the Gaussian distribution being itself a particular case of a PL. Traditionally, when it comes to Bayesian formulations and relative entropy, it is worth noting that both models will lead to similar results if and only if the real expected error associated with the data generation system is zero in the symmetric support space around zero(see e.g. [30]). Similarly, the results of Gibbs-Shannon cross-entropy and Tsallis' non-extended cross-entropy will match when errors included in the model are not correlated, and the system distribution will then evolve towards the Gaussian attractor e.g., [
34,
35,
36].
With respect to the confidence interval of the parameters, equation 11 shows the non-additivity of the Tsallis entropy for two - probable - independent systems, one related to the probability distribution of the parameters and the other to the probability distribution of error perturbations:
is then the sum of the normalized entropy associated with the parameters of the model and the term of perturbation . Similarly, the last value is obtained for all observations, with the number of data points above the support of the estimated probabilities related to the time length of errors.
The values of these normalized entropy indices range from zero to one. The values, close to one, indicate a weak information variable, while lower values indicate a parameter that is estimated to be more informative in the model.
We will illustrate the empirical basis of the theoretical model developed in the above sections by applying Polish energy efficiency to the case.
Table 1 - whose row and column symbols correspond to equation Eq.6 - is presented to illustrate the extent to which a researcher can have limited information about quantity and quality before solving an inverse and ill-posed inverse problem such as the one in this article.
We have two aggregated energy indicators each from the Polish energy demand sectors per province. The problem in hands stands as [21] a discrete problem in a multidimensional space, leading to (
-1)x(
-1) degree of freedom that would illustrate the case of the standard inverse problem. In this problem, as illustrated in the
Table 1 [
37,
38] we have 3x15=45 degrees of freedom related to the number of energy efficiency indicators per sector and province. Moreover , as alluded to before, the row total and the column total are different as probably the result of two separate data sources with different nature and scale.
As the above pages made us aware of, this kind of problem corresponds the best to the philosophy naturally contained in the principle of maximum entropy that we have implemented to forecast the energy efficiency ratios displayed in
Table 2.
Let us now comment on the model outputs presenting the final model solution about the cross “energy efficiency ratios “ (column 5 of
Table 2) by sector and province. As presented in the theoretical model(equations 5-10), in addition to the classical normality conditions and moment consistency, the earlier explained random factors
and
allowed the system for balancing. The first factor is related to various variables differentiating energy intensity from energy efficiency except the climatical factor represented by
As already said, these calculated ratios in the above table are not deterministic since the priors are not known with certainty. Therefore, we treat them as random variables, from which the posteriors result from the optimization of the model. It is worthy to recall that the presented energy efficiency scores stand for the overall energy efficiency score, i.e. a combination of the three already presented components: the energy efficiency level, the energy efficiency progress (i.e. energy efficiency trends) and the energy efficiency policies. The not presented in this paper prior matrix of energy efficiency ratio has been initiated on the basis of knowledge of province energy intensity index and sector energy efficiency ratio averages. Next, using proportions of energy intensity index(last column of tab.1) of different provinces, we computed the initial energy efficiency ratio by sector, so as to sum up to the total of energy efficiency ratios-known with error, available from the last row of the same
Table 1. In probabilistic terms, we have assumed the energy efficiency ratio to be a uniform distribution across provinces, thus the last column of energy intensity as a marginal probability. The assumption behind this procedure is that the province with lower or higher average energy intensity index will have lower or higher average energy efficiency ratio respectively, irrespective of the considered sector. Doing so, we enabled the nonlinear mathematical system to start the search of the model global optimal solution from the best starting points leading to a quick convergence. The post-entropy posterior ratios in
Table 2 are normalised and the higher the value the lower the energy efficiency level for a given sector and\or province. The obtained forecasts are empirically close to real world expectations of energy efficiency ranking within polish provinces. We notice that in the industry sector the provinces Mazowieckie (including Warsaw) and Wielkopolski (including Poznan) have the lowest energy efficiency ratios, respectively around 0,320 and 0,333. The highest ratio is shown in the provinces Swietokrzyskie and Opole with ratios around to 0,445 and 0,482 respectively. Globally, we notice that Mazowieckie displays the highest efficiency in all sectors while Opole displays the lowest efficiency. We notice too that industry sector globally remains the most efficient among all sectors.
Table 3 presents the level at which the model has discriminated the prior in favor of the post cross-entropy solution given the new model consistency moments and normality conditions. Consequently, we notice in that table the highest post cross-entropy discriminating values through all sectors in the case of Opole in comparison with the remaining sectors. It is worthy to recall that, as described in the precedent paragraphs, the prior values presented the same structure for different sectors. The cross-entropy formalism has discounted non valuable information to just retain information fitting to corresponding sectors and province given initial information on the energy intensity indicator and the climatic factor.
Following the formulation of the equation 11, the confidence interval of the model is a normalized values ranging between zero to one. The values, close to one, indicate weak informative variables, while lower values indicate a model whose estimates are more informative. In the present case, the equivalent to the classical coefficient of determination is equal to 0.011, then much closer to zero than unit. Then, the proposed model variables have enough discriminated far from the priors, given all constraints encountering the energy efficiency system defined in this paper.
The objective of this article was to carry out a forecast of energy efficiency ratios in conditions where statistical data are not only missing but also from contradictory sources. In empirical research, this case is rather a rule than an exception. In most situations, this will lead to a nonlinear ill-posed inverse problem for which traditional statistics fail to find an analytical solution unless commodity additional hypotheses are made as is often the case in empirical econometrics.
In the present paper, we have analysed this problem at the level of the Polish 16 regions for four sectors of energy consumption. We started with sparse information concerning the annual sectoral energy efficiency ratio averages aggregated at the national level. We disposed as well as the annual averages of energy intensity aggregated at the provincial level. The problem therefore consisted of bringing together these two aggregates of different nature to produce reliable information on interprovincial sectoral energy efficiency coefficients.
The forecasts obtained on these ratios seem to be in line with the expectations of those who know realities behind the Polish energy sector.
This work more or less described in details the generalized cross-entropy technique and illustrated its applicability in the field of energy experimental statistics. Furthermore, thanks to this approach, this problem could also be solved in the case of further disaggregation, for example to the NUTS 3 level. This is of capital importance because it is at the more disaggregated levels that economic actors act.
In this work, the explanatory variable used was the climatic factor in the form of average annual temperature by province. Depending on the countries studied, this variable or few other variables presented in this paper, like the structure and nature of industry in different regions, could much better influence the difference between energy intensity and energy efficiency. It would be interesting to simulate future values of these efficiency ratios by linking energy intensity with these influential variables. Comparing such an information on energy efficiency across sectors and regions will be of significant benefit to target gray areas and redefine more appropriate energy policies fitting to social values inside a disaggregated region. In particular, the outputs of the model could allow public energy institutions for monitoring the relative progress of each region or province in their endeavour to optimize the energy use given their own specifical conditions. Finally, an extensive forecast at European or larger scale of these indicators through experimental official statistics could allow for a better management of energy saving as targeted by the European or worldwide climate change mitigation institutions or projects.