4.3. Implementation of Techniques
The standard procedure for fuel price forecasting using the MA method is illustrated in
Figure 4. The flowchart illustrates the procedure for time series forecasting using the MA method in Microsoft Excel. The forecasting process commenced by gathering historical data. The primary data utilized for forecasting fuel prices include coal, LNG and gas prices. The data is subsequently visualized during the pre-processing stage. Consequently, the data appears to be random or irregular in nature. MA was utilized as a data preparation technique due to its ability to decrease random variation in observations and reveal the more accurate structure of the underlying causal processes. The calculation involves determining the average of the most recent 12 data points. The smoothness of the trend cycle prediction is determined by the order of the moving average (MA). The subsequent phase involves identifying the seasonal component. The data is subsequently subjected to a process of di-seasonalized order to eliminate irregularities and seasonal patterns. This is achieved by dividing the actual data by the seasonal value. The application of simple linear regression is subsequently employed to identify the trend component, which is then utilized for forecasting the future price of fuel. When evaluating the model outputs that were acquired, the mean absolute percentage error (MAPE) for the time series model was employed as evaluation criteria.
Different to MA, the goal of the LSSVM algorithm is to locate a hyperplane that best separates or fits the data that has been provided for training. To learn the underlying patterns and relationships that exist between input variables and output variables (forecasted), LSSVM makes use of past data. After that, it builds a model that, based on the observed patterns in the training data, can be utilized to make accurate predictions regarding future values. The process of predicting using the LSSVM approach is illustrated in
Figure 4.
It is necessary to perform the necessary preprocessing on the historical data pertaining to fuel prices in order to train the LSSVM model. After it has been collected, the data must then undergo preprocessing, during which any outliers or inconsistencies must be resolved. In order to guarantee that the data is presented in a format that is suitable for the LSSVM method, the data must first be normalised. Data normalisation was conducted prior to training by employing Min Max Normalisation, a technique that normalises each feature component to a specific range [
26]. This ensures that larger input values do not overpower or overshadow the smaller ones. Following that step, the dataset is segmented into a training set and a testing set. The LSSVM model will be trained using the training set, and the testing set will be used to evaluate the model's performance in terms of predicting. The whole dataset of 132 sample points for coal, gas, and LNG is split into two sub-sets: (a) the training dataset, which includes 84 sample points necessary for training the model; and (b) the testing dataset, which includes 48 sample points to evaluate the performance of a model. There is a distinction in performance between the LSSVM when being tested and when being trained. Because of overtraining, LSSVM typically displays superior performance during the training phase as compared to the testing phase. Training for the LSSVM model is done with the help of historical data. The LSSVM method determines which hyperplane is the most accurate prediction based on the training data and then optimises that hyperplane. The strategy of least squares is used to solve a constrained optimisation issue, which enables this result to be obtained. After the model has been trained, the performance of the model is evaluated using MAPE [
27]. The LSSVM model that has been trained is given the input variables for the upcoming time periods, and the model then gives forecasts for the years 2023, 2024, and 2025.
On the other hand, the ARIMA model takes historical data and divides it into three components, autoregression (AR) which shows a changing variable regress on its own lagged, integrated (I), which represents linear or polynomial trends and Moving Average (MA), which represents dependency between and observation and residual error. As a result, the model has three model parameters: AR(p), I(d), and MA(q), which are all combined to form the ARIMA (p, d, q) model parameters: AR(p), I(d), and MA(q), where, p is autocorrelation order (lag), d is differencing order, q is moving average order.
The ARIMA model attempts to ensure that the future value of time series data has a practical connection with the current and historical values. ARIMA model building consists of four major steps, which are identification, estimation, diagnostic and forecasting [
28]. With the help of these four steps, the tentative model parameters are first identified using graphs of the auto-correlation function (ACF) and partial auto-correlation function (PACF) after which the coefficient is calculated, and the likely model is determined. The model must then be validated and to assess the forecast's veracity and monitor the model's performance, simple statistics were applied. The time series must be stationary before using ARIMA or any of its extensions. If a time series possesses all three of the following characteristics which are constant mean, variance, and covariance, it is said to be stationary. There are two methods to check for stationarity which are i) The autocorrelation to check the stationary and ii) The autocorrelation to check the stationarity.
Based on the analysis of
Figure 5,
Figure 6 and
Figure 7, it can be observed that the autocorrelation function (ACF) for coal, gas, and LNG exhibits a positive value with a consistent and gradual decrease over time. The lagging issues extend beyond the designated blue highlighted area. This suggests that there is a lack of zero correlation between observations at different lags for all fuel types. The presence of additional spikes beyond the blue range indicates that the series does not exhibit characteristics of 'White Noise'. An illustration of a stationary time series is the phenomenon known as 'White Noise'. Based on the analysis, it can be inferred, that the time series data for coal, LNG, and regulated gas exhibits non-stationarity.
Another method for assessing stationarity is the Augmented Dickey Fuller Test (ADF Test). This statistical test belongs to the subcategory of Unit Root tests. The analysis aims to determine the presence of non-stationarity and the existence of a unit root in the time series. The assumptions are a) Null hypothesis b) Ho is the time series is non-stationary and there is a unit root. And, Alternate Hypothesis H1 is the time series is stationary and there is no unit root.
The ADF assessment is carried out using the original dataset. The p-value from the ADF test is 0.951480 for coal and 0.267107 for gas, both of which are greater than 0.05. At 5% significance, the null hypothesis cannot be rejected. As a result, the test does not reject the null hypothesis. According to ADF, the series has a unit root and is non-stationary. The p-value for LNG is 0.012868, which is less than 0.05. As a result, the null hypothesis was rejected, and the series was determined to be stationary. The differential was done on non-stationary data, and the smoothness of the data was then assessed using the ADF test. The d-value in the ARIMA (p, d, q) model was chosen after passing the stationary test [
29]. The next step is to identify a suitable model. The p and q values of the ARIMA model must be determined. Using the auto ARIMA function to examine the range of the best model, the values of p and q were obtained by observing the sample's auto correlation function (ACF) and partial auto correlation function (PACF). For coal, LNG, and gas, the selected ARIMA models are (1,1,1), (1,0,1), and (2,0,1). An autoregressive order of 1 for coal and LNG, and an autoregressive order of 2 for gas, specify the value of the series one and two time periods in the past to be used to predict the current value. Predictions are made and compared with the testing data set to validate the model after fitting input data with a suitable ARIMA model.
Thus, the Imbalance Cost Pass-Through (ICPT) is a mechanism employed to transfer the expenses associated with fuel and other generation-related costs to consumers of electricity. The ICPT, or the Imbalance Cost Pass-Through, is assessed biannually to determine whether a rebate or a surcharge is applicable. This determination is based on a comparison between the actual fuel costs and the scheduled fuel costs. The ICPT is typically calculated on a semi-annual basis. The adjustment for each type of cost is divided into two components. The initial segment utilises the current data for the two most recent months that are available, along with estimated data for the subsequent four months. The second part corrects for differences between actual outcomes and the estimated data used to calculate the first adjustment. The process for calculating the ICPT for commercial customers involves the following steps:
- 1)
The initial step in calculating ICPT involves determining the interim fuel cost pass through adjustment for a six-month period (IFUCS) using Equation (6). The estimated and actual total fuel costs (Cm, Dm) were derived from the projected fuel cost using the ARIMA and LSSVM models. The data for Dm was not accessible due to limited resources. As a result, forecasted data was utilised instead. The estimated total qualifying sales (Fm) were acquired from websites of Single Buyers (SB). The audited total qualifying sales, to which the ICPT adjustment is applied, was obtained from the Grid System Operator (GSO) website. By using Equation (5), the average fuel cost for was calculated. The total forecasted fuel cost (FFULs) for six months is derived from the previously forecasted fuel cost. The weighted average cost of capital of Regulatory Period 3 (RP3) is set by the government at 7.3%. The forecasted total electricity sales in year 2021 as made at the time of setting the Base Average Tariff was obtained from SB.
- 2)
Then, using Equation (6), the first fuel cost pass-through adjustment (As) was determined.
- 3)
Next, Equation (7) was used to determine the interim other generation cost pass-through adjustment (IGSCs). System marginal pricing (SMP) at SB websites was used to determine the estimated and actual total other generation cost (Gm, Hm).
- 4)
To determine the average other generation cost (AGSCs) using Equation (8), the forecasted other generation cost is obtained by subtracting the generation margin (Gm) from the forecasted fuel and fuel-related costs (FFULs).
- 5)
Equation (7) is then used to compute the first other generation cost pass-through adjustment (Bs) in the six-month period.
- 6)
The next part involves calculating the secondary fuel and additional generation cost pass-through adjustment within the designated six-month timeframe. This can be achieved by utilising Equation (10)-(11).
- 7)
The remuneration rates for ICPT adjustment, specifically IARRs-1 and IARRs-2, are constantly set at 2.8738 and 2.86, respectively.
- 8)
The six-month generation cost adjustment was determined using Equation (3).
- 9)
Equation (12) was used to compute the fund contribution (FUNDs). The approved payment (FUNPm) from the Electricity Industry Fund (EIF) and the payment by the Single Buyer (FUNTm) into the EIF are fixed at MYR 1.6 billion and MYR 1.3 billion, respectively.
- 10)
The ICPT price was then determined using Equation (2).
Even though, the ICPT is a controversial mechanism, but it is an important part of the electricity market. The ICPT helps to ensure that consumers pay the true cost of electricity, and it can help to protect consumers from large fluctuations in fuel prices. Thus, the summary of the method process flow to find the forecast ICPT is demonstrated as in
Figure 8 congruently.