1. Introduction
Advancement in renewable energy technology has been on a rapid ascending trend in recent years. This has encouraged wide acceptance of the technology and thus, the subsequent boom in the installation of renewable energy-based power plants around the world especially in China, India, Europe and America. Solar photovoltaic (PV) is one of the leading renewable energy technologies on exponential rise. Due in part to the growing concern over oil depletion, environmental issues, fuel price dependence, and operational complexity associated with the production of fossil fuels, an increasing number of residential, commercial, and industrial consumers have adopted and are adopting solar PV as their source of power generation [
1]. The National Renewable Energy Lab (NREL) reported that the PV capacity installed globally in 2021 was 172GW
dc bringing the global cumulative capacity to 939GW
dc [
2]. NREL further stated that China, India and Germany significantly increased their PV installations by 106%, 51% and 22% respectively, in the first 9 months of 2022 [
3]. This remarkable rise in the significance of solar PV in the global energy sector is also reflected in [
2], where it is reported an increased annual global PV installations especially in the aforementioned countries was projected by analysts.
However there are is a downside to the technology. Solar PV systems require ongoing maintenance in order to function efficiently over time because they have been known to lose efficiency and productivity if not properly and appropriately managed and maintained [
1]. That is to say, in order for solar PV systems to operate properly over time, they need to undergo routine maintenance, which calls for the adoption of mechanisms to efficiently monitor and control these systems. Several operation and maintenance conventional methods which have helped to sustain efficient operation of PV systems have been introduced over the years. These methods however have not been able to totally prevent system failure and operation downtime which begs the need for more intelligent methods of fault detection and the subsequent adoption of AI-based methods. These techniques make use of machine learning to develop models that can quickly find various issues, track the overall health of PV systems, and assist maintenance engineers in hastening system recovery [
4].
There are so many available literatures that have carried out experiments and researches on the use of AI in fault detection and diagnosis for PV systems. In [
5], a straightforward and efficient monitoring technique for PV systems is given. It is based on parametric models and the double exponential smoothing scheme. In order to find minor deviations, the simplicity and adaptability of empirical models are combined with the sensitivity of a double exponential smoothing method. By analyzing the resulting residuals, the double exponential smoothing approach detects flaws, and its sensitivity is increased by creating a nonparametric detection threshold using kernel density estimation. Partial shading, inverter disconnections, PV string faults, soiling on PV arrays, and short circuits in PV modules are just a few of the defects that can be found utilizing the suggested method. The research's findings, demonstrated that the suggested method can be used to monitor PV system operating parameters in real-time but may not be suitable for spotting abnormalities at various scales because it was designed for one scale, namely the time scale. By implementing grid partition (GP) and subtractive clustering (SC) algorithms utilizing research data, the approach suggested in [
6] trains the adaptive neuro-fuzzy inference system (ANFIS) model for a reliable PV defect detection and classification system. Afterwards, in order to identify PV system problems, the trained models ANFIS GP and ANFIS SC were used. The resulting data were compared using statistical analysis. It was discovered that in terms of precisely identifying fault states, the ANFIS SC technique outperformed the ANFIS GP technique. The proposed method may not be appropriate for the identification of faults in various PV systems where the environmental factors are beyond the specified model range, according to the authors, who also emphasized that the method solely considers electrical defects. Using thermographic pictures, a fault detection technique is provided in [
7] that categorizes various PV module anomalies. A multi-scale convolutional neural network (CNN) with three branches is used in the technique, which is based on the transfer learning approach. The transferred network's pre-trained information is used in the convolutional branches, which also have multi-scale kernels with levels of visual perception, to enhance the network's capacity for representation. By combining an oversampling strategy with an offline augmentation method, the study was able to improve network performance while overcoming the unbalanced class distribution of the raw data.
The suggested method was used in the experiment to identify a variety of fault kinds, and the study came to the conclusion that it performs better than other deep learning methods and studies now available and provides higher classification accuracy and robustness in PV panel defects. Thermographic images are used in yet another technique described in [
8] to identify flaws in PV systems. Here, deep convolutional neural networks (DCNNs) and infrared thermographic images are used together to detect and diagnose faults. The method involves first creating a binary classifier to identify faults in PV modules, then creating a multiclass classifier to determine what kind of flaws are there. The study takes into account four typical PV module faults: short-circuiting, partial shading effects, dust deposition on PV module surfaces, and bypass diode failure. The proposed DCNN-based classifiers have been first optimized, then embedded into a low-cost microprocessor (Rasberry Pi 4), and the models are compared with three main TF-Lite optimization strategies, including simple conversion, dynamic range quantization, and float 16 quantization. The study came to the conclusion that the proposed technique can operate in real-time and can diagnose and detect anomalies with a level of accuracy that is acceptable based on the experimental findings that were obtained. Additionally, the technique is set up to send email and SMS using a GSM module to operators informing them of the status of the PV array. The authors of Study [
9], which also presents a thermographic image-based method, noted that it is crucial to quickly and affordably maintain the proper functioning of PV systems without interrupting regular operations by identifying PV module overheating through thermographic non-destructive testing. The paper then suggested a technique for convolutional neural networks that was created using open-source libraries to automatically classify thermographic images. To lower image noise, a number of preprocessing techniques were tested, including grey-scaling, thresholding discrete wavelet transform, normalizing and homogenizing pixels, normalizing and homogenizing pixels, Sobel Feldman and box blur filtering, and thresholding discrete wavelet transform. Without following any set protocols, these techniques enable the classification of thermographic images of varying quality that are taken using various pieces of equipment. The performance of neural networks was evaluated using the suggested method through a number of experiments using various parameters and overfitting mitigation techniques. In order to assess network performance and the amount of time needed to complete the thermographic inspection, images obtained by unmanned aerial vehicles and ground-based operators were compared. The foundation of the proposed method is a tool built on convolutional neural networks that enables rapid and accurate failure detection in PV panels.
According to the authors, the proposed methodology provides an alternative and a reliable tool that enhances the resolution of picture classification for issues involving remote failure detection and can be applied in any field of science. Study [
10] provides a summary of IoT and AI applications for PV systems. The most cutting-edge algorithms, including machine and deep learning, are also discussed in the paper, along with their implementation costs, accuracy, complexity, software appropriateness, and viability for real-time applications. For PV facilities located in remote locations with expensive and difficult accessibility for maintenance, the integration of AI and IoT approaches for defect detection and diagnosis into basic hardware such as inexpensive chips, may be economically and technically possible. These strategies were also provided together with challenging problems, advice, and trends. In [
11], a study examining the use of ANN in various areas of partially shaded PV systems is provided. It provided an overview of and covered the use of ANNs in MPPT, fault detection, fault mitigation, system modeling, and performance enhancement of solar PV systems exposed to partial shading. The study did not just examine the literature; it also showed how the approaches may be enhanced and applied in real-world settings. The study described in [
12] assesses various ML and ensemble learning (EL) algorithms for fault diagnosis of PV arrays, including previously untested methods for faults with numerous faults and faults with comparable I-V curves. The study created a novel method to accurately identify and classify defects based on this evaluation. According to the authors' findings, the results are positive. The study went further to demonstrate when ML & EL methods ought to be applied in practice and provided some recommendations, difficulties, and potential future directions in this area. By suggesting an early degradation detection that affects glass, EVA, wiring, etc., the work described in [
13] aims to lower operation costs of PV modules. In the suggested approach, automated self-evaluation of PV panels is created, and degradation models are integrated as software into a microcontroller that uses instantly measured parameters. In the study, it is also discussed the deterioration phenomena of each PV module's component. Modeling each recognized degradation using P-V characteristics is the basis for the Observing Degradation System (ODS) program, which is then presented. A checklist is then created for successful testing. Study [
14] presents an ensemble-based deep neural network (DNN) model for the autonomous detection of visual faults on various PV modules, including glass breakage, burn marks, snail trails, discoloration, and delamination. This method for detecting degradation faults is similar to the one described earlier. A RGB camera placed on an unmanned aerial vehicle is used in the procedure to capture the image dataset (UAV). Images are preprocessed by removing spatial and frequency domain characteristics from them, such as discrete wavelet transform, texture grey level co-occurrence matrix, rapid Fourier transform, and different grey level approach. Following that, the edited photos are input. To identify any visual defects on the PV modules, the proposed ensemble-based DNN model uses DNN. In order to assess the performance of the suggested model, the classification accuracy, receiver operating characteristic curve, and confusion matrix are utilized. The results revealed that the proposed model, coupled with random forest classifier, achieved a high classification accuracy.
Similarly, in [
15], the authors described a fuzzy diagnostic algorithm that relies on the classification of electrical characteristics, the values of which are taken from experimental measurements of crystalline modules' I-V curves. By using the suggested method, flaws like uniform dust, partial shading, and potential induced degradation can be found. Also, a brand-new approach is suggested for the detection of aberrations in the measured I-V curve brought about by bypasses that are activated as a result of partial shading. This is based on quadratic and cubic polynomial regression, whose concavities are very sensitive to noisy data. An approach for identifying, diagnosing, and categorizing short-circuit and open-circuit string errors that is based on deep learning is provided [
16]. There are four steps in the suggested technique. First, a PSIM-based simulation that seeks to accurately represent the functioning PV system using a heuristic optimization approach based on the Coyote Optimization Algorithm (COA) is used to input five unknown electrical characteristics of one diode model. The second phase involves creating a database with information on current, voltage, and power at MMP, module temperature, and solar irradiation for the PV system under both ideal and unsatisfactory working circumstances. In the third stage, new features from the old database are extracted using the unsupervised learning capabilities of the auto-encoder, and in the final step, PV defect detection and classification are accomplished using supervised learning on the new database based on ANN construction. The obtained results show how well the suggested strategy works with the aforementioned fault kinds. Study [
17] investigates the effects of various physical faults and cyberattacks in order to develop an intelligent fault/attacks detection and diagnosis system. They find that by being able to quickly identify and diagnose fault/attacks, local controllers and energy management systems can accommodate or lessen the negative effects of physical faults and cyberattacks in microgrids. The study then presented an intelligent hybrid diagnosis method for data online monitoring and diagnosing to reflect the real-time state of PV system running at microgrid level. The proposed method is based on a fuzzy inference system, a power spectrum estimator, and an adaptive neuro-fuzzy inference system. A realistic microgrid benchmark model with a range of operating conditions and dynamic electrical loads in the presence of potential microgrid disturbances is used to show the high level of efficiency of the proposed method under various types of fault/attack scenarios. In [
18], a fault detection and diagnosis (FDD) scheme design is presented that employs Wasserstein generative adversarial network (WGAN) and convolutional neural network for automatic fault feature extraction from raw electrical data of PV array, resulting in the creation of an effective FDD model with little data. A classifier, a generator, and a discriminator make up the three modules that make up the FDD model. In order to enhance the effectiveness of the CNN-based classifier, the discriminator and generator analyze sequential PV data in a two-dimensional manner to learn the distribution of PV data under different PV system operations. Then they are utilized to generate additional labelled data samples. According to the paper, the suggested FDD model could be trained with only a little amount of labeled data, and the effectiveness of the model was assessed using a lab grid-connected PV setup.
A diagnosis of line-line and open-circuit faults using the suggested method was demonstrated by the results. An implementation of a fault detection strategy based on the identification of PV systems' neuro-fuzzy models is presented in the work in [
19]. Modeling and identification of systems and the detection of operational states are the two stages of the technique. The derived neuro-fuzzy model approximates, with only a few minor differences, the properties and behavior of a genuine system. The suggested model has a very high level of accuracy and could identify errors very quickly. Three shortcomings in machine learning-based defect detection techniques were listed in Study [
20], which is what it seeks to fix. The inability of shallow network structures to effectively learn nonlinear characteristics of I-V curves is one of them. The others are that feature extraction relies on expert experience and lacks automation; artificial feature extraction readily ignores some potentially useful features; and feature extraction is not automated. As a result, the study suggested a methodology based on layered automated encoder and clustering algorithm that can automatically extract features and employ a limited amount of labeled data samples to mine data sample characteristics for defect diagnosis. Three steps make up the technique's execution. To enhance the effectiveness of the clustering approach, the effective features are first automatically retrieved from the I-V curves by the stacking encoder, and then the dimension of the features is decreased and visualized by the t-distributed stochastic neighbor embedding. Eventually, the clustering method produces clustering centers and clusters, and the membership function is utilized to diagnose faults. Eventually, the clustering method produces clustering centers and clusters, and the membership function is utilized to diagnose faults. To address the issue of fault detection of PV modules using thermographic images, a Convolutional Neural Network (CNN) model and a fine-tuned model based on Visual Geometry Group (VGG-16) have been investigated in [
21]. Binary classification and multiclass classification were employed to determine the type of fault in order to detect it. The database utilized in the study was made up of an unbalanced class distribution of thermographic images taken by infrared cameras of PV modules both in good and bad condition (such as bypass diode failure, partially covered PV module, shading effect, short-circuit and dust deposit on the PV surface). The fine-tuned model performs very well in experimental tests, but the small Deep Convolutional Neural Network (small-DCNN) model performs somewhat less well.
This study presents a novel method of fault detection in PV arrays and inverter faults by utilizing Elman Neural Network (ENN), Boosted Tree Algorithms (BTA), Multi-later Perceptron (MLP) and Gaussian Processes Regression (GPR) models to estimate the DC Power (DCP) and AC Power (ACP) of a PV system setup. Different models have different strengths and weaknesses, and what works well for one problem may not work as well for another. Therefore, it is important to experiment with different models and choose the one that is best suited for the particular task at hand [
22]. As such, we develop several model combination based on the influencing factors (see
Figure 2), existing claimed of dominancy in the literature [
23,
24,
25,
26]. Furthermore, it is important to evaluate the performance of the chosen model carefully, using appropriate metrics and validation techniques, to ensure that it is indeed performing well on the given problem. As such the study employs several performance criteria to assess the accuracy of the models. Prior to model development, data pre-processing including normalization, model validation and stationarity analysis, is carried out. Also, both standalone and hybrid models will be compared using Nash–Sutcliffe efficiency (NSE), Pearson correlation coefficient (PCC), mean absolute percentage error (MAPE), mean absolute error (MAE), root mean square error (RMSE), and Percent bias (PBAIS) to understand the strength of each model combination.
3. Proposed Intelligent Methods
According to the “no free lunch" theorem, there is no single model that is universally better than all other models for every type of problem. In other words, there is no one-size-fits-all model that can provide optimal results across all possible scenarios [
27]. In this work we proposed several AI learning based on three different scenarios (cropland, pasture, and cropland and pasture) to estimate the DCP and ACP based on solar panel and inverter, respectively. For this purpose, ENN, BTA, MLP and GPR models are utilized. The proposed modeling schematics is presented in
Figure 1. It is essential to consider the specific characteristics of the problem at hand, such as the nature of the data, the size of the dataset, and the goals of the analysis. Different models have different strengths and weaknesses, and what works well for one problem may not work as well for another. Therefore, it is important to experiment with different models and choose the one that is best suited for the particular task at hand [
22]. As such we developed several model combinations based on the influencing factors (see
Figure 2), existing claimed of dominancy in the literature [
23,
24,
2526] as follows:
Where, DCP is DC Power, ACP is AC Power, DY is Daily Yield, AT is Ambient Temperature, MT is Module Temperature, and SR is Solar Radiation.
Furthermore, it is important to evaluate the performance of the chosen model carefully, using appropriate metrics and validation techniques, to ensure that it is indeed performing well on the given problem. As such this study employed several performance criteria to assess the accuracy of the models. Prior to model development, several pre-processing including normalization (Equation (3)), model validation and stationarity analysis, were carried out. In this study, both the standalone and hybrid models were compared using NSE (Nash–Sutcliffe efficiency), PCC (Pearson correlation coefficient), MAPE (mean absolute percentage error), MAE (mean absolute error), RMSE (root mean square error), and PBAIS to understand the strength of each model combination.
Where
y denotes normalized data,
x is the actual data,
is the mean of the measured data,
xmax denotes the maximum value of the measured data, and
xmin denotes the minimum value.
Where by;, are considered as the TP loss rate predicted, observed and average values respectively.
3.1. Elman Neural Network (ENN)
As recurrent computational learning, ENN is a type of machine learning technique that uses feedback connections to retain information about previous inputs. It comprises of different three layers as presented in
Figure 2a similar to traditional neural network with the hidden layer having additional connections to itself from the previous time step [
28]. This allows the network to capture sequential information and make predictions based on past inputs. The network is trained using backpropagation through time, where the error is propagated through the network and the weights are updated accordingly [
29]. The ENN has been used in numerous purposes, both in science and engineering problems [
30,
31].
3.2. Boosted Tree Algorithms (BTA)
BTA are machine learning techniques that combine multiple decision trees to create a powerful predictive model (
Figure 2b). The term "boosting" refers to the process of iteratively improving the performance of the model by focusing on the misclassified instances in each iteration. Boosted trees can be applied to both supervised and unsupervised learning problems [
32]. The most popular boosted tree algorithms are Gradient Boosted Trees (GBTs) and Extreme Gradient Boosting (XGBoost). These algorithms work by sequentially adding trees to the model, with each subsequent tree trying to correct the errors made by the previous ones. The projection is obtained by aggregating the predictions of all the trees [
33]. GBTs and XGBoost have several hyperparameters that can be tuned to optimize performance, including the learning rate, number of trees, tree depth, and regularization. These algorithms have been successfully applied in various domains, including finance, e-commerce, and healthcare. They are known for their high accuracy and interpretability, as they provide information on the importance of each feature in the prediction [
32,
34].
3.3. Multi-later Perceptron (MLP)
As another version of ANN, MLP model consists of multiple layers of interconnected perceptron units. MLP plays the role of FFNN in terms of information processing through the three layers (input, hidden, and outputs) [
35] (
Figure 2c). Each layer contains one or more nodes (perceptrons) that receive input from the previous layer and produce an output that is transmitted to the next layer [
36]. The perceptrons in the hidden layers use activation functions to transform the inputs, allowing the network to learn complex mapping in the data. The MLP is trained using a supervised learning approach called backpropagation, hence works based on the principle of reducing the error between the estimated and observed data [
37].
3.4. Gaussian Processes Regression (GPR)
GPR model has been used to solve different problems related to regression analysis in both science and engineering [
38]. It uses Bayesian inference to make predictions and estimate uncertainties in the predictions. In contrast to other regression models, GPs do not make assumptions about the underlying function that generates the data, making them more flexible and suitable for modeling complex, nonlinear relationships. The core idea behind GPR is to assume that the function values at any set of input points are jointly Gaussian distributed [
39]. A GP is defined by a mean function and a covariance function, also called the kernel function. The mean function specifies the expected value of the function at any input point, while the kernel function captures the similarity between different input points [
40,
41].
Figure 2.
Schematic diagram of (a) ENN (b) BTA (c) MLP used in modelling process.
Figure 2.
Schematic diagram of (a) ENN (b) BTA (c) MLP used in modelling process.