A Multi-feature Fusion Method for Life Prediction of Automotive Proton Exchange Membrane Fuel Cell Based on TCN-GRU

Jiaming Zhang; Fuwu Yan; Changqing Du; Yiming Zhang; Chao Zheng; Jinhai Wang

doi:10.20944/preprints202406.1368.v1

Submitted:

19 June 2024

Posted:

20 June 2024

You are already at the latest version

Abstract

Proton Exchange Membrane Fuel Cell (PEMFC) is a fast-developing battery technology, and the key to its reliability and lifespan improvement lies in the accurate assessment of durability. However, the degradation mechanism of PEMFC is hard to determine and its internal parameters are highly coupled. Thus, the development of the more accurate life prediction model that meets the actual scenarios is urgent to investigate. To solve this problem, a multi-feature fusion life prediction method based on Temporal Convolutional Network-Gated Recurrent Unit (TCN-GRU) is proposed. A TCN algorithm is used as the prediction base model, and two GRU modules are included to the model to strengthen the model expression ability and improve its predictive accuracy. Two widely recognized datasets and two operating conditions are utilized for model training and prediction, respectively. Comparisons are made with single-feature parameter models in terms of Root Mean Square Error (RMSE) and Determination Coefficient (R2). The results show that the prediction accuracy of the TCN-GRU multi-feature fusion model is higher than that of the single-feature models in terms of stability and anti-interference under both operating conditions. Furthermore, with the increase of input feature parameters, the TCN-GRU model is closer to the real value, which proves once again that the proposed model can meet the accuracy requirements of the life prediction of PEMFC.

Keywords:

PEMFC

;

Life prediction

;

Multi-feature fusion method

;

TCN-GRU

Subject:

Engineering - Automotive Engineering

1. Introduction

In recent years, with the aggravation of environmental problems and energy crisis, more and more countries realize the importance of clean energy in future energy development, and start to research and develop infrastructures and energy storage devices around hydrogen energy [1-3]. Proton Exchange Membrane Fuel Cell (PEMFC) is a typical hydrogen fuel cell with high power density, fast startup, long sustainable operation time, zero pollution and emission, which is the mainstream hydrogen energy power device [4,5]. Therefore, it has a wide application prospect in the power field of automobile, power locomotive, and ship [6].

However, the cost and life of PEMFCs are two important factors limiting their large-scale commercial application [7], and accurate assessment of durability is key to improving reliability and extending lifespan. Therefore, many scholars have devoted to exploring the life prediction methods for PEMFCs. Nowadays, there are three main methods: model-based method, data-driven method and hybrid prediction method [8].

The model-based prediction method mainly focuses on modeling from the fuel cell degradation mechanism, which requires less data and has high accuracy. Ou et al. [9] proposed a PEMFC prediction method based on a semi-empirical model to realize the prediction of PEMFC degradation and estimation of its remaining service life under automotive environmental conditions by introducing electrochemical surface area and equivalent resistance degradation models, respectively. Lechartier et al. [10] presented a combined static and dynamic prediction model and verified the accuracy of the model with experimental data. Ao et al. [11] used a life prediction method based on a frequency-domain Kalman filter (FDKF) and a voltage degradation model. By processing the data in groups, the computation time can be greatly reduced with high accuracy. However, the model-based method requires an in-depth understanding of the aging mechanism of the stack and strong modeling ability. Meanwhile, the internal structure and materials of different stacks are not the same, and the aging mechanism is somewhat different, so some key parameters in the modeling cannot be defined directly, which makes it difficult to establish a complete and accurate mechanism model.

The data-based prediction methods are mainly used to build a system behavior model on PEMFC historical operation data directly for fault diagnosis and life prediction. These methods do not require in-depth analysis of the internal reaction mechanism of the fuel cell, but rather comprehensive analysis of a large amount of historical operation data using a variety of methods, such as statistical modeling, machine learning, deep learning, and hybrid learning [12]. Wu et al. [13] improved on the Relevance Vector Machine (RVM) and introduced the Support Vector Machine (SVM) algorithm for comparison, and the results proved that the improved RVM statistical model can form an adaptive to the prediction process due to the absence of limitations in the kernel function, and the prediction performance is better. Liu et al. [14] compared several life prediction methods based on different structural neural networks, and the results showed that the Adaptive Neuro-Fuzzy Inference System- Fuzzy C-Means (ANFIS-FCM) has the best short-term prediction performance, while the introduction of the Particle Swarm Optimization (PSO) algorithm realizes the automatic adjustment of parameters of ANFIS-FCM. Mezzi et al. [15] designed an echo state network based variable load prediction method for fuel cells that can be predicted without prior knowledge of the variable load profile. Zuo et al. [16] combined the attention mechanism and GRU for PEMFC prediction, and the results showed that the model has high prediction accuracy on both dynamic and pseudo-stable datasets. However, the data-based prediction method still has the problem of "black box" for the degradation mechanism and state changes of PEMFC, and there are some limitations in relying on the prediction results to decide the follow-up maintenance measures.

The hybrid prediction methods combine different life prediction methods to improve prediction accuracy by improving individual prediction method weaknesses. There are mainly model-data hybrid driven methods and data-data hybrid driven methods [17]. The model-data hybrid driven method retains the interpretability of the parameters in the model-based method, but also makes full use of the advantages of the two models by blurring them with the help of the data-based method when the mechanism process is not clear. Pan et al. [18] combined model-based AEKF and data-driven NARX with external inputs for predicting PEMFC performance degradation and validated the predictive power of the method using two different datasets. Liu et al. [19] proposed a 2-Stage hybrid prediction method: Stage 1 used an automated machine learning algorithm based on an evolutionary algorithm and an adaptive neuro-fuzzy inference system to achieve voltage degradation prediction in a long time series state. Stage 2 utilized a semi-empirical degradation model based on the predicted data to estimate the remaining lifetime of the battery. The data-data hybrid driven method can fully utilize the different dimensional information of the algorithm for prediction, thereby improving the prediction accuracy and robustness. Zhu et al. [20] combined Bayesian theory and self-attention mechanism to propose a B-GRU hybrid model, which can quantify the uncertainty parameters in the prediction process, and experimentally proved that the hybrid algorithm’s computational accuracy is higher than that of the commonly used neural networks when the training data is less than 380h.

The degradation of PEMFC internal is often not accurately characterized by single feature with poor robustness, while the single feature may change due to the performance degradation of the stack, which in turn affects the output characteristics of the stack. And due to the strong coupling of the internal parameters of the PEMFC, it is often difficult to determine the strong correlation parameters using multi-features to predict the degradation trend, which leads to long prediction time and low prediction accuracy. Therefore, this paper proposes a multi-feature fusion method for life prediction of automotive PEMFCs based on TCN-GRU. The correlation of feature parameters in PEMFC experiments is analyzed, and the strength of the correlation between other feature parameters and voltage is projected, so that multiple strongly correlated features are filtered to participate in the fusion life prediction. The TCN-GRU dual-feature and three-feature fusion life prediction model are constructed, and the voltage trend prediction is carried out by utilizing the FCLAB test open dataset and the dynamic cycling operating condition test dataset of the automotive PEMFC system, comparing with the evaluation indicators to judge the applicability of the models.

The rest of paper is structured as follows. In the section 2, for the characteristics of steady-state operating conditions and dynamic cycling conditions, correlation analysis is performed on the original data, and several strongly correlated feature parameters are filtered and smoothed for noise reduction and reconstruction. The section 3 presents the single-feature parameter and multi-feature parameters fusion prediction models. The life prediction under different operating conditions is carried out respectively in the section 4, and the model evaluation results are analyzed and summarized in the section 5. The conclusions and prospects for future work are summarized in section 6.

2. Multi-Feature Correlation Analysis and Data Processing

2.1. Source of Data

The aging process of internal components of PEMFC such as proton exchange membrane and bipolar plate is relatively slow. Meanwhile, due to the continuity of the durability experiments and the closure of the power stack, it is not available to find a reasonable performance index from the internal perspective, so the external index is used as a symbol for characterization [21]. In order to accurately predict the aging trend of the PEMFC in steady-state conditions, the first PEMFC dataset group (FC1) from the 2014 IEEE PHM Data Challenge durability test public dataset is used [22]. The fuel cell test bench used for this data is shown in Figure 1. The specific parameters of this stack are shown in Table 1. The monitoring parameters include air/hydrogen flow and pressure, coolant temperature, output current, output voltage, monomer voltage and so on.

The dynamic operating conditions data comes from the cycling conditions simulation experiment completed on the 60kW fuel cell bench test platform. The dynamic cycling test conditions are based on the CLTC-P driving conditions [24], with a single cycle duration of 1800s, divided into three speed intervals: low, medium and high. The test rig cumulatively tested and sampled data for 96 complete cycles with a data sampling interval of 2s. The test bench is shown in Figure 2. The specific parameters of this stack are shown in Table 2. In this experiment, the anode of the stack was supplied with hydrogen in a cyclic mode, with the hydrogen pressure maintained at about 15 bar and the pressure controlled by a proportional valve to control the flow area. The unreacted hydrogen is returned to the anode inlet via a hydrogen circulation pump and an ejector. The cathode air is pressurized, cooled, humidified and then flows into the cathode flow channel. The cathode inlet pressure and flow rate can be adjusted by air compressor, backpressure valve and exhaust throttle. The parameters that can be directly measured on the test platform include: output voltage, output power, cathode/anode inlet/exhaust pressures, air compressor speed, coolant inlet/outlet temperatures, proportional valve and backpressure valve openings, hydrogen circulating pump speed and so on.

2.2. Correlation Coefficient Analysis Method

Output voltage can be a key feature to characterize the degradation of PEMFC [25]. Therefore, when selecting other features for joint prediction, it is necessary to filter out the feature parameters that are closely related to the output voltage. There are three main methods for performing correlation analysis: the Pearson correlation coefficient, the Spearman correlation coefficient, and the Kendall correlation coefficient [26].

Pearson correlation coefficient is applicable to the comparison between two continuous variables and measures the strength and direction of their linear relationship, with a value between -1 and 1. The Pearson correlation coefficient of variables X and Y is denoted as

ρ_{X, Y}

, as shown in Eq. (1):

(1)

When

ρ_{X, Y}

=1, it indicates that there is a completely positive linear relationship between variables X and Y, i.e., the two variables are completely positively correlated. When

ρ_{X, Y}

=-1, it means that there is a completely negative linear relationship between variables X and Y, i.e., the two variables are completely negatively correlated. When

ρ_{X, Y}

=0, it means that there is no linear relationship between variables X and Y, i.e., the two variables are uncorrelated [27].

Spearman correlation coefficient is used to determine the monotonic relationship between two variables, which does not require the data to follow a normal distribution, and calculates the correlation by ranking the variables, which is applicable in the case of a nonlinear relationship. The rank of a number in a variable is the position of that number in the set of columns from smallest to largest. This method is particularly suitable for rank order data. The Spearman correlation coefficient of variables X and Y is denoted as

ρ

, which is calculated as shown in Eq. (2):

(2)

here,

d_{i}

is the rank difference obtained by subtracting

Y_{i}

from

X_{i}

. This method for the degree of linear relationship is consistent with the Pearson correlation coefficient [28].

Kendall correlation coefficient is a statistic used to measure the hierarchical relationship between two variables, which does not depend on the specific values of the data, but rather calculates the correlation based on the hierarchy of the data. The Kendall correlation coefficient is particularly applicable when the data does not satisfy a normal distribution or there are outliers. The Kendall correlation coefficient between two variables is denoted as

τ

, which is calculated as shown in Eq. (3):

(3)

here,

N_{c}

is the number of two variables that agree in rank,

N_{d}

is the number of two variables that do not agree in rank, and n is the number of elements [29].

The feature parameters of PEMFC are coupled with each other, but do not have linear relationship with each other, and at the same time, there is no normal distribution law. Therefore, in this paper, Spearman correlation coefficient and Kendall correlation coefficient are selected as multi-feature filtering methods.

2.3. Multi-Feature Filtering And Data Processing

Since the monitoring parameters of the steady-state condition dataset and the dynamic cycling condition dataset are not exactly the same, and the operating environments are different. Therefore, the multi-feature correlation analysis and data processing of the two conditions are performed separately.

2.3.1. Steady-State Condition Multi-Feature Filtering and Data Processing

In the FCLAB FC1 group dataset, the test bench monitored 24 feature parameters. Not all of them affect the degradation of PEMFC and some of the parameters are repetitive in properties, so the single voltages

U_{1}

-

U_{5}

, the current density

i

and the current

I

are excluded. Since hydrogen and air are compressible fluids, the relationship between their pressures and flow rates cannot be directly described by Bernoulli’s equation and cannot be replaced with each other. Therefore, both pressures and flow rates are filtered into the feature parameters to be analyzed. Finally, a total of 17 feature parameters are retained, as shown in Table 2.

Table 3. Feature parameters for steady-state condition.

Feature Parameters	Symbol	Unit
Stack output voltage	$U_{t o t}$	V
Inlet temperature of hydrogen	$T_{i n H 2}$	℃
Outlet temperature of hydrogen	$T_{o u t H 2}$	℃
Inlet temperature of air	$T_{i n A I R}$	℃
Outlet temperature of air	$T_{o u t A I R}$	℃
Inlet temperature of coolant	$T_{i n W A T}$	℃
Outlet temperature of coolant	$T_{o u t W A T}$	℃
Inlet pressure of hydrogen	$P_{i n H 2}$	Mbar
Outlet pressure of hydrogen	$P_{o u t H 2}$	Mbar
Inlet pressure of air	$P_{i n A I R}$	Mbar
Outlet pressure of air	$P_{o u t A I R}$	Mbar
Inlet flow rate of hydrogen	$D_{i n H 2}$	L/min
Outlet flow rate of hydrogen	$D_{o u t H 2}$	L/min
Inlet flow rate of air	$D_{i n A I R}$	L/min
Outlet flow rate of air	$D_{o u t A I R}$	L/min
Flow rate of coolant	$D_{W A T}$	L/min
Air humidity	$H_{r A I R F C}$	%

The correlation analysis of 17 feature parameters in the FC1 group dataset is shown in Figure 3. When the absolute value of the correlation coefficient is 0.8-1.0, it is called a strong correlation. When the absolute value of the correlation coefficient is 0.3-0.8, it is called weak correlation. When the absolute value of the correlation coefficient is less than 0.3, it can be regarded as no correlation between the two values [30]. Comparing with Figure 3(a) Spearman results, the

ρ

value of

D_{o u t A I R}

and

T_{i n H 2}

with

U_{t o t}

are 0.67 and -0.51, respectively, which are higher than the correlation of other feature parameters, and the same results are found in Figure 3 (b) Kendall correlation coefficient. Therefore,

D_{o u t A I R}

,

T_{i n H 2}

and

U_{t o t}

are selected as the feature parameters for the life prediction of the steady-state operating conditions.

If the original FC1 group dataset is used directly for life prediction, it is computationally intensive and highly susceptible to overfitting, so the dataset needs to be reconstructed. Firstly, interval sampling is performed, and the output voltage values are sampled every 70s. In order to reduce the effect of noise and improve the prediction accuracy, the interval sampling data is smoothed by using one-dimensional Discrete Wavelet Transform (DWT) [31,32]. The three selected feature parameters before and after the noise reduction and smoothing are shown in Figure 4. As can be seen in Figure 4, the voltage information after the noise reduction process retains both the original data change trend, with fewer data anomalies and overall smoother, which is conducive to model prediction.

2.3.2. Dynamic Cycling Condition Multi-Feature Filtering and Data Processing

The parameters that can be directly monitored in the 60kW fuel cell bench test platform include output voltage, output power, and more than ten feature parameters. The feature parameters that are not related to the degradation of PEMFC are also excluded. The final 16 feature parameters retained are shown in Table 4.

Correlation analysis is performed on 16 feature parameters in the dynamic cycling condition dataset, which are shown in Figure 5. In Figure 5 (a), it can be found that

U_{m i n}

,

P_{i n H 2}

,

P_{i n A I R}

,

n_{A C P S p d}

and

P_{o u t A I R}

are strongly correlated with

U_{o u t}

, but the difference is not obvious. From comparing the correlation coefficient in Figure 5 (b), it can be found that the absolute values of

U_{m i n}

and

P_{i n A I R}

are closer to 1, which indicates that the stronger the correlation between these two variables and

U_{o u t}

. Therefore,

U_{o u t}

,

U_{m i n}

and

P_{i n A I R}

are selected as the features parameters for the life prediction of the dynamic cycling conditions.

In the actual operation process, due to the PEMFC is affected by the environment, temperature, operating conditions, its own system and other factors, the output data collected by the dynamic cycling condition fluctuations, resulting in a large number of outliers [33]. If it is divided by operating condition trend or data threshold, there may be data overlapping calculation, which affects the prediction accuracy. Conditional Generative Adversarial Networks (CGAN) is an extension of Generative Adversarial Networks (GAN), which allows the model to introduce conditional information in the process of generating data, so that the generator can generate samples of specific categories or features based on pre-set conditions [34]. Therefore, the original data containing high-frequency noise is reconstructed using the CGAN unsupervised model to make the data preprocessing and feature extraction process adaptive and self-learning. The data before and after processing is shown in Figure 6. It can be found in Figure 6 that all three feature parameters are able to retain the change trend of the original data after data reconstruction, while the noise is significantly suppressed and the curve is smoother.

3. Design of Life Prediction Model

The commonly used deep learning algorithms and their usage scenarios are shown in Table 4.

Table 5. Comparison of deep learning algorithms.

Name of Algorithm	Advantage	Drawback	Usage Scenario
Recurrent Neural Network (RNN)	Wide range of applications with internal memory mechanism.	Difficult to parallelize training; prone to gradient vanishing or gradient explosion; computationally inefficient.	Short-to-medium time series
Convolutional Neural Network (CNN)	It is good at capturing local features and can be computed in parallel with strong robustness and generalization ability.	The number of parameters is large, and the convolution operation may lead to information loss.	Spatial data
Long Short-Term Memory Network (LSTM)	Use gating mechanism for easier convergence, stable training and flexible structure.	The model is large and the computational complexity is high and not easy to interpret.	Memorizing long time series
Temporal Convolutional Network (TCN)	Parallel computation capable; stable gradient propagation; good at capturing long-term dependencies.	Input sequences need to be formulated with a fixed length and are sensitive to sequence ordering.	Long time series
Gated Recurrent Unit (GRU)	The structure is simple, with fewer gating mechanisms, less computational effort, and higher accuracy when the dataset is small.	Weak memory, may not be able to capture long-term dependencies, weak processing of complex sequences.	Simple long time series

3.1. Design of Single-Feature Models

The prediction method with output voltage as a single feature is added to facilitate the comparison and analysis of the advantages and disadvantages of the proposed multi-feature life prediction methods. For the steady-state operating condition life prediction, two prediction methods, traditional RNN and LSTM, are selected as a comparison based on the algorithm characteristics. The RNN model is built based on TensorFlow, and in order to prevent the prediction from overfitting, the Dropout technique is introduced. This technique randomly removes some neurons and their single-step connections during the training process and at each iteration, so that the whole training process is "sparsely" sampled, which avoids the same network from being trained repeatedly and ensures the model’s generalization ability [35]. Meanwhile, in order to ensure that the learning rate

η

can be adjusted adaptively and maintain the adaptability and stability of each parameter, Adam is chosen as the optimizer, with the number of neurons in the hidden layer of 150. The probability of Dropout random deletion is 0.4, the activation function is ReLU, and the number of iterations is 2000. Considering the training speed and generalization ability, the Batch Size is formulated to be 64 at each iteration.

Based on the LSTM algorithm structure, the Dropout technique is also introduced to prevent predictive overfitting. According to the conclusions of Zhang et al. [36] the Learning Rate

η

of 0.01 and Dropout random deletion probability of 0.4 are selected to train the network. The number of neurons in the hidden layer of the LSTM is 150, the number of iterations is 2000, and the learning rate is decayed for 50 generations, with a learning rate decay rate of 0.2.

For dynamic cycling operating condition life prediction, a CNN-LSTM based PEMFC voltage prediction model is selected based on the algorithmic characteristics with the same configuration as above [37,38]. The model utilizes the advantages of the data feature extraction capability of the CNN algorithm and the suitability of the LSTM algorithm for dealing with long-term dependencies. The CNN is mainly used to extract the underlying features of the data to achieve dimensionality reduction, and the use of LSTM network is mainly to grasp the contextual information more deeply and comprehensively, to avoid the loss of contextual information due to the excessive depth of the CNN network, and at the same time, it can allow the model to focus more on the temporal attributes of the data.

3.2. Design of Multi-Feature Models

Since the essence of multi-feature fusion prediction is the processing of time series features, TCN is selected as the prediction base model in this paper. In order to improve the prediction accuracy of multi-feature data input, the TCN structure is modified, and the improved TCN multi-input model structure is shown in Figure 7. The TCN activation function is replaced with the Leaky ReLU. The Leaky ReLU function introduces a negative slope on the base of ReLU, as shown in Eq. (4). Because the Leaky ReLU function has a non-zero gradient, it allows faster convergence during training and avoids the problems of neurons not being updated and gradient vanishing. In order to enhance the model capacity and expressiveness [39], three layers of residual blocks are used in the TCN model to help the model capture the long-term correlation relationships within the time series more accurately, while avoiding too many residual blocks leading to model overfitting.

(4)

Compared to the LSTM model using three gating mechanisms, the GRU structure is simpler and has fewer hyperparameters. Hence, it has higher computational efficiency in long time series data prediction and is easier to adjust the model hyperparameters and structure [40]. Similarly, in order to better capture multiple sequence features and strengthen the model expression ability, 2 GRU modules are added to the TCN-GRU model. The improved TCN-GRU model is shown in Figure 8. After repeated experimental tuning, the appropriate hyperparameters of the TCN-GRU multi-feature input model are determined, as shown in Table 6.

4. Multi-Operating Condition Life Prediction for PEMFCs

In this paper, predictions are made according to the Training Set: Test Set = 4:6(40%), 6:4(60%), 8:2(80%) division. The influence of the experimental environment configuration on the results during the prediction process is minimized as much as possible.

4.1. Life Prediction for Steady-State Operating Condition

According to the multi-feature correlation filtering results in the previous section,

U_{t o t}

and

D_{o u t A I R}

are selected as the input values for the dual-feature prediction model.

U_{t o t}

,

D_{o u t A I R}

, and

T_{i n H 2}

are selected as the input values for the three-feature fusion prediction model.

The datasets of

U_{t o t}

and

D_{o u t A I R}

are imported into the input layer of the TCN-GRU model, and the prediction results under the conditions of 40%, 60%, and 80% of the training set ratio are obtained, respectively, as shown in Figure 9. By analyzing the prediction curves in the figure, it can be found that the results of the TCN-GRU dual-feature fusion prediction model are significantly better than those of the LSTM and RNN models, and the dual-feature fusion prediction fitting performance is better as the number of training sets increases.

The datasets of

U_{t o t}, D_{o u t A I R}

and

T_{i n H 2}

datasets are imported into the input layer of the TCN-GRU model, and the prediction results are obtained under the conditions of 40%, 60%, and 80% of the training set ratio, respectively, as shown in Figure 10. Analyzing the prediction curves in the figure, it can be observed that the three-feature fusion prediction results are close to the original data at 40%, 60%, and 80% of the training set ratio, indicating that the TCN-GRU model can characterize the degradation inside the stacks more rapidly and accurately and make prediction after more stacks’ feature parameters are introduced.

4.2. Life Prediction for Dynamic Cycling Condition

According to the multi-feature correlation filtering results in the previous section,

U_{o u t}

and

U_{m i n}

are selected as the input values for the dual-feature prediction model.

U_{o u t}

,

U_{m i n}

, and

P_{i n A I R}

are selected as the input values for the three-feature fusion prediction model.

The datasets of

U_{o u t}

and

U_{m i n}

are imported into the input layer of the TCN-GRU model, and the prediction results under the conditions of 40%, 60%, and 80% of the training set ratio are obtained, respectively, as shown in Figure 11. The dynamic cycling condition of automotive PEMFC has more uncertainties compared with the steady-state operating condition. Theoretically, the multi-feature fusion prediction has better robustness by combining the trends of multi-feature parameters to comprehensively analyze the degradation characteristics of PEMFC, and weakening the influence of single-feature parameter anomalies and noise. By qualitatively analyzing Figure 11, it can be seen that the dual-feature fusion prediction is more desirable for dynamic life prediction. From the local zoom in the figure, it can be noticed that the dual-feature prediction curve can better reflect the trend of the voltage value compared with CNN-LSTM.

The datasets of

U_{o u t}, U_{m i n}

and

P_{i n A I R}

datasets are imported into the input layer of the TCN-GRU model, and the prediction results are obtained under the conditions of 40%, 60%, and 80% of the training set ratio, respectively, as shown in Figure 12. As can be seen from the figure, the triple-feature fusion prediction is slightly better than the dual-feature fusion prediction, especially when the training set ratio is at 60%. From the local zoom in the figure, the three-feature fusion prediction is closer to the dual-feature fusion prediction, and the fitting effect is better. As the training data volume is larger, the prediction trend for the remaining lifetime is more accurate. At 80% of the training set ratio, the prediction results are almost the same as the original data.

5. Result and Discussion

In this paper, predictions are made according to the Training Set: Test Set = 4:6(40%), 6:4(60%), 8:2(80%) division. The influence of the experimental environment configuration on the results during the prediction process is minimized as much as possible. In order to quantitatively assess model performance to visualize the degree of model fitting and prediction ability, it is necessary to use appropriate model evaluation indicators to select, adjust, interpret and improve the model [41]. According to the characteristics of operating conditions, this paper adopts Root Mean Square Error (RMSE) and Determination Coefficient (R²) as the evaluation indexes of model prediction results.

5.1. Analysis of Life Prediction Results for Steady-State Operating Condition

The prediction evaluation indicators RMSE and R² of RNN, LSTM, TCN-GRU (dual-feature) and TCN-GRU (three-feature) models are shown in Table 7 and Figure 13. It can be concluded as follows:

TCN-GRU (three-feature) has the highest prediction accuracy, with the RMSE value of 3.27×10^-3 and the R² value of 0.965 at 80% of the training set ratio, almost perfectly fitting the original data, which is significantly better than the other two prediction models. It indicates that for the multi-feature fusion prediction model’s anti-overfitting and generalization ability is better than RNN and LSTM models, and it is more suitable for the prediction of long time series and data with multiple parameters
TCN-GRU (three-feature) prediction results are better than TCN-GRU (dual-feature), compared with the RMSE value reduced by at least 3.82%, and even reduced by 23.6% in the training set ratio of 80%, which indicates that in the long time series prediction, the more features involved in the fusion of the prediction of the more the model’s ability to resist the abnormal data points or noise, and the better the stability of the model;
Comparing the TCN-GRU multi-feature fusion prediction results horizontally, with the increase of the training set, the RMSE value of TCN-GRU (three-feature) decreases by 12.3% and 26.02% in turn, and the R² value improves by 1.18% and 2.33%, and the prediction effect gets better with the increase of the training set, which proves the multi-feature fusion prediction model can meet the requirements of the PEMFC life prediction.

5.2. Analysis of Life Prediction Results for Dynamic Cycling Condition

The prediction evaluation indicators RMSE and R² of CNN- LSTM, TCN-GRU (dual-feature) and TCN-GRU (three-feature) models are shown in Table 8 and Figure 14. It can be concluded as follows:

TCN-GRU (three-feature) has the highest prediction accuracy, which is slightly better than TCN-GRU (dual-feature). Comparing the RMSE values, the TCN-GRU (three-feature) model reduces at least 6.79% compared to the CNN-LSTM model, which indicates that the multi-feature fusion prediction model is more resistant to noise and has better robustness in the long time series prediction of dynamic cycling conditions;
Comparing the R² values, it can be found that the fitting ability of the TCN-GRU multi-feature fusion prediction model has a disconnected improvement compared with the traditional deep learning network. It further shows that a single deep learning network cannot meet the requirements of time series prediction with large data volume, and it is necessary to flexibly combine multiple algorithms, give full play to the advantages of each algorithm, and reasonably build a joint model according to the data volume and data characteristics, in order to achieve better prediction results.

6. Conclusion

In this paper, a multi-feature fusion method for life prediction of PEMFC based on TCN-GRU is proposed. This model is based on the TCN algorithm that possesses a three-layer residual block, and two GRU modules are added to better capture multiple sequential features and strengthen the model expression capability. To evaluate the model, two commonly used datasets are used for model training under steady-state operating conditions and dynamic cyclic conditions, respectively. Strongly correlated feature parameters are imported into the model for prediction after correlation filtering. Compared with the single-feature prediction model, the multi-feature model has many advantages such as strong anti-interference, stability and high prediction accuracy. In the future research, we will continue to optimize the multi-feature fusion life prediction model to improve the prediction accuracy and efficiency. Meanwhile, we continue to try to design other advanced life prediction algorithms to enhance the practicability and generalization under different operating conditions and scenarios to meet the needs of practical applications.

Author Contributions

Conceptualization, Jiaming Zhang, Fuwu Yan, Changqing Du, Yiming Zhang, Chao Zheng and Jinhai Wang; methodology, Jiaming Zhang, Fuwu Yan, Changqing Du and Yiming Zhang; software, Jiaming Zhang, Yiming Zhang, Chao Zheng and Jinhai Wang; validation, Jiaming Zhang, Yiming Zhang and Jinhai Wang; formal analysis, Fuwu Yan and Changqing Du; investigation, Yiming Zhang and Chao Zheng; resources, Jiaming Zhang, Changqing Du and Yiming Zhang; data curation, Jiaming Zhang, Changqing Du, Yiming Zhang and Chao Zheng; writing—original draft preparation, Jiaming Zhang, Yiming Zhang and Chao Zheng; writing—review and editing, Jiaming Zhang, Fuwu Yan, Changqing Du and Jinhai Wang; visualization, Jiaming Zhang, Fuwu Yan, Changqing Du, Yiming Zhang, Chao Zheng and Jinhai Wang; supervision, Fuwu Yan and Changqing Du; project administration, Fuwu Yan and Changqing Du; funding acquisition, Fuwu Yan and Changqing Du. All authors have read and agreed to the published version of the manuscript. Jiaming Zhang, Fuwu Yan, Changqing Du, Yiming Zhang, Chao Zheng, Jinhai Wang

Funding

This research was funded by National Key R&D Program of China, grant number 2022YFB4003703; Foshan Xianhu Laboratory of the Advanced Energy Science and Technology Guangdong Laboratory, grant number XHRD2024-11233100-01; Key R&D project of Hubei Province China, grant number 2021AAA006.

Conflicts of Interest

The authors declare no conflict of interest.

References

Liu, W.; Sun, L.; Li, Z.; et al. Trends and future challenges in hydrogen production and storage research. Environmental Science and Pollution Research. 2020, 27(25), 31092–31104. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Du, C.; Yan, F.; et al. Hierarchical Rewarding Deep Deterministic Policy Gradient Strategy for Energy Management of Hybrid Electric Vehicles. IEEE Transactions on Transportation Electrification. 2024, 10, 1802–1815. [Google Scholar] [CrossRef]
Guo, L.; Zhao, L.; Jing, D.; et al. Solar hydrogen production and its development in China. Energy. 2009, 34(9), 1073–1090. [Google Scholar] [CrossRef]
Zhang, J.; Yan. F.; Du. C.; et al. Model-based performance optimization of thermal management system of proton exchange membrane fuel cell. Energies. 2023, 16(9), 3952. [CrossRef]
Zhao, S.; Gao, Z.; Li, X.; et al. Research on Energy Management Strategy of Fuel Cell Tractor Hybrid Power System. World Electric Vehicle Journal. 2024, 15(2), 61. [Google Scholar] [CrossRef]
Mo, T.; Li, Y.; Luo, Y. Advantages and Technological Progress of Hydrogen Fuel Cell Vehicles. World Electric Vehicle Journal. 2023, 14(6), 162. [Google Scholar]
Chen, K.; Laghrouche, S.; Djerdir, A. Aging prognosis model of proton exchange membrane fuel cell in different operating conditions. International Journal of Hydrogen Energy. 2020, 45, 11761–11772. [Google Scholar]
Hua, Z.; Zheng, Z.; Pahon, E.; et al. A review on lifetime prediction of proton exchange membrane fuel cells system. Journal of Power Sources. 2022, 529. [Google Scholar]
Ou, M.; Zhang, R.; Shao, Z.; et al. A novel approach based on semi-empirical model for degradation prediction of fuel cells. Journal of Power Sources. 2021, 488. [Google Scholar] [CrossRef]
Lechartier, E.; Laffly, E.; Péra, M.-C.; et al. Proton exchange membrane fuel cell behavioral model suitable for prognostics. International Journal of Hydrogen Energy. 2015, 40, 8384–8397. [Google Scholar] [CrossRef]
Ao, Y.; Laghrouche, S.; Depernet, D.; et al. Proton Exchange Membrane Fuel Cell Prognosis Based on Frequency-Domain Kalman Filter. IEEE Transactions on Transportation Electrification. 2021, 7, 2332–2343. [Google Scholar] [CrossRef]
Deng, S.; Zhang, J.; Zhang, C.; et al. Prediction and optimization of gas distribution quality for high-temperature PEMFC based on data-driven surrogate model. Applied Energy. 2022, 327. [Google Scholar] [CrossRef]
Wu, Y.; Breaz, E.; Gao, F.; et al. A Modified Relevance Vector Machine for PEM Fuel-Cell Stack Aging Prediction. IEEE Transactions on Industry Applications. 2016, 52, 2573–2581. [Google Scholar] [CrossRef]
Liu, H.; Chen, J.; Hissel, D.; et al. Short-Term Prognostics of PEM Fuel Cells: A Comparative and Improvement Study. IEEE Transactions on Industrial Electronics. 2019, 66, 6077–6086. [Google Scholar] [CrossRef]
Mezzi, R.; Yousfi-Steiner, N.; Péra, M.C.; et al. An Echo State Network for fuel cell lifetime prediction under a dynamic micro-cogeneration load profile. Applied Energy. 2021, 283. [Google Scholar] [CrossRef]
Zuo, J.; Lv, H.; Zhou, D.; et al. Deep learning based prognostic framework towards proton exchange membrane fuel cell for automotive application. Applied Energy. 2021, 281. [Google Scholar] [CrossRef]
Wang, Y.; Wu, K.; Zhao, H.; et al. Degradation prediction of proton exchange membrane fuel cell stack using semi-empirical and data-driven methods. Energy and AI. 2023, 11. [Google Scholar] [CrossRef]
Pan, R.; Yang, D.; Wang, Y.; et al. Performance degradation prediction of proton exchange membrane fuel cell using a hybrid prognostic approach. International Journal of Hydrogen Energy. 2020, 45, 30994–31008. [Google Scholar] [CrossRef]
Liu, H.; Chen, J.; Hissel, D.; et al. Remaining useful life estimation for proton exchange membrane fuel cells using a hybrid method. Applied Energy. 2019, 237, 910–919. [Google Scholar] [CrossRef]
Zhu, W.; Guo, B.; Li, Y.; et al. Uncertainty quantification of proton-exchange-membrane fuel cells degradation prediction based on Bayesian-Gated Recurrent Unit. eTransportation 2023, 16. [Google Scholar] [CrossRef]
Zhang, C.; Hu, H.; Ji, J.; et al. An evolutionary stacked generalization model based on deep learning and improved grasshopper optimization algorithm for predicting the remaining useful life of PEMFC. Applied Energy. 2023, 330. [Google Scholar] [CrossRef]
Gouriveau, R.; Hilairet, M.; Hissel, D.; et al. IEEE PHM 2014 data challenge: Outline, experiments, scoring of results, winners. In Proceedings of the Proc. IEEE Conf. Prognostics Health Manage; 2014; pp. 1–6. [Google Scholar]
Chen, K.; Laghrouche, S.; Djerdir, A. Degradation prediction of proton exchange membrane fuel cell based on grey neural network model and particle swarm optimization. Energy Conversion and Management. 2019, 195, 810–818. [Google Scholar] [CrossRef]
China Automotive Technology and Research Center, Co. Ltd. China automotive test cycle-Part 1: Light-duty vehicles. 2019, GB/T 38146.1-2019, 36.
Jouin, M.; Gouriveau, R.; Hissel, D.; et al. Prognostics of PEM fuel cell in a particle filtering framework. International Journal of Hydrogen Energy 2014, 39, 481–494. [Google Scholar] [CrossRef]
Wang, J.; Du, C.; Yan, F.; et al. Energy Management of a Plug-in Hybrid Electric Vehicle Using Bayesian Optimization and Soft Actor-Critic Algorithm. IEEE Transactions on Transportation Electrification. 2024, 1–1. [Google Scholar] [CrossRef]
Dufera, A.G.; Liu, T.; Xu, J. Regression models of Pearson correlation coefficient. Statistical Theory and Related Fields. 2023, 7, 97–106. [Google Scholar] [CrossRef]
Yu, H.; Hutson, A.D. A robust Spearman correlation coefficient permutation test. Communications in Statistics - Theory and Methods. 2022, 53, 2141-2153. [CrossRef]
Gao, D.; Zhou, Y.; Wang, T.; et al. A Method for Predicting the Remaining Useful Life of Lithium-Ion Batteries Based on Particle Filter Using Kendall Rank Correlation Coefficient. Energies. 2020, 13. [Google Scholar] [CrossRef]
Lu, Y.; Tang, X.; Wang, H. An Effective Stock Clustering Method Based on Hybrid Correlation Coefficient; 2018.
Ding, R.; Zhang, S.; Chen, Y.; et al. Application of Machine Learning in Optimizing Proton Exchange Membrane Fuel Cells: A Review. Energy and AI. 2022, 9. [Google Scholar] [CrossRef]
Ibrahim, M.; Steiner, N.; Jemei, S.; et al. Wavelets-based approach for online Fuel Cells Remaining Useful lifetime Prediction. IEEE Transactions on Industrial Electronics 2016, 1–1. [Google Scholar] [CrossRef]
Luo, J.; Chen, T.; Xiao, F.; et al. Remaining useful life prediction of PEMFC based on CNN-Birnn model. International Journal of Green Energy. 2023, 20, 1729–1740. [Google Scholar] [CrossRef]
Campbell, J.N.A.; Dais Ferreira, M.; Isenor, A.W. Generation of Vessel Track Characteristics Using a Conditional Generative Adversarial Network (CGAN). Applied Artificial Intelligence. 2024, 38. [Google Scholar] [CrossRef]
Wang, F.-K.; Cheng, X.-B.; Hsiao, K.-C. Stacked long short-term memory model for proton exchange membrane fuel cell systems degradation. Journal of Power Sources. 2020, 448. [Google Scholar] [CrossRef]
Zhang, C.; Ji, C.; Hua, L.; et al. Evolutionary quantile regression gated recurrent unit network based on variational mode decomposition, improved whale optimization algorithm for probabilistic short-term wind speed prediction. Renewable Energy. 2022, 197, 668–682. [Google Scholar] [CrossRef]
Peng, Y.; Chen, T.; Xiao, F.; et al. Remaining useful lifetime prediction methods of proton exchange membrane fuel cell based on convolutional neural network-long short-term memory and convolutional neural network-bidirectional long short-term memory. Fuel Cells 2022, 23, 75–87. [Google Scholar] [CrossRef]
Liu, C.; Shen, J.; Dong, Z.; et al. Accuracy improvement of fuel cell prognostics based on voltage prediction. International Journal of Hydrogen Energy. 2024, 58, 839–851. [Google Scholar] [CrossRef]
Yang, J.; Wang, L.; Zhang, B.; et al. Remaining useful life prediction of vehicle-oriented PEMFC systems based on IGWO-BP neural network under real-world traffic conditions. Energy. 2024, 291. [Google Scholar] [CrossRef]
Tao, Z.; Zhang, C.; Xiong, J.; et al. Evolutionary gate recurrent unit coupling convolutional neural network and improved manta ray foraging optimization algorithm for performance degradation prediction of PEMFC. Applied Energy. 2023, 336. [Google Scholar] [CrossRef]
Ma, R.; Xie, R.; Xu, L.; et al. A Hybrid Prognostic Method for PEMFC With Aging Parameter Prediction. IEEE Transactions on Transportation Electrification. 2021, 7, 2318–2331. [Google Scholar] [CrossRef]

Figure 1. 2014 IEEE PHM Data Challenge durability test bench [22,23].

Figure 2. 60kW fuel cell test bench.

Figure 3. Matrix of correlation coefficients of feature parameters for steady-state condition: (a) Comparison of Spearman correlation coefficient; (b) Comparison of Kendall correlation coefficient.

Figure 4. Comparison of DWT smoothing noise reduction results for feature parameters: (a) The original

U_{t o t}

; (b) Results of

U_{t o t}

after smoothing noise reduction; (c) The original

D_{o u t A I R}

; (d) Results of

D_{o u t A I R}

after smoothing noise reduction; (e) The original

T_{i n H 2}

; (f) Results of

T_{i n H 2}

after smoothing noise reduction;

Figure 4. Comparison of DWT smoothing noise reduction results for feature parameters: (a) The original

U_{t o t}

; (b) Results of

U_{t o t}

after smoothing noise reduction; (c) The original

D_{o u t A I R}

; (d) Results of

D_{o u t A I R}

after smoothing noise reduction; (e) The original

T_{i n H 2}

; (f) Results of

T_{i n H 2}

after smoothing noise reduction;

Figure 5. Matrix of correlation coefficients of feature parameters for dynamic cycling condition: (a) Comparison of Spearman correlation coefficient; (b) Comparison of Kendall correlation coefficient.

Figure 6. Comparison of reconstruction results of feature parameters based on CGAN: (a) The original

U_{o u t}

; (b) Results of

U_{o u t}

; after reconstruction; (c) The original

U_{m i n}

; (d) Results of

U_{m i n}

after reconstruction; (e) The original

P_{i n A I R}

; (f) Results of

P_{i n A I R}

after reconstruction;

Figure 6. Comparison of reconstruction results of feature parameters based on CGAN: (a) The original

U_{o u t}

; (b) Results of

U_{o u t}

; after reconstruction; (c) The original

U_{m i n}

; (d) Results of

U_{m i n}

after reconstruction; (e) The original

P_{i n A I R}

; (f) Results of

P_{i n A I R}

after reconstruction;

Figure 7. Schematic of the multi-input structure of the TCN model.

Figure 8. Schematic of the structure of multi-feature fusion life prediction model based on TCN-GRU.

Figure 9. Results of dual-feature fusion life prediction for steady-state operating condition: (a) 40% of the training set ratio; (b) 60% of the training set ratio; (c) 80% of the training set ratio.

Figure 10. Results of three-feature fusion life prediction for steady-state operating condition: (a) 40% of the training set ratio; (b) 60% of the training set ratio; (c) 80% of the training set ratio.

Figure 11. Results of dual-feature fusion life prediction for dynamic cycling condition: (a) 40% of the training set ratio; (b) 60% of the training set ratio; (c) 80% of the training set ratio.

Figure 12. Results of three-feature fusion life prediction for dynamic cycling condition: (a) 40% of the training set ratio; (b) 60% of the training set ratio; (c) 80% of the training set ratio.

Figure 13. Evaluation of life prediction results for steady-state operating condition: (a) Comparison of RMSE results; (b) Comparison of R² results.

Figure 14. Evaluation of life prediction results for dynamic cycling condition: (a) Comparison of RMSE results; (b) Comparison of R² results.

Table 1. 2014 IEEE PHM Data Challenge durability test stack parameters [22,23].

Parameters	Value	Unit
Active area	100	cm²
Operating temperature	60	℃
Pressure	1.5	bar
Relative humidity	50	%
Load current	70	A
Rated current density	0.7	A/cm²

Table 2. 60kW fuel cell test bench stack parameters.

Parameters	Value	Unit
Active area	304	cm²
Operating temperature	80	℃
Pressure	2	bar
Relative humidity	80	%

Table 4. Feature parameters for dynamic cycling condition.

Feature Parameters	Symbol	Unit
Stack output voltage	$U_{o u t}$	V
Stack output power	$P_{o u t}$	W
Inlet temperature of hydrogen	$T_{i n H 2}$	℃
Inlet temperature of air	$T_{i n A I R}$	℃
Outlet temperature of air	$T_{o u t A I R}$	℃
Inlet temperature of coolant	$T_{i n W A T}$	℃
Outlet temperature of coolant	$T_{o u t W A T}$	℃
Temperature difference of coolant	$T_{W A T D i f}$	℃
Inlet pressure of hydrogen	$P_{i n H 2}$	MPa
Inlet pressure of air	$P_{i n A I R}$	MPa
Outlet pressure of air	$P_{o u t A I R}$	MPa
Inlet pressure of coolant	$P_{i n W A T}$	MPa
Outlet pressure of coolant	$P_{o u t W A T}$	MPa
Pressure difference of cathode and anode	$P_{i n D i f}$	MPa
Speed of air compressor	$n_{A C P S p d}$	rpm
Minimum single voltage	$U_{m i n}$	V

Table 6. Table 6. Model hyperparameter settings.

Parameter	Value	Parameter	Value
Neurons	30	Bach size	128
Filters	32	Learning rate	0.001
Sizes	3	Dropout	0.5
Dilations	2	Epoch	30

Table 7. Evaluation of life prediction results for steady-state operating condition.

Name of Model	RMSE(x10^-3)			R²
Name of Model	40%	60%	80%	40%	60%	80%
RNN	23.91	13.22	11.35	0.277	0.403	0.455
LSTM	12.77	10.51	10.71	0.541	0.616	0.647
TCN-GRU (dual-feature)	5.24	5.67	4.28	0.908	0.924	0.938
TCN-GRU (three-feature)	5.04	4.42	3.27	0.932	0.943	0.965

Table 8. Evaluation of life prediction results for dynamic cycling condition.

Name of Model	RMSE(x10^-3)			R²
Name of Model	40%	60%	80%	40%	60%	80%
CNN-LSTM	5.60	6.95	3.66	0.678	0.685	0.789
TCN-GRU (dual-feature)	5.30	4.23	2.94	0.866	0.881	0.887
TCN-GRU (three-feature)	5.22	4.04	2.91	0.885	0.890	0.924

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

A Multi-feature Fusion Method for Life Prediction of Automotive Proton Exchange Membrane Fuel Cell Based on TCN-GRU

Abstract

Keywords:

Subject:

1. Introduction

2. Multi-Feature Correlation Analysis and Data Processing

2.1. Source of Data

2.2. Correlation Coefficient Analysis Method

2.3. Multi-Feature Filtering And Data Processing

2.3.1. Steady-State Condition Multi-Feature Filtering and Data Processing

2.3.2. Dynamic Cycling Condition Multi-Feature Filtering and Data Processing

3. Design of Life Prediction Model

3.1. Design of Single-Feature Models

3.2. Design of Multi-Feature Models

4. Multi-Operating Condition Life Prediction for PEMFCs

4.1. Life Prediction for Steady-State Operating Condition

4.2. Life Prediction for Dynamic Cycling Condition

5. Result and Discussion

5.1. Analysis of Life Prediction Results for Steady-State Operating Condition

5.2. Analysis of Life Prediction Results for Dynamic Cycling Condition

6. Conclusion

Author Contributions

Funding

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe