1. Introduction
Although air pollution levels, including those of particulate matter (
), have shown some improvement over time, the aging population poses a significant challenge. If current emission levels are maintained, the number of premature deaths attributed to
is expected to continue rising due to the increasing vulnerability of older adults[
1,
2,
3]. Therefore, additional efforts to reduce emissions are necessary to counteract the rise in premature deaths associated with population aging. This study aims to address this issue by establishing a method for predicting
concentrations in response to emission reductions and estimating premature deaths based on these predicted concentrations. Specifically, we develop a model that incorporates the aging population’s impact on premature death rates and determines the additional emission reductions required to offset the increase in premature deaths.
The complex nature of
pollution in South Korea presents a significant challenge for air quality management[
4,
5,
6].
levels in South Korea are influenced by a combination of domestic emissions and transboundary pollution, primarily from China. While domestic emissions are a crucial target for mitigation efforts, the impact of foreign sources, which is beyond the country’s direct control, must also be considered[
5,
7,
8,
9,
10]. To address this, the South Korean government is implementing a comprehensive strategy that includes regional assessments, advanced
concentration modeling, and dynamic emission reduction policies[
11]. Emergency measures are also in place for severe pollution events[
12]. Effective management requires the ability to rapidly simulate and evaluate multiple emission scenarios under varying conditions, including the impact of transboundary pollution. Therefore, there is a need for a
concentration prediction model that can account for the influence of boundary conditions from neighboring countries, such as China. Developing such models is essential for establishing rational, responsive, and effective policies for managing fine particulate matter and safeguarding public health in South Korea.
Figure 1.
A graphical abstract of the pipeline that uses the model we developed. From top to bottom, the pipeline we present in this paper is as follows: (1) Compute the minimum data required to train the model via CMAQ simulation, (2) Train an emulator that mimics CMAQ by training a conditional U-Net model using the computed data. (3) Finally, we use this emulator to conduct and utilize various studies. The details of the model’s operation are explained in the Methods section.
Figure 1.
A graphical abstract of the pipeline that uses the model we developed. From top to bottom, the pipeline we present in this paper is as follows: (1) Compute the minimum data required to train the model via CMAQ simulation, (2) Train an emulator that mimics CMAQ by training a conditional U-Net model using the computed data. (3) Finally, we use this emulator to conduct and utilize various studies. The details of the model’s operation are explained in the Methods section.
One of the most well-known air pollutant concentration simulation tools is the Community Multiscale Air Quality Modeling System (CMAQ), developed by the U.S. Environmental Protection Agency (EPA)[
13]. CMAQ accurately simulates pollutant dispersion based on physical and chemical processes and is widely used by governments and organizations, including those in South Korea, for air quality management. While CMAQ offers the advantage of incorporating complex chemical reactions and providing sophisticated simulations, its high data management complexity, substantial computational demands, and lengthy processing times for each emission scenario pose significant challenges for efficient policy evaluation across multiple air quality scenarios[
14]. To address these limitations, research has focused on developing surrogate models using statistical techniques like polynomial chaos and response surface modeling (pf-RSM) and deep response surface modeling (deepRSM)[
15,
16,
17,
18], as well as deep learning approaches such as Convolutional Neural Networks (CNN) and U-Net models [
19,
20]. These surrogate models have demonstrated the ability to emulate CMAQ with high performance. However, they still face challenges in terms of data handling complexity, which can limit their accessibility for non-experts in air pollution simulation—such as those focused on evaluating health impacts and formulating air pollution policies.
In this study, we present an innovative methodology that bridges the gap between complex air quality modeling and practical policy evaluation. Our approach integrates a high-speed CMAQ surrogate model based on a conditional U-Net architecture with the widely used Environmental Benefits Mapping and Analysis Program (BenMAP) health impact assessment tool [
21,
22]. This combined framework allows policymakers to swiftly assess the health impacts of various precursor emission strategies. The surrogate model, designed to emulate annual
concentrations in South Korea, uses adjustable precursor emission rates and boundary conditions as inputs. It demonstrates high performance and scientific validity, as confirmed by input-output relationship analysis. Notably, the model significantly reduces computational time, enhancing efficiency for specific applications. By leveraging this surrogate model and BenMAP pipeline, we can evaluate how human health impacts shift under different annual emission scenarios and boundary conditions, providing decision-makers with a more efficient and accurate tool for policy development.
The paper is organized as follows:
Section 2 (Materials and Methods) provides details on data preparation, explains the conditional U-Net architecture, describes the training and evaluation processes, and outlines the health impact assessment approach using BenMAP.
Section 3 (Results) presents the performance metrics of the surrogate model and the outcomes of the health impact assessments. Finally,
Section 4 (Discussion) explores the implications of our findings, acknowledges the study’s limitations, and proposes directions for future research. This work represents a significant advancement toward more responsive and effective air quality management, especially in regions dealing with complex
pollution challenges.
3. Results
The results of this study can be divided into two main categories. One is the confirmation of the emulation performance of the CMAQ emulator, and the other is the results of the health effect assessment using this emulator and its analysis. In this section, we discuss these results.
3.1. Emulation Performance Evaluation
We first verify the emulation performance of the CMAQ simulation results for the conditional U-Net structure used in our methodology. This is an important step in determining whether the health effects assessment using this conditional U-Net is reliable. The performance is compared from three perspectives: (1) for quantitative evaluation, simulation and emulation results are compared by scatterplotting all grid verbs, (2) the mean absolute error (MAE), normalized mean absolute error (NMAE), and coefficient of determination (), (3) for qualitative evaluation, the importance between model inputs and outputs is analyzed using SHAP values to see how well the model mimics the physical and chemical processes of CMAQ, and finally, (4) the computation time was evaluated.
3.1.1. Emulation Results
Figure 3 presents a comprehensive comparative analysis of the Conditional U-Net model’s inference results against CMAQ simulations, using both training and test data sets. The scatter plots are arranged side-by-side, with the left panel showing the model’s performance on the training data and the right panel demonstrating its capability on the unseen test data. Both scenarios show remarkable agreement with the CMAQ simulations, achieving an impressive
value of 0.99. This exceptional correlation underscores the model’s ability to emulate CMAQ with near-perfect accuracy over a wide range of
concentration values. The consistently high performance across both training and test data sets not only validates the model’s robust learning, but also confirms its ability to effectively generalize to new scenarios. This consistency is particularly noteworthy because it eliminates concerns about overfitting, a common challenge for machine learning models. The tight clustering of points along the diagonal in both plots further illustrates the model’s precision in capturing the nuances of CMAQ simulations. This visual and statistical evidence reinforces the reliability of the Conditional U-Net as an efficient and accurate surrogate for CMAQ simulations, potentially revolutionizing the speed and scale at which air quality scenarios can be evaluated for policy-making purposes.
Table 2 presents a quantitative evaluation of the performance of the conditional U-Net model, which emulates the CMAQ simulation of
concentrations in South Korea in 2019. The accuracy of the model was evaluated using three main metrics: mean absolute error (MAE), normalized mean absolute error (NMAE), and coefficient of determination (
). The results showed high accuracy of the conditional U-Net in reproducing CMAQ results, with significantly lower MAE values (training: 0.222
, test: 0.221
) and NMAE values (training: 1.788%, test: 1.762%). The
value of 0.996 for both the training and test sets is particularly impressive, indicating that the conditional U-Net explains 99.6% of the variance in the CMAQ predictions. This near perfect correlation indicates an exceptional level of agreement between the surrogate model and CMAQ.
The consistency of the performance metrics between the training and test sets provides strong evidence that the model effectively learned the underlying patterns without overfitting. Overall, these results confirm the ability of the conditional U-Net to serve as a highly accurate and efficient surrogate model for CMAQ in predicting annual average concentrations across Korea, and may provide important computational advantages for rapid policy scenario evaluation.
3.1.2. Input Contribution Analysis
The effect of input parameters on the output concentration was analyzed using SHAP values to evaluate whether the emulators used in this study accurately reproduce the physical and chemical processes of CMAQ as well as simple emulation performance. First, because our conditional U-Net structure has two major inputs, domestic emissions expressed in terms of emissions of the five precursors and international emissions expressed in terms of activity at the boundary, we evaluated domestic and international impacts on changes in concentrations.
Figure 4 shows the percentage of domestic and international SHAP values for each region. The six regions shown here are the national average, Seoul, Daejeon, Gwangju, Ulsan, and Jeju. In the case of domestic impact assessment, the SHAP values for all input parameters representing domestic precursor emissions were obtained by summing the SHAP values.
Figure 4 shows that international influence is greater than or nearly equal to domestic influence in every region. For the national average impact, boundary is rated as more influential, and for the localized results in other inland regions, domestic and boundary are rated similarly. In particular, Jeju Island, located in the southern part of the Korean peninsula far from the mainland, is known to be a particularly foreign-influenced region, with weak influence from land-based precursor emissions and low emission activity of its own, and the emulation model reflects this well [
36]. These results are generally consistent with the analysis of factors affecting
concentrations in Korea reported in the study by N. Kumar et al. [
9] despite differences in the target time period, indicating that the model successfully mimics the scientific process of CMAQ.
The second qualitative assessment examined the nonlinear response between
and precursor emissions. Since nonlinear behavior of
and its precursors,
and
, has been reported in previous studies based on simulations and actual measurements [
37,
38], it is necessary to confirm whether the emulator model we developed behaves in a similar nonlinear manner.
Figure 5 presents a comprehensive visualization of the nonlinear response of
concentrations to variations in
and
emission activity ratios over South Korea and five major cities, as in the previous evaluation. The contour maps show different patterns for different regions, highlighting the spatial heterogeneity in
formation dynamics. The local trend shows that
concentrations generally increase with higher emission ratios of both
and
, ranging from about 14 to 17
. Seoul shows the highest
levels (19-25
) with strong sensitivity to both precursors, while Daejeon and Gwangju show intermediate levels with a more pronounced response to
emissions. Ulsan shows a relatively uniform
distribution (13-16
), and Jeju Island shows significantly lower concentrations (7-9
) with minimal variation across emission scenarios.
These contour maps underscore the complexity of chemistry and the importance of considering both and emissions in air quality management plans. The observed non-linear responses and regional variations highlight the need for tailored emission control strategies in different areas of South Korea. This detailed analysis provides valuable insights for policy makers and environmental scientists, enabling more targeted and effective approaches to reduce concentrations and improve air quality in different urban and rural areas across the country. Importantly, these results are consistent with trends reported in previous studies, confirming the validity of our findings and reinforcing the established understanding of formation dynamics in relation to precursor emissions in different geographical contexts in South Korea.
3.1.3. Computational Performance
Table 3 presents a comparative analysis of the computational time required for CMAQ simulations and the Conditional U-Net model, highlighting the significant efficiency gains offered by the latter. The CMAQ v4.7 [
39] simulation running on one CPU requires approximately 24 hours to complete a single scenario. In stark contrast, the Conditional U-Net demonstrates significantly faster performance in both the training and prediction phases. The Conditional U-Net model was implemented using a custom-developed code based on Tensorflow v2.14 [
40], running on Python 3.10.13 within an Ubuntu 22.04 operating system environment, leveraging this powerful software stack to optimize performance and functionality.
During the training phase, the Conditional U-Net processes an epoch in about 10 seconds using a CPU, and this time is further reduced to about 1 second using a GPU, both with a batch size of 256. The most striking efficiency gain is observed in the prediction phase, where the Conditional U-Net can produce results for a single scenario in as little as 10 milliseconds on a CPU and an impressive 1 millisecond on a GPU, with a batch size of 32.
It’s important to note that while the Conditional U-Net offers significant time savings per scenario, it requires initial training on CMAQ simulation data. However, once trained, the model’s ability to generate predictions in milliseconds represents a transformative improvement in computational efficiency compared to the 24-hour runtime of CMAQ simulations. This dramatic reduction in processing time could significantly increase the capacity for rapid scenario analysis and policy evaluation in air quality management.
3.2. Health Impact Assessment
Health benefits were evaluated separately for cardiovascular diseases (CVD) and respiratory diseases (RD), with premature mortality estimated under two boundary conditions: 100% and 50%.
Figure 6 illustrates the projected premature mortality due to CVD and RD from 2019 to 2050. Assuming that the predicted
concentrations from the 2019 scenario remain constant over time, a continuous increase in premature mortality is observed.
Under the 100% boundary condition, the combined premature mortality for CVD and RD was estimated at 15,934 (95% CI: 10,460-21,855) in 2019, rising to 53,146 (95% CI: 34,216–73,392) by 2050. Under the 50% boundary condition, the combined premature mortality was 11,106 (95% CI: 7,241–15,418) in 2019, increasing to 37,043 (95% CI: 23,679-51,803) by 2050. This represents an approximate 330% increase in premature mortality by 2050 compared to 2019, regardless of the boundary condition.
The substantial rise in premature mortality is largely attributed to the projected growth in the population segments with higher mortality rates, particularly among those aged 65 and older. Specifically, the population in the 65-69 age group is expected to increase by 153%, and the 80+ age group by 434% between 2019 and 2050.
In
Figure 7, the projected differences in premature mortality from 2020 to 2050 are compared at five-year intervals, using the scenario with 100% pollutant emissions in 2019 as the baseline. Under scenarios where emissions are reduced to 10% of 2019 levels, premature mortality is projected to remain below the baseline under both the 100% and 50% boundary conditions until 2050. Excluding the 10% emission scenario, premature mortality under the 100% boundary condition surpasses the baseline across all other scenarios starting from 2030. Similarly, under the 50% boundary condition, the 25% emission scenario shows fewer premature deaths compared to the baseline until 2040, after which mortality begins to exceed the baseline. These findings indicate that reducing emission TO 10-25% is necessary to counteract the effects of population aging and maintain current premature mortality rates.
reduction strategies are typically developed with five-year targets.
Figure 8 outlines the required reduction targets for each region from 2025 to 2050, segmented into five-year intervals, to prevent an increase in premature mortality due to an aging population. To maintain baseline levels of premature mortality from 2019 to 2025, a reduction of 22-42%, averaging 31%, is necessary across 17 provinces. If the 2025 reduction target is met, an additional reduction of 11-35%, averaging 22%, will be required from 2025 to 2030. Similar reductions are needed for the subsequent periods: 24% from 2030 to 2035, 26% from 2035 to 2040, 19% from 2040 to 2045, and 11% from 2045 to 2050. These five-year reduction percentages are consistent under both the 100% and 50% boundary conditions.
4. Discussion and Conclusions
4.1. Development of a CMAQ Emulator Using a Conditional U-Net
In this study, we developed an emulation model that accurately and rapidly simulates the physicochemical processes of CMAQ using deep learning techniques with a conditional U-Net architecture. We also established and validated a series of pipelines that leverage this emulation model for health impact assessments and policy-making applications related to precursor emissions. While CMAQ is one of the most reliable and widely used air pollution simulation tools, it requires extensive background knowledge and suffers from limitations due to its long computational times. These limitations hinder the assessment of air pollution across multiple scenarios and the formulation of effective policies for mitigating respiratory and other health impacts associated with air pollution. The emulation methodology presented here addresses these challenges by linking CMAQ with health impact assessments, thereby facilitating simpler, faster, and accurate air quality evaluations. This advancement supports more efficient policy-making and enhances the ability to assess health impacts and develop strategies to improve public health outcomes.
For the emulation model that predicts the annual average
concentration in Korea for a given precursor emission activity, the
concentration calculated from the simulation of CMAQ using the emissions, climate, and weather conditions of 2019 as the baseline for this study was used as training data, and a deep learning model with a conditional U-Net structure specialized in calculating the annual average pollutant concentration described in the text was employed. The emulation model predicts the annual average
concentration with an MAE of 0.221
, NMAE of 1.762%, and
of 0.996 based on the test data. Also, as shown in
Table 3, for the trained model predicting a scenario, based on the machine we used, we were able to perform high-speed computations on the order of 10ms using CPU and 1ms using GPU, which is a significant improvement considering the time required to precalculate the data needed for training with CMAQ. These results demonstrate that the emulation model and methodology we developed significantly contribute to overcoming the limitations of CMAQ and are well suited for the applications presented in this study.
Additional contribution analyses performed using SHAP values to increase the reliability of the emulation model, as well as an analysis of the relationship between input and output values, also show that the methodology does a good job of simulating the physicochemical computational processes behind CMAQ and their associated properties, giving policy makers more confidence in using the methodology we have developed. It is important to note that the SHAP value analysis is qualitative rather than quantitative. The conditional U-Net model uniquely accommodates both domestic precursor emissions and transboundary emissions as inputs, enabling the evaluation of their combined effects on the model’s annual
concentration output.
Figure 4 illustrates the impacts of domestic and boundary emissions on annual
concentrations for both national and provincial averages. The evaluated domestic and boundary influences align well with findings from previous studies, validating that the emulation model’s computational processes are consistent with those observed in previous simulations and CMAQ results. Furthermore, the emulation model accurately reflects the nonlinear relationship between
and precursor emissions.
Figure 5 highlights the nonlinear interactions between
and
emissions and
concentrations, which corroborates the nonlinear behavior reported in previous research. This validation supports the use of the conditional U-Net as an effective emulation model.
4.2. Health Impact Assessment and Policy Implications
The health impact assessment results, made possible by CMAQ’s rapid simulation of annual concentrations, highlight the critical role of emission reductions in reducing premature mortality in South Korea. While premature deaths are expected to increase significantly from 2019 to 2050 due to an aging population, reducing emission TO 10-25% from current levels could counteract this increase. The study recommends an average reduction of 31% by 2025, followed by further reductions of 11% to 26% every five years until 2050. This innovative approach allows for the rapid and accurate evaluation of multiple emissions scenarios, providing policymakers with a powerful tool for real-time decision making. By integrating rapid CMAQ emulation with comprehensive health impact assessments, the methodology significantly advances environmental health research and policy planning. It also benefits society by facilitating more informed, timely, and effective air quality management strategies, potentially saving thousands of lives and improving the quality of life for millions. This multidisciplinary approach can be applied to air quality challenges around the world and represents a major step forward in the integration of environmental science, public health, and policy making.
4.3. Limitations and Future Research
The CMAQ emulator developed in this study successfully mimics the annual average concentrations in Korea, but it has several challenges and limitations. First, when evaluating concentrations and their impact on human health, it is important to consider not only annual, but also monthly and daily average concentrations, and while CMAQ is capable of simulating these time resolutions, our model is limited by its structural limitations in that it can only infer annual average concentrations. While CMAQ can simulate these time resolutions, our model is limited by its structural limitations in that it can only infer annual average concentrations. Also, because our model was designed to simply predict the annual average concentration of from given precursor emissions and boundary conditions, the meteorological and geographic information necessary to simulate actual concentrations is not used to predict concentrations. This is because the model predictions are not based on SHAP values. This means that although the causal relationship of the model’s prediction results from the SHAP values has been confirmed, the possibility that inferences are being made using a process that differs from the actual behavior of CMAQ cannot be ruled out.
Based on the above limitations, future research directions are as follows. First, the model could be improved to accommodate data with different temporal resolutions. Health impact assessments based on concentrations are generally conducted at different time units, and this improvement is essential to enable more diverse assessments in the future. Further analysis of the input-output variables of the model is also needed. In particular, as mentioned above, unlike the original simulation of CMAQ, which is the object to be emulated, our model has a structure that does not take into account any meteorological or geographic information. A deeper investigation of whether the emulation results from this structure accurately mimic the CMAQ process would be essential to increase the credibility of our methodology. In addition, having meteorological and geographic information as input means that we will be able to comprehensively explore different scenarios, time periods, and years, which will allow us to develop a more general surrogate model for CMAQ. Finally, since our model can in principle predict concentrations of air pollutants other than , such as and , we will confirm the performance of the emulation and various case studies for these substances in future studies.
4.4. Overall Summary and Conclusion
This study introduces a novel methodology that bridges the gap between complex air quality modeling and practical policy evaluation. We developed a high-speed surrogate model based on a conditional U-Net architecture to predict concentrations and integrated it with the BenMAP health impact assessment tool to evaluate health impacts efficiently. The surrogate model, trained using annual data from South Korea, demonstrated impressive performance with an MAE of 0.221 , NMAE of 1.762%, and an of 0.996, significantly reducing computational time compared to traditional CMAQ simulations. This model allows rapid predictions and assessments of health impacts under various emission scenarios.
Furthermore, this research established a method for predicting the number of premature deaths reflecting population aging levels, and developed a technique to calculate the additional emission reductions needed to offset the health impacts of increased premature mortality due to aging. However, the study’s scope is limited to , and future research should extend this approach to other pollutants such as , , and . Developing models to predict various metrics like daily averages and incorporating advanced dose-response models that consider age-specific sensitivities and threshold values will offer a more comprehensive evaluation of health impacts. Addressing these areas will enhance air quality management and public health protection, providing more robust tools for policymakers.
Author Contributions
YH.L. conceived the study, developed the methodology, performed formal analysis, wrote the original draft, and created the visualizations. JH.P. contributed to the methodology, conducted formal analysis, and reviewed and edited the manuscript. JS.K. developed the software, collected resources, and performed validation. JH.W. conducted the investigation, collected resources, and administered the project. JH.L. performed formal analysis, reviewed and edited the manuscript, and supervised the study.
Figure 2.
Schematic diagram of conditional U-Net architecture used in this study. Each parameters listed next to each block in the figure represent the block dimensions set in this study, and these are hyperparameters that can be changed as appropriate depending on the target region, other parameters to be entered, and the number of data used in the study.
Figure 2.
Schematic diagram of conditional U-Net architecture used in this study. Each parameters listed next to each block in the figure represent the block dimensions set in this study, and these are hyperparameters that can be changed as appropriate depending on the target region, other parameters to be entered, and the number of data used in the study.
Figure 3.
Scatter plots for emulation performance evaluation. left: plot for training data, right: plot for test data. The left plot is for the training data and the right plot is for the test data. Also, both the training and test data show the same performance, indicating that no overfitting has occurred.
Figure 3.
Scatter plots for emulation performance evaluation. left: plot for training data, right: plot for test data. The left plot is for the training data and the right plot is for the test data. Also, both the training and test data show the same performance, indicating that no overfitting has occurred.
Figure 4.
Results of the graphical representation of the contribution of input variables to the concentration of in two categories, national and international, using SHAP values. The results show that the impact of precursor emissions from outside the country is equal to or greater than the domestic impact, indicating that the simulation results and the actual situation on the Korean peninsula are properly mimicked. From top left to bottom right, the panels show the contribution of boundary and domestic emissions to annual concentrations in the whole country, Seoul, Daejeon, Gwangju, Ulsan, and Jeju Island, respectively.
Figure 4.
Results of the graphical representation of the contribution of input variables to the concentration of in two categories, national and international, using SHAP values. The results show that the impact of precursor emissions from outside the country is equal to or greater than the domestic impact, indicating that the simulation results and the actual situation on the Korean peninsula are properly mimicked. From top left to bottom right, the panels show the contribution of boundary and domestic emissions to annual concentrations in the whole country, Seoul, Daejeon, Gwangju, Ulsan, and Jeju Island, respectively.
Figure 5.
Contour map showing the non-linear response of to and emissions, showing, from top left, the response of concentrations for the entire country, Seoul, Daejeon, Gwangju, Ulsan, and Jeju Island.
Figure 5.
Contour map showing the non-linear response of to and emissions, showing, from top left, the response of concentrations for the entire country, Seoul, Daejeon, Gwangju, Ulsan, and Jeju Island.
Figure 6.
Annual premature mortality due to various diseases and total diseases (cardiovascular diseases + respiratory diseases) based on concentrations. Panel (a) illustrates the results under a boundary condition of 100%, while panel (b) shows the results under a boundary condition of 50%. Scenarios were applied where precursor emissions were reduced from 100% to 10%. Premature mortality for the entire population of South Korea was calculated annually from 2019 to 2050.
Figure 6.
Annual premature mortality due to various diseases and total diseases (cardiovascular diseases + respiratory diseases) based on concentrations. Panel (a) illustrates the results under a boundary condition of 100%, while panel (b) shows the results under a boundary condition of 50%. Scenarios were applied where precursor emissions were reduced from 100% to 10%. Premature mortality for the entire population of South Korea was calculated annually from 2019 to 2050.
Figure 7.
Differences in premature mortality relative to 2019 under scenarios of precursor emissions at 100%. Panel (a) shows results with a boundary condition of 100%, while Panel (b) shows results with a boundary condition of 50%. The scenarios applied involve reductions in precursor emissions from 100% to 10%. The differences in premature mortality for the entire population of South Korea from 2020 to 2050, calculated in 5-year intervals, are shown relative to premature mortality at 100% precursor emissions in 2019.
Figure 7.
Differences in premature mortality relative to 2019 under scenarios of precursor emissions at 100%. Panel (a) shows results with a boundary condition of 100%, while Panel (b) shows results with a boundary condition of 50%. The scenarios applied involve reductions in precursor emissions from 100% to 10%. The differences in premature mortality for the entire population of South Korea from 2020 to 2050, calculated in 5-year intervals, are shown relative to premature mortality at 100% precursor emissions in 2019.
Figure 8.
Reduction percentages of precursor emissions across 17 provinces over five-year intervals to sustain premature mortality rates in 2019. The x-axis represents the average reduction percentages across the 17 provinces. The y-axis denotes the percentage reduction in precursor emissions required from the start year to the end year, based on the emission levels at the start year. This assumes that the emission reduction targets for the previous year were achieved.
Figure 8.
Reduction percentages of precursor emissions across 17 provinces over five-year intervals to sustain premature mortality rates in 2019. The x-axis represents the average reduction percentages across the 17 provinces. The y-axis denotes the percentage reduction in precursor emissions required from the start year to the end year, based on the emission levels at the start year. This assumes that the emission reduction targets for the previous year were achieved.
Table 1.
Example of random control matrix scenario. Each scenario is sampled with Latin Hyper Sampling method. Each column indicates the precursor emission activity of each emission source and each row indicates the activity of each region of Korea.
Table 1.
Example of random control matrix scenario. Each scenario is sampled with Latin Hyper Sampling method. Each column indicates the precursor emission activity of each emission source and each row indicates the activity of each region of Korea.
Region name |
|
|
VOC |
|
|
Activity |
Seoul |
0.514 |
0.927 |
0.945 |
0.692 |
1.109 |
- |
Incheon |
0.611 |
1.087 |
0.949 |
0.546 |
0.778 |
- |
Busan |
0.504 |
1.364 |
1.046 |
1.192 |
1.249 |
- |
Daegu |
1.274 |
0.951 |
0.708 |
1.247 |
0.786 |
- |
Gwangju |
0.872 |
1.069 |
0.621 |
0.840 |
1.497 |
- |
Gyeonggi-do |
0.574 |
1.379 |
1.436 |
0.842 |
1.177 |
- |
Gangwon-do |
1.479 |
1.167 |
0.540 |
1.098 |
1.173 |
- |
Chungbuk-do |
1.134 |
0.710 |
0.725 |
1.410 |
0.503 |
- |
Chungnam-do |
0.520 |
0.562 |
0.812 |
1.021 |
0.994 |
- |
Gyeongbuk-do |
1.063 |
1.073 |
1.192 |
1.343 |
1.045 |
- |
Gyeongnam-do |
0.581 |
1.337 |
1.057 |
0.811 |
0.671 |
- |
Jeonbuk-do |
0.702 |
1.286 |
0.580 |
1.150 |
1.063 |
- |
Jeonnam-do |
1.311 |
1.187 |
1.001 |
1.281 |
0.805 |
- |
Jeju-do |
1.034 |
1.224 |
1.395 |
0.520 |
0.752 |
- |
Daejeon |
1.173 |
0.718 |
1.386 |
0.762 |
0.864 |
- |
Ulsan |
1.197 |
0.919 |
0.528 |
1.284 |
0.534 |
- |
Sejong |
0.996 |
1.394 |
0.549 |
0.787 |
1.343 |
- |
Boundary |
- |
- |
- |
- |
- |
1.000 |
Table 2.
Quantitative comparison of the performance of Conditional U-Net emulation results. MAE, NMAE and metrics were evaluated here. The results indicate that the Conditional U-Net model emulates CMAQ with very high performance, and the comparison of the results for the training and test data shows that the model also learns successfully without overfitting problems.
Table 2.
Quantitative comparison of the performance of Conditional U-Net emulation results. MAE, NMAE and metrics were evaluated here. The results indicate that the Conditional U-Net model emulates CMAQ with very high performance, and the comparison of the results for the training and test data shows that the model also learns successfully without overfitting problems.
Target year |
Target Region |
Metrics [Units] |
Data category |
Score |
2019 |
South Korea |
MAE [] |
Training set |
0.222 |
|
|
|
Test set |
0.221 |
|
|
NMAE [%] |
Training set |
1.788 |
|
|
|
Test set |
1.762 |
|
|
[-] |
Training set |
0.996 |
|
|
|
Test set |
0.996 |
Table 3.
Time required to compute each method. Compared to the time required to simulate each scenario in CMAQ, the time saved by the conditional U-Net method is significantly shorter, regardless of the processor type. It should be noted, however, that the CMAQ simulation data must be computed in advance in order to train the conditional U-Net model.
Table 3.
Time required to compute each method. Compared to the time required to simulate each scenario in CMAQ, the time saved by the conditional U-Net method is significantly shorter, regardless of the processor type. It should be noted, however, that the CMAQ simulation data must be computed in advance in order to train the conditional U-Net model.
Method |
Processor |
Batch size |
Time consumed |
CMAQ v4.7 |
CPU (simulation) |
- |
∼ 24h / scenario |
Conditional U-Net |
CPU (training) |
256 |
∼ 10s / epoch |
|
GPU (training) |
256 |
∼ 1s / epoch |
|
CPU (prediction) |
32 |
∼ 10ms / scenario |
|
GPU (prediction) |
32 |
∼ 1ms / scenario |