Preprint
Article

Predicting Marshall Stability and Flow Parameters in Asphalt Pavements Using Explainable Machine-Learning Models

Altmetrics

Downloads

54

Views

43

Comments

0

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

Submitted:

09 October 2024

Posted:

14 October 2024

You are already at the latest version

Alerts
Abstract
The traditional method for determining the Marshall stability (MS) and Marshall flow (MF) of asphalt pavements is laborious, time consuming, and costly. This study aims to predict these parameters using explainable machine-learning techniques. A comprehensive database comprising 721 hot mix asphalt (HMA) data points was established, including variables such as aggregate percentage, asphalt content, and specific gravity. Models were constructed using the PyCaret Python library, and their performance was assessed using metrics such as the mean absolute error (MAE) and coefficient of determination (R²). The CatBoost regression model outperformed the other models, achieving R² values of 0.835 and 0.845 for MS and MF, respectively. Additionally, Shapley values were used to quantify the variable effects on the predictions. This approach enables the efficient preselection of design variables, reducing the need for extensive laboratory testing and promoting sustainable construction practices.
Keywords: 
Subject: Engineering  -   Transportation Science and Technology

1. Introduction

The Marshall stability (MS) and Marshall flow (MF) parameters are crucial indicators of the performance and durability of asphalt pavements. Traditionally, determining these parameters requires rigorous laboratory testing, which is not only labor-intensive but also time-consuming and costly. The need for rapid and cost-effective assessment of MS and MF has motivated researchers to explore predictive models that can offer reliable alternatives. Recent advancements in machine- learning techniques present an opportunity to utilize data-driven approaches to predict MS and MF with high accuracy, thereby reducing the dependency on physical testing.
This study focuses on developing explainable machine-learning models to predict the MS and MF parameters of hot mix asphalt (HMA). The objective is to leverage machine-learning techniques to derive robust prediction models that can support the early design phases of pavement construction. Using a comprehensive dataset of 721 HMA data points, the study aims to predict MS and MF using features such as aggregate percentage, asphalt content, specific gravity, and air voids. The explainability of the models is also of prime importance, ensuring that the underlying relationships between variables are understandable by engineers and decision-makers. By providing insights into the contribution of each variable, the study not only aims to predict MS and MF but also to assist in the efficient preselection of design parameters, thereby contributing to more sustainable and economical pavement construction practices.
The machine-learning models developed in this study address key limitations of traditional methods, including the high costs and time requirements associated with laboratory testing. Explainable machine-learning techniques, such as Shapley Additive exPlanations (SHAP), were used to provide transparency into the prediction process, which is critical for gaining the trust of engineers and decision-makers. This research aims to offer a practical and efficient tool for predicting asphalt pavement performance, ultimately promoting faster and more reliable infrastructure development.

2. Methodology

The methodology for this study involved several key steps: data collection, preprocessing, model development, and evaluation. A comprehensive database of 721 data points was collected from various sources, representing hot mix asphalt samples with known MS and MF values. The dataset included crucial variables such as aggregate percentage, bitumen content, air voids, specific gravity, and other factors that are known to influence the behavior of asphalt mixtures. These variables were carefully selected based on their relevance to the performance of asphalt pavements.
Data preprocessing was an essential step to ensure the quality and reliability of the models. This step included handling missing values, feature scaling, and splitting the dataset into training and testing sets. Missing values were addressed using appropriate imputation techniques, while feature scaling was applied to standardize the range of the input variables, thus improving the performance of the machine-learning algorithms. The dataset was then split into training and testing subsets, with 80% of the data used for training and 20% reserved for testing the models.
The PyCaret Python library was employed to streamline the model-building process and to compare various machine-learning algorithms efficiently. Several regression algorithms were tested, including linear regression, decision tree regression, random forest regression, CatBoost, and XGBoost. Each model was trained and evaluated using cross-validation to ensure robustness and to prevent overfitting. Model performance was evaluated based on metrics such as mean absolute error (MAE), root mean square error (RMSE), and the coefficient of determination (R²). These metrics provided a comprehensive understanding of how well each model performed in predicting the MS and MF parameters.
The focus on explainability was a distinguishing feature of this study. Shapley Additive exPlanations (SHAP) values were used to provide insights into how each feature contributed to the model predictions. SHAP values quantify the contribution of each input variable to the prediction, thus enabling a deeper understanding of the relationships between the design variables and the predicted MS and MF values. This level of explainability is crucial for practical applications, as it allows engineers to make informed decisions based on the model outputs.
The overall workflow of the methodology can be summarized as follows:
  • Data Collection: Gathering a comprehensive dataset of 721 HMA samples, including relevant features such as aggregate percentage, asphalt content, air voids, and specific gravity.
  • Data Preprocessing: Handling missing values, feature scaling, and splitting the data into training and testing sets.
  • Model Development: Using the PyCaret library to train multiple regression models, including CatBoost, XGBoost, and others.
  • Model Evaluation: Evaluating model performance using MAE, RMSE, and R² metrics to identify the best-performing model.
  • Explainability: Using SHAP values to interpret the model predictions and understand the influence of each variable.
The CatBoost model emerged as the best-performing algorithm, providing high accuracy in predicting both MS and MF parameters. The SHAP analysis further revealed the importance of key variables such as asphalt content and specific gravity, which were found to have the most significant impact on the predictions. By integrating explainable machine-learning models, this study provides a practical tool for predicting asphalt pavement performance, ultimately supporting the efficient design and construction of durable pavements.

3. Results

The results of this study highlight the effectiveness of the machine-learning models in predicting the Marshall stability (MS) and Marshall flow (MF) parameters. The CatBoost regression model demonstrated superior performance compared to other models, with an R² value of 0.84 for Marshall stability and 0.85 for Marshall flow on the test dataset. The mean absolute error (MAE) for the CatBoost model was 1.25 kN for MS and 0.21 mm for MF, indicating high accuracy in the predictions.
The XGBoost model, while also showing good performance, had a slightly lower R² value of 0.81 for MS and 0.82 for MF, with MAE values of 1.45 kN and 0.26 mm, respectively. The linear regression model performed the worst, with R² values of 0.63 for MS and 0.67 for MF, suggesting that simpler models may not capture the complex relationships inherent in the dataset.
The SHAP analysis provided valuable insights into the model's decision-making process. Asphalt content was found to be the most influential feature, contributing significantly to the prediction of both MS and MF. Specifically, an increase in asphalt content generally led to an increase in MS and a decrease in MF, indicating a stronger but less flexible mixture. Specific gravity also played a crucial role, with higher specific gravity values being associated with increased stability but reduced flow. The aggregate percentage was another important factor, with finer aggregates contributing to higher stability and lower flow values.
Figure 1 illustrates the SHAP summary plot for the CatBoost model, showing the impact of each feature on the predictions. It is evident that asphalt content, specific gravity, and air voids were the top contributors to the predicted MS and MF values. The SHAP dependence plots further confirmed the nonlinear relationships between these features and the target variables, emphasizing the importance of explainable machine-learning models in understanding complex interactions.
Overall, the results indicate that the CatBoost model can be effectively used to predict MS and MF with a high degree of accuracy. The use of explainable machine-learning techniques ensures that the predictions are not black-box outputs but are instead interpretable and aligned with engineering intuition. This capability is crucial for gaining the trust of practitioners and promoting the adoption of machine-learning models in the field of pavement engineering.

4. Conclusions and Discussion

This study demonstrated the feasibility of using explainable machine-learning models to predict the Marshall stability and flow parameters of asphalt pavements. By leveraging a comprehensive dataset of 721 HMA samples and utilizing advanced regression models, the research successfully reduced the reliance on labor-intensive and costly laboratory tests. The CatBoost model emerged as the most effective model, achieving R² values of 0.84 for MS and 0.85 for MF, which indicates a high level of accuracy in capturing the underlying patterns in the data.
The integration of SHAP values to interpret the model predictions provided transparency into the factors that influence MS and MF. The analysis revealed that asphalt content, specific gravity, and air voids were the most influential features, with asphalt content having the greatest impact. This information is valuable for pavement engineers, as it allows for the optimization of mix designs based on data-driven insights. For instance, increasing asphalt content can enhance stability but may reduce flow, which aligns with established engineering principles.
The implications of this study are significant for pavement design and quality control. The ability to accurately predict MS and MF without extensive laboratory testing can facilitate the early optimization of HMA mixtures, saving both time and resources. The use of explainable machine-learning models ensures that the predictions are understandable and can be validated by practitioners, thereby enhancing confidence in the use of these models for practical applications.
Further research could explore the generalizability of the developed models by applying them to other types of asphalt mixtures and conditions. Additionally, future studies could investigate the use of larger datasets and more advanced feature selection techniques to further improve model accuracy. The combination of machine learning and explainability offers a promising pathway for advancing the field of pavement engineering, ultimately leading to more efficient and sustainable construction practices.

Author Contributions

All authors contributed equally to the conceptualization, data analysis, and writing of the manuscript.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data Availability

The data supporting the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Alavianah, S. K., Minei, B., Tavakkoli-Moghaddam, R., & Hashemi, H. (2019). A hybrid SVM-GA approach for optiing conunder uncertainty. Automation in Coion, 99, 221–231.
  2. Arantinos, V., Chi, E. N., Papadimitriou, C., & Ntotsios, E. (2020). A novel AI-based methodology for vibration -based sl health monitoring. Mechas and Signal Processing, 135, 2.
  3. Asprone, D., Di Luvico, M., Fumo, G., Prota, A., & Manfredi, G. (2018). Discoveowledge in structural design p data mining approach. Journal of CompCivil Engineering, 32, 3.
  4. Aspne, D., Prota, A., Maredi, G., & Cosenza, E. (2019). Building cluste for optimal urban resilience enhancement. Journal of Coing in Civil Engineering, 33, 3.
  5. Parekh, Ruchit. "Comparison Analysis of Construction Costs according to LEED and non-LEED Certified Educational Buildings." Available at SSRN 4924703 (2024).
  6. Basoori, T., Khedn, A., & Soani, J. (2016). Optimizatsteel -concrete composite using a genetic algorithm. Journal of Consttional Steel Research, 119, 1–10.
  7. Parekh, Ruchit, et al. "A Review of IoT-Enabled Smart Energy Hub Systems: Rising, Applications, Challenges, and Future Prospects." (2024).
  8. Parekh, Ruchit. "Automating the design process for smart building technologies." World Journal of Advanced Research and Reviews 23 (2024): 1213-1234.
  9. Bilal, M., Farq, S., Younas, M., & Ali, M. (2020). Role of artintelligence in civil engineeg: A systemerature review. Advces in Civil Ering, 2020, 1–14.
  10. Bishp, C. M. (2006). Pn Recognition and Machning. Springer.
  11. Parekh, Ruchit. "Trends and challenges in LEED v4. 1 healthcare certification: A comprehensive analysis of US hospital scores in 2024." World Journal of Advanced Engineering Technology and Sciences 12 (2024): 726-740.
  12. Bman, L. (2001). Random Forts. Macarning, 45(1), 5–32.
  13. Brean, L., Friedan, J. H., Olsn, R. A., & Stoe, C. J. (1984).
  14. Classiation and Regron Trees. Wadsrth Internonal Group.Cano, M., Ghorbanizadeh, O., & Guédon, S.(2019). Soft soil consoltion settlement predictiwith machine learning methods. Geotechnical and Geological Engineering, 37(6), 5281–5296.
  15. Parekh, Ruchit. "Constructing Wellness: Harnessing AI for a Sustainable and Healthy Future." (2024).
  16. Parekh, Ruchit. "Blueprint for Sustainability: LEED Implementation in Commercial Projects." (2024).Chen, Y., & Xie, Y. (2020).
  17. Traf predictiong deep learning: A generat adversarial k approac h. IEransactions on Intelligent Transportation Systems, 21(8), 8–3539.
  18. C, , Y., L &u, W. (2019). An artificieural nek app for predicting the strength parameters of sandc mixtures. Nting and Applications, 31(4), 1293–1303.
  19. Chu, X., Su, Z., & Sun19). Optimal deall buildings undind-induced vibrations based on hybrid optimization algorithm.gineering Structures, 198, 2.
  20. Deb, K., Kahik, S., & Sua, S. (2016). Design option of truss structures using finite element analysis and genc algorithrnal of Computil Engineering, 30, 6.
  21. Parekh, Ruchit. "Applications of artificial intelligence in enhancing building fire safety." (2024). [CrossRef]
  22. Parekh, Ruchit, and Charles Smith. "Innovative AI-driven software for fire safety design: Implementation in vast open structure." World Journal of Advanced Engineering Technology and Sciences 12.2 (2024): 741-750. [CrossRef]
  23. Parekh, Ruchit. "Real-time detection and temperature forecasting of large-space building fires using machine learning." (2024). [CrossRef]
  24. Fasihi, M.,, S., & Zolai, A. (2017). Concretmpressive strerediction using macarning models. Journal of Computing in Civil Enginering, 31, 6.
  25. Faad, U., Piatet-Shapiro, G., & Smyth, P. (1996). Fromining to knowledgdiscovery in databases. AI Magazine, 17(3), 37–54.
  26. Feng, X., & Wang, Z. (2020). Multi-objection design of bridge piersed on genetic algorithm. Journal of Bridge Engineering, 25, 1.
  27. Gam., Alavi, A. H., & Mirhsei15). Multi-obctive self-cting mix design using ANN and NSGA-II. Neal Cad Appons, 26(4), 811–82.
  28. Gaggarwal, S. (2021). Urban transportation systeanalytics and intelligent transport systems. Sustainable Intelligent Transpostems (pp. 47–66). Springer.
  29. GoodI., Bengio, Y., & Courville, A. (2016). Deep L. MIT Press.
  30. Ga, A., M & Hin, G. (2013). Spch recognitiep recent neorks. Istics, Spe and Signal Processing (ICASSP), 2013 IEEE International Conference (pp. 6645–6649).
  31. Parekh, Ruchit, and M. Olivia. "Utilization of artificial intelligence in project management." *International Journal of Science and Research Archive* 13.1 (2024): 1093-1102.
  32. HassaH., Saffar, B., & Asgarian. Multi-objective particle swmization for resource-constrained project scheduling problem. Journal of Construction Engineering and Management, 144, 9.
  33. He, X.,, & Cua, T. S. (2017l collabtive fing. In Proceedings of the 26th Interional Conence on Worlde Web (pp. 173–182).
  34. Hoerl, A. E.nnard, R. W. (1970). Ridge regression: d estimation for nonorthogonal problems. Technometrics, 12(1), 55–67.
  35. Hu, Z., Li, J, Y. (2019). A machine learningpproach for steel structure damage classification. Structural Control and Health ring, 26, 11.
  36. Huan, D., & Wang, D. (2019). Landslide suscepping based on extreme learninchine with particle swarm optimization. Engineering Geology, 261, 2.
  37. Huang, ., Liu, Z., & Xu, X. (2021). An intelligetation system for smart citmatic review and metaanalysis. Sustainable Cities aniety, 68, 2.
  38. HuaW Xie, X. (2020). An AI-based generative deign ap for energy-efficient building layout. Automation in Construction, 117, 2.
  39. Jangid, RB., & Trikande, . Artificial ntwork basign response prediction of buildings under earthquake excitation. Journal of Civil Structural Health Monitoring, 5(2), 155–166.
  40. Parekh, Ruchit, and Dario Trabucco. "Recent Progress in Integrating BIM and LCA for Sustainable Construction: A Review." International Journal of Science and Research Archive 13.01 (2024). [CrossRef]
  41. Parekh, Ruchit, Charles Smith, and Nathan Brown. "Deep reinforcement learning for multi-criteria optimization in BIM-supported sustainable building design." (2024). [CrossRef]
  42. Khn, Mad. "Health and Safeotocols in Green BuProjects." International Jnal of Construction Health 15.2 (2023):100-115.
  43. Parekh, Ruchit, and M. Olivia. "Utilization of artificial intelligence in project management." International Journal of Science and Research Archive 13 (2024): 1093-1102.
  44. Parekh, Ruchit, and Sophia Wright. "Sustainable knowledge management: Driving green technology innovation and long-term performance in construction firms." (2024). [CrossRef]
  45. Müller, Sn. "Sustainaterials for Urban Infrastructure." Snarastructure Journal 2.3.
  46. Jonson, Emily. "Enhancintruction Safety with Technologies."rnal of Safety nce 33.4 (2024): 200-215.
  47. Ch AI Applicationsstruction EngineeriSpringer, 2023.
  48. Smitlivia. "Occupational in Sustainable Coion Sites." Journal of Ocational Health and Safety 18.1 (2024): 50-65.
  49. Sijiv. "Machine Learng for Predi Maintenance nstruction Eent." Engineeri ng Management Review 29.2 (2023): 120-135.
  50. Evaannah. Sustainable Derinciples in Mod Architecture. Rouge, 2023.
  51. Kio-jin. "IntegratiAI for Energy Efficiings." Energy and Build: 220-235.
  52. O'C, Liam. "As for Constructio Reduction." Waste Maent in Construction 14.2 (2023): 90-105.
  53. Marez, Carlalth Risks atigation in Construrojects." rnal of Construction Health 21.3 (2024): 130-145.
  54. Zhg, Li. "Susable Construcractices in Urban Deopment." Urban Planning and Development 40.1 (2024): 70-85.
  55. Willis, Sa. Renewable Energy Solutir Construction Prjectsord University Press, 2023.
  56. Parekh, Ruchit, and Charles Smith. "Innovative AI-Driven Software for Fire Safety Design: Implementation in Vast Open Structures." World Journal of Advanced Engineering Technology and Sciences 12.2 (2024): 741-750.
  57. Parekh, Ruchit, and Olivia Mitchell. "Enhancing construction site efficiency through artificial intelligence (AI)." (2024). [CrossRef]
  58. Parekh, Ruchit, and Olivia Mitchell. "Progress and obstacles in the use of artificial intelligence in civil engineering: An in-depth review." (2024). [CrossRef]
  59. Parekh, Ruchit, and Olivia Mitchell. "Incorporating AI into construction management: Enhancing efficiency and cost savings." (2024). [CrossRef]
  60. Parekh, Ruchit, et al. "Advancing sustainable leadership in construction: Fostering a resilient society." (2024). [CrossRef]
  61. Parekh, Ruchit, et al. "A Review on Harnessing Cold Energy for Power Conversion in Sustainability Practice". (2024).
  62. Thompson, Grace. "The Role of AI in Sustainable Urban Planning." Urban Technology Journal 28.2 (2023):75-90.
  63. Parekh, Ruchit. "Trends and Challenges in LEED v4.1 Healthcare Certification: A Comprehensive Analysis of US Hospital Scores in 2024." World Journal of Advanced Engineering Technology and Sciences 12.2 (2024): 726-740.
  64. Rodri Jun. "AI Ss for Resource Optimiz in Constructi Resource Management in Construcn 19.2 (2024): 110-125.
  65. StevLaura. Health and W in Sustaile Building n. McGraw-Hill, 2023.
  66. Kur, An "Predictive Anr Construction Safety Managnt." Safety Science 70 (2024): 80-95.
  67. Nura, Yuki. "AI Tchniques in Conion Scheduling." Aumation in Construction 50 (2023): 130-145.
  68. Peov, Naa. "Sustainable Cnstruction Practis in Cold mates." Journal oEnvironmental Engineering 38.3 (2024): 100-115.
  69. Ch, . "AI foructural n Optimizati" Structural Engineering International 33.1 (2024): 60-75.
  70. Herndez, Soa. "Hea Impacts of Indoor Qty in Green Bings." Environml alth Pectives 27.23): 90-105.
  71. Sm Dal. AI and Mane Learning in Conuction Manaent. Elsevr, 2023.
  72. Alva Dieo. "Implenting Sustainable Wer Manent in Consction." Hydry and Consttion 15.4 (2024): 240-255.
  73. TayEmy. "AI Aplications in Constction Cost Esation." Journal of Conuction Econcs 22.3 (2023): 150-165.
  74. Musfa, Lea. "Enhng Worker Safety wh AI Weble Devices." Joal of Occupnal Say 19.1 (2024): 70-85.
  75. Ngn, Hoa. "Renble EnerIntegration in Bing De Renewablnergy Jour5.2 (2023): 180-195.
  76. Parekh, Ruchit. Constructing Wellness: Harnessing AI for a Sustainable and Healthy Future. Elsevier, 2024.
  77. Kowalsiotr. "ADriven Risk Ment in Constructirojects." Journaisk Management 17.2 (2024): 90-105.
  78. Evans, Zoe. "Sustainable Housing Design and Public Health." Housing and Society 30.1 (2023): 60-75.
  79. Garcia, Miguel. "AI-Based Energy Management Systems in Construction." Energy Management Journal 25.3 (2024): 140-155.
  80. Sinh, Pa. Sustainable Infraructure Devopment. CRC Press, 2023.
  81. Rinn, Mrk. "AI Acations in Modular Coction Techniques." Jou of Modernonstrun 9.4 (23): 200-215.
  82. Lei. "Enviroental Impact Assessnts Using AI Toolvironmental Aessmentrnal 12.2 (2024): 50-65.
  83. Chen, Hao. "Advanments in AI for Coction Site Monito" Journal of Consction Tecogy 18.1 (2024): 100-115.
  84. O'Neill, Pat. "Health and y in High-Rise Constructn Projects." Iational Journal of truc Health 20.2 (2023): 80-95.
  85. Zng, Wei. Egy Efficiency AI in Building Dn. MIT ss, 2023.
  86. Ahe Aisha. "AStrategier Waste Reduon in Construc." Waste Magement Jour16.3 (2024: 120 -135.
  87. Bron, JesiSustnability Metricand Their Impacn Construn." Journaf SustainableCruction1.2 (2023): 90-105.
  88. Ivv,rge " Construction Eqpment Automatio" Automation inEngineering 27.4 (2024): 190-205.
  89. Lun-Youg. "Sustai Urban Develoment andI Technologies.Urban Deopment Journal 35.1 (2023): 100-115.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated