Preprint Article Version 1 This version is not peer-reviewed

Predicting Pregnancy Risk Levels Using Ensemble Machine Learning Techniques and Oversampling Methods

Version 1 : Received: 2 November 2024 / Approved: 5 November 2024 / Online: 6 November 2024 (07:43:29 CET)

How to cite: Uraimzhan Kyzy, A.; Mekuria, R. R. Predicting Pregnancy Risk Levels Using Ensemble Machine Learning Techniques and Oversampling Methods. Preprints 2024, 2024110371. https://doi.org/10.20944/preprints202411.0371.v1 Uraimzhan Kyzy, A.; Mekuria, R. R. Predicting Pregnancy Risk Levels Using Ensemble Machine Learning Techniques and Oversampling Methods. Preprints 2024, 2024110371. https://doi.org/10.20944/preprints202411.0371.v1

Abstract

Accurate prediction of pregnancy risk levels is essential for preventing maternal and fetal complications. This study explores the application of ensemble machine learning models combined with oversampling techniques to predict pregnancy risk levels. We utilized a publicly available dataset from the UCI Machine Learning Repository, performing extensive feature engineering, including the introduction of new features like Pulse Pressure and Mean Arterial Pressure. To address class imbalance, we employed the Adaptive Synthetic Sampling (ADASYN) method. We conducted comprehensive hyperparameter tuning to enhance model performance and achieve optimal predictive results. Additional evaluation metrics, including sensitivity, specificity, and precision-recall curves, were used to assess model per- formance comprehensively. Our findings demonstrate that the Voting Classifier, particularly when combined with ADASYN oversampling and optimized hyperpa- rameters, achieves an accuracy of 87.19% and a macro F1 score of 87.66%, effec- tively distinguishing between ’low risk’ and ’mid risk’ pregnancy cases. This work contributes to the field by enhancing prediction accuracy, providing insights into important features influencing pregnancy risk, and addressing ethical considerations in deploying machine learning models in healthcare.

Keywords

Pregnancy Risk Prediction; Machine Learning; ADASYN; Feature Engineering; Ensemble Methods; Ethical Considerations

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.