Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Leveraging Climate Data for Dengue Forecasting in Ba Ria Vung Tau Province, Vietnam: An Advanced Machine Learning Approach

Version 1 : Received: 18 September 2024 / Approved: 19 September 2024 / Online: 19 September 2024 (13:18:24 CEST)
Version 2 : Received: 23 September 2024 / Approved: 23 September 2024 / Online: 23 September 2024 (14:31:09 CEST)
Version 3 : Received: 23 September 2024 / Approved: 23 September 2024 / Online: 23 September 2024 (14:39:40 CEST)

How to cite: Anh Tuan, D.; Dang, T. N. Leveraging Climate Data for Dengue Forecasting in Ba Ria Vung Tau Province, Vietnam: An Advanced Machine Learning Approach. Preprints 2024, 2024091535. https://doi.org/10.20944/preprints202409.1535.v1 Anh Tuan, D.; Dang, T. N. Leveraging Climate Data for Dengue Forecasting in Ba Ria Vung Tau Province, Vietnam: An Advanced Machine Learning Approach. Preprints 2024, 2024091535. https://doi.org/10.20944/preprints202409.1535.v1

Abstract

Dengue fever is a persistent public health issue in tropical regions, including Vietnam, where climate variability plays a crucial role in disease transmission dynamics. This study focuses on developing climate-based machine learning models to forecast dengue outbreaks in Ba Ria Vung Tau (BRVT) province, Vietnam, using meteorological data from 2003 to 2022. We utilized four predictive models—Negative Binomial Regression (NBR), Seasonal AutoRegressive Integrated Moving Average with Exogenous Regressors (SARIMAX), Extreme Gradient Boosting (XGBoost), and Long Short-Term Memory (LSTM)—to predict weekly dengue incidence. Key climate variables, including temperature, humidity, precipitation, and wind speed, were integrated into these models, with lagged variables included to capture delayed climatic effects on dengue transmission. The NBR model demonstrated the best performance in terms of predictive accuracy, achieving the lowest Mean Absolute Error (MAE) compared to other models. The inclusion of lagged climate variables significantly enhanced the model’s ability to predict dengue cases. SARIMAX and LSTM models, although effective in capturing seasonal trends, struggled with overfitting and failed to accurately predict short-term outbreaks. XGBoost exhibited moderate predictive power but was sensitive to overfitting, particularly without fine-tuning. Our findings confirm that climate-based machine learning models, particularly the NBR model, offer valuable tools for forecasting dengue outbreaks in BRVT. However, improving the models’ ability to predict short-term peaks remains a challenge. The integration of meteorological data into early warning systems is crucial for public health authorities to plan timely and effective interventions. This research contributes to the growing body of literature on climate-based disease forecasting and underscores the need for further model refinement to address the complexities of dengue transmission in highly endemic regions.

Keywords

Dengue fever; Machine learning; Climate forecasting; Negative Binomial Regression; SARIMAX; XGBoost; LSTM; Ba Ria Vung Tau; Vietnam

Subject

Public Health and Healthcare, Health Policy and Services

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.