Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Proposing Machine Learning Models Suitable for Predicting Open Data Utilization

Version 1 : Received: 19 June 2024 / Approved: 19 June 2024 / Online: 19 June 2024 (07:36:26 CEST)

A peer-reviewed article of this Preprint also exists.

Jeong, J.; Cho, K. Proposing Machine Learning Models Suitable for Predicting Open Data Utilization. Sustainability 2024, 16, 5880. Jeong, J.; Cho, K. Proposing Machine Learning Models Suitable for Predicting Open Data Utilization. Sustainability 2024, 16, 5880.

Abstract

As the digital transformation accelerates in our society, open data is being increasingly recognized as a key resource for digital innovation in the public sector. This study explores the following two research questions: 1) Can a machine learning approach be appropriately used for measuring and evaluating open data utilization? 2) Should different machine learning models be applied for measuring open data utilization depending on open data attributes (field and usage type)? This study used single-model (Random Forest, XGBoost, LightGBM, CatBoost) and multi-model (Stacking Ensemble) machine learning methods. A key finding is that the best-performing models differed depending on open data attributes (field and type of use). The applicability of the machine learning approach for measuring and evaluating open data utilization in advance was also confirmed. This study contributes to open data utilization and to the application of its intrinsic value to society.

Keywords

open data; open government data; open data utilization

Subject

Business, Economics and Management, Business and Management

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.