Daily thousands of people and goods move along Brazilian Federal highways. Traffic accidents are numerous on these highways and have a significant impact, whether on the economy or the health system. Identifying predictor variables, the probability of an event occurring and how to mitigate them are of paramount importance for the actions of the transit authorities that manage these roads. The main contribution of this study is the development of a predictive machine learning model which uses open data to shows graphically the critical points in the highways. This model is fully reproducible and can be applied to any region worldwide helping to minimize the number of accidents and to prevent deaths by automotive collisions. For this study, 43 variables were analyzed supporting the identification of the causes of accidents with fatal victims on the main highways in the south of Brazil. RoadLytics is proposed as a supervised machine learning model, using the Random Forest algorithm to analyze about 33 thousand occurrences between 2017 and 2020. An exploratory analysis of the data was carried out to support the modeling and to facilitate data visualization. In this sense, heat maps were developed to support the analysis and identification of potential risk areas. The results show that BR386 highway registers the highest number of fatal occurrences, regardless of the season. Additionally, concerning the weather conditions, the analysis shows that 52% of accidents occurred in favorable conditions, such as clear skies, victimizing 501 people. The driver’s lack of attention is the main reason for the accidents’ occurrences. Applying the developed model, an accuracy of 77% was achieved for the classification of fatal accidents.
Keywords:
Subject: Computer Science and Mathematics - Information Systems
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.