(Publisher of Peer Reviewed Open Access Journals)

International Journal of Advanced Technology and Engineering Exploration (IJATEE)

ISSN (Print):2394-5443    ISSN (Online):2394-7454
Volume-10 Issue-108 November-2023
Full-Text PDF
Paper Title : Analysis of the severity of transport vehicle accidents by a comparative study of machine learning models
Author Name : Mensouri Houssam, Azmani Abdellah and Azmani Monir
Abstract :

Traffic accidents pose a significant global threat to public safety, with the World Health Organization (WHO) estimating that they claim the lives of approximately 1.25 million individuals each year. Without intervention, traffic accidents are projected to become the leading cause of death by 2030. Predicting accident severity and understanding their underlying causes represent crucial steps in developing effective strategies to prevent accidents and enhance overall traffic safety. This paper presented an in-depth analysis of accident severity prediction, considering a wide range of factors, including the vehicle, driver, environmental conditions, and more. The study aims to predict the extent of the severity of traffic accidents using a comprehensive dataset comprising over 4 million incidents that occurred across 49 states in the United States of America (USA) between February 2016 and December 2020. Various machine learning models, including logistic regression (LR), support vector machine (SVM), decision tree (DT), and random forest (RF), were implemented and rigorously evaluated against multiple performance metrics. The achieved results reveal that the RF model stands out with the highest accuracy of 91% in predicting accident severity. Additionally, this model demonstrates excellent performance across additional evaluation metrics, including a precision rate of 89%, a recall rate of 91%, a root mean square error (RMSE) of 18%, and an F1 score of 89%. These findings emphasize the exceptional predictive power and robustness of the RF model, making it a highly promising approach for real-world traffic accident scenarios. This research provides valuable insights into predicting accident severity, which is crucial for the development of effective accident prevention strategies and improvements in traffic safety.

Keywords : Accident, Severity prediction, Machine learning, RMSE.
Cite this article : Houssam M, Abdellah A, Monir A. Analysis of the severity of transport vehicle accidents by a comparative study of machine learning models. International Journal of Advanced Technology and Engineering Exploration. 2023; 10(108):1431-1448. DOI:10.19101/IJATEE.2023.10101879.
References :
[1]Moosavi S, Samavatian MH, Parthasarathy S, Teodorescu R, Ramnath R. Accident risk prediction based on heterogeneous sparse data: new dataset and insights. In proceedings of the 27th SIGSPATIAL international conference on advances in geographic information systems 2019 (pp. 33-42). ACM.
[Crossref] [Google Scholar]
[2]Mills BN, Andrey J, Hambly D. Analysis of precipitation-related motor vehicle collision and injury risk using insurance and police record information for Winnipeg, Canada. Journal of Safety Research. 2011; 42(5):383-90.
[Crossref] [Google Scholar]
[3]Thompson JP, Baldock MR, Dutschke JK. Trends in the crash involvement of older drivers in Australia. Accident Analysis & Prevention. 2018; 117:262-9.
[Crossref] [Google Scholar]
[4]Hao W, Kamga C, Daniel J. The effect of age and gender on motor vehicle driver injury severity at highway-rail grade crossings in the United States. Journal of Safety Research. 2015; 55:105-13.
[Crossref] [Google Scholar]
[5]Zhao L, Wang C, Yang H, Wu X, Zhu T, Wang J. Exploring injury severity of non-motor vehicle riders involving in traffic accidents using the generalized ordered logit model. Ain Shams Engineering Journal. 2023; 14(5):101962.
[Crossref] [Google Scholar]
[6]Pereira V, Bamel U, Paul H, Varma A. Personality and safety behavior: an analysis of worldwide research on road and traffic safety leading to organizational and policy implications. Journal of Business Research. 2022; 151:185-96.
[Crossref] [Google Scholar]
[7]Wu J, Lu Y, Shi S, Zhou R, Liu Y. Research on the prediction model of hazardous chemical road transportation accidents. Journal of Loss Prevention in the Process Industries. 2023:105103.
[Crossref] [Google Scholar]
[8]Alkheder S, AlRukaibi F, Aiash A. Risk analysis of traffic accidents’ severities: An application of three data mining models. ISA Transactions. 2020; 106:213-20.
[Crossref] [Google Scholar]
[9]Ji A, Levinson D. Injury severity prediction from two-vehicle crash mechanisms with machine learning and ensemble models. IEEE Open Journal of Intelligent Transportation Systems. 2020; 1:217-26.
[Crossref] [Google Scholar]
[10]Lane MN. Pricing risk transfer transactions1. ASTIN Bulletin: The Journal of the IAA. 2000; 30(2):259-93.
[Google Scholar]
[11]Miyajima C, Nishiwaki Y, Ozawa K, Wakita T, Itou K, Takeda K, et al. Driver modeling based on driving behavior and its evaluation in driver identification. Proceedings of the IEEE. 2007; 95(2):427-37.
[Crossref] [Google Scholar]
[12]Chang LY. Analysis of freeway accident frequencies: negative binomial regression versus artificial neural network. Safety Science. 2005; 43(8):541-57.
[Crossref] [Google Scholar]
[13]Kumar S, Toshniwal D. A data mining framework to analyze road accident data. Journal of Big Data. 2015; 2(1):1-8.
[Crossref] [Google Scholar]
[14]Wenqi L, Dongyu L, Menghua Y. A model of traffic accident prediction based on convolutional neural network. In 2nd international conference on intelligent transportation engineering 2017 (pp. 198-202). IEEE.
[Crossref] [Google Scholar]
[15]Yu L, Du B, Hu X, Sun L, Han L, Lv W. Deep spatio-temporal graph convolutional network for traffic accident prediction. Neurocomputing. 2021; 423:135-47.
[Crossref] [Google Scholar]
[16]Yuan Z, Zhou X, Yang T. Hetero-convlstm: a deep learning approach to traffic accident prediction on heterogeneous spatio-temporal data. In proceedings of the 24th international conference on knowledge discovery & data mining 2018 (pp. 984-92). ACM.
[Crossref] [Google Scholar]
[17]Wahab L, Jiang H. A comparative study on machine learning based algorithms for prediction of motorcycle crash severity. PLoS one. 2019; 14(4):1-17.
[Crossref] [Google Scholar]
[18]Zhu L, Lu L, Zhang W, Zhao Y, Song M. Analysis of accident severity for curved roadways based on bayesian networks. Sustainability. 2019; 11(8):1-17.
[Crossref] [Google Scholar]
[19]Kopelias P, Papadimitriou F, Papandreou K, Prevedouros P. Urban freeway crash analysis: geometric, operational, and weather effects on crash number and severity. Transportation Research Record. 2007; 2015(1):123-31.
[Crossref] [Google Scholar]
[20]Soderstrom CA, Dischinger PC, Kufera JA, Ho SM, Shepard A. Crash culpability relative to age and sex for injured drivers using alcohol, marijuana or cocaine. In annual proceedings/association for the advancement of automotive medicine 2005 (pp. 327-31). Association for the Advancement of Automotive Medicine.
[Google Scholar]
[21]Zajac SS, Ivan JN. Factors influencing injury severity of motor vehicle–crossing pedestrian crashes in rural connecticut. Accident Analysis & Prevention. 2003; 35(3):369-79.
[Crossref] [Google Scholar]
[22]Beirness DJ, Simpson HM, Williams AF. Role of cannabis and benzodiazepines in motor vehicle crashes. Drugs and Traffic. 2005:12-21.
[Google Scholar]
[23]Zhang G, Yau KK, Chen G. Risk factors associated with traffic violations and accident severity in China. Accident Analysis & Prevention. 2013; 59:18-25.
[Crossref] [Google Scholar]
[24]Islam M. Multi-vehicle crashes involving large trucks: a random parameter discrete outcome modeling approach. Journal of the Transportation Research Forum. 2015; 54(1):77-104.
[Crossref] [Google Scholar]
[25]Rifaat SM, Tay R, De BA. Effect of street pattern on the severity of crashes involving vulnerable road users. Accident Analysis & Prevention. 2011; 43(1):276-83.
[Crossref] [Google Scholar]
[26]Christie SM, Lyons RA, Dunstan FD, Jones SJ. Are mobile speed cameras effective? a controlled before and after study. Injury Prevention. 2003; 9(4):302-6.
[Crossref] [Google Scholar]
[27]Moore DN, Schneider IVWH, Savolainen PT, Farzaneh M. Mixed logit analysis of bicyclist injury severity resulting from motor vehicle crashes at intersection and non-intersection locations. Accident Analysis & Prevention. 2011; 43(3):621-30.
[Crossref] [Google Scholar]
[28]Al-ghamdi AS. Experimental evaluation of fog warning system. Accident Analysis & Prevention. 2007; 39(6):1065-72.
[Crossref] [Google Scholar]
[29]Edwards JB. The relationship between road accident severity and recorded weather. Journal of Safety Research. 1998; 29(4):249-62.
[Crossref] [Google Scholar]
[30]Behnood A, Al-bdairi NS. Determinant of injury severities in large truck crashes: a weekly instability analysis. Safety Science. 2020; 131:104911.
[Crossref] [Google Scholar]
[31]Moosavi S, Samavatian MH, Parthasarathy S, Ramnath R. A countrywide traffic accident dataset. ArXiv Preprint Server. 2019:1-6.
[Crossref] [Google Scholar]
[32]http://data-seattlecitygis.opendata.arcgis.com/datasets/5b5c745e0f1f48e7a53acec63a0022ab_0. Accessed 24 October 2023.
[33]https://data.brla.gov/Transportation-and-Infrastructure/Baton-Rouge-Traffic-Incidents/2tu5-7kif. Accessed 24 October 2023.
[34]Zong F, Xu H, Zhang H. Prediction for traffic accident severity: comparing the Bayesian network and regression models. Mathematical Problems in Engineering. 2013; 2013:1-10.
[Crossref] [Google Scholar]
[35]Satri J, El MC, Hachimi H. Artificial intelligence and machine learning for a better decision making in the public sector. In 8th international conference on optimization and applications 2022 (pp. 1-5). IEEE.
[Crossref] [Google Scholar]
[36]Hidayat TH, Ruldeviyani Y, Aditama AR, Madya GR, Nugraha AW, Adisaputra MW. Sentiment analysis of twitter data related to Rinca Island development using Doc2Vec and SVM and logistic regression as classifier. Procedia Computer Science. 2022; 197:660-7.
[Crossref] [Google Scholar]
[37]Güven I, Şimşir F. Demand forecasting with color parameter in retail apparel industry using artificial neural networks (ANN) and support vector machines (SVM) methods. Computers & Industrial Engineering. 2020; 147:106678.
[Crossref] [Google Scholar]
[38]Boser BE, Guyon IM, Vapnik VN. A training algorithm for optimal margin classifiers. In proceedings of the fifth annual workshop on computational learning theory. 1992 (pp. 144-52). ACM.
[Crossref] [Google Scholar]
[39]Cortes C, Vapnik V. Support-vector networks. Machine Learning. 1995; 20:273-97.
[Crossref] [Google Scholar]
[40]Vapnik VN. An overview of statistical learning theory. IEEE Transactions on Neural Networks. 1999; 10(5):988-99.
[Crossref] [Google Scholar]
[41]Li K, Zhou G, Yang Y, Li F, Jiao Z. A novel prediction method for favorable reservoir of oil field based on grey wolf optimizer and twin support vector machine. Journal of Petroleum Science and Engineering. 2020; 189:1-11.
[Crossref] [Google Scholar]
[42]Sahana M, Rehman S, Sajjad H, Hong H. Exploring effectiveness of frequency ratio and support vector machine models in storm surge flood susceptibility assessment: a study of Sundarban Biosphere Reserve, India. Catena. 2020; 189:104450.
[Crossref] [Google Scholar]
[43]Yu Y, Shao M, Jiang L, Ke Y, Wei D, Zhang D, et al. Quantitative analysis of multiple components based on support vector machine (SVM). Optik. 2021; 237:166759.
[Crossref] [Google Scholar]
[44]Hunt EB, Marin J, Stone PJ. Experiments in induction. Academic Press. 1996.
[Google Scholar]
[45]Jin C, Li F, Ma S, Wang Y. Sampling scheme-based classification rule mining method using decision tree in big data environment. Knowledge-Based Systems. 2022; 244:108522.
[Crossref] [Google Scholar]
[46]Chang MY, Chiang RD, Wu SJ, Chan CH. Mining unexpected patterns using decision trees and interestingness measures: a case study of endometriosis. Soft Computing. 2016; 20:3991-4003.
[Crossref] [Google Scholar]
[47]Clarke DD, Forsyth R, Wright R. Machine learning in road accident research: decision trees describing road accidents during cross-flow turns. Ergonomics. 1998; 41(7):1060-79.
[Crossref] [Google Scholar]
[48]https://www.analyticsvidhya.com/blog/2021/04/beginners-guide-to-decision-tree-classification-using-python/. Accessed 24 October 2023.
[49]Shorabeh SN, Samany NN, Minaei F, Firozjaei HK, Homaee M, Boloorani AD. A decision model based on decision tree and particle swarm optimization algorithms to identify optimal locations for solar power plants construction in Iran. Renewable Energy. 2022; 187:56-67.
[Crossref] [Google Scholar]
[50]Maimon OZ, Rokach L. Data mining with decision trees: theory and applications. World Scientific; 2014.
[Google Scholar]
[51]Breiman L. Random forests. Machine Learning. 2001; 45:5-32.
[Crossref] [Google Scholar]
[52]Dietterich TG. An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization. Machine Learning. 2000; 40:139-57.
[Crossref] [Google Scholar]
[53]Ho TK. The random subspace method for constructing decision forests. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1998; 20(8):832-44.
[Crossref] [Google Scholar]
[54]Hosseinpour M, Ghaemi S, Khanmohammadi S, Daneshvar S. A hybrid high‐order type‐2 FCM improved random forest classification method for breast cancer risk assessment. Applied Mathematics and Computation. 2022; 424:127038.
[Crossref] [Google Scholar]
[55]Chaudhary A, Kolhe S, Kamal R. An improved random forest classifier for multi-class classification. Information Processing in Agriculture. 2016; 3(4):215-22.
[Crossref] [Google Scholar]
[56]Kim Y, Kim Y. Explainable heat-related mortality with random forest and SHapley Additive exPlanations (SHAP) models. Sustainable Cities and Society. 2022; 79:103677.
[Crossref] [Google Scholar]
[57]Egwim CN, Alaka H, Toriola-coker LO, Balogun H, Sunmola F. Applied artificial intelligence for predicting construction projects delay. Machine Learning with Applications. 2021; 6:1-15.
[Crossref] [Google Scholar]
[58]Mensouri D, Azmani A. A new marketing recommendation system using a hybrid approach to generate smart offers. Applied Computer Systems. 2022; 27(2):149-58.
[Crossref] [Google Scholar]
[59]Alkheder S, Taamneh M, Taamneh S. Severity prediction of traffic accident using an artificial neural network. Journal of Forecasting. 2017; 36(1):100-8.
[Crossref] [Google Scholar]
[60]Shaik ME, Islam MM, Hossain QS. A review on neural network techniques for the prediction of road traffic accident severity. Asian Transport Studies. 2021; 7:1-11.
[Crossref] [Google Scholar]