期刊名称:Karbala International Journal of Modern Science
印刷版ISSN:2405-609X
电子版ISSN:2405-609X
出版年度:2022
卷号:8
期号:1
页码:1-19
DOI:10.33640/2405-609X.3197
语种:English
出版社:Elsevier
摘要:The crime rate in India is considerably increasing day by day. Consequently, the data associated with crime is also increasing, opening doors for data-driven approaches to these data to extract insightful knowledge, which can help police and other law enforcement organizations of the country in crime control and prevention. Crime prediction using machine learning algorithms on crime data can predict region-wise crime counts. In this paper, a machine learning-based soft computing regression analysis approach for Indian Crime Data Analysis (ICDA) is proposed. Different regression algorithms, namely, Simple Linear Regression (SLR), Multiple Linear Regression (MLR), Decision Tree Regression (DTR), Support Vector Regression (SVR), and Random Forest Regression (RFR) are uses to build regression models. These regression models can predict a total number of Indian Penal Code (IPC) crime counts and crime counts of different types of crime (murder, rape, kidnapping and abduction, riots, to name a few) region-wise and state-wise and all over the country for a given year. Adjusted R squared value and Mean Absolute Percentage Error (MAPE) is used to evaluate and compare proposed regression models. In the proposed approach for ICDA, district-wise spatial-temporal crime data of years 2001 to 2012 is used, collected from the official website of NCRB. For the chosen data, it is concluded that the region-wise total IPC crime prediction RFR model fits the best with an adjusted R squared value of 0.9631551 and an error of 0.2027437. Whereas for region-wise thefts crime count prediction, the RFR model fits the best with an adjusted R squared value of 0.966604 and an error of 0.16571.
关键词:Indian Penal Code (IPC);Support Vector Regression (SVR);Random Forest Regression (RFR);Decision Tree Regression (DTR);Multiple Linear Regression (MLR);Machine Learning algorithms;Indian Crime data analysis (ICDA)