Homicide Prediction Model in Bogotá Using the Decision Tree Regression Algorithm
Keywords:
Machine Learning, Homicides, Bogota, Decision TreeAbstract
Bogota is the capital of Colombia, and like many capitals in the world it faces challenges related to security and specifically homicides. Throughout the city's existence, the homicide rate has varied due to multiple political, social, and economic factors. These indicators have always been high and quite significant and a constant concern for the city authorities. However, the use of Machine Learning algorithms to predict homicides is a controversial application, but one of growing interest for authorities and experts in Data Mining. For this reason, the development of a Regression algorithm is proposed, specifically the Decision Tree algorithm that predicts the number of homicides in the city of Bogotá, applicable to any city in Colombia, seeking to identify the potential that this tool may have in the planning of prevention strategies. The design and validation of the algorithm yielded an accuracy between 70% and 75%, which is not a desired percentage, but neither can be ruled out in the framework of the use of prediction algorithms. Finally, it is important to point out that this issue should be approached with caution and responsibility and not fall into the promotion of profiles based on stereotypes or the reinforcement of negative stereotypes.
Downloads
References
I. H. Witten, E. Frank, and M. Hall, Data Mining: Practical Machine Learning Tools and Techniques, Burlington: Elsevier Inc, 2011.
S. H. Fredys A., B. G. Fabian, L. Jesser, T. C. Wilfred, P. R. Jairo, and B. G. Lugo, "Air Quality Index Prediction Model for the City of Bogotá, DC," Advances in Mechanics, vol. 9, no. 3, pp. 542-553, 2021.
S. H. Fredys A., H. B. Miguel, A. T. Andrés, P. R. Jairo, B. G. Fabián, O. D. Camilo, and B. G. Lugo, "Application of the Polynomial Regression Algorithm to Predict Covid-19 Cases Per Day in Colombia," Advances in Mechanics, vol. 9, no. 3, pp. 49-61, 2021.
R. F. R. Forradellas, S. L. Alonso Náñez, M. L. Ródriguez, and J. J. Vásquez, "Applied machine learning in social sciences: Neural networks and crime prediction," Social Sciences, vol. 10, no. 4, pp. 1-20, 2021.
G. M. Campedelli, "Explainable machine learning for predicting homicide clearance in the United States," Journal of Criminal Justice, vol. 79, no. 1, pp. 1-10, 2022.
F. A. Simanca H., J. A. Cortés Méndez, A. Abuchar Porras, F. Blanco Garrido, J. A. Páez Páez, and J. A. Páez Páez, "Algorithm for predicting the most frequent causes of mortality by analyzing age and gender variables.," Journal of Positive Psychology & Wellbeing, vol. 6, no. 1, p. 1419 – 1429, 2022.
H. A. Ordoñez Eraso, C. J. Pardo Calvache, and C. A. Cobos Lozada, "Detection of Homicide Trends in Colombia Using Machine Learning," Journal of the Faculty of Engineering, vol. 29, no. 54, pp. 1-20, 2020.
J. Gironés Roig, J. Casas Roma, J. Minguillón Alfonso and R. Caihuelas Quiles, Data Mining: Models and Algorithms, Barcelona: Editorial UOC, 2017.
National Police of Colombia, "National Police of Colombia," National Police of Colombia, 10 8 2023. [Online]. Available: https://www.policia.gov.co/grupo-informacion-criminalidad/estadistica-delictiva. [Accessed 10 8 2023].
J. Z. Mohammed and M. Wagner, Data Mining and Analysis: Fundamental Concepts and Algorithms, Cambridge: Cambridge University, 2013.
E. Russano and E. Ferreira Avelino, Fundamentals of Machine Learning using Python, Oakville: Arcler Press, 2020.
Pandas, 5 October 2020. [Online]. Available: https://pandas.pydata.org/.
"LearnAI," 2020. [Online]. Available: https://aprendeia.com/introduccion-a-numpy-python-1/.
Matplotlib, "Matplotlib visualization with Python," 2012. [Online]. Available: https://matplotlib.org/.
U. d. Alcalá, "SCIKIT-LEARN, A BASIC TOOL FOR DATA SCIENCE IN PYTHON," 2020. [Online]. Available: https://www.master-data-scientist.com/scikit-learn-data-science/.
E. Ribas, «IBES,» 08 JANUARY 2018. [Online]. Available: https://www.iebschool.com/blog/data-mining-mineria-datos-big-data/#:~:text=El%20Data%20Mining%20es%20un,el%20comportamiento%20de%20estos%20datos.
G. . E. Chanchí Golondrino, L. . M. Sierra Martinez and W. Y. Campo Muñoz, "Application of polynomial regression for the characterization of the COVID-19 curve, using machine learning techniques," Research and Innovation in Engineering, vol. 8, no. 2, pp. 87-105, 2020.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.