Homicide Prediction Model in Bogotá Using the Decision Tree Regression Algorithm

Authors

  • Simanca H. Fredys A. Universidad Cooperativa de Colombia
  • Abuchar Porras Alexandra Universidad Distrital Francisco José de Caldas
  • Anzola John Fundación Universitaria Los Libertadores
  • Palacios Jairo Jamith Docente de Planta, Colegio Mayor de Cundinamarca
  • Suarez Roldán Carolina Universidad Cooperativa de Colombia
  • Lugo Manuel Barbosa Guerrero Docente de Planta, Colegio Mayor de Cundinamarca

Keywords:

Machine Learning, Homicides, Bogota, Decision Tree

Abstract

Bogota is the capital of Colombia, and like many capitals in the world it faces challenges related to security and specifically homicides. Throughout the city's existence, the homicide rate has varied due to multiple political, social, and economic factors. These indicators have always been high and quite significant and a constant concern for the city authorities. However, the use of Machine Learning algorithms to predict homicides is a controversial application, but one of growing interest for authorities and experts in Data Mining. For this reason, the development of a Regression algorithm is proposed, specifically the Decision Tree algorithm that predicts the number of homicides in the city of Bogotá, applicable to any city in Colombia, seeking to identify the potential that this tool may have in the planning of prevention strategies. The design and validation of the algorithm yielded an accuracy between 70% and 75%, which is not a desired percentage, but neither can be ruled out in the framework of the use of prediction algorithms. Finally, it is important to point out that this issue should be approached with caution and responsibility and not fall into the promotion of profiles based on stereotypes or the reinforcement of negative stereotypes.

Downloads

Download data is not yet available.

References

I. H. Witten, E. Frank, and M. Hall, Data Mining: Practical Machine Learning Tools and Techniques, Burlington: Elsevier Inc, 2011.

S. H. Fredys A., B. G. Fabian, L. Jesser, T. C. Wilfred, P. R. Jairo, and B. G. Lugo, "Air Quality Index Prediction Model for the City of Bogotá, DC," Advances in Mechanics, vol. 9, no. 3, pp. 542-553, 2021.

S. H. Fredys A., H. B. Miguel, A. T. Andrés, P. R. Jairo, B. G. Fabián, O. D. Camilo, and B. G. Lugo, "Application of the Polynomial Regression Algorithm to Predict Covid-19 Cases Per Day in Colombia," Advances in Mechanics, vol. 9, no. 3, pp. 49-61, 2021.

R. F. R. Forradellas, S. L. Alonso Náñez, M. L. Ródriguez, and J. J. Vásquez, "Applied machine learning in social sciences: Neural networks and crime prediction," Social Sciences, vol. 10, no. 4, pp. 1-20, 2021.

G. M. Campedelli, "Explainable machine learning for predicting homicide clearance in the United States," Journal of Criminal Justice, vol. 79, no. 1, pp. 1-10, 2022.

F. A. Simanca H., J. A. Cortés Méndez, A. Abuchar Porras, F. Blanco Garrido, J. A. Páez Páez, and J. A. Páez Páez, "Algorithm for predicting the most frequent causes of mortality by analyzing age and gender variables.," Journal of Positive Psychology & Wellbeing, vol. 6, no. 1, p. 1419 – 1429, 2022.

H. A. Ordoñez Eraso, C. J. Pardo Calvache, and C. A. Cobos Lozada, "Detection of Homicide Trends in Colombia Using Machine Learning," Journal of the Faculty of Engineering, vol. 29, no. 54, pp. 1-20, 2020.

J. Gironés Roig, J. Casas Roma, J. Minguillón Alfonso and R. Caihuelas Quiles, Data Mining: Models and Algorithms, Barcelona: Editorial UOC, 2017.

National Police of Colombia, "National Police of Colombia," National Police of Colombia, 10 8 2023. [Online]. Available: https://www.policia.gov.co/grupo-informacion-criminalidad/estadistica-delictiva. [Accessed 10 8 2023].

J. Z. Mohammed and M. Wagner, Data Mining and Analysis: Fundamental Concepts and Algorithms, Cambridge: Cambridge University, 2013.

E. Russano and E. Ferreira Avelino, Fundamentals of Machine Learning using Python, Oakville: Arcler Press, 2020.

Pandas, 5 October 2020. [Online]. Available: https://pandas.pydata.org/.

"LearnAI," 2020. [Online]. Available: https://aprendeia.com/introduccion-a-numpy-python-1/.

Matplotlib, "Matplotlib visualization with Python," 2012. [Online]. Available: https://matplotlib.org/.

U. d. Alcalá, "SCIKIT-LEARN, A BASIC TOOL FOR DATA SCIENCE IN PYTHON," 2020. [Online]. Available: https://www.master-data-scientist.com/scikit-learn-data-science/.

E. Ribas, «IBES,» 08 JANUARY 2018. [Online]. Available: https://www.iebschool.com/blog/data-mining-mineria-datos-big-data/#:~:text=El%20Data%20Mining%20es%20un,el%20comportamiento%20de%20estos%20datos.

G. . E. Chanchí Golondrino, L. . M. Sierra Martinez and W. Y. Campo Muñoz, "Application of polynomial regression for the characterization of the COVID-19 curve, using machine learning techniques," Research and Innovation in Engineering, vol. 8, no. 2, pp. 87-105, 2020.

Downloads

Published

24.11.2023

How to Cite

Fredys A., S. H. ., Alexandra, A. P. ., John, A. ., Jamith, P. J. ., Carolina, S. R. ., & Guerrero, L. M. B. . (2023). Homicide Prediction Model in Bogotá Using the Decision Tree Regression Algorithm. International Journal of Intelligent Systems and Applications in Engineering, 12(5s), 521–529. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3961

Issue

Section

Research Article