Reconnoitering Static Analysis Metrics for Predicting Software Component Reusability Using Ensemble Model

Authors

  • Srishti Bhugra Department of Computer Science &Engineering, SRM University, Delhi NCR, Sonepat, India
  • Puneet Goswami Department of Computer Science &Engineering, SRM University, Delhi NCR, Sonepat, India

Keywords:

Ensemble Model, Machine Learning, Prediction, Reusability, Software

Abstract

Data mining and machine learning have created new avenues for creating software and investigation. The software development life cycle (SDLC) has also benefited from incorporating machine learning, opening up prospects for efficient and well-planned growth. The SDLC includes the reusability of software as a key component. Software reuse management thus assumes an active part in the SDLC. It reduces the expense and time needed to produce a software application. Evaluating a software component's reusability, or how appropriate it is for reuse, is a key difficulty in this scenario. The most effective methods for determining whether a particular software part is reusable or not come from machine learning to evaluate reusability; this study aims to create an ensemble machine learning model that integrates Support Vector Machine, K-Nearest Neighbour, Decision Tree, Artificial Neural Network, and Naive Bayes. After pre-processing, the publicly accessible benchmark dataset is used for experimentation. Compared with base classifiers, the suggested model delivered the most favourable results, with accuracy, precision, recall, and f1-score values of 89.48%, 0.9406, 0.9484, and 0.9445, respectively. According to the assessment of our technique, our approach can accurately evaluate reusability as experienced by engineers.

Downloads

Download data is not yet available.

References

R. Feldt, F. de Oliveira Neto and R. Torkar, “Ways of applying artificial intelligence in software engineering,” in Proceedings of the 6th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, 2018.

S. Martínez-Fernández, J. Bogner, X. Franch, M. Oriol, J. Siebert, A. Trendowicz, A. Vollmer and S. Wagner, “Software engineering for AI-based systems: a survey,” ACM Transactions on Software Engineering and Methodology (TOSEM), vol. 31, no. 2, pp. 1-59, 2022.

R. Ma, E. Sun and J. Zou, “A spectral method for assessing and combining multiple data visualizations,” Nature Communications, vol. 14, no. 1, p. 780, 2023.

P. Goswami, A. Noorwali, A. Kumar, M. Khan, P. Srivastava and S. Batra, “Appraising Early Reliability of a Software Component Using Fuzzy Inference,” Electronics, vol. 12, p. 1137, 2023.

D. Wangoo, “Artificial intelligence techniques in software engineering for automated software reuse and design,” in 2018 4th International Conference on Computing Communication and Automation (ICCCA), 2018.

“ISO/IEC 25010:2011,” [Online]. Available: https://www.iso.org/obp/ui/#iso:std:iso-iec:25010:ed-1:v1:en. [Accessed 23 March 2023].

S. Pfleeger and J. Atlee, Software engineering: theory and practice, Pearson Education India, 2010.

M. Papamichail, T. Diamantopoulos and A. Symeonidis, “Measuring the reusability of software components using static analysis metrics and reuse rate information,” Journal of Systems and Software, vol. 158, p. 110423, 2019.

R. Qayyum, J. Rubaab, U. Riaz and F. Arif, “Role of Data Mining and Machine Learning in Software Reusability,” in 2021 International Conference on Innovative Computing (ICIC), 2021.

A. Singh and P. Tomar, “Estimation of component reusability through reusability metrics,” International Journal of Computer and Information Engineering, vol. 8, no. 11, pp. 2018-2025, 2014.

L. Heinemann, F. Deissenboeck, M. Gleirscher, B. Hummel and M. Irlbeck, “On the extent and nature of software reuse in open source java projects,” in Top Productivity through Software Reuse: 12th International Conference on Software Reuse, ICSR 2011, Pohang, South Korea, June 13-17, 2011. Proceedings 12, 2011.

I. Mojica, B. Adams, M. Nagappan, S. Dienst, T. Berger and A. Hassan, “A large-scale empirical study on software reuse in mobile apps,” IEEE software, vol. 31, no. 2, pp. 78-86, 2013.

J. Bansiya and C. Davis, “A hierarchical model for object-oriented design quality assessment,” IEEE Transactions on software engineering, vol. 28, no. 1, pp. 4-17, 2002.

O. Rotaru and M. Dobre, “Reusability metrics for software components,” in The 3rd ACS/IEEE International Conference onComputer Systems and Applications, 2005, 2005.

S. Amershi, A. Begel, C. Bird, R. DeLine, H. Gall, E. Kamar, N. Nagappan, B. Nushi and T. Zimmermann, “Software engineering for machine learning: A case study,” in 2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP), 2019.

H. Ammar, W. Abdelmoez and M. Hamdi, “Software engineering using artificial intelligence techniques: Current state and open problems,” in Proceedings of the First Taibah University International Conference on Computing and Information Technology (ICCIT 2012), Al-Madinah Al-Munawwarah, Saudi Arabia, 2012.

L. Kaur and A. Mishra, “An empirical analysis for predicting source code file reusability using meta-classification algorithms,” in Advanced Computational and Communication Paradigms: Proceedings of International Conference on ICACCP 2017, Singapore, 2018.

N. Chythanya and L. Rajamani, “Neural Network Approach for Reusable Component Handling,” in 2017 IEEE 7th International Advance Computing Conference (IACC), 2017.

M. Papamichail, T. Diamantopoulos and A. Symeonidis, “Software reusability dataset based on static analysis metrics and reuse rate information,” Data in brief, vol. 27, p. 104687, 2019.

“SourceMeter,” [Online]. Available: https://www.sourcemeter.com/. [Accessed 25 March 2023].

S. Batra and S. Sachdeva, “Organizing standardized electronic healthcare records data for mining,” Health Policy and Technology, vol. 5, no. 3, pp. 226-242, 2016.

S. Batra and S. Sachdeva, “Pre-processing highly sparse and frequently evolving standardized electronic health records for mining,” in Handbook of Research on Disease Prediction Through Data Analytics and Machine Learning, IGI Global, 2021, pp. 8-21.

S. Mishra, P. Mallick, H. Tripathy, A. Bhoi and A. González-Briones, “Performance evaluation of a proposed machine learning model for chronic disease datasets using an integrated attribute evaluator and an improved decision tree classifier,” Applied Sciences, vol. 10, no. 22, p. 8137, 2020.

H. Polat, H. Danaei Mehr and A. Cetin, “Diagnosis of chronic kidney disease based on support vector machine by feature selection methods,” Journal of medical systems, vol. 41, pp. 1-11, 2017.

A. Pathak, S. Batra and H. Chaudhary, “Imputing Missing Data in Electronic Health Records,” in Proceedings of 3rd International Conference on Machine Learning, Advances in Computing, Renewable Energy and Communication: MARC 2021, Singapore, 2022.

A. Pathak, S. Batra and V. Sharma, “An Assessment of the Missing Data Imputation Techniques for COVID-19 Data,” in Proceedings of 3rd International Conference on Machine Learning, Advances in Computing, Renewable Energy and Communication: MARC 2021, Singapore, 2022.

K. Chandel, V. Kunwar, S. Sabitha, T. Choudhury and S. Mukherjee, “A comparative study on thyroid disease detection using K-nearest neighbor and Naive Bayes classification techniques,” CSI transactions on ICT, vol. 4, pp. 313-319, 2016.

S. Sachdeva, D. Batra and S. Batra, “Storage Efficient Implementation of Standardized Electronic Health Records Data,” in 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2020.

B. Boukenze, A. Haqiq and H. Mousannif, “Predicting chronic kidney failure disease using data mining techniques,” in Advances in Ubiquitous Networking 2: Proceedings of the UNet’16 2, 2017.

S. Sachdeva, M. Singh, N. Kumar and P. Goswami, “Personalized e-learning based on ant colony optimization,” International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, vol. 30, no. 1, 2022.

S. Perveen, M. Shahbaz, A. Guergachi and K. Keshavjee, “Performance analysis of data mining classification techniques to predict diabetes,” Procedia Computer Science,, vol. 82, pp. 115-121, 2016.

R. Dhruvi, P. Yavnika and R. Nutan, “Prediction of Probability of Chronic Diseases and Providing Relative Real-Time Statistical Report using data mining and machine learning techniques,” International Journal of Science, Engineering and Technology Research (IJSETR), vol. 5, no. 4, 2016.

R. Ani, G. Sasi, U. Sankar and O. Deepa, “Decision support system for diagnosis and prediction of chronic renal failure using random subspace classification,” in 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), 2016.

S. Ramya and N. Radha, “Diagnosis of chronic kidney disease using machine learning algorithms,” International Journal of Innovative Research in Computer and Communication Engineering, vol. 4, no. 1, pp. 812-820, 2016.

B. Deekshatulu and P. Chandra, “Classification of heart disease using k-nearest neighbor and genetic algorithm,” Procedia technology, vol. 10, pp. 85-94, 2013.

S. Bashir, U. Qamar, F. Khan and L. Naseem, “HMV: A medical decision support framework using multi-layer classifiers for disease prediction,” Journal of Computational Science, vol. 13, pp. 10-25, 2016.

S. Bashir, U. Qamar, F. Khan and M. Javed, “MV5: a clinical decision support framework for heart disease prediction using majority vote based classifier ensemble,” Arabian Journal for Science and Engineering, vol. 39, pp. 7771-7783, 2014.

S. Batra, H. Sharma, W. Boulila, V. Arya, P. Srivastava, M. Z. Khan and M. Krichen, “An Intelligent Sensor Based Decision Support System for Diagnosing Pulmonary Ailment through Standardized Chest X-ray Scans,” Sensors, vol. 22, no. 19, p. Sensors, 2022.

S. Batra, R. Khurana, M. Z. Khan, W. Boulila, A. Koubaa and P. Srivastava, “A Pragmatic Ensemble Strategy for Missing Values Imputation in Health Records,” Entropy, vol. 24, no. 4, p. 533, 20

Ch.Sarada, C., Lakshmi, K. V. ., & Padmavathamma, M. . (2023). MLO Mammogram Pectoral Masking with Ensemble of MSER and Slope Edge Detection and Extensive Pre-Processing. International Journal on Recent and Innovation Trends in Computing and Communication, 11(3), 135–144. https://doi.org/10.17762/ijritcc.v11i3.6330

Chang Lee, Deep Learning for Speech Recognition in Intelligent Assistants , Machine Learning Applications Conference Proceedings, Vol 1 2021.

Downloads

Published

16.07.2023

How to Cite

Bhugra, S. ., & Goswami, P. . (2023). Reconnoitering Static Analysis Metrics for Predicting Software Component Reusability Using Ensemble Model. International Journal of Intelligent Systems and Applications in Engineering, 11(3), 150–158. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3152

Issue

Section

Research Article