Reconnoitering Static Analysis Metrics for Predicting Software Component Reusability Using Ensemble Model
Keywords:
Ensemble Model, Machine Learning, Prediction, Reusability, SoftwareAbstract
Data mining and machine learning have created new avenues for creating software and investigation. The software development life cycle (SDLC) has also benefited from incorporating machine learning, opening up prospects for efficient and well-planned growth. The SDLC includes the reusability of software as a key component. Software reuse management thus assumes an active part in the SDLC. It reduces the expense and time needed to produce a software application. Evaluating a software component's reusability, or how appropriate it is for reuse, is a key difficulty in this scenario. The most effective methods for determining whether a particular software part is reusable or not come from machine learning to evaluate reusability; this study aims to create an ensemble machine learning model that integrates Support Vector Machine, K-Nearest Neighbour, Decision Tree, Artificial Neural Network, and Naive Bayes. After pre-processing, the publicly accessible benchmark dataset is used for experimentation. Compared with base classifiers, the suggested model delivered the most favourable results, with accuracy, precision, recall, and f1-score values of 89.48%, 0.9406, 0.9484, and 0.9445, respectively. According to the assessment of our technique, our approach can accurately evaluate reusability as experienced by engineers.
Downloads
References
R. Feldt, F. de Oliveira Neto and R. Torkar, “Ways of applying artificial intelligence in software engineering,” in Proceedings of the 6th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, 2018.
S. Martínez-Fernández, J. Bogner, X. Franch, M. Oriol, J. Siebert, A. Trendowicz, A. Vollmer and S. Wagner, “Software engineering for AI-based systems: a survey,” ACM Transactions on Software Engineering and Methodology (TOSEM), vol. 31, no. 2, pp. 1-59, 2022.
R. Ma, E. Sun and J. Zou, “A spectral method for assessing and combining multiple data visualizations,” Nature Communications, vol. 14, no. 1, p. 780, 2023.
P. Goswami, A. Noorwali, A. Kumar, M. Khan, P. Srivastava and S. Batra, “Appraising Early Reliability of a Software Component Using Fuzzy Inference,” Electronics, vol. 12, p. 1137, 2023.
D. Wangoo, “Artificial intelligence techniques in software engineering for automated software reuse and design,” in 2018 4th International Conference on Computing Communication and Automation (ICCCA), 2018.
“ISO/IEC 25010:2011,” [Online]. Available: https://www.iso.org/obp/ui/#iso:std:iso-iec:25010:ed-1:v1:en. [Accessed 23 March 2023].
S. Pfleeger and J. Atlee, Software engineering: theory and practice, Pearson Education India, 2010.
M. Papamichail, T. Diamantopoulos and A. Symeonidis, “Measuring the reusability of software components using static analysis metrics and reuse rate information,” Journal of Systems and Software, vol. 158, p. 110423, 2019.
R. Qayyum, J. Rubaab, U. Riaz and F. Arif, “Role of Data Mining and Machine Learning in Software Reusability,” in 2021 International Conference on Innovative Computing (ICIC), 2021.
A. Singh and P. Tomar, “Estimation of component reusability through reusability metrics,” International Journal of Computer and Information Engineering, vol. 8, no. 11, pp. 2018-2025, 2014.
L. Heinemann, F. Deissenboeck, M. Gleirscher, B. Hummel and M. Irlbeck, “On the extent and nature of software reuse in open source java projects,” in Top Productivity through Software Reuse: 12th International Conference on Software Reuse, ICSR 2011, Pohang, South Korea, June 13-17, 2011. Proceedings 12, 2011.
I. Mojica, B. Adams, M. Nagappan, S. Dienst, T. Berger and A. Hassan, “A large-scale empirical study on software reuse in mobile apps,” IEEE software, vol. 31, no. 2, pp. 78-86, 2013.
J. Bansiya and C. Davis, “A hierarchical model for object-oriented design quality assessment,” IEEE Transactions on software engineering, vol. 28, no. 1, pp. 4-17, 2002.
O. Rotaru and M. Dobre, “Reusability metrics for software components,” in The 3rd ACS/IEEE International Conference onComputer Systems and Applications, 2005, 2005.
S. Amershi, A. Begel, C. Bird, R. DeLine, H. Gall, E. Kamar, N. Nagappan, B. Nushi and T. Zimmermann, “Software engineering for machine learning: A case study,” in 2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP), 2019.
H. Ammar, W. Abdelmoez and M. Hamdi, “Software engineering using artificial intelligence techniques: Current state and open problems,” in Proceedings of the First Taibah University International Conference on Computing and Information Technology (ICCIT 2012), Al-Madinah Al-Munawwarah, Saudi Arabia, 2012.
L. Kaur and A. Mishra, “An empirical analysis for predicting source code file reusability using meta-classification algorithms,” in Advanced Computational and Communication Paradigms: Proceedings of International Conference on ICACCP 2017, Singapore, 2018.
N. Chythanya and L. Rajamani, “Neural Network Approach for Reusable Component Handling,” in 2017 IEEE 7th International Advance Computing Conference (IACC), 2017.
M. Papamichail, T. Diamantopoulos and A. Symeonidis, “Software reusability dataset based on static analysis metrics and reuse rate information,” Data in brief, vol. 27, p. 104687, 2019.
“SourceMeter,” [Online]. Available: https://www.sourcemeter.com/. [Accessed 25 March 2023].
S. Batra and S. Sachdeva, “Organizing standardized electronic healthcare records data for mining,” Health Policy and Technology, vol. 5, no. 3, pp. 226-242, 2016.
S. Batra and S. Sachdeva, “Pre-processing highly sparse and frequently evolving standardized electronic health records for mining,” in Handbook of Research on Disease Prediction Through Data Analytics and Machine Learning, IGI Global, 2021, pp. 8-21.
S. Mishra, P. Mallick, H. Tripathy, A. Bhoi and A. González-Briones, “Performance evaluation of a proposed machine learning model for chronic disease datasets using an integrated attribute evaluator and an improved decision tree classifier,” Applied Sciences, vol. 10, no. 22, p. 8137, 2020.
H. Polat, H. Danaei Mehr and A. Cetin, “Diagnosis of chronic kidney disease based on support vector machine by feature selection methods,” Journal of medical systems, vol. 41, pp. 1-11, 2017.
A. Pathak, S. Batra and H. Chaudhary, “Imputing Missing Data in Electronic Health Records,” in Proceedings of 3rd International Conference on Machine Learning, Advances in Computing, Renewable Energy and Communication: MARC 2021, Singapore, 2022.
A. Pathak, S. Batra and V. Sharma, “An Assessment of the Missing Data Imputation Techniques for COVID-19 Data,” in Proceedings of 3rd International Conference on Machine Learning, Advances in Computing, Renewable Energy and Communication: MARC 2021, Singapore, 2022.
K. Chandel, V. Kunwar, S. Sabitha, T. Choudhury and S. Mukherjee, “A comparative study on thyroid disease detection using K-nearest neighbor and Naive Bayes classification techniques,” CSI transactions on ICT, vol. 4, pp. 313-319, 2016.
S. Sachdeva, D. Batra and S. Batra, “Storage Efficient Implementation of Standardized Electronic Health Records Data,” in 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2020.
B. Boukenze, A. Haqiq and H. Mousannif, “Predicting chronic kidney failure disease using data mining techniques,” in Advances in Ubiquitous Networking 2: Proceedings of the UNet’16 2, 2017.
S. Sachdeva, M. Singh, N. Kumar and P. Goswami, “Personalized e-learning based on ant colony optimization,” International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, vol. 30, no. 1, 2022.
S. Perveen, M. Shahbaz, A. Guergachi and K. Keshavjee, “Performance analysis of data mining classification techniques to predict diabetes,” Procedia Computer Science,, vol. 82, pp. 115-121, 2016.
R. Dhruvi, P. Yavnika and R. Nutan, “Prediction of Probability of Chronic Diseases and Providing Relative Real-Time Statistical Report using data mining and machine learning techniques,” International Journal of Science, Engineering and Technology Research (IJSETR), vol. 5, no. 4, 2016.
R. Ani, G. Sasi, U. Sankar and O. Deepa, “Decision support system for diagnosis and prediction of chronic renal failure using random subspace classification,” in 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), 2016.
S. Ramya and N. Radha, “Diagnosis of chronic kidney disease using machine learning algorithms,” International Journal of Innovative Research in Computer and Communication Engineering, vol. 4, no. 1, pp. 812-820, 2016.
B. Deekshatulu and P. Chandra, “Classification of heart disease using k-nearest neighbor and genetic algorithm,” Procedia technology, vol. 10, pp. 85-94, 2013.
S. Bashir, U. Qamar, F. Khan and L. Naseem, “HMV: A medical decision support framework using multi-layer classifiers for disease prediction,” Journal of Computational Science, vol. 13, pp. 10-25, 2016.
S. Bashir, U. Qamar, F. Khan and M. Javed, “MV5: a clinical decision support framework for heart disease prediction using majority vote based classifier ensemble,” Arabian Journal for Science and Engineering, vol. 39, pp. 7771-7783, 2014.
S. Batra, H. Sharma, W. Boulila, V. Arya, P. Srivastava, M. Z. Khan and M. Krichen, “An Intelligent Sensor Based Decision Support System for Diagnosing Pulmonary Ailment through Standardized Chest X-ray Scans,” Sensors, vol. 22, no. 19, p. Sensors, 2022.
S. Batra, R. Khurana, M. Z. Khan, W. Boulila, A. Koubaa and P. Srivastava, “A Pragmatic Ensemble Strategy for Missing Values Imputation in Health Records,” Entropy, vol. 24, no. 4, p. 533, 20
Ch.Sarada, C., Lakshmi, K. V. ., & Padmavathamma, M. . (2023). MLO Mammogram Pectoral Masking with Ensemble of MSER and Slope Edge Detection and Extensive Pre-Processing. International Journal on Recent and Innovation Trends in Computing and Communication, 11(3), 135–144. https://doi.org/10.17762/ijritcc.v11i3.6330
Chang Lee, Deep Learning for Speech Recognition in Intelligent Assistants , Machine Learning Applications Conference Proceedings, Vol 1 2021.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.