Enhanced Software Defect Prediction Through Homogeneous Ensemble Models

Authors

  • R. Mamatha Research Scholar, Department of CSE, Koneru Lakshmaiah Education Foundation, Hyderabad-500075, Telangana, India
  • P. Lalitha Surya Kumari Professor, Department of CSE, Koneru Lakshmaiah Education Foundation, Hyderabad-500075, Telangana, India
  • A. Sharada Professor, Department of CSE, G. Narayanamma Institute of Technology & Science, Hyderabad, Telangana, India

Keywords:

effective, methods, Boosting, accelerating, isolating

Abstract

The authors of this paper recommend employing state-of-the-art Machine Learning methods for fault prediction in computer programmes. The Promise software engineering repository, where NASA stores its data, serves as an example. The basic objective of software defect prediction is the early discovery of software flaws. Machine learning algorithms can help with this by making predictions based on historical data. The initial experiments' results suffered from low precision and recall since they relied on outdated machine learning methods. Modern machine learning methods such as Naive Bayes, Boosting, and Grid Search were incorporated to increase the model's accuracy. The performance of the software defect prediction model has been greatly enhanced through the use of state-of-the-art machine learning techniques.   The precision and recall rates, two measures of how well a system can forecast errors, have also grown.   Naive Bayes, Boosting, and Grid Search are just a few of the modern machine learning methods that helped improve the software defect prediction model.   The algorithms' increased accuracy and recall rates show how effective they are at finding and predicting software defects.   The importance of using state-of-the-art machine learning methods to the task of defect prediction is emphasised. Using techniques such as Naive Bayes, Boosting, and Grid Search can significantly increase the model's effectiveness. These methods have improved software development processes by accelerating and better isolating bugs.

Downloads

Download data is not yet available.

References

Tsai, H.-T., H. Moskowitz, and L.-H. Lee, Human resource selection for software development projects using Taguchi’s parameter design. European Journal of Operational Research, 2003. 151(1): p. 167-180.

Di Penta, M., M. Harman, and G. Antoniol, The use of search‐based optimization techniques to schedule and staff software projects: an approach and an empirical study. Software: Practice and Experience, 2011. 41(5): p. 495-519.

Goel, L., Sharma, M., Khatri, S.K., Damodaran, D.: Implementation of data sampling in class imbalance learning for cross project defect prediction: an empirical study. In: 2018 Fifth International Symposium on Innovation in Information and Communication Technology (ISIICT), pp. 1–6. IEEE (2018)

Peixoto, D.C., G.R. Mateus, and R.F. Resende. The Issues of Solving Staffing and Scheduling Problems in Software Development Projects. in Computer Software and Applications Conference (COMPSAC), 2014 IEEE 38th Annual. 2014. IEEE.

Balogun, A.O., Basri, S., Abdulkadir, S.J., Adeyemo, V.E., Imam, A.A., Bajeh, A.O.: Software defect prediction: analysis of class imbalance and performance stability. J. Eng. Sci. Technol. 14, 3294–3308 (2019)

Alba, E. and J.F. Chicano, Software project management with GAs. Information Sciences, 2007. 177(11): p. 2380- 2401.

Rodriguez, D., Herraiz, I., Harrison, R., Dolado, J., Riquelme, J.C.: Preliminary comparison of techniques for dealing with imbalance in software defect prediction. In: Proceedings of the 18th International Conference on Evaluation and Assessment in Software Engineering, pp. 1–10 (2014)

Uddin, J., Ghazali, R., Deris, M.M., Naseem, R., Shah, H.: A survey on bug prioritization. Artif Intell Rev 47(2), 145–180 (2017). DOI 10.1007/s10462-016-9478-6. URL http://link.springer. com/10.1007/s10462-016-9478-6

Singh, V., Misra, S., Sharma, M.: Bug severity assessment in cross-project context and identifying training candidates. J. Inf. Knowl. Manag. 16, 1750005 (2017)

El-Shorbagy, S.A., El-Gammal, W.M., Abdelmoez, W.M.: Using SMOTE and heterogeneous stacking in ensemble learning for software defect prediction. In: Proceedings of the 7th International Conference on Software and Information Engineering, pp. 44–47 (2018)

Ferrucci, F., M. Harman, and F. Sarro, Search-Based Software Project Management, in Software Project Management in a Changing World, G. Ruhe and C. Wohlin, Editors. 2014, Springer Berlin Heidelberg. p. 373-399.

J. Xuan, H. Jiang, Z. Ren, J. Yan, and Z. Luo, “Automatic bug triage using semi-supervised text classification,” in Proc. Intl. Conf. Software Engineering & Knowledge Engineering (SEKE 10), 2010, pp. 209–214.

Laradji, I.H., Alshayeb, M., Ghouti, L.: Software defect prediction using ensemble learning on selected features. Inf. Softw. Technol. 58, 388–402 (2015)

Singh, V., Misra, S., Sharma, M.: Bug severity assessment in cross-project context and identifying training candidates. J. Inf. Knowl. Manag. 16, 1750005 (2017)

Song, Q., Guo, Y., Shepperd, M.: A comprehensive investigation of the role of imbalanced learning for software defect prediction. IEEE Trans. Softw. Eng. 45, 1253–1269 (2018).

A. Tamrawi, T. Nguyen, J. Al-Kofahi, and T. Nguyen, “Fuzzy set and cache-based approach for bug triaging,” in Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering. ACM, 2011, pp. 365–375.

Sun, B., Chen, S., Wang, J., Chen, H.: A robust multi-class AdaBoost algorithm for mislabeled noisy data. Knowl.-Based Syst. 102, 87–102 (2016) [18] J. Anvik, “Automating bug report assignment,” in Proceedings of the 28th international conference on Software engineering. ACM, 2006, pp. 937–940.

Yang, X., Lo, D., Xia, X., Sun, J.: TLEL: a two-layer ensemble learning approach for justin-time defect prediction. Inf. Softw. Technol. 87, 206–220 (2017)

N. Bettenburg, S. Just, A. Schroter, C. Weiss, R. Premraj, and T. Zim- ¨ mermann, “What makes a good bug report?” in Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering. ACM, 2008, pp. 308–318.

Goel, L., Sharma, M., Khatri, S.K., Damodaran, D.: Implementation of data sampling in class imbalance learning for cross project defect prediction: an empirical study. In: 2018 Fifth International Symposium on Innovation in Information and Communication Technology (ISIICT), pp. 1–6. IEEE (2018)

Hamdy, A., El-Laithy, A.: SMOTE and Feature Selection for More Effective Bug Severity Prediction. Int. J. Softw. Eng. Knowl. Eng. 29, 897–919 (2019)

Iqbal, A., Aftab, S.: A classification framework for software defect prediction using multifilter feature selection technique and MLP. Int. J. Mod. Educ. Comput. Sci. 12(1), 18–25 (2020). https://doi.org/10.5815/ijmecs.2020.01.03

Chang, C.K., M.J. Christensen, and T. Zhang, Genetic algorithms for project management. Annals of Software Engineering, 2001. 11(1): p. 107-139.

Izadi, M., Ganji, S., Heydarnoori, A., Gousios, G.: Topic recommendation for software repositories using multi-label classification algorithms (2020)

Ghotra, B., McIntosh, S., Hassan, A.E.: A large-scale study of the impact of feature selection techniques on defect classification models. In: 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), pp. 146–157. IEEE (2017)

Xu, Z., Liu, J., Yang, Z., An, G., Jia, X.: The impact of feature selection on defect prediction performance: an empirical comparison. In: 2016 IEEE 27th International Symposium on Software Reliability Engineering (ISSRE), pp. 309–320. IEEE (2016)

Gupta, A., Suri, B., Misra, S.: A systematic literature review: code bad smells in java source code. In: Gervasi, O., et al. (eds.) ICCSA 2017. LNCS, vol. 10408, pp. 665–682. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-62404-4_49

J. Anvik, L. Hiew, and G. Murphy, “Who should fix this bug?” in Proceedings of the 28th international conference on Software engineering. ACM, 2006, pp. 361–370.

D. Cubrani ˇ c and G. C. Murphy, “Automatic bug triage using text categorization,” in In SEKE 2004: Proceedings of the Sixteenth International Conference on Software Engineering. Citeseer, 2004, pp. 92–97.

Balogun, A., Oladele, R., Mojeed, H., Amin-Balogun, B., Adeyemo, V.E., Aro, T.O.: Performance analysis of selected clustering techniques for software defects prediction. Afr. J. Comput. ICT 12, 30–42 (2019)

Ghotra, B., McIntosh, S., Hassan, A.E.: A large-scale study of the impact of feature selection techniques on defect classification models. In: 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), pp. 146–157. IEEE (2017)

N. Bettenburg, S. Just, A. Schroter, C. Weiss, R. Premraj, and T. Zim- ¨ mermann, “What makes a good bug report?” in Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering. ACM, 2008, pp. 308–318.

J.-W. Park, M.-W. Lee, J. Kim, S. won Hwang, and S. Kim, “Costriage: A cost-aware triage algorithm for bug reporting systems.” in AAAI, W. Burgard and D. Roth, Eds. AAAI Press, 2011.

El-Shorbagy, S.A., El-Gammal, W.M., Abdelmoez, W.M.: Using SMOTE and heterogeneous stacking in ensemble learning for software defect prediction. In: Proceedings of the 7th International Conference on Software and Information Engineering, pp. 44–47 (2018)

Kumar, L., Misra, S., Rath, S.K.: An empirical analysis of the effectiveness of software metrics and fault prediction model for identifying faulty classes. Comput. Stand. Interfaces 53, 1–32 (2017)

Ms. Nora Zilam Runera. (2014). Performance Analysis On Knowledge Management System on Project Management. International Journal of New Practices in Management and Engineering, 3(02), 08 - 13. Retrieved from http://ijnpme.org/index.php/IJNPME/article/view/28

Ghazaly, N. M. . (2020). Secure Internet of Things Environment Based Blockchain Analysis. Research Journal of Computer Systems and Engineering, 1(2), 26:30. Retrieved from https://technicaljournals.org/RJCSE/index.php/journal/article/view/8

Downloads

Published

10.11.2023

How to Cite

Mamatha, R. ., Kumari, P. L. S. ., & Sharada, A. . (2023). Enhanced Software Defect Prediction Through Homogeneous Ensemble Models. International Journal of Intelligent Systems and Applications in Engineering, 12(4s), 676–684. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3849

Issue

Section

Research Article