Data Mining for Software Repositories with Data Analytics for Feature Extraction and Classification Using Deep Learning Model

Authors

  • M. Jeevana Sujitha JNTUK, CSE, SRKR ENGINEERING COLLEGE Bhimavaram
  • Ms. Divya Paikaray Assistant Professor, Department of Computer Science Arka Jain University, Jamshedpur, Jharkhand, India. https://orcid.org/0000-0001-7886-1538
  • Tatikonda Kavya GITAM (Deemed to be University), Computer Science and Engineering GITAM School of Technology, Visakhapatnam
  • Badria Sulaiman Alfurhood Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Saudi Arabia
  • Akula VS Siva Rama Rao Associate Professor, Dept of CSE, Sasi Institute of Technology & Engineering, Tadepalligudem
  • Srinivasan Sriramulu Professor, Department of CSE, Galgotias University, Greater Noida, Uttar Pradesh, India

Keywords:

software Repositories, data mining, data analytics, feature extraction, classification, deep learning

Abstract

In the period of computerized media, the quickly expanding volume and intricacy of sight and sound information cause numerous issues in putting away, handling, and questioning data in a sensible time. Huge storehouses of source code set out new difficulties and open doors for factual machine learning. Here we initially foster Sourcerer, a foundation for the mechanized slithering, parsing, and data set capacity of open source programming. This research propose novel technique in software Repositories for data mining and data analytics in feature extraction and classification using deep learning. here the software Repositories for data mining is carried out based on Markov Chain Monte Carlo model. Then the data analytics has been carried out for feature extraction and classification using heuristic Gaussian bayes neural network with principal component analysis. The experimental analysis has been carried out for various dataset in terms of accuracy, precision, recall, MSE, MAP.The proposed technique attainedaccuracy of 96%, precision of 85%, recall of 79%, MSE of 66%, MAP of 63%.

Downloads

Download data is not yet available.

References

Karandikar, R. L. (2006). On the markov chain montecarlo (MCMC) method. Sadhana, 31(2), 81-104.

Yang, M. S., Lai, C. Y., & Lin, C. Y. (2012). A robust EM clustering algorithm for Gaussian mixture models. Pattern Recognition, 45(11), 3950-3961.

Chen, J., & Liu, Y. (2021). Probabilistic physics-guided machine learning for fatigue data analysis. Expert Systems with Applications, 168, 114316.

Kaplan, H., Tehrani, K., &Jamshidi, M. (2021). A fault diagnosis design based on deep learning approach for electric vehicle applications. Energies, 14(20), 6599.

Jena, B., Saxena, S., Nayak, G. K., Saba, L., Sharma, N., &Suri, J. S. (2021). Artificial intelligence-based hybrid deep learning models for image classification: The first narrative review. Computers in Biology and Medicine, 137, 104803.

Fisch, L., Leenings, R., Winter, N. R., Dannlowski, U., Gaser, C., Cole, J. H., & Hahn, T. (2021). predicting chronological age from structural neuroimaging: the predictive analytics competition 2019. Frontiers in Psychiatry, 12.

Shamshirband, S., Fathi, M., Dehzangi, A., Chronopoulos, A. T., &Alinejad-Rokny, H. (2021). A review on deep learning approaches in healthcare systems: Taxonomies, challenges, and open issues. Journal of Biomedical Informatics, 113, 103627.

Awan, M. J., Bilal, M. H., Yasin, A., Nobanee, H., Khan, N. S., &Zain, A. M. (2021). Detection of COVID-19 in chest X-ray images: A big data enabled deep learning approach. International journal of environmental research and public health, 18(19), 10147.

Gupta, V., Choudhary, K., Tavazza, F., Campbell, C., Liao, W. K., Choudhary, A., &Agrawal, A. (2021). Cross-property deep transfer learning framework for enhanced predictive analytics on small materials data. Nature communications, 12(1), 1-10.

Wang, Q., Jiao, W., Wang, P., & Zhang, Y. (2021). A tutorial on deep learning-based data analytics in manufacturing through a welding case study. Journal of Manufacturing Processes, 63, 2-13.

Buda, M., Saha, A., Walsh, R., Ghate, S., Li, N., Święcicki, A., ...&Mazurowski, M. A. (2021). A data set and deep learning algorithm for the detection of masses and architectural distortions in digital breast tomosynthesis images. JAMA network open, 4(8), e2119100-e2119100.

Morgan, R., Nord, B., Bechtol, K., González, S. J., Buckley-Geer, E., Möller, A., ...& To, C. (2022). DeepZipper: A Novel Deep-learning Architecture for Lensed Supernovae Identification. The Astrophysical Journal, 927(1), 109.

data analytics based proposed model

Downloads

Published

19.12.2022

How to Cite

M. Jeevana Sujitha, Ms. Divya Paikaray, Tatikonda Kavya, Badria Sulaiman Alfurhood, Akula VS Siva Rama Rao, & Srinivasan Sriramulu. (2022). Data Mining for Software Repositories with Data Analytics for Feature Extraction and Classification Using Deep Learning Model. International Journal of Intelligent Systems and Applications in Engineering, 10(2s), 291 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/2403

Issue

Section

Research Article