Classification of Android Applications and Performance Evaluation of Machine Learning Model with HFST

Authors

  • Umesh V. Nikam Department of Computer Science & Engineering, Prof. Ram Meghe Institute of Technology &research, Badnera, India
  • Vaishali M. Deshmukh Department of Computer Science & Engineering, Prof. Ram Meghe Institute of Technology &research, Badnera, India

Keywords:

Machine learning algorithms, Hybrid features, Classification and detection, Performance evaluation

Abstract

In the dynamic landscape of mobile applications, Android devices seamlessly integrate into our daily lives, offering diverse functionalities through various applications. However, the surge in malware and malicious software poses a significant challenge for security professionals and users alike. To address this issue, researchers and cyber security experts actively explore innovative methods. This research paper delves into the crucial domain of Android application categorization, evaluating the performance of a novel machine learning model incorporating a Hybrid Feature Selection Technique (HFST). Initially, the study identifies the top twenty significant features through information gain, feature selection, and chi-square methods. The classifier's performance is then assessed using these features. Subsequently, all 60 features selected through the three techniques are merged and out of the 60 features, 11 are identified as hybrid features that are common in at least two techniques. The study re-evaluates machine learning classifiers' performance using these 11 hybrid features, comparing the results with existing state-of-the-art techniques to illustrate the superiority of the proposed HFST-based technique. Furthermore, the study measures its impact on various performance metrics, including classification accuracy, precision, and f-measure, revealing notable enhancements across these parameters when employing the HFST. The application of this hybrid feature selection technique significantly improves the classification process, ultimately achieving an impressive classification accuracy of 98.11%, with precision at 97.56 and an f-measure of 97.99 for distinguishing between malicious and benign apps.

Downloads

Download data is not yet available.

References

E. Odat and Q. M. Yaseen, "A Novel Machine Learning Approach for Android Malware Detection Based on the Co-Existence of Features," in IEEE Access, vol. 11, pp. 15471-15484, 2023, doi: 10.1109/ACCESS.2023.3244656.

X. Zhang, J. Wang, J. Xu and C. Gu, "Detection of Android Malware Based on Deep Forest and Feature Enhancement," in IEEE Access, vol. 11, pp. 29344-29359, 2023, doi: 10.1109/ACCESS.2023.3260977.

N. Z. Gorment, A. Selamat, L. K. Cheng and O. Krejcar, "Machine Learning Algorithm for Malware Detection: Taxonomy, Current Challenges and Future Directions," in IEEE Access, doi: 10.1109/ACCESS.2023.3256979.

Gopinath M., Sibi Chakkaravarthy Sethuraman, A comprehensive survey on deep learning based malware detection techniques, Computer Science Review, Volume 47, 2023, 100529, ISSN 1574-0137,https://doi.org/10.1016/j.cosrev.2022.100529.

Liu, Tianyue & Li, Zhenwan & Long, Haixia & Bilal, Anas. (2023). NT-GNN: Network Traffic Graph for 5G Mobile IoT Android Malware Detection. Electronics.12.10.3390/electronics12040789.

Dhalaria, Meghna & Gandotra, Ekta. (2020). A Hybrid Approach for Android Malware Detection and Family Classification. International Journal of Interactive Multimedia and Artificial Intelligence. In Press. 1. 10.9781/ijimai.2020.09.001.

P. Agrawal and B. Trivedi, "Evaluating Machine Learning Classifiers to detect Android Malware," 2020 IEEE International Conference for Innovation in Technology (INOCON), Bangluru, India, 2020, pp. 1-6, doi: 10.1109/INOCON50539.2020.9298290.

AlOmari, Hani & Yaseen, Qussai & Al-Betar, Mohammed. (2023). A Comparative Analysis of Machine Learning Algorithms for Android Malware Detection. Procedia Computer Science. 220. 763-768. 10.1016/j.procs.2023.03.101.

Mimura, M., Ito, R. Applying NLP techniques to malware detection in a practical environment. Int. J. Inf. Secur. 21, 279–291 (2022).

Taher, Dr. Fatma & AlFandi, Omar & Kfairy, Mousa & Al Hamadi, Hussam & Alrabaee, Saed. (2023). DroidDetectMW: A Hybrid Intelligent Model for Android Malware Detection. 10.20944/preprints202305.0333.v1.

Bherde, Gajanan & Pund, Mahendra. (2020). Strategy and Knowledge-Based XML Attack Detection Systems using Ontology. International Journal of Recent Technology and Engineering (IJRTE). 8. 798-801. 10.35940/ijrte.E5786.018520.

A. A. Darem, F. A. Ghaleb, A. A. Al-Hashmi, J. H. Abawajy, S. M. Alanazi and A. Y. Al-Rezami, "An Adaptive Behavioral-Based Incremental Batch Learning Malware Variants Detection Model Using Concept Drift Detection and Sequential Deep Learning," in IEEE Access, vol. 9, pp. 97180-97196, 2021, doi: 10.1109/ACCESS.2021.3093366.

Agrawal, Prerna & Trivedi, Bhushan. (2021). AndroHealthCheck: A Malware Detection System for Android Using Machine Learning. 10.1007/978-981-16-0965-7_4.

Shhadat, Ihab & Al-bataineh, Bara & Hayajneh, Amena & Al-Sharif, Ziad. (2020). The Use of Machine Learning Techniques to Advance the Detection and Classification of Unknown Malware. Procedia Computer Science. 170. 917-922. 10.1016/j.procs.2020.03.110.

Li, Shanxi & Zhou, Qingguo & Zhou, Rui & Lv, Qingquan. (2022). Intelligent malware detection based on graph convolutional network. The Journal of Supercomputing. 78. 10.1007/s11227-021-04020-y.

Zhang, Xiao-Lei & Xu, Menglong. (2022). AUC optimization for deep learning-based voice activity detection. EURASIP Journal on Audio, Speech, and Music Processing. 2022. 10.1186/s13636-022-00260-9.

Akhtar, Muhammad Shoaib & Feng, Tao. (2022). Malware Analysis and Detection Using Machine Learning Algorithms. Symmetry. 14. 2304. 10.3390/sym14112304.

Kim, Jinsung & Ban, Younghoon & Ko, Eunbyeol & Cho, Haehyun & Yi, Jeong. (2022). MAPAS: a practical deep learning-based android malware detection system. International Journal of Information Security. 21. 1-14. 10.1007/s10207-022-00579-6.

Peng, T., Hu, B., Liu, J., Huang, J., Zhang, Z., He, R., & Hu, X. (2022). A Lightweight Multi-Source Fast Android Malware Detection Model. Applied Sciences, 12(11), 5394. MDPI AG. Retrieved from http://dx.doi.org/10.3390/app12115394

Taha, Altyeb. (2016). Classification of Android Malware Applications using Feature Selection and Classification Algorithms. VAWKUM Transactions on Computer Sciences. 10. 1. 10.21015/vtcs.v10i1.412.

Aljanabi, Maryam & Altamimi, Ahmad. (2022). A Comparative Analysis of Machine Learning Techniques for Classification and Detection of Malware.

Diptiben Ghelani. A perspective study on Malware detection and protection, A review. Authorea. September 13, 2022. DOI: 10.22541/au.166308976.63086986/v1

Sunil Gupta, Kamal Saluja, Ankur Goyal, Amit Vajpayee, Vipin Tiwari,Comparing the performance of machine learning algorithms using estimated accuracy, Measurement: Sensors,Volume24,2022,100432,ISSN 26659174,https://doi.org/10.1016/j.measen.2022.100432.

Asam, Dr & Khan, Saddam & Akbar, Altaf & Bibi, Sameena & Jamal, Tauseef & Khan, Asifullah & Ghafoor, Usman & Bhutta, Raheel. (2022). IoT malware detection architecture using a novel channel boosted and squeezed CNN. Scientific Reports. 12. 10.1038/s41598-022-18936-9.

P. R. K. Varma, K. P. Raj and K. V. S. Raju, "Android mobile security by detecting and classification of malware based on permissions using machine learning algorithms," 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), Palladam, India, 2017, pp. 294-299, doi: 10.1109/I-SMAC.2017.8058358.

V. J. Raymond and R. J. R. Raj, "Investigation of android malware with machine learning classifiers using enhanced pca algorithm," Computer Systems Science and Engineering, vol. 44, no.3, pp. 2147–2163, 2023.

Albakri, Ashwag & Alhayan, Fatimah & Alturki, Nazik & Ahamed, Saahirabanu & Shamsudheen, Shermin. (2023). Metaheuristics with Deep Learning Model for Cybersecurity and Android Malware Detection and Classification. Applied Sciences. 13. 2172. 10.3390/app13042172.

U. V. Nikam and V. M. Deshmuh, "Performance Evaluation of Machine Learning Classifiers in Malware Detection," 2022 IEEE International Conference on Distributed Computing and Electrical Circuits and Electronics (ICDCECE), Ballari, India, 2022, pp. 1-5, doi: 10.1109/ICDCECE53908.2022.9793102.

U. Nikam and V. M. Deshmukh, “Hybrid Feature Selection Technique to classify Malicious Applications using Machine Learning approach”, J Integr Sci Technol, vol. 12, no. 1, p. 702, Aug. 2023.

Nikam, U.V., Deshmukh, V.M. (2023). Efficient Approach for Malware Detection Using Machine Learning Classifier. In: Tiwari, R., Pavone, M.F., Saraswat, M. (eds) Proceedings of International Conference on Computational Intelligence. ICCI 2022. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-99-2854-5_14

M. K. A. Abuthawabeh and K. W. Mahmoud, "Android Malware Detection and Categorization Based on Conversation-level Network Traffic Features," 2019 International Arab Conference on Information Technology (ACIT), Al Ain, United Arab Emirates, 2019, pp. 42-47, doi: 10.1109/ACIT47987.2019.8991114.

L. Taheri, A. F. A. Kadir and A. H. Lashkari, "Extensible Android Malware Detection and Family Classification Using Network-Flows and API-Calls," 2019 International Carnahan Conference on Security Technology (ICCST), Chennai, India, 2019, pp. 1-8, doi: 10.1109/CCST.2019.8888430.

A. H. Lashkari, A. F. A. Kadir, L. Taheri and A. A. Ghorbani, "Toward Developing a Systematic Approach to Generate Benchmark Android Malware Datasets and Classification," 2018 International Carnahan Conference on Security Technology (ICCST), Montreal, QC, Canada, 2018, pp. 1-7, doi: 10.1109/CCST.2018.8585560.

I. Aljarah et al., "A Robust Multi-Objective Feature Selection Model Based on Local Neighborhood Multi-Verse Optimization," in IEEE Access, vol. 9, pp. 100009-100028, 2021, doi: 10.1109/ACCESS.2021.3097206.

Downloads

Published

29.01.2024

How to Cite

Nikam, U. V. ., & Deshmukh, V. M. . (2024). Classification of Android Applications and Performance Evaluation of Machine Learning Model with HFST. International Journal of Intelligent Systems and Applications in Engineering, 12(13s), 168–178. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/4583

Issue

Section

Research Article