Accuracy Improvement using Machine Learning by Objects Count Based Feature selection method on Biological Data of Human Ancestors

Authors

  • Maradana Durga Venkata Prasad, Srikanth T., N. Srivani, Balakrishna Gudla

Keywords:

Feature Selection,dimensionality reduction, Filter Method, Wrapper Method, Embedded Method, Hybrid feature selection.

Abstract

In the present era huge data is generated by the IOT devices, mobiles, laptops and systems and stored either in data basesor files. Technique of Clustering is a used to extract the data from the data baseor files. Improving clustering accuracy always depends on the feature selection method. So feature selection always depends on choose of best feature selection method like wrapper, filter, embedded and hybrid.The original data set or data source's redundant, unnecessary, and noisy features can also be eliminated using feature selection techniques. Feature selection methods are used to reduce the computational costs, increases the accuracy, dimensionality is reduced and model is predictable.  

Downloads

Download data is not yet available.

References

V. K and G. Dayalan, "Framework for Improving the Accuracy of the Machine Learning Model in Predicting Future Values," 2023 IEEE 8th International Conference for Convergence in Technology (I2CT), Lonavla, India, 2023, pp. 1-8, DOI: 10.1109/I2CT57861.2023.10126236. [2]. A. Benkessirat and N. Benblidia, "Fundamentals of Feature Selection: An Overview and Comparison," 2019 IEEE/ACS 16th International Conference on Computer Systems and Applications (AICCSA), Abu Dhabi, United Arab Emirates, 2019, pp. 1-6, DOI: 10.1109/AICCSA47632.2019.9035281.

A. V. Veligosha, D. I. Kaplun, D. M. Klionskiy, V. V. Gulvanskiy, D. V. Bogaevskiy and I. I. Kanatov, "Model of computation accuracy in modular digital filters," 2017 XX IEEE International Conference on Soft Computing and Measurements (SCM), St. Petersburg, Russia, 2017, pp. 275-277, DOI: 10.1109/SCM.2017.7970559.

S. Sun et al., "Investigation of Prediction Accuracy, Sensitivity, and Parameter Stability of Large-Scale Propagation Path Loss Models for 5G Wireless Communications," in IEEE Transactions on Vehicular Technology, vol. 65, no. 5, pp. 2843-2860, May 2016, DOI: 10.1109/TVT.2016.2543139.

B. P. Salmon, W. Kleynhans, C. P. Schwegmann and J. C. Olivier, "Proper comparison among methods using a confusion matrix," 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Milan, Italy, 2015, pp. 3057-3060, DOI: 10.1109/IGARSS.2015.7326461.

S. Gupta, W. Zhang and F. Wang, "Model Accuracy and Runtime Tradeoff in Distributed Deep Learning: A Systematic Study," 2016 IEEE 16th International Conference on Data Mining (ICDM), Barcelona, Spain, 2016, pp. 171-180, DOI: 10.1109/ICDM.2016.0028.

W. Wang and L. Bi, "Research on strategies to improve model accuracy based on incomplete time series data," 2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT), Haikou, China, 2021, pp. 45-52, DOI: 10.1109/ACAIT53529.2021.9731336.

S. Tabassum, M. B. Sampa, R. Islam, F. Yokota, N. Nakashima and A. Ahmed, "A Data Enhancement Approach to Improve Machine Learning Performance for Predicting Health Status Using Remote Healthcare Data," 2020 2nd International Conference on Advanced Information and Communication Technology (ICAICT), Dhaka, Bangladesh, 2020, pp. 308-312, DOI: 10.1109/ICAICT51780.2020.9333506.

V. N. G. Raju, K. P. Lakshmi, V. M. Jain, A. Kalidindi and V. Padma, "Study the Influence of Normalization/Transformation process on the Accuracy of Supervised Classification," 2020 Third International Conference on Smart Systems and Inventive Technology (ICSSIT), Tirunelveli, India, 2020, pp. 729-735, DOI: 10.1109/ICSSIT48917.2020.9214160.

M. Seliem, Muhammad. (2022). Handling Outlier Data as Missing Values by Imputation Methods: Application of Machine Learning Algorithms. Turkish Journal of Computer and Mathematics Education (TURCOMAT). 13. 273-286.

J. Heaton, "An empirical analysis of feature engineering for predictive modeling," SoutheastCon 2016, Norfolk, VA, USA, 2016, pp. 1-6, DOI: 10.1109/SECON.2016.7506650. [12]. A. Kusiak, "Feature transformation methods in data mining," in IEEE Transactions on Electronics Packaging Manufacturing, vol. 24, no. 3, pp. 214-221, July 2001, DOI: 10.1109/6104.956807.

Aparna U.R. and S. Paul, "Feature selection and extraction in data mining," 2016 Online International Conference on Green Engineering and Technologies (IC-GET), Coimbatore, India, 2016, pp. 1-3, DOI: 10.1109/GET.2016.7916845.

N. Sharma and K. Saroha, "Study of dimension reduction methodologies in data mining," International Conference on Computing, Communication & Automation, Greater Noida, India, 2015, pp. 133-137, DOI: 10.1109/CCAA.2015.7148359.

K. M. A. Patel and P. Thakral, "The best clustering algorithms in data mining," 2016 International Conference on Communication and Signal Processing (ICCSP), Melmaruvathur, India, 2016, pp. 2042-2046, DOI: 10.1109/ICCSP.2016.7754534.

T. T. Chikohora and E. Chikohora, "An Algorithm for Selecting a Data Mining Technique," 2021 3rd International Multidisciplinary Information Technology and Engineering Conference (IMITEC), Windhoek, Namibia, 2021, pp. 1-7, DOI:. 10.1109/IMITEC52926.2021.9714525.

Z. M. Fadhil and R. A. Jaleel, "Multiple Efficient Data Mining Algorithms with Genetic Selection for Prediction of SARS-CoV2," 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2022, pp. 2016-2020, DOI: 10.1109/ICACITE53722.2022.9823757.

F. Arden and C. Safitri, "Hyperparameter Tuning Algorithm Comparison with Machine Learning Algorithms," 2022 6th International Conference on Information Technology, Information Systems and Electrical Engineering (ICITISEE), Yogyakarta, Indonesia, 2022, pp. 183-188, DOI: 10.1109/ICITISEE57756.2022.10057630.

T. N. Rincy and R. Gupta, "Ensemble Learning Techniques and its Efficiency in Machine Learning: A Survey," 2nd International Conference on Data, Engineering and Applications (IDEA), Bhopal, India, 2020, pp. 1-6, DOI: 10.1109/IDEA49133.2020.9170675.

D. P. Gaikwad and R. C. Thool, "Intrusion Detection System Using Bagging Ensemble Method of Machine Learning," 2015 International Conference on Computing Communication Control and Automation, Pune, India, 2015, pp. 291-295, DOI: 10.1109/ICCUBEA.2015.61.

L. Cherif and A. Kortebi, "On using eXtreme Gradient Boosting (XGBoost) Machine Learning algorithm for Home Network Traffic Classification," 2019 Wireless Days (WD), Manchester, UK, 2019, pp. 1-6, DOI: 10.1109/WD.2019.8734193.

T. Gunasegaran and Y. -N. Cheah, "Evolutionary cross validation," 2017 8th International Conference on Information Technology (ICIT), Amman, Jordan, 2017, pp. 89-95, DOI: 10.1109/ICITECH.2017.8079960.

T. R. N and R. Gupta, "Feature Selection Techniques and its Importance in Machine Learning: A Survey," 2020 IEEE International Students' Conference on Electrical,Electronics and Computer Science (SCEECS), Bhopal, India, 2020, pp. 1-6, DOI:. 10.1109/SCEECS48394.2020.189.

M. S. De Sousa, C. E. L. Veiga, R. D. O. Albuquerque and W. F. Giozza, "Information Gain applied to reduce model-building time in decision-tree-based intrusion detection system," 2022 17th Iberian Conference on Information Systems and Technologies (CISTI), Madrid, Spain, 2022, pp. 1-6, DOI: 10.23919/CISTI54924.2022.9820579.

S. Rosidin, Muljono, G. FajarShidik, A. ZainulFanani, F. Al Zami and Purwanto, "Improvement with Chi Square Selection Feature using Supervised Machine Learning Approach on Covid-19 Data," 2021 International Seminar on Application for Technology of Information and Communication (iSemantic), Semarangin, Indonesia, 2021, pp. 32-36, DOI: 10.1109/iSemantic52711.2021.9573196.

Zhongzhi Shi, Youping Huang and Sulan Zhang, "Fisher Score Based Naive Bayesian Classifier," 2005 International Conference on Neural Networks and Brain, Beijing, China, 2005, pp. 1616-1621, DOI: 10.1109/ICNNB.2005.1614941.

-h. Yang, G. -l. Shan and L. -l. Zhao, "Correlation Coefficient Method for Support Vector Machine Input Samples," 2006 International Conference on Machine Learning and Cybernetics, Dalian, China, 2006, pp. 2857-2861, DOI: 10.1109/ICMLC.2006.259069.

M. Peña, M. Cerrada, D. Cabrera and R. -V. Sánchez, "Fast feature selection based on cluster validity index applied on data-driven bearing fault detection," 2020 IEEE ANDESCON, Quito, Ecuador, 2020, pp. 1-6, DOI: 10.1109/ANDESCON50619.2020.9272146.

U. Haq, J. Li, M. H. Memon, M. HunainMemon, J. Khan and S. M. Marium, "Heart Disease Prediction System Using Model of Machine Learning and Sequential Backward Selection Algorithm for Features Selection," 2019 IEEE 5th International Conference for Convergence in Technology (I2CT), Bombay, India, 2019, pp. 1-4, DOI: 10.1109/I2CT45611.2019.9033683.

Raj, A. Kumar, V. Sharma, S. Rani, A. K. Shanu and T. Singh, "Applications of Genetic Algorithm with Integrated Machine Learning," 2023 3rd International Conference on Innovative Practices in Technology and Management (ICIPTM), Uttar Pradesh, India, 2023, pp. 1-6, DOI: 10.1109/ICIPTM57143.2023.10118328.

H. Osman, M. Ghafari and O. Nierstrasz, "Automatic feature selection by regularization to improve bug prediction accuracy," 2017 IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE), Klagenfurt, Austria, 2017, pp. 27-32, DOI: 10.1109/MALTESQUE.2017.7882013.

Suto, S. Oniga and P. P. Sitar, "Comparison of wrapper and filter feature selection algorithms on human activity recognition," 2016 6th International Conference on Computers Communications and Control (ICCCC), Oradea, Romania, 2016, pp. 124-129, DOI: 10.1109/ICCCC.2016.7496749.

T. Hamed, R. Dara and S. C. Kremer, "An Accurate, Fast Embedded Feature Selection for SVMs," 2014 13th International Conference on Machine Learning and Applications, Detroit, MI, USA, 2014, pp. 135-140, DOI: 10.1109/ICMLA.2014.104.

Downloads

Published

09.07.2024

How to Cite

Maradana Durga Venkata Prasad. (2024). Accuracy Improvement using Machine Learning by Objects Count Based Feature selection method on Biological Data of Human Ancestors. International Journal of Intelligent Systems and Applications in Engineering, 12(22s), 657 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6536

Issue

Section

Research Article