Integrating Features and Unlabeled Data with Modified Support Vector Machines for Improved Lung Cancer Detection


  • Suman Antony Lasrado, G. N. K. Suresh Babu


Modified Support Vector Machines, Lung cancer classification, Feature importance, Unlabeled data integration, Predictive accuracy


This research explores the application of Modified Support Vector Machines (MSVMs) as a potent classifier for the effective diagnosis of lung cancer, aiming to enhance the accuracy and performance compared to conventional Support Vector Machines (SVMs). While SVMs have been widely employed, their limitation lies in treating all features equally, potentially affecting the precision of disease detection. In response to this, MSVMs introduce a novel approach by incorporating both labeled and unlabeled data into the learning process, gradually searching for the optimal separating hyper plane. The key innovation lies in the assignment of weights to a kernel function, measuring the importance of individual features and addressing the shortcomings of traditional SVMs. By acknowledging the varying significance of features, MSVMs offer a more explored and efficient classification process. The newly formulated kernel function enables the integration of labeled and unlabeled data, contributing to a more robust learning model. The proposed modification not only enhances the classifier's ability to discern between malignant and benign lung tissues but also opens avenues for improved pattern recognition indicative of lung cancer. The research investigates the comparative performance of MSVMs against different SVMs, with preliminary results indicating promising outcomes. The integration of both labeled and unlabeled data, combined with the consideration of feature importance through weighted kernel functions, demonstrates the potential of MSVMs as a breakthrough in the accurate classification of lung cancer. While further validation with larger datasets is essential, this study suggests that MSVMs could emerge as a significant advancement in the field of lung cancer diagnosis, offering heightened 93% accuracy and 99% specificity in predicting and classifying the lung cancer disease.


Download data is not yet available.


Ahmad Taher Azar H, Hannah Inbarani and Renuga Devi K (2017): Improved dominance rough set-based classification system, Neural Computing Applications, vol. 28, pp. 2231-2246.

Alali AMF, Padmaja DL, Soni M, Khan MA, Khan F, Ofori I (2023): A data mining technique for detecting malignant mesothelioma cancer using multiple regression analysis, Open Life Sciences, Volume. 18(1):20220746, 10.1515/biol-2022-0746.

Amma TA, Sunny AR, Biji KP and Mohanan M (2020): Lung Cancer Identification and Prediction Based on VGG Architecture, International Journal of Research in Engineering, Science and Management, Vol. 3(7), pp. 88-92.

Bhattacharjee A and Majumder S (2019): Automated computer-aided lung cancer detection system, In Advances in Communication, Devices, and Networking, pp. 425-433, Springer

C Venkatesh, J Chinnababu, Ajmeera Kiran, C H Nagaraju and Manoj Kumar (2023): A hybrid model for lung cancer prediction using patch processing and deeplearning on CT images. Multimedia Tools Applications.

Cassim S, Chepulis L, Keenan R, Kidd J, Firth M and Lawrenson R (2019): Patient and carer perceived barriers to early presentation and diagnosis of lung cancer: a systematic review, Bmc Cancer, vol. 19, no. 1, pp. 1-14.

Dalwadi SM, Szeja SS, Bernicker EH, Butler EB, Teh BS and Farach AM (2018): Practice patterns and outcomes in elderly stage I non–small-cell lung cancer: A 2004 to 2012 SEER analysis, Clinical lung cancer, vol. 19, no. 2, pp. 269-276.

David Dooling, Angela Kim, Barbara McAneny and Jennifer Webster (2016): Personalized prognostic models for oncology: A machine learning approach, Innovative Oncology Business Solutions, pp. 1-28.

Detterbeck F C (2018): The eighth edition TNM stage classification for lung cancer: What does it mean on main street?, The Journal of Thoracic and Cardiovascular Surgery, 155(1):356–359.

F Leena Vinmalar, Dr A Kumar Kombaiya (2019): Prediction of Lung Cancer using Data Mining Techniques, International Journal of Engineering Research and Technology, Vol. 7, No. 1, (Volume 7 – Issue 01), 1-4

G Ashwin Shanbhag, K Anurag Prabhu, N V Subba Reddy and B Ashwath Rao (2021): Prediction of Lung Cancer using Ensemble Classifiers, Journal of Physics: Conference Series, Volume 2161, 10.1088/1742-6596/2161/1/012007

Gayathri K and Vaidhehi V (2019): An Automatic Identification of Lung Cancer from different types of Medical Images, Research Journal of Pharmacy and Technology, Vol. 12(5), pp. 2109-2115.

Ioannis E, Livieris Andreas Kanavos, Vassilis Tampakas and Panagiotis Pintelas (2019): A Weighted Voting Ensemble Self-Labeled Algorithm for the Detection of Lung Abnormalities from X-Rays, MDPI Algorithms, vol. 64, no. 12, pp. 1-15.

Kaviarasi R and Gandhi Raj R (2021): Prediction System for the Lung Cancer Patients and Classification Accuracy Enhancement Using Ensemble Method, Journal of Medical Imaging and Health Informatics, vol. 11, no. 3, pp. 856-862

Kim H, Goo J M, Lee K H, Kim Y T, and Park C M, (2020), Preoperative CT-Based Deep Learning Model for Predicting Disease-Free Survival in Patients with Lung Adenocarcinomas, Radiology, 296(1), 216–224.

Kumar Vinod and Brijesh Bakariya (2021): Identification of Lung Cancer Malignancy Using Artificial Intelligence, In Artificial Intelligence, Machine Learning, and Data Science, Technologies, pp. 37-71.

Liu C and Pang M (2020): Automatic lung segmentation based on image decomposition and wavelet transform. Biomedical Signal Processing and Control, 61:102032.

M Sumalatha, Dr Latha Parthiban (2022): Predictive Analytics Framework for Lung Cancer with Data Mining Methods, Lecture Notes in Networks and Systems (Springer Nature), Volume 300, 783–800.

Mafarja M, Heidari AA, Faris H, Mirjalili S and Aljarah I (2020): Dragonfly algorithm: theory, literature review, and application in feature selection, Nature-Inspired Optimizers, 47-67.

MohanaPriya R and Venkatesan P (2021): An efficient image segmentation and classification of lung lesions in pet and CT image fusion using DTWT incorporated SVM, Microprocessors and Microsystems, 82:103958.

Mostafa Langarizadeh and Fateme Moghbeli (2016): Applying Naive Bayesian Networks to Disease Prediction: A Systematic Review, ACTA Informatica Medica, vol. 24, no. 5, pp. 364-369.

Nagra A A, Mubarik I, Asif M M, Masood, Ghamdi M A A, Almotiri S H (2022): Hybrid GA-SVM Approach for Postoperative Life Expectancy Prediction in Lung Cancer Patients, Applied Sciences, Volume. 12, 10927.

Naik A (2021): Lung nodule classification on computed tomography images using deep learning, Wireless Pers Commun, 116:655–690.

Palani D and Venkatalakshmi K (2019): An IoT based predictive modelling for predicting lung cancer using fuzzy cluster based segmentation and classification, Journal of medical systems, vol. 43, no. 2, pp. 1-12.

Patra R (2020): Prediction of δung Cancer Using machine learning Classifier, International Conference on Computing Science, Communication and Security, 132-142.

Qiong P, Iao Y, Hao P, He X and Hui C (2019): A self-adaptive step glowworm swarm optimization approach, International Journal of Computational Intelligence and Applications, vol. 18, no. 01, pp. 1-11.

Sannasi Chakravarthy SR and Rajaguru H (2019): Lung Cancer Detection using Probabilistic Neural Network with modified Crow-Search Algorithm, Asian Pacific journal of cancer prevention (APJCP), vol. 20, no. 7, pp. 2159-2166.

Shalini Wankhade, Vigneshwari S (2023): A novel hybrid deep learning method for early detection of lung cancer using neural networks, Healthcare Analytics, Volume 3, 100195, ISSN 2772-4425,

Shanid M and Anitha A (2020): An Exhaustive Study on the Lung Cancer Risk Models, International Journal of Bioinformatics Research and Applications, Vol. 16(2), 151-172.

Thamilselvan Piriyatharisini (2022): Lung Cancer Prediction and Classification Using Adaboost Data Mining Algorithm, International Journal of Computer Theory and Engineering, 254337572, 10.7763/ijcte.2022.v14.1322

Tian PF (2019): Current status and prospects of biomarkers in early diagnosis of lung cancer, Precision medicine Research, vol. 1, no. 2, pp. 61-65.

Tiwari L, Raja R, Awasthi V, Miri R, Sinha G, Alkinani M H, and Polat K (2021): Detection of lung nodule and cancer using novel mask-3 FCM and TWEDLNN algorithms, Measurement, 172:108882

Trailokya Ojha (2023): Machine Learning based Classification and Detection of Lung Cancer, Journal of Artificial Intelligence and Capsule Networks, Volume 5, 110 -128. 10.36548/jaicn.2023.2.003.

V Sreeprada, Dr K Vedavathi (2023): Lung Cancer Detection from X-Ray Images using Hybrid Deep Learning Technique, Procedia Computer Science, Volume 230, 467 - 474,

Vinod Kumar and Brijesh Bakariya (2021): An Empirical Identification of Pulmonary Nodules using Deep Learning, Design Engineering, Volume 2021, Issue 07, 13468-13486




How to Cite

G. N. K. Suresh Babu, S. A. L. . (2024). Integrating Features and Unlabeled Data with Modified Support Vector Machines for Improved Lung Cancer Detection. International Journal of Intelligent Systems and Applications in Engineering, 12(21s), 1503–1515. Retrieved from



Research Article