An Airline Passenger Satisfaction Prediction by Genetic-Algorithm-Based Hybrid AutoEncoder and Machine Learning Models

Authors

  • Lee Ye Hean, Olanrewaju Victor Johnson, Chew XinYing, Teoh Wei Lin, Chong Zhi Lin, Khaw Khai Wah

Keywords:

Airline, Autoencoder, Genetic Algorithm, Customer Experience, Optimization.

Abstract

The evolving landscape of the Business-to-Client (B2C) model across the globe is reshaping service delivery paradigms and transforming consumer perceptions of service providers, thereby revolutionizing customer experiences. This paradigm shift directly impacts airline companies that offer multiple service tiers, necessitating ongoing promotional strategies to attract and retain customers. Moreover, in addition to attracting new passengers, it is equally vital for airlines to retain existing ones. Therefore, comprehensive research is imperative to comprehend customers’ perceptions and conduct post-flight customer satisfaction surveys to delve into the factors influencing their decision-making processes. By gaining insights into these crucial causal factors, airlines can tailor their services to better meet customer expectations and enhance overall satisfaction levels. To address these challenges, this paper proposes a hybrid model comprising Deep Autoencoder (DAE) and Genetic Algorithm (GA) techniques for optimizing feature extraction. Utilizing eleven Machine Learning (ML) models as baseline predictors, the study endeavors to forecast passenger satisfaction levels. Furthermore, each ML model is intricately combined with the AE-GA optimization framework to conduct in-depth customer satisfaction experiments. Conducting a 5-fold cross-validation analysis in each experimental setup, the study highlights the efficacy of the proposed optimization strategy in significantly enhancing the predictive performance of ML methods in forecasting customer satisfaction levels.

Downloads

Download data is not yet available.

References

S. Leon and J. C. Martín, “A fuzzy segmentation analysis of airline passengers in the U.S. based on service satisfaction,” Res. Transp. Bus. Manag., 2020.

V. Leninkumar, “The relationship between customer satisfaction and customer trust on customer loyalty,” Int. J. Acad. Res. Bus. Soc. Sci., vol. 7, no. 4, 2017.

A. C. Y. Hong, K. W. Khaw, C. XinYing, and W. C. Yeong, “Prediction of US airline passenger satisfaction using machine learning algorithms,” Data Anal. Manag., vol. 4, no. 1, 2023.

M. Al-Mashraie, S. H. Chung, and H. W. Jeon, “Customer switching behavior analysis in the telecommunication industry via push-pull-mooring framework: A machine learning approach,” Comput. Ind. Eng., vol. 144, Apr. 2020, 10.1016/j.cie.2020.106476.

L. Hibović, S. Smajić, and E. Yaman, “Predicting satisfaction of airline passengers using classification,” International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT), 2022, 10.1109/ISMSIT56059.2022.9932850.

I. V. Pustokhina, D. A. Pustokhin, P. T. Nguyen, M. Elhoseny, and K. Shankar, “Multi-objective rain optimization algorithm with WELM model for customer churn prediction in telecommunication sector,” Complex Intell. Syst., 2021, 10.1007/s40747-021-00353-6.

A. C. Kae, C. XinYing, O. V. Johnson, and K. W. Khaw, “Employee Turnover Prediction by Machine Learning Techniques,” J. Telecommun. Electron. Comput. Eng., vol. 13, no. 4, pp. 49-56, 2021. [Online]. Available: https://jtec.utem.edu.my/jtec/article/view/6148/4063.

B. Bhargav and R. T. Prabu, “Airline passenger satisfaction prediction using novel hybrid random forest model comparison with k-nearest neighbour model,” Eighth International Conference on Science Technology Engineering and Mathematics (ICONSTEM), 2023.

S. Li, G. Xia, and X. Zhang, “Customer Churn Combination Prediction Model Based on Convolutional Neural Network and Gradient Boosting Decision Tree,” ACM Int. Conf. Proc. Ser., 2022, 10.1145/3579654.3579666.

J. A. Nichols, H. W. Chan, and M. A. B. Baker, “Machine learning: applications of artificial intelligence to imaging and diagnosis,” Biophys. Rev., vol. 11, no. 1, pp. 111-118, 2019, 10.1007/s12551-018-0449-9.

O. V. Johnson, C. XinYing, K. W. Khaw, and L. H. Ming, “A Cost-Based Dual ConvNet-Attention Transfer Learning Model for ECG Heartbeat Classification,” J. Inform. Web Eng., vol. 2, no. 2, pp. 90-110, 2023, 10.33093/jiwe.2023.2.2.7.

F. Balducci, D. Impedovo, and G. Pirlo, “Machine learning applications on agricultural datasets for smart farm enhancement,” Machines, vol. 6, no. 3, 2018, 10.3390/machines6030038.

B. K. Chan, O. V. Johnson, X. Chew, K. W. Khaw, M. H. Lee, and A. Alnoor, “Proposed Bayesian optimization based LSTM-CNN model for stock trend prediction,” Comput. Inform., vol. 43, no. 1, pp. 38–63, 2024, 10.31577/cai_2024_1_38.

M. Pesaresi, V. Syrris, and A. Julea, “A New Method for Earth Observation Data Analytics Based on Symbolic Machine Learning,” Remote Sens., vol. 8, no. 5, p. 399, 2016, 10.3390/rs8050399.

E. Domingos, B. Ojeme, and O. Daramola, “Experimental analysis of hyperparameters for deep learning-based churn prediction in the banking sector,” Computation, vol. 9, no. 3, 2021, 10.3390/computation9030034.

O. M. Ajinaja, O. S. Egwuche, and O. V. Johnson, “Prediction of Graduating Students for Tertiary Institutions Using Data Mining Technique,” 2nd International Conference on Education and Development, 2019, pp. 58-62.

H. Ramchoun, M. Amine, J. Idrissi, Y. Ghanou, and M. Ettaouil, “Multilayer Perceptron: Architecture Optimization and Training,” Int. J. Interact. Multimed. Artif. Intell., vol. 4, no. 1, p. 26, 2016, 10.9781/ijimai.2016.415.

R. Sudharsan and E. N. Ganesh, “A Swish RNN based customer churn prediction for the telecom industry with a novel feature selection strategy,” Connect. Sci., vol. 34, no. 1, pp. 1855-1876, 2022, 10.1080/09540091.2022.2083584.

S. J. Haddadi, O. M. Mohammadi, M. Bahrami, E. Khoeini, M. Beygi, and M. H. Khoshkar, “Customer churn prediction in the Iranian banking sector,” International Conference on Applied Artificial Intelligence, 2022, pp. 1-6, 10.1109/ICAPAI55158.2022.9801574.

O. M. Mirza, G. J. Moses, R. Rajender, E. L. Lydia, S. Kadry, C. Me-Ead, and O. Thinnukool, “Optimal Deep Canonically Correlated Autoencoder-Enabled Prediction Model for Customer Churn Prediction,” Comput. Mater. Continua, vol. 73, no. 2, pp. 3757-3769, 2022, 10.32604/cmc.2022.030428.

H. Gao, B. Qiu, R. J. D. Barroso, W. Hussain, Y. Xu, and X. Wang, “TSMAE: A novel anomaly detection approach for Internet of Things time series data using memory-augmented autoencoder,” IEEE Trans. Netw. Sci. Eng., vol. 10, no. 5, pp. 2978-2990, 2023, 10.1109/TNSE.2022.3163144.

K. G. Lore, A. Akintayo, and S. Sarkar, “LLNet: A deep autoencoder approach to natural low-light image enhancement,” Pattern Recognit., vol. 61, pp. 650-662, 2017, 10.1016/j.patcog.2016.06.008.

B. Garimella, G. V. S. N. R. V. Prasad, and M. H. M. K. Prasad, “Churn prediction using optimized deep learning classifier on huge telecom data,” J. Ambient Intell. Human Comput., vol. 14, no. 3, pp. 2007-2028, 2023, 10.1007/s12652-021-03413-4.

V. Haridasan, K. Muthukumaran, and K. Hariharanath, “Arithmetic optimization with deep learning enabled churn prediction model for telecommunication industries,” Intell. Autom. Soft Comput., vol. 35, no. 3, pp. 3531-3544, 2023, 10.32604/iasc.2023.030628.

C. K. Praseeda and B. L. Shivakumar, “Fuzzy particle swarm optimization (FPSO) based feature selection and hybrid kernel distance-based possibilistic fuzzy local information C-means (HKD-PFLICM) clustering for churn prediction in telecom industry,” SN Appl. Sci., vol. 3, no. 6, 2021, 10.1007/s42452-021-04576-7.

S. Ouf, “An optimized deep learning approach for improving airline services,” Comput. Mater. Continua, vol. 75, no. 1, pp. 1213-1233, 2023, 10.32604/cmc.2023.037276.

M. Guimarães, C. Soares, and R. Ventura, “Decision support models for predicting and explaining airport passenger connectivity from data,” IEEE Trans. Intell. Transp. Syst., vol. 23, no. 9, pp. 16005-16015, 2022, 10.1109/TITS.2022.3148830.

S. Chen, D. L. Xu, and W. Jiang, “High value passenger identification research based on federated learning,” 12th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2020, vol. 1, pp. 107-110.

Z. Wang, X. Han, Y. Chen, X. Ye, K. Hu, and D. Yu, “Prediction of willingness to pay for airline seat selection based on improved ensemble learning,” Aerospace, vol. 9, no. 2, p. 47, 2022, 10.3390/aerospace9020047.

R. Pranav and H. S. Gururaja, “Explainable stacking machine learning ensemble for predicting airline customer satisfaction,” Congress on Intelligent Systems, 2022, pp. 41-56, Springer Nature Singapore, 10.1007/978-981-19-3812-4_5.

A. Mottini and R. Acuna-Agost, “Deep choice model using pointer networks for airline itinerary prediction,” Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2017, pp. 1575-1583, https://doi.org/10.1145/3097983.3098094.

I. Rahmany, N. Dhahri, T. Moulahi, and A. Alabdulatif, “Optimized stacked auto-encoder for unnecessary data reduction in cloud of things,” International Wireless Communications and Mobile Computing (IWCMC), 2022, pp. 110-115, https://doi.org/10.1109/IWCMC55113.2022.9824846.

A. Lambora, K. Gupta, and K. Chopra, “Genetic algorithm - a literature review,” International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), 2019, pp. 380-384.

S. Ardabili, A. Mosavi, and A. R. Várkonyi-Kóczy, “Advances in machine learning modeling reviewing hybrid and ensemble methods,” International Conference on Global Research and Education, 2019, pp. 215-227, Springer International Publishing, https://doi.org/10.1007/978-3-030-20393-1_20.

H. H. Huang, Z. Wang, and W. Chung, “Efficient parameter selection for SVM: The case of business intelligence categorization,” 2017 IEEE International Conference on Intelligence and Security Informatics (ISI), 2017, pp. 158-160, https://doi.org/10.1109/ISI.2017.8004895.

W. Baswardono, D. Kurniadi, A. Mulyani, and D. M. Arifin, “Comparative analysis of decision tree algorithms: Random forest and C4.5 for airlines customer satisfaction classification,” J. Phys. Conf. Ser., vol. 1402, p. 066055, 2019, https://doi.org/10.1088/1742-6596/1402/6/066055.

S. Mitrofanov and E. Semenkin, “An approach to training decision trees with the relearning of nodes,” 2021 International Conference on Information Technologies (InfoTech), 2021, pp. 1-5, https://doi.org/10.1109/InfoTech52442.2021.9586108

T. Noviantoro and J. P. Huang, “Investigating airline passenger satisfaction: Data mining method,” Res. Transp. Bus. Manag., vol. 43, p. 100726, 2022, https://doi.org/10.1016/j.rtbm.2022.100726.

S. Khan, Z. A. Khan, Z. Noshad, S. Javaid, and N. Javaid, “Short term load and price forecasting using tuned parameters for k-nearest neighbors,” Sixth HCT Information Technology Trends (ITT), 2019, pp. 89-93.

C. Zhang, X. Pan, H. Li, A. Gardiner, I. Sargent, J. Hare, and P. M. Atkinson, “A hybrid MLP-CNN classifier for very fine resolution remotely sensed image classification,” ISPRS J. Photogramm. Remote Sens., vol. 140, pp. 133-144, 2018, https://doi.org/10.1016/j.isprsjprs.2017.07.014.

X. Zou, Y. Hu, Z. Tian, and K. Shen, “Logistic regression model optimization and case analysis,” 2019 IEEE 7th International Conference on Computer Science and Network Technology (ICCSNT), 2019, pp. 135-139, https://doi.org/10.1109/ICCSNT47585.2019.8962464.

X. Jiang, Y. Zhang, and Y. Li, “Forecast and analysis of aircraft passenger satisfaction based on RF-RFE-LR model,” Sci. Rep., vol. 12, p. 11174, 2022, https://doi.org/10.1038/s41598-022-15341-6.

B. Spoorthi, S. S. Kumar, A. P. Rodrigues, R. Fernandes, and N. Balaji, “Comparative analysis of bank loan defaulter prediction using machine learning techniques,” IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER), 2021, pp. 24-29, https://doi.org/10.1109/DISCOVER52568.2021.9624056.

T. Wahyuningrum, S. Khomsah, S. Suyanto, S. Meliana, P. E. Yunanto, and W. F. Al Maki, “Improving clustering method performance using k-means, mini batch k-means, birch and spectral,” 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI), 2021, pp. 206-210, https://doi.org/10.1109/ISRITI54043.2021.9702677.

D. Müllner, “Modern hierarchical, agglomerative clustering algorithms,” arXiv preprint arXiv:1109.2378, 2011, https://doi.org/10.48550/arXiv.1109.2378.

A. Nugraha, M. A. H. Perdana, H. A. Santoso, J. Zeniarja, A. Luthfiarta, and A. Pertiwi, “Determining the senior high school major using agglomerative hierarchical clustering algorithm,” 2018 International Seminar on Application for Technology of Information and Communication, 2018, pp. 225-228, https://doi.org/10.1109/iSemantic.2018.8579985.

D. John, “Passenger satisfaction,” US airline passenger satisfaction. Available at: https://www.kaggle.com/datasets/johndddddd/customer-satisfaction. Accessed 21 Nov 2023.

K. Pordar, T. S. Pardawala, and C. D. Pai, “A comparative study of categorical variable encoding techniques for neural network classifiers,” Int. J. Comput. Appl., vol. 175, no. 4, pp. 7-9, 2017, https://doi.org/10.5120/ijca2017914706.

D. Singh and B. Singh, “Investigating the impact of data normalization on classification performance,” Appl. Soft Comput., vol. 97, p. 105524, 2020, https://doi.org/10.1016/j.asoc.2020.105524.

Downloads

Published

23.07.2024

How to Cite

Lee Ye Hean. (2024). An Airline Passenger Satisfaction Prediction by Genetic-Algorithm-Based Hybrid AutoEncoder and Machine Learning Models. International Journal of Intelligent Systems and Applications in Engineering, 12(4), 1974–1985. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6516

Issue

Section

Research Article