Enhancing House Price Forecasting with Stacking Regression and Multiple Machine Learning Approaches

Authors

  • Kelvin Leomitro Computer Science Department - Bina Nusantara Graduate Program - Master of Computer Science, Jakarta, Indonesia, 11480
  • Alexander Agung Santoso Gunawan Computer Science Department - School of Computer Science, Bina Nusantara University, Jakarta, Indonesia, 11480

Keywords:

House Price Prediction, Machine Learning, Stacking Regression, Supervised Learning

Abstract

This paper introduces a comprehensive algorithmic model for predicting house prices, addressing the absence of a standard reference for property valuation. To counter this, we utilize a set of machine learning techniques that consider various house attributes and features, thereby providing a more standardized approach to house pricing. The dataset used in this study is obtained from Kaggle. A range of algorithms, including Gradient Boost, Support Vector Regression, Decision Tree, Random Forest, Bagging Tree, Ridge, Lasso, Elastic Net, and Stacking Regression, are applied to improve prediction accuracy. Stacking Regression demonstrates the potential for achieving superior prediction scores compared to conventional algorithms. Our experimental results reveal that a Stacking Regression model incorporating Gradient Boosting, Bagging Tree, and Ridge as input algorithms outperforms the other methods, yielding a lower RMSE score. After parameter tuning, the best RMSE score attained with general algorithms was 0.11157 using the Ridge algorithm. In contrast, the Stacking Regression model delivered the best RMSE score of 0.10954, highlighting its enhanced predictive capabilities for house prices.

Downloads

Download data is not yet available.

References

Ahtesham, M., Bawany, N. Z., & Fatima, K., “House Price Prediction using Machine Learning Algorithm - The Case of Karachi City, Pakistan. Proceedings – 2020,” 21st International Arab Conference on Information Technology, ACIT 2020, 7-11.

Subudhi, S., Verma, A., & Patel, A. B., “Prognostic machine learning models for COVID-19 to facilitate decision making,” International Journal of Clinical Practice, 74(12), 4–7.

Zhang, Y., & Haghani, A., “A gradient boosting method to improve travel time prediction,” Transportation Research Part C: Emerging Technologies, 58, 308–324.

Belgiu, M., & Drăgu, L., “Random Forest in remote sensing: A review of applications and future directions,” ISPRS Journal of Photogrammetry and Remote Sensing, 114, 24–31.

Borde, S., Rane, A., Shende, G., & Shetty, S., “Real Estate Investment Advising Using Machine Learning,” International Research Journal of Engineering and Technology (IRJET), 4(3), 1821–1825.

Zhang, L., Xiong, J., Zhao, H., Hong, H., Zhu, X., & Li, C., “Sleep stages classification by CW Doppler radar using bagged trees algorithm,” 2017 IEEE Radar Conference, RadarConf 2017, 0788–0791.

Melkumova, L. E., and S. Ya Shatskikh. "Comparing Ridge and LASSO estimators for data analysis." Procedia engineering 201 (2017): 746-755.

Awad, M., & Khanna, R., “Support Vector Regression,” 2011 1st International EConference on Computer and Knowledge Engineering, ICCKE 2011, Vc, 28–33.

PROCESS Corporation, Boston, MA, USA. Intranets: Internet technologies deployed behind the firewall for corporate productivity. Presented at INET96 Annual Meeting. [Online]. Available: http://home.process.com/Intranets/wp2.htp

Pavlyshenko, B., “Using Stacking Approaches for Machine Learning Models,” Proceedings of the 2018 IEEE 2nd International Conference on Data Stream Mining and Processing, DSMP 2018, 255–258.

Lu, S., Li, Z., Qin, Z., Yang, X., & Goh, R. S. M., “A hybrid regression technique for house prices prediction,” IEEE International Conference on Industrial Engineering and Engineering Management, 2017-Decem, 319–323.

Abbasi, S., “Advanced Regression Techniques Based Housing Price Prediction Model,” 13th International Conference of Iranian Operations Research Society, February, 2020, 1–10.

Parasich, A. V., Parasich, V. A., Kaftannikov, I. L., & Parasich, I. V., “Predicting sales prices of the houses using regression methods of machine learning,” RPC 2018 - Proceedings of the 3rd Russian-Pacific Conference on Computer Technology and Applications, 3–7.

Aggarwal, C. C., “Outlier Analysis. In Outlier Analysis,” Motorola Semiconductor Data Manual, Motorola Semiconductor Products Inc., Phoenix, AZ, USA, 1989.

Aggarwal, V., Gupta, V., Singh, P., Sharma, K., & Sharma, N., “Detection of spatial outlier by using improved Z-score test,” Proceedings of the International Conference on Trends in Electronics and Informatics, ICOEI 2019, 2019-April(Icoei), 788–790.

Yang, L., & Shami, A., “On hyperparameter optimization of machine learning algorithms: Theory and practice. Neurocomputing,” 415, 295–316.

Log Transformation Result

Downloads

Published

17.02.2023

How to Cite

Leomitro, K. ., & Gunawan, A. A. S. . (2023). Enhancing House Price Forecasting with Stacking Regression and Multiple Machine Learning Approaches. International Journal of Intelligent Systems and Applications in Engineering, 11(2), 898 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/2967

Issue

Section

Research Article