Analogousness Enhanced Rainfall Predictor using XGBoost Backbone


  • Govardhana Meti Computer Science & Engineering, BGS Institute of Technology, Karnataka, India
  • Ravi Kumar G. K. Computer Science & Engineering, BGS College of Engineering and Technology, Karnataka, India


Rainfall prediction, Machine Learning, Classification, Extreme Gradient Boosting, Data Imbalance


The forecasting of intense rainfall presents a significant challenge for the meteorological department because of the strong connection between rain and the economy as well as human lives, but extreme climate shifts have made it more complicated than ever before to estimate precipitation accurately. In a country that relies heavily on agriculture, the precision of rainfall forecasts is vital. Predicting rainfall is a common application for machine learning systems. By figuring out the hidden patterns in weather data from the past, these methods can almost accurately predict when it will rain. This study proposes a novel machine learning method called Analogousness Enhanced Rainfall Predictor using XGBoost Backbone to foretell rainfall. The proposed method uses the basis of XGBoost and tunes parameters for it to get higher accuracy for the outcomes. This study uses a large dataset of weather observations collected over ten years in various places in Australia. This model successfully deals with the issue of the data-class imbalance issue.


Download data is not yet available.


C. Wu and K.-W. J. E. a. o. a. i. Chau, "Prediction of rainfall time series using modular soft computingmethods," vol. 26, no. 3, pp. 997-1007, 2013.

K. W. Chau and C. J. J. o. H. Wu, "A hybrid model coupled with singular spectrum analysis for daily rainfall prediction," vol. 12, no. 4, pp. 458-473, 2010.

J. Wu, J. Long, and M. J. N. Liu, "Evolving RBF neural networks for rainfall prediction using hybrid particle swarm optimization and genetic algorithm," vol. 148, pp. 136-142, 2015.

A. Parmar, K. Mistree, and M. Sompura, "Machine learning techniques for rainfall prediction: A review," in International Conference on Innovations in information Embedded and Communication Systems, 2017, vol. 3.

S. Aftab et al., "Rainfall prediction in Lahore City using data mining techniques," vol. 9, no. 4, 2018.

M. A. Nayak, S. J. T. Ghosh, and a. climatology, "Prediction of extreme rainfall event using weather pattern recognition and support vector machine classifier," vol. 114, no. 3, pp. 583-603, 2013.

T. Yue, S. Zhang, J. Zhang, B. Zhang, and R. J. J. o. E. M. Li, "Variation of representative rainfall time series length for rainwater harvesting modelling in different climatic zones," vol. 269, p. 110731, 2020.

N. Mishra, H. K. Soni, S. Sharma, A. J. J. o. I. R. Upadhyay, and Applications, "A Comprehensive Survey of Data Mining Techniques on Time Series Data for Rainfall Prediction," vol. 11, no. 2, 2017.

M. Ahmad, S. Aftab, and I. J. I. J. C. A. Ali, "Sentiment analysis of tweets using svm," vol. 177, no. 5, pp. 25-29, 2017.

M. Ahmad, S. J. I. J. o. M. E. Aftab, and C. Science, "Analyzing the performance of SVM for polarity detection with different datasets," vol. 9, no. 10, p. 29, 2017.

M. Ahmad, S. Aftab, I. Ali, and N. J. I. J. M. S. E. Hameed, "Hybrid tools and techniques for sentiment analysis: A review," vol. 8, no. 3, pp. 29-33, 2017.

N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. J. J. o. a. i. r. Kegelmeyer, "SMOTE: synthetic minority over-sampling technique," vol. 16, pp. 321-357, 2002.

T. Zhu, Y. Lin, and Y. J. P. R. Liu, "Synthetic minority oversampling technique for multiclass imbalance problems," vol. 72, pp. 327-340, 2017.

S. Barua, M. Islam, and K. Murase, "A novel synthetic minority oversampling technique for imbalanced data set learning," in International Conference on Neural Information Processing, 2011, pp. 735-744: Springer.

T. J. S. Fushiki and Computing, "Estimation of prediction error by using K-fold cross-validation," vol. 21, no. 2, pp. 137-146, 2011.

P. Refaeilzadeh, L. Tang, and H. J. E. o. d. s. Liu, "Cross-validation," vol. 5, pp. 532-538, 2009.

G. J. Sawale and S. R. J. I. J. C. S. A. Gupta, "Use of artificial neural network in data mining for weather forecasting," vol. 6, no. 2, pp. 383-387, 2013.

J. N. Liu, B. N. Li, T. S. J. I. T. o. S. Dillon, Man,, and P. C. Cybernetics, "An improved naive Bayesian classifier technique coupled with a novel input solution method [rainfall prediction]," vol. 31, no. 2, pp. 249-256, 2001.

K. Abhishek, A. Kumar, R. Ranjan, and S. Kumar, "A rainfall prediction model using artificial neural network," in 2012 IEEE Control and System Graduate Research Colloquium, 2012, pp. 82-87: IEEE.

N. S. Philip, K. B. J. C. Joseph, and Geosciences, "A neural network tool for analyzing trends in rainfall," vol. 29, no. 2, pp. 215-223, 2003.

C. Wu, K. W. Chau, and C. J. J. o. H. Fan, "Prediction of rainfall time series using modular artificial neural networks coupled with data-preprocessing techniques," vol. 389, no. 1-2, pp. 146-167, 2010.

J. Joseph and T. J. I. J. o. C. A. Ratheesh, "Rainfall prediction using data mining techniques," vol. 83, no. 8, 2013.

A. Grover, A. Kapoor, and E. Horvitz, "A deep hybrid model for weather forecasting," in Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, 2015, pp. 379-386.

S. Zainudin, D. S. Jasim, and A. A. J. I. J. A. S. E. I. T. Bakar, "Comparative analysis of data mining techniques for Malaysian rainfall prediction," vol. 6, no. 6, pp. 1148-1153, 2016.

K. Lu and L. Wang, "A novel non-linear combination model based on support vector machine for rainfall prediction," in 2011 Fourth International Joint Conference on Computational Sciences and Optimization, 2011, pp. 1343-1346: IEEE.

A. Fernández, S. Garcia, F. Herrera, and N. V. J. J. o. a. i. r. Chawla, "SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary," vol. 61, pp. 863-905, 2018.

P. Skryjomski and B. Krawczyk, "Influence of minority class instance types on SMOTE imbalanced data oversampling," in first international workshop on learning with imbalanced domains: theory and applications, 2017, pp. 7-21: PMLR.

P. Jeatrakul, K. W. Wong, and C. C. Fung, "Classification of imbalanced data by combining the complementary neural network and SMOTE algorithm," in International Conference on Neural Information Processing, 2010, pp. 152-159: Springer.

M. N. Triba et al., "PLS/OPLS models in metabolomics: the impact of permutation of dataset rows on the K-fold cross-validation quality parameters," vol. 11, no. 1, pp. 13-19, 2015.

The Flow Diagram of Methodology for proposed work.




How to Cite

Meti, G. ., & Kumar G. K., R. . (2023). Analogousness Enhanced Rainfall Predictor using XGBoost Backbone. International Journal of Intelligent Systems and Applications in Engineering, 11(2), 329–335. Retrieved from



Research Article