Comparative Analysis of Machine Learning Algorithms for Crop Variety Prediction: Performance Metrics, Data Requirements, and Methodological Insight

Authors

  • Shivani Rastogi, Ranjana Sharma

Keywords:

Crop variety prediction, machine learning, CNN, Random Forests, Deep Neural Networks, SVM, performance metrics, agricultural data analysis.

Abstract

The prediction of crop variety performance is crucial for agricultural planning and decision- making. Traditional methods often fall short in handling the complexity and volume of data required for accurate predictions. Recently, machine learning algorithms have shown promise in improving prediction accuracy. This study aims to compare various machine learning methods in terms of their performance metrics, data requirements, and methodological strengths and limitations in the context of crop variety prediction. A comprehensive meta- analysis was conducted, reviewing 40 studies that applied different machine learning algorithms, including Convolutional Neural Networks (CNN), Random Forests (RF), Deep Neural Networks (DNN), Support Vector Machines (SVM), and more. Performance metrics such as RMSE, and accuracy were standardized for comparison. The studies covered a range of crops including corn, soybean, rice, and wheat, with test sample sizes varying from 80 to 2500 samples. The results indicate that RF and DNN generally perform well across various metrics, while CNN methods excel particularly in classification tasks. Data requirements and performance varied significantly, with CNN-based methods requiring larger datasets compared to traditional models. This meta-analysis highlights the potential of machine learning algorithms to enhance crop variety prediction accuracy. RF and DNN are robust performers across diverse datasets, while CNNs are particularly effective for specific applications. The study underscores the importance of selecting appropriate algorithms based on the specific prediction task and available data. Future research should focus on optimizing data collection and preprocessing to further improve prediction accuracy and applicability of these methods in real-world agricultural settings.

Downloads

Download data is not yet available.

References

Pawlak, K., & Kołodziejczak, M. (2020). The Role of Agriculture in Ensuring Food Security in Developing Countries: Considerations in the Context of the Problem of Sustainable Food Production. Sustainability, 12(13), 5488. https://doi.org/10.3390/su12135488

Araújo, S. O., Peres, R. S., Ramalho, J. C., Lidon, F., & Barata, J. (2023a). Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives. Agronomy, 13(12), 2976. https://doi.org/10.3390/agronomy13122976

Dhanaraju, M., Chenniappan, P., Ramalingam, K., Pazhanivelan, S., & Kaliaperumal,R. (2022). Smart Farming: Internet of Things (IoT)-Based Sustainable Agriculture. Agriculture, 12(10), 1745. https://doi.org/10.3390/agriculture12101745

Elbasi, E., Zaki, C., Topcu, A. E., Abdelbaki, W., Zreikat, A. I., Cina, E., Shdefat, A., & Saker, L. (2023a). Crop Prediction Model Using Machine Learning Algorithms. Applied Sciences, 13(16), 9288. https://doi.org/10.3390/app13169288

Grannis, S. J., Williams, J. L., Kasthuri, S., Murray, M., & Xu, H. (2022). Evaluation of real-world referential and probabilistic patient matching to advance patient identification strategy. Journal of the American Medical Informatics Association, 29(8), 1409–1415. https://doi.org/10.1093/jamia/ocac068

Araújo, S. O., Peres, R. S., Ramalho, J. C., Lidon, F., & Barata, J. (2023b). Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives. Agronomy, 13(12), 2976. https://doi.org/10.3390/agronomy13122976

Chicco, D., Warrens, M. J., & Jurman, G. (2021). The coefficient of determination R- squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ. Computer Science, 7, e623. https://doi.org/10.7717/peerj-cs.623

Pukrongta, N., Taparugssanagorn, A., & Sangpradit, K. (2024a). Enhancing Crop Yield Predictions with PEnsemble 4: IoT and ML-Driven for Precision Agriculture. Applied Sciences, 14(8), 3313. https://doi.org/10.3390/app14083313

Pukrongta, N., Taparugssanagorn, A., & Sangpradit, K. (2024b). Enhancing Crop Yield Predictions with PEnsemble 4: IoT and ML-Driven for Precision Agriculture. Applied Sciences, 14(8), 3313. https://doi.org/10.3390/app14083313

Kandasamy, A., & M, V. S. (2024). Data Analytics in Crop Decision Support System. ResearchGate. https://www.researchgate.net/publication/377811660_Data_Analytics_in_Crop_Decis ion_Support_System

Kossmeier, M., Tran, U. S., & Voracek, M. (2020). Charting the landscape of graphical displays for meta-analysis and systematic reviews: a comprehensive review, taxonomy, and feature analysis. BMC Medical Research Methodology, 20(1). https://doi.org/10.1186/s12874-020-0911-9

Cravero, A., Pardo, S., Sepúlveda, S., & Muñoz, L. (2022). Challenges to Use Machine Learning in Agricultural Big Data: A Systematic Literature Review. Agronomy, 12(3), 748. https://doi.org/10.3390/agronomy12030748

Schlemitz, A., & Mezhuyev, V. (2024). Approaches for data collection and process standardization in smart manufacturing: systematic literature review. Journal of Industrial Information Integration, 38, 100578. https://doi.org/10.1016/j.jii.2024.100578

Elbasi, E., Zaki, C., Topcu, A. E., Abdelbaki, W., Zreikat, A. I., Cina, E., Shdefat, A., & Saker, L. (2023b). Crop Prediction Model Using Machine Learning Algorithms. Applied Sciences, 13(16), 9288. https://doi.org/10.3390/app13169288

Araújo, S. O., Peres, R. S., Ramalho, J. C., Lidon, F., & Barata, J. (2023c). Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives. Agronomy, 13(12), 2976. https://doi.org/10.3390/agronomy13122976

Althnian, A., AlSaeed, D., Al-Baity, H., Samha, A., Dris, A. B., Alzakari, N., Elwafa,A., & Kurdi, H. (2021). Impact of Dataset Size on Classification Performance: An Empirical Evaluation in the Medical Domain. Applied Sciences, 11(2), 796. https://doi.org/10.3390/app11020796

Figure 3: Box-plots summarizing the performance of the algorithms. . . (n.d.). ResearchGate. https://www.researchgate.net/figure/Box-plots-summarizing-the- performance-of-the-algorithms-measured-using-the-Jaccard_fig1_256467933

Dettori, J. R., Norvell, D. C., & Chapman, J. R. (2021a). Seeing the Forest by Looking at the Trees: How to Interpret a Meta-Analysis Forest Plot. Global Spine Journal, 11(4), 614–616. https://doi.org/10.1177/21925682211003889

Dettori, J. R., Norvell, D. C., & Chapman, J. R. (2021b). Seeing the Forest by Looking at the Trees: How to Interpret a Meta-Analysis Forest Plot. Global Spine Journal, 11(4), 614–616. https://doi.org/10.1177/21925682211003889

A forest plot displaying the effect sizes and confidence intervals of. . . (n.d.). ResearchGate. https://www.researchgate.net/figure/A-forest-plot-displaying-the- effect-sizes-and-confidence-intervals-of-studies-included-in_fig3_316993227

Kim, N., Ha, K. J., Park, N. W., Cho, J., Hong, S., & Lee, Y. W. (2019). A Comparison Between Major Artificial Intelligence Models for Crop Yield Prediction:Case Study of the Midwestern United States, 2006–2015. ISPRS International Journal of Geo-information, 8(5), 240. https://doi.org/10.3390/ijgi8050240

Iqbal, S., Qureshi, A. N., Ullah, A., Li, J., & Mahmood, T. (2022). Improving the Robustness and Quality of Biomedical CNN Models through Adaptive Hyperparameter Tuning. Applied Sciences, 12(22), 11870. https://doi.org/10.3390/app122211870

Vyshnavi, P., Challagulla, S. P., Adamu, M., Vicencio, F., Jameel, M., Ibrahim, Y. E., & Ahmed, O. S. (2023). Utilizing Artificial Neural Networks and Random Forests to Forecast the Dynamic Amplification Factors of Non-Structural Components. Applied Sciences, 13(20), 11329. https://doi.org/10.3390/app132011329

Joshi, A., Pradhan, B., Gite, S., & Chakraborty, S. (2023). Remote-Sensing Data and Deep-Learning Techniques in Crop Mapping and Yield Prediction: A Systematic Review. Remote Sensing, 15(8), 2014. https://doi.org/10.3390/rs15082014

Ahmed, S. F., Alam, M. S. B., Hassan, M., Rozbu, M. R., Ishtiak, T., Rafa, N., Mofijur, M., Ali, A. B. M. S., & Gandomi, A. H. (2023). Deep learning modelling techniques: current progress, applications, advantages, and challenges. Artificial Intelligence Review, 56(11), 13521–13617. https://doi.org/10.1007/s10462-023- 10466-8

Guido, R., Ferrisi, S., Lofaro, D., & Conforti, D. (2024). An Overview on the Advancements of Support Vector Machine Models in Healthcare Applications: A Review. Information, 15(4), 235. https://doi.org/10.3390/info15040235

Downloads

Published

09.07.2024

How to Cite

Shivani Rastogi. (2024). Comparative Analysis of Machine Learning Algorithms for Crop Variety Prediction: Performance Metrics, Data Requirements, and Methodological Insight. International Journal of Intelligent Systems and Applications in Engineering, 12(22s), 1013 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6583

Issue

Section

Research Article