Comparative Analysis of Machine Learning Algorithms for Crop Variety Prediction: Performance Metrics, Data Requirements, and Methodological Insight
Keywords:
Crop variety prediction, machine learning, CNN, Random Forests, Deep Neural Networks, SVM, performance metrics, agricultural data analysis.Abstract
The prediction of crop variety performance is crucial for agricultural planning and decision- making. Traditional methods often fall short in handling the complexity and volume of data required for accurate predictions. Recently, machine learning algorithms have shown promise in improving prediction accuracy. This study aims to compare various machine learning methods in terms of their performance metrics, data requirements, and methodological strengths and limitations in the context of crop variety prediction. A comprehensive meta- analysis was conducted, reviewing 40 studies that applied different machine learning algorithms, including Convolutional Neural Networks (CNN), Random Forests (RF), Deep Neural Networks (DNN), Support Vector Machines (SVM), and more. Performance metrics such as RMSE, and accuracy were standardized for comparison. The studies covered a range of crops including corn, soybean, rice, and wheat, with test sample sizes varying from 80 to 2500 samples. The results indicate that RF and DNN generally perform well across various metrics, while CNN methods excel particularly in classification tasks. Data requirements and performance varied significantly, with CNN-based methods requiring larger datasets compared to traditional models. This meta-analysis highlights the potential of machine learning algorithms to enhance crop variety prediction accuracy. RF and DNN are robust performers across diverse datasets, while CNNs are particularly effective for specific applications. The study underscores the importance of selecting appropriate algorithms based on the specific prediction task and available data. Future research should focus on optimizing data collection and preprocessing to further improve prediction accuracy and applicability of these methods in real-world agricultural settings.
Downloads
References
Pawlak, K., & Kołodziejczak, M. (2020). The Role of Agriculture in Ensuring Food Security in Developing Countries: Considerations in the Context of the Problem of Sustainable Food Production. Sustainability, 12(13), 5488. https://doi.org/10.3390/su12135488
Araújo, S. O., Peres, R. S., Ramalho, J. C., Lidon, F., & Barata, J. (2023a). Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives. Agronomy, 13(12), 2976. https://doi.org/10.3390/agronomy13122976
Dhanaraju, M., Chenniappan, P., Ramalingam, K., Pazhanivelan, S., & Kaliaperumal,R. (2022). Smart Farming: Internet of Things (IoT)-Based Sustainable Agriculture. Agriculture, 12(10), 1745. https://doi.org/10.3390/agriculture12101745
Elbasi, E., Zaki, C., Topcu, A. E., Abdelbaki, W., Zreikat, A. I., Cina, E., Shdefat, A., & Saker, L. (2023a). Crop Prediction Model Using Machine Learning Algorithms. Applied Sciences, 13(16), 9288. https://doi.org/10.3390/app13169288
Grannis, S. J., Williams, J. L., Kasthuri, S., Murray, M., & Xu, H. (2022). Evaluation of real-world referential and probabilistic patient matching to advance patient identification strategy. Journal of the American Medical Informatics Association, 29(8), 1409–1415. https://doi.org/10.1093/jamia/ocac068
Araújo, S. O., Peres, R. S., Ramalho, J. C., Lidon, F., & Barata, J. (2023b). Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives. Agronomy, 13(12), 2976. https://doi.org/10.3390/agronomy13122976
Chicco, D., Warrens, M. J., & Jurman, G. (2021). The coefficient of determination R- squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ. Computer Science, 7, e623. https://doi.org/10.7717/peerj-cs.623
Pukrongta, N., Taparugssanagorn, A., & Sangpradit, K. (2024a). Enhancing Crop Yield Predictions with PEnsemble 4: IoT and ML-Driven for Precision Agriculture. Applied Sciences, 14(8), 3313. https://doi.org/10.3390/app14083313
Pukrongta, N., Taparugssanagorn, A., & Sangpradit, K. (2024b). Enhancing Crop Yield Predictions with PEnsemble 4: IoT and ML-Driven for Precision Agriculture. Applied Sciences, 14(8), 3313. https://doi.org/10.3390/app14083313
Kandasamy, A., & M, V. S. (2024). Data Analytics in Crop Decision Support System. ResearchGate. https://www.researchgate.net/publication/377811660_Data_Analytics_in_Crop_Decis ion_Support_System
Kossmeier, M., Tran, U. S., & Voracek, M. (2020). Charting the landscape of graphical displays for meta-analysis and systematic reviews: a comprehensive review, taxonomy, and feature analysis. BMC Medical Research Methodology, 20(1). https://doi.org/10.1186/s12874-020-0911-9
Cravero, A., Pardo, S., Sepúlveda, S., & Muñoz, L. (2022). Challenges to Use Machine Learning in Agricultural Big Data: A Systematic Literature Review. Agronomy, 12(3), 748. https://doi.org/10.3390/agronomy12030748
Schlemitz, A., & Mezhuyev, V. (2024). Approaches for data collection and process standardization in smart manufacturing: systematic literature review. Journal of Industrial Information Integration, 38, 100578. https://doi.org/10.1016/j.jii.2024.100578
Elbasi, E., Zaki, C., Topcu, A. E., Abdelbaki, W., Zreikat, A. I., Cina, E., Shdefat, A., & Saker, L. (2023b). Crop Prediction Model Using Machine Learning Algorithms. Applied Sciences, 13(16), 9288. https://doi.org/10.3390/app13169288
Araújo, S. O., Peres, R. S., Ramalho, J. C., Lidon, F., & Barata, J. (2023c). Machine Learning Applications in Agriculture: Current Trends, Challenges, and Future Perspectives. Agronomy, 13(12), 2976. https://doi.org/10.3390/agronomy13122976
Althnian, A., AlSaeed, D., Al-Baity, H., Samha, A., Dris, A. B., Alzakari, N., Elwafa,A., & Kurdi, H. (2021). Impact of Dataset Size on Classification Performance: An Empirical Evaluation in the Medical Domain. Applied Sciences, 11(2), 796. https://doi.org/10.3390/app11020796
Figure 3: Box-plots summarizing the performance of the algorithms. . . (n.d.). ResearchGate. https://www.researchgate.net/figure/Box-plots-summarizing-the- performance-of-the-algorithms-measured-using-the-Jaccard_fig1_256467933
Dettori, J. R., Norvell, D. C., & Chapman, J. R. (2021a). Seeing the Forest by Looking at the Trees: How to Interpret a Meta-Analysis Forest Plot. Global Spine Journal, 11(4), 614–616. https://doi.org/10.1177/21925682211003889
Dettori, J. R., Norvell, D. C., & Chapman, J. R. (2021b). Seeing the Forest by Looking at the Trees: How to Interpret a Meta-Analysis Forest Plot. Global Spine Journal, 11(4), 614–616. https://doi.org/10.1177/21925682211003889
A forest plot displaying the effect sizes and confidence intervals of. . . (n.d.). ResearchGate. https://www.researchgate.net/figure/A-forest-plot-displaying-the- effect-sizes-and-confidence-intervals-of-studies-included-in_fig3_316993227
Kim, N., Ha, K. J., Park, N. W., Cho, J., Hong, S., & Lee, Y. W. (2019). A Comparison Between Major Artificial Intelligence Models for Crop Yield Prediction:Case Study of the Midwestern United States, 2006–2015. ISPRS International Journal of Geo-information, 8(5), 240. https://doi.org/10.3390/ijgi8050240
Iqbal, S., Qureshi, A. N., Ullah, A., Li, J., & Mahmood, T. (2022). Improving the Robustness and Quality of Biomedical CNN Models through Adaptive Hyperparameter Tuning. Applied Sciences, 12(22), 11870. https://doi.org/10.3390/app122211870
Vyshnavi, P., Challagulla, S. P., Adamu, M., Vicencio, F., Jameel, M., Ibrahim, Y. E., & Ahmed, O. S. (2023). Utilizing Artificial Neural Networks and Random Forests to Forecast the Dynamic Amplification Factors of Non-Structural Components. Applied Sciences, 13(20), 11329. https://doi.org/10.3390/app132011329
Joshi, A., Pradhan, B., Gite, S., & Chakraborty, S. (2023). Remote-Sensing Data and Deep-Learning Techniques in Crop Mapping and Yield Prediction: A Systematic Review. Remote Sensing, 15(8), 2014. https://doi.org/10.3390/rs15082014
Ahmed, S. F., Alam, M. S. B., Hassan, M., Rozbu, M. R., Ishtiak, T., Rafa, N., Mofijur, M., Ali, A. B. M. S., & Gandomi, A. H. (2023). Deep learning modelling techniques: current progress, applications, advantages, and challenges. Artificial Intelligence Review, 56(11), 13521–13617. https://doi.org/10.1007/s10462-023- 10466-8
Guido, R., Ferrisi, S., Lofaro, D., & Conforti, D. (2024). An Overview on the Advancements of Support Vector Machine Models in Healthcare Applications: A Review. Information, 15(4), 235. https://doi.org/10.3390/info15040235
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.