STACKING ENSEMBLE-BASED PREDICTIVE SYSTEM FOR CROP RECOMMENDATION

  • Gilbert I. O. Aimufua Department of Computer Science
  • Olalere Morufu
  • Muhammad Umar Abdullahi Federal University of Technology, Owerri, Imo State
  • Umar Faruk Usman
Keywords: Ensemble Learning, Crop Selection, Precision Agriculture, Predictive Modeling, Proof-of-Concept System

Abstract

Agricultural sustainability relies on crop production, but the task of choosing appropriate crops for certain places is difficult owing to the ever-changing environmental circumstances. Traditional approaches are often limited in scope, failing to adapt to diverse soil types and environmental parameters. This study introduces a novel prediction method that utilizes a machine-learning model with ensemble approaches to provide recommendations for crops. The system was developed using a Design Science Research (DSR) methodology. The proposed model incorporates a wide array of machine-learning techniques, including K-Nearest Neighbors, Decision Trees, Support Vector Machines, Naive Bayes, Logistic Regression, and Extreme Gradient Boosting. The integration utilizes the Random Forest meta-model. The model was trained and validated using a large dataset gathered from Kaggle, which consisted of a wide variety of crops and environmental characteristics. The model's performance was evaluated using metrics such as Accuracy, Recall, F1-Score, and Precision. It exhibited outstanding accuracy of 99.8%, along with superior recall, precision, and F1 scores, outperforming previous research by a significant margin. Furthermore, data flow diagrams illustrate the data processing flow within the system. The implementation was carried out using the Python programming language, with MongoDB employed for database development. The resulting proof-of-concept system demonstrates the practical applicability of the model by providing reliable crop recommendations based on environmental data. This research marks a substantial advancement in optimizing crop management strategies through advanced predictive modeling, offering a robust tool to aid farmers in making informed decisions, ultimately enhancing agricultural productivity and sustainability.

References

Akkem, Y., Biswas, S. K., & Varanasi, A. (2024). A comprehensive review of synthetic data generation in smart farming by using variational autoencoder and generative adversarial network. Engineering Applications of Artificial Intelligence, 131, 107881. DOI: https://doi.org/10.1016/j.engappai.2024.107881

Ali, Z. A., Abduljabbar, Z. H., Taher, H. A., Sallow, A. B., & Almufti, S. M. (2023). Exploring the power of eXtreme gradient boosting algorithm in machine learning: A review. Academic Journal of Nawroz University, 12(2), 320-334. DOI: https://doi.org/10.25007/ajnu.v12n2a1612

Alnuaimi, A. F., & Albaldawi, T. H. (2024). An overview of machine learning classification techniques. In BIO Web of Conferences (Vol. 97, p. 00133). EDP Sciences.

Alnuaimi, A. F., & Albaldawi, T. H. (2024). An overview of machine learning classification techniques. In BIO Web of Conferences (Vol. 97, p. 00133). EDP Sciences. DOI: https://doi.org/10.1051/bioconf/20249700133

Amanullah, A., & Khan, U. (2024). Advancing sustainable agriculture with beneficial microbes: Enhancing crop growth and yield for food security and human health. Advances in Modern Agriculture, 4(2). DOI: https://doi.org/10.54517/ama.v4i2.2426

Balaska, V., Adamidou, Z., Vryzas, Z., & Gasteratos, A. (2023). Sustainable crop protection via robotics and artificial intelligence solutions. Machines, 11(8), 774. DOI: https://doi.org/10.3390/machines11080774

Bian, Z., Vong, C. M., Wong, P. K., & Wang, S. (2020). Fuzzy KNN method with adaptive nearest neighbors. IEEE transactions on cybernetics, 52(6), 5380-5393. DOI: https://doi.org/10.1109/TCYB.2020.3031610

Chen, J., Zhao, F., Sun, Y., & Yin, Y. (2020). Improved XGBoost model based on genetic algorithm. International Journal of Computer Applications in Technology, 62(3), 240-245. DOI: https://doi.org/10.1504/IJCAT.2020.106571

Cheng, F., Yang, C., Zhou, C., Lan, L., Zhu, H., & Li, Y. (2020). Simultaneous determination of metal ions in zinc sulfate solution using UVVis spectrometry and SPSE-XGBoost method. Sensors, 20(17), 4936. DOI: https://doi.org/10.3390/s20174936

Chergui, N., & Kechadi, M. T. (2022). Data analytics for crop management: a big data view. Journal of Big Data, 9(1), 123. DOI: https://doi.org/10.1186/s40537-022-00668-2

Costa, V. G., & Pedreira, C. E. (2023). Recent advances in decision trees: An updated survey. Artificial Intelligence Review, 56(5), 4765-4800. DOI: https://doi.org/10.1007/s10462-022-10275-5

COSTEA, M., LEAH, D., PACALU, R., MULEAC, L., STANCIU, S., MERGHE, P., ... & FEHER, A. (2023). CHALLENGES AND ISSUES FOR FARMERS IN THE MODERN ERA. Research Journal of Agricultural Science, 55(2).

Cunningham, P., & Delany, S. J. (2020). k-Nearest neighbour classifiers: (with Python examples). arXiv preprint arXiv:2004.04523.

Cunningham, P., & Delany, S. J. (2021). K-nearest neighbour classifiers-a tutorial. ACM computing surveys (CSUR), 54(6), 1-25. DOI: https://doi.org/10.1145/3459665

Fong, K. S., & Motani, M. (2024, March). Symbolic Regression Enhanced Decision Trees for Classification Tasks. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 38, No. 11, pp. 12033-12042). DOI: https://doi.org/10.1609/aaai.v38i11.29091

Ganaie, M. A., Hu, M., Malik, A. K., Tanveer, M., & Suganthan, P. N. (2022). Ensemble deep learning: A review. Engineering Applications of Artificial Intelligence, 115, 105151. DOI: https://doi.org/10.1016/j.engappai.2022.105151

Glanois, C., Weng, P., Zimmer, M., Li, D., Yang, T., Hao, J., & Liu, W. (2024). A survey on interpretable reinforcement learning. Machine Learning, 1-44. DOI: https://doi.org/10.1007/s10994-024-06543-w

Hassan, M., Rai, P., & Maharjan, S. (2023). Empowering south asian agricultural communities: A comprehensive approach to iot-driven agriculture through awareness, training, and collaboration. Quarterly Journal of Emerging Technologies and Innovations, 8(3), 18-32.

Huber, F., Yushchenko, A., Stratmann, B., & Steinhage, V. (2022). Extreme Gradient Boosting for yield estimation compared with Deep Learning approaches. Computers and Electronics in Agriculture, 202, 107346. DOI: https://doi.org/10.1016/j.compag.2022.107346

Isnaeni, I. A., Indriani, S., Zaman, M. R. N., & Nugroho, A. (2024). Comparison of K-Nearest Neighbors (KNN) and Decision Tree with Binary Particle Swarm Optimization (BPSO) in Predicting Employee Performance. International Journal of Open Information Technologies, 12(3), 57-65.

Ketheneni, K., Yenuga, P., Garnepudi, P., Paleti, L., Burla, N. R., Srinivas, O., ... & Yamarthi, N. R. (2024). Crop, Fertilizer and Pesticide Recommendation using Ensemble Method and Sequential Convolutional Neural Network. International Journal of Intelligent Systems and Applications in Engineering, 12(2), 473-485.

Khatri, P., Kumar, P., Shakya, K. S., Kirlas, M. C., & Tiwari, K. K. (2023). Understanding the intertwined nature of rising multiple risks in modern agriculture and food system. Environment, Development and Sustainability, 1-44. DOI: https://doi.org/10.1007/s10668-023-03638-7

Koshariya, A. K., Rameshkumar, P. M., Balaji, P., Cavaliere, L. P. L., Dornadula, V. H. R., & Singh, B. (2024). Data-Driven Insights for Agricultural Management: Leveraging Industry 4.0 Technologies for Improved Crop Yields and Resource Optimization. In Robotics and Automation in Industry 4.0 (pp. 260-274). CRC Press. DOI: https://doi.org/10.1201/9781003317456-14

Kumar, M., Bajaj, K., Sharma, B., & Narang, S. (2022). A Comparative Performance Assessment of Optimized Multilevel Ensemble Learning Model with Existing Classifier Models. Big Data, 10(5), 371-387. DOI: https://doi.org/10.1089/big.2021.0257

Linardatos, P., Papastefanopoulos, V., & Kotsiantis, S. (2020). Explainable ai: A review of machine learning interpretability methods. Entropy, 23(1), 18. DOI: https://doi.org/10.3390/e23010018

Maheswary, A., Nagendram, S., Kiran, K. U., Ahammad, S. H., Priya, P. P., Hossain, M. A., & Rashed, A. N. Z. (2024). Intelligent Crop Recommender System for Yield Prediction Using Machine Learning Strategy. Journal of The Institution of Engineers (India): Series B, 1-9. DOI: https://doi.org/10.1007/s40031-024-01029-8

Mbanaso, U. M., Abrahams, L., & Okafor, K. C. (2023). Research Techniques for Computer Science, Information Systems and Cybersecurity (1st ed.). Springer. DOI: https://doi.org/10.1007/978-3-031-30031-8_1

Mohammed, A., & Kora, R. (2023). A comprehensive review on ensemble deep learning: Opportunities and challenges. Journal of King Saud University-Computer and Information Sciences, 35(2), 757-774. DOI: https://doi.org/10.1016/j.jksuci.2023.01.014

Nanfack, G., Temple, P., & Frnay, B. (2022). Constraint enforcement on decision trees: A survey. ACM Computing Surveys (CSUR), 54(10s), 1-36. DOI: https://doi.org/10.1145/3506734

Oates, B. J. (2006). Researching Information Systems and Computing (First, Vol. 1). SAGE Publications.

Phadke, M., Goel, M., Bajpai, R., & Mehta, N. (2022). Designing an algorithm to support optimized crop selection by farmers. In ICT Analysis and Applications (pp. 345-357). Springer Singapore. DOI: https://doi.org/10.1007/978-981-16-5655-2_33

Raju, C., Ashoka, D. V., & BV, A. P. (2024). CropCast: Harvesting the future with interfused machine learning and advanced stacking ensemble for precise crop prediction. Kuwait Journal of Science, 51(1), 100160. DOI: https://doi.org/10.1016/j.kjs.2023.11.009

Reddy, G. V., Reddy, M. V. K., Spandana, K., Subbarayudu, Y., Albawi, A., Chandrashekar, R., ... & Praveen, P. (2024). Precision farming practices with data-driven analysis and machine learning-based crop and fertiliser recommendation system. In E3S Web of Conferences (Vol. 507, p. 01078). EDP Sciences. DOI: https://doi.org/10.1051/e3sconf/202450701078

Rimanic, L., Renggli, C., Li, B., & Zhang, C. (2020). On convergence of nearest neighbor classifiers over feature transformations. Advances in Neural Information Processing Systems, 33, 12521-12532.

Sagi, O., & Rokach, L. (2021). Approximating XGBoost with an interpretable decision tree. Information sciences, 572, 522-542. DOI: https://doi.org/10.1016/j.ins.2021.05.055

Satish, N., Anmala, J., Rajitha, K., & Varma, M. R. (2024). A stacking ANN ensemble model of ML models for stream water quality prediction of Godavari River Basin, India. Ecological Informatics, 80, 102500. DOI: https://doi.org/10.1016/j.ecoinf.2024.102500

Sunil Kumar, S. (2024). MACHINE LEARNING-BASED CROP RECOMMENDATION SYSTEM FOR ENHANCED YIELD PRODUCTION. Journal of Interdisciplinary Cycle Research, 26(1), 597-611.

Thakur, A., Kumari, D., & Kumar, P. (2024). Climate Change and Agriculture: Assessing the relationship between agricultural practices, greenhouse gas emissions, and climate change. A Comprehensive Exploration of Soil, Water, and Air Pollution in Agriculture, 155.

Verma, G., & Sahu, T. P. (2024). A correlation-based feature weighting filter for multi-label Naive Bayes. International Journal of Information Technology, 16(1), 611-619. DOI: https://doi.org/10.1007/s41870-023-01555-6

Vezirolu, M., Ezirolu, E., & Bucak, . . (2024). Performance Comparison between Naive Bayes and Machine Learning Algorithms for News Classification. In Bayesian Inference-Recent Trends. IntechOpen. DOI: https://doi.org/10.5772/intechopen.1002778

Wade, C., & Glynn, K. (2020). Hands-On Gradient Boosting with XGBoost and scikit-learn: Perform accessible machine learning and extreme gradient boosting with Python. Packt Publishing Ltd.

Yang, Y., Lv, H., & Chen, N. (2023). A survey on ensemble learning under the era of deep learning. Artificial Intelligence Review, 56(6), 5545-5589. DOI: https://doi.org/10.1007/s10462-022-10283-5

Zhu, L., Wang, L., Yang, Z., Xu, P., & Yang, S. (2024). PPSNO: A Feature-Rich SNO Sites Predictor by Stacking Ensemble Strategy from Protein Sequence-Derived Information. Interdisciplinary Sciences: Computational Life Sciences, 1-26. DOI: https://doi.org/10.1007/s12539-023-00595-7

Published
2024-12-31
How to Cite
AimufuaG. I. O., Morufu O., AbdullahiM. U., & Usman U. F. (2024). STACKING ENSEMBLE-BASED PREDICTIVE SYSTEM FOR CROP RECOMMENDATION. FUDMA JOURNAL OF SCIENCES, 8(6), 72 - 83. https://doi.org/10.33003/fjs-2024-0806-2856