A HYBRID PREDICTION MODEL FOR CLASSIFYING STUDENT’S ACADEMIC PERFORMANCE USING VOTING ENSEMBLE METHOD

Authors

  • Jane Ngozi Oruh Michael Okpara University of Agriculture, Umudike
  • Basil Chukwuemeka Iwuji Abia State University, Uturu

DOI:

https://doi.org/10.33003/fjs-2026-1003-4751

Keywords:

Hybrid Prediction, Classification, K-Nearest Neighbour (K-NN), Naïve Bayes (NB), Ensemble Method, Academic Performance

Abstract

Single classification algorithms for predicting students’ academic performance often face limitations in accuracy and robustness, especially with high-dimensional educational data. This study proposes a hybrid prediction model that integrates K-Nearest Neighbour (K-NN) and Naïve Bayes (NB) algorithms using a soft voting ensemble technique to classify students’ academic performance into high, average, and low categories. The Cross-Industry Standard Process for Data Mining (CRISP-DM) and Object-Oriented Analysis and Design Methodology (OOADM) guided the study. Primary data were collected from 240 third-year nursing students of Abia State University, Uturu, across two academic semesters, using 38 demographic, academic, social, and prior performance features. The models were implemented in Python using Scikit-learn and related libraries, with an 80:20 train-test split after preprocessing and feature scaling. Performance was evaluated using accuracy, precision, recall, F1-score, ROC-AUC, and confusion matrices. Results show that the hybrid model achieved superior performance, attaining 100% accuracy and outperforming the standalone K-NN model. The classification results indicated that 51.7% of students were high performers, 10.8% average, and 37.5% low. The deployed system provides performance-based advisory support and demonstrates strong potential for early intervention, personalized learning, and academic planning in educational institutions.

Author Biographies

  • Jane Ngozi Oruh, Michael Okpara University of Agriculture, Umudike

    Department of Computer Science,

    Michael Okpara University of Agriculture, Umudike

     

  • Basil Chukwuemeka Iwuji, Abia State University, Uturu

    School of Postgraduate Studies,

    Abia State University, Uturu

References

Al-Barrak, M. A., & Al-Razgan, M. (2016). Predicting students’ final GPA using decision trees: A case study. International Journal of Information and Education Technology,6(7), 528-533.http://doi.org/10.7763/IJET.2016.V6.745

Amin. Z., Refik, C., Yau, H., & Hernandez-Torrano, D., (2017). Predicting Students‟ GPA and Developing Intervention Strategies Based on Self-Regulatory Learning Behaviors. 2017, IEEE

Amrieh, E., Hamtini, T., & Aljarah, I., (2016). Mining educational data to predict

student’cademic performance using ensemble methods. International Journal of DatabaseTheory and Application 9(8), 119-136.

Baker, R. S., & Inventado, P. S. (2014). Educational data mining and learning analytics. In J. A. Larusson & B. White (Eds.), Learning analytics (pp. 61–75). Springer. https://doi.org/10.1007/978-1-4614-3305-7_4

Baradwaj, B., & Kumar, B., (2011). Mining Educational Data to Analyze Students

Performance.International Journal of Advanced Computer Science and Applications (IJACSA), 2(6),2011, 63-69

Bunkar, K., Sharma, C., Singh, U. P., & Bunkar, M. (2012). Data mining: Prediction for performance improvement of graduate students using classification. International Journal of Computer Applications, 41(3), 1–5. https://doi.org/10.5120/5594-7670

Devasia T, Vinushree T., & Hegde V., (2016): “Prediction of Students Performance using Educational Data Mining”, 2016, IEEE

Dewan M., Zhang L., Rahman C., Hossain A., & Strachan R., (2014). Hybrid decision tree and Naive bayes classifiers for multi-class classification tasks. Elsevier, Expert systems with applications, pp) 1937-1946, 2014

Dietterich, T. G. (2000). Ensemble methods in machine learning. In International workshop on multiple classifier systems (pp. 1–15). Springer.

Gray, S., & Rogers, M., Martinussen, R., & Tannock, R. (2014). Longitudinal relations among inattention, working memory, and academic achievement: Testing mediation and the moderating role of gender. PeerJ. 3. e939. 10.7717/peerj.939

Khan, M. Z., Wasid, M., & Jabin, S. (2019). Machine learning algorithms in analyzing and predicting students’ performance: A review. Procedia Computer Science, 172, 282–289. https://doi.org/10.1016/j.procs.2020.05.037

Kabakchieva, D., (2013). Predicting Student Performance by Using Data Mining Methods for Classification. Cybernetics and information technologies Volume 13, No 1 Sofia 2013 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2013-0006

Kotsiantis, S., Pierrakeas, C., & Pintelas, P. (2004). Prediction of Student’s Performance in Distance Learning Using Machine Learning Techniques. Applied Artificial Intelligence,Vol. 18, No. 5, 2004, pp. 411-426

Merchan Robiano, S.M., & Duarte Garcia, J.A. (2016). Analysis of data mining techniques for constructing a predictive model for academic performance. IEEE Latin America Transactions, 14(6), 2783-2788

Minaei-Bidgoli, B., Kashy, D., Kortemeyer, G., & Punch, W. (2003). Predicting student performance: An application of data mining methods with an educational web-based system. Proceedings of the 33rd Annual Conference on Frontiers in Education, Nov. 5-8, IEEE Computer Society, Washington, DC, USA., pp: 13-18

Nakayama, M., Kouichi, M., Hiroh, Y. (2018). Contributions of Student’s Assessment of Reflections on the Prediction of Learning Performance. 2018, IEEE

Naren., J. (2014). Application of Data Mining in Educational Database for Predicting Behavioral Patterns of the Students. International Journal of Computer Science and Information Technologies, Vol.5 No.03 2014.

Nikam, S. (2015). A Comparative Study of Classification Techniques in Data Mining Algorithms. Oriental Journal of Computer Science and Technology ;8(1), 2015, ISSN 0974-6471 Online ISSN : 2320-8481

Nikolovski, V., Stojanov, R., Mishkovski, I., Chorbev, I., & Madjarov., G. (2015).

Educational Data Mining: Case Study for Predicting Student Dropout in Higher Education. https://www.researchgate.net/publication/282333827 Conference Paper. April 2015 187

Omisore, O., & Azeez, N. (2016). Predicting Academic Performance of Students with KNN Classifier. Conference: ACM International Conference on Computer Science Research and Innovations (CoSRI 2015).

Osmanbegovic, E., & Suljic, M., (2012). Data mining approach for predicting studentperformance. Economic Review Journal of Economics and Business. Volume 10(1), 2012

Pandey, M., & Taruna, S. (2014). An Empirical Analysis of Classification Techniques for Predicting Academic Performance. 10.1109/IAdCC.2014.6779379.188

Paris. I., Affendey, L., & Mustapha, N.(2010). Improving academic performance prediction using voting technique in data mining. World Academy of Science, Engineering and Technology, vol. 4, pp. 820--823, 2010

Romero, C., & Ventura, S. (2010). Educational Data Mining: A Review of the State of the Art. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 40(6), pp.601–618

Safri, Y.F., Arifudiri, R., & Muslim, M.A. (2018). K-Nearest Neighbour and Naïve Bayes classifier algorithm in deterging the classification of Health Indonesia CardRecipients.ScientificJournalofInformatics,5(1). https://doi.org/10.15294/sji.v5i1.12057

Sivasakthi, M. (2017). Classification and Prediction based Data Mining Algorithms to Predict Students‟ Introductory programming Performance. Proceedings of the International Conference on Inventive Computing and Informatics (ICICI 2017)

Ulyani, N., Mohd, N., Nor-Aini, Y., & Amin A. (2017). Service Quality Performance of Student Housing: The Effects on Students Behavioural Intentions. 2017 IEEE 15th StudentConference on Research and Development (SCOReD)

Ward, A., Howard, S., & Murray-Ward, M. (1996). Achievement and Ability Tests –definition of the Domain. Educational Measurement, 2, University Press of America, pp. 2–5, ISBN 978-0-7618-0385-0

Yang, F., & Li, F.W. B (2018). Study on student performance estimation, student progress analysis, and student potential prediction based on data mining. Computers&Education.123,97-108. https://doi.org/10.1016/j.compedu.2018.04.006

Zhou, Z. H. (2012). Ensemble methods: Foundations and algorithms. CRC Press

Ziedner, M. (1998). Test anxiety: The state of the art”. New York: New York: Plenum Press. p. 259. ISBN 9780306471452. OCLC 757106093 194.

Extracting of Knowledge Using Naïve Bayes Classifier (Dake amd Gyimah, 2017)

Downloads

Published

13-02-2026

How to Cite

Oruh, J. N., & Iwuji, B. C. (2026). A HYBRID PREDICTION MODEL FOR CLASSIFYING STUDENT’S ACADEMIC PERFORMANCE USING VOTING ENSEMBLE METHOD. FUDMA JOURNAL OF SCIENCES, 10(3), 364-376. https://doi.org/10.33003/fjs-2026-1003-4751