A HYBRID PREDICTION MODEL FOR CLASSIFYING STUDENT’S ACADEMIC PERFORMANCE USING VOTING ENSEMBLE METHOD
DOI:
https://doi.org/10.33003/fjs-2026-1003-4751Keywords:
Hybrid Prediction, Classification, K-Nearest Neighbour (K-NN), Naïve Bayes (NB), Ensemble Method, Academic PerformanceAbstract
Single classification algorithms for predicting students’ academic performance often face limitations in accuracy and robustness, especially with high-dimensional educational data. This study proposes a hybrid prediction model that integrates K-Nearest Neighbour (K-NN) and Naïve Bayes (NB) algorithms using a soft voting ensemble technique to classify students’ academic performance into high, average, and low categories. The Cross-Industry Standard Process for Data Mining (CRISP-DM) and Object-Oriented Analysis and Design Methodology (OOADM) guided the study. Primary data were collected from 240 third-year nursing students of Abia State University, Uturu, across two academic semesters, using 38 demographic, academic, social, and prior performance features. The models were implemented in Python using Scikit-learn and related libraries, with an 80:20 train-test split after preprocessing and feature scaling. Performance was evaluated using accuracy, precision, recall, F1-score, ROC-AUC, and confusion matrices. Results show that the hybrid model achieved superior performance, attaining 100% accuracy and outperforming the standalone K-NN model. The classification results indicated that 51.7% of students were high performers, 10.8% average, and 37.5% low. The deployed system provides performance-based advisory support and demonstrates strong potential for early intervention, personalized learning, and academic planning in educational institutions.
References
Al-Barrak, M. A., & Al-Razgan, M. (2016). Predicting students’ final GPA using decision trees: A case study. International Journal of Information and Education Technology,6(7), 528-533.http://doi.org/10.7763/IJET.2016.V6.745
Amin. Z., Refik, C., Yau, H., & Hernandez-Torrano, D., (2017). Predicting Students‟ GPA and Developing Intervention Strategies Based on Self-Regulatory Learning Behaviors. 2017, IEEE
Amrieh, E., Hamtini, T., & Aljarah, I., (2016). Mining educational data to predict
student’cademic performance using ensemble methods. International Journal of DatabaseTheory and Application 9(8), 119-136.
Baker, R. S., & Inventado, P. S. (2014). Educational data mining and learning analytics. In J. A. Larusson & B. White (Eds.), Learning analytics (pp. 61–75). Springer. https://doi.org/10.1007/978-1-4614-3305-7_4
Baradwaj, B., & Kumar, B., (2011). Mining Educational Data to Analyze Students
Performance.International Journal of Advanced Computer Science and Applications (IJACSA), 2(6),2011, 63-69
Bunkar, K., Sharma, C., Singh, U. P., & Bunkar, M. (2012). Data mining: Prediction for performance improvement of graduate students using classification. International Journal of Computer Applications, 41(3), 1–5. https://doi.org/10.5120/5594-7670
Devasia T, Vinushree T., & Hegde V., (2016): “Prediction of Students Performance using Educational Data Mining”, 2016, IEEE
Dewan M., Zhang L., Rahman C., Hossain A., & Strachan R., (2014). Hybrid decision tree and Naive bayes classifiers for multi-class classification tasks. Elsevier, Expert systems with applications, pp) 1937-1946, 2014
Dietterich, T. G. (2000). Ensemble methods in machine learning. In International workshop on multiple classifier systems (pp. 1–15). Springer.
Gray, S., & Rogers, M., Martinussen, R., & Tannock, R. (2014). Longitudinal relations among inattention, working memory, and academic achievement: Testing mediation and the moderating role of gender. PeerJ. 3. e939. 10.7717/peerj.939
Khan, M. Z., Wasid, M., & Jabin, S. (2019). Machine learning algorithms in analyzing and predicting students’ performance: A review. Procedia Computer Science, 172, 282–289. https://doi.org/10.1016/j.procs.2020.05.037
Kabakchieva, D., (2013). Predicting Student Performance by Using Data Mining Methods for Classification. Cybernetics and information technologies Volume 13, No 1 Sofia 2013 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.2478/cait-2013-0006
Kotsiantis, S., Pierrakeas, C., & Pintelas, P. (2004). Prediction of Student’s Performance in Distance Learning Using Machine Learning Techniques. Applied Artificial Intelligence,Vol. 18, No. 5, 2004, pp. 411-426
Merchan Robiano, S.M., & Duarte Garcia, J.A. (2016). Analysis of data mining techniques for constructing a predictive model for academic performance. IEEE Latin America Transactions, 14(6), 2783-2788
Minaei-Bidgoli, B., Kashy, D., Kortemeyer, G., & Punch, W. (2003). Predicting student performance: An application of data mining methods with an educational web-based system. Proceedings of the 33rd Annual Conference on Frontiers in Education, Nov. 5-8, IEEE Computer Society, Washington, DC, USA., pp: 13-18
Nakayama, M., Kouichi, M., Hiroh, Y. (2018). Contributions of Student’s Assessment of Reflections on the Prediction of Learning Performance. 2018, IEEE
Naren., J. (2014). Application of Data Mining in Educational Database for Predicting Behavioral Patterns of the Students. International Journal of Computer Science and Information Technologies, Vol.5 No.03 2014.
Nikam, S. (2015). A Comparative Study of Classification Techniques in Data Mining Algorithms. Oriental Journal of Computer Science and Technology ;8(1), 2015, ISSN 0974-6471 Online ISSN : 2320-8481
Nikolovski, V., Stojanov, R., Mishkovski, I., Chorbev, I., & Madjarov., G. (2015).
Educational Data Mining: Case Study for Predicting Student Dropout in Higher Education. https://www.researchgate.net/publication/282333827 Conference Paper. April 2015 187
Omisore, O., & Azeez, N. (2016). Predicting Academic Performance of Students with KNN Classifier. Conference: ACM International Conference on Computer Science Research and Innovations (CoSRI 2015).
Osmanbegovic, E., & Suljic, M., (2012). Data mining approach for predicting studentperformance. Economic Review Journal of Economics and Business. Volume 10(1), 2012
Pandey, M., & Taruna, S. (2014). An Empirical Analysis of Classification Techniques for Predicting Academic Performance. 10.1109/IAdCC.2014.6779379.188
Paris. I., Affendey, L., & Mustapha, N.(2010). Improving academic performance prediction using voting technique in data mining. World Academy of Science, Engineering and Technology, vol. 4, pp. 820--823, 2010
Romero, C., & Ventura, S. (2010). Educational Data Mining: A Review of the State of the Art. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 40(6), pp.601–618
Safri, Y.F., Arifudiri, R., & Muslim, M.A. (2018). K-Nearest Neighbour and Naïve Bayes classifier algorithm in deterging the classification of Health Indonesia CardRecipients.ScientificJournalofInformatics,5(1). https://doi.org/10.15294/sji.v5i1.12057
Sivasakthi, M. (2017). Classification and Prediction based Data Mining Algorithms to Predict Students‟ Introductory programming Performance. Proceedings of the International Conference on Inventive Computing and Informatics (ICICI 2017)
Ulyani, N., Mohd, N., Nor-Aini, Y., & Amin A. (2017). Service Quality Performance of Student Housing: The Effects on Students Behavioural Intentions. 2017 IEEE 15th StudentConference on Research and Development (SCOReD)
Ward, A., Howard, S., & Murray-Ward, M. (1996). Achievement and Ability Tests –definition of the Domain. Educational Measurement, 2, University Press of America, pp. 2–5, ISBN 978-0-7618-0385-0
Yang, F., & Li, F.W. B (2018). Study on student performance estimation, student progress analysis, and student potential prediction based on data mining. Computers&Education.123,97-108. https://doi.org/10.1016/j.compedu.2018.04.006
Zhou, Z. H. (2012). Ensemble methods: Foundations and algorithms. CRC Press
Ziedner, M. (1998). Test anxiety: The state of the art”. New York: New York: Plenum Press. p. 259. ISBN 9780306471452. OCLC 757106093 194.
Downloads
Published
Issue
Section
Categories
License
Copyright (c) 2026 Jane Ngozi Oruh, Basil Chukwuemeka Iwuji

This work is licensed under a Creative Commons Attribution 4.0 International License.