PREDICTION ACCURACY ANALYSIS OF MACHINE LEARNING CLASSIFIERS ON STUDENT COURSE ASSESSMENT METHODS

Authors

  • Godwin A. Otu
    Air Force Institute of Technology Kaduna image/svg+xml
  • Oludele Awodele
  • Sola A. Adeniji
  • Henry O. Mafua
  • Kehinde A. Olayanju
  • Adeniyi U. Adedayo
  • Suleiman A. Usman
  • Samson Adeyinka
  • Aisha Ramalan
  • Maryam Masari

Keywords:

Prediction Accuracy, Machine Learning Algorithms, Student Performance, Course Assessment, Educational Data Mining

Abstract

There is growing need to improve the quality of education through an effective service delivery from educators. Also, educational institutions are searching for ways to reduce student failure rate. The rapid growth in size and availability of student data and robust algorithms to generate machine learning models, more accurate predictions and tailored learning interventions can be factored. The research investigates the prediction accuracy of machine learning algorithms, including Logistic Regression, Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Naive Bayes, XGBoost, and Gradient Boosting, applied to student learning attributes and course assessment. The aim is to evaluate the effectiveness of these algorithms in predicting student performance on various metrics. A dataset encompassing student learning attributes and assessment modes were analyzed. Each algorithm's predictive capabilities was assessed using accuracy, precision, recall and F1-score metrics. Logistic regression had the highest accuracy score of 0.93, SVM and XGBoost both achieved an accuracy 0f 0.90 while Random Forest, KNN and Naive Bayes achieved same accuracy score of 0.88 while Gradient Boosting achieved an accuracy score of 0.85 each which was the lowest. RF, SVM, KNN got the same F-score, recall and precision of 0.93, 0.97 and 0.90 respectively while Naive Bayes, XGBoost, and Gradient Boosting achieved the same recall of 0.94 while KNN had a recall of 0.97. Gradient Boosting had a precision of 0.89, and an F-score of 0.92, the F-score of Naïve Bayes was 0.93. This research underscores the potential of advanced machine learning techniques in enhancing educational outcomes.

Dimensions

Al-Shabandar R, Hussain A, Laws A, et al. (2017). Machine learning approaches to predict learningoutcomes in Massive open online courses[C]//2017 International Joint Conference on NeuralNetworks (IJCNN). IEEE, 2017: 713-720.

Alturki, S., Cohausz, L., & Stuckenschmidt, H. (2022). Predicting Masters students academic performance: An empirical study in germany. Smart Learning Environments, 9(1), 38. doi:https://doi.org/10.1186/s40561-022-00220-y

Balakrishnan G, Coetzee D. (2013). Predicting student retention in massive open online courses using hidden markov models [J]. Electrical Engineering and Computer Sciences University of California at Berkeley, 2013, 53: 57-58.

Burgos C, Campanario M L, de la Pena D, et al. (2018). Data mining for modeling students

performance: A tutoring action plan to prevent academic dropout [J]. Computers & ElectricalEngineering, 2018, 66: 541-556.

Chao Ma. (2024). Improving the prediction of student performance by integrating a random forest classifier with meta-heuristic optimization algorithms. International Journal of Advanced Computer Science and Applications, 15(6) https://doi.org/10.14569/IJACSA.2024.01506106

Chen J, Feng J, Sun X, et al. (2019). MOOC dropout prediction using a hybrid algorithm based on

decision tree and extreme learning machine [J]. Mathematical Problems in Engineering,

, 2019

Chen, Min & Wu, Lifa. (2021). A dropout prediction method based on time series model in MOOCs. Journal of Physics: Conference Series. 1774. 012065. https://doi.org/10.1088/1742-6596/1774/1/012065 .

Dobson, J. E. (2024). On the confusion matrix. Configurations, 32(4), 331-350. https://doi.org/10.1353/con.2024.a942087

Elvira Vidya Berliana, Mardhani Riasetiawan. (2024). Comparative analysis of nave bayes classifier, support vector machine and decision tree in rainfall classification using confusion matrix. International Journal of Advanced Computer Science and Applications, 15(7) https://doi.org/10.14569/IJACSA.2024.0150755

Hadyaoui, A., & Cheniti-Belcadhi, L. (2023). Ontology-based group assessment analytics framework for performances prediction in project-based collaborative learning. Smart Learning Environments, 10(1), 43. https://doi.org/10.1186/s40561-023-00262-w

Kaur, A., & Sarmadi, M. (2024). Comparative analysis of data preprocessing methods, feature selection techniques and machine learning models for improved classification and regression performance on imbalanced genetic data. Ithaca: https://doi.org/10.1007/s40745-024-00575-8

Kloft M, Stiehler F, Zheng Z, et al. (2014). Predicting MOOC dropout over weeks using machine

learning methods[C]//Proceedings of the EMNLP 2014 workshop on analysis of large scale social interaction in MOOCs. 2014: 60-65.

Lalitha, R., J, D. L., Kavitha, K., RamaReddy, T., Srinivas, R., & Sujana, C. (2021). Prediction and analysis of corona virus disease (COVID-19) using cubist and OneR. IOP Conference Series. Materials Science and Engineering, 1074(1) https://doi.org/10.1088/1757-899X/1074/1/012022

Liang J, Li C, Zheng L. (2014). Machine learning application in MOOCs: Dropout prediction[C]//2016 11th International Conference on Computer Science & Education (ICCSE). IEEE, 2016: 52-57.

LIU T, Xiu L I. (2017). Finding out reasons for low completion in MOOC environment: an explicable approach using hybrid data mining methods [J]. DEStech Transactions on Social Science,Education and Human Science, 2017 (meit).

Ortiz, B. L., Gupta, V., Kumar, R., Jalin, A., Cao, X., Ziegenbein, C., . . . Choi, S. W. (2024). Data preprocessing techniques for AI and machine learning readiness: Scoping review of wearable sensor data in cancer care. JMIR mHealth and uHealth, 12 https://doi.org/10.2196/59587

Otu, Godwin & Usman, Suleiman & Ugbe, Raphael & Iheagwara, Stephen & Okafor, Akudo & Okonkwo, Fredrick & Shukurah, Oyebanji & Adeniyi, Usman. (2023). COMPARATIVE ANALYSIS OF AGGREGATION AND INHERITANCE STRATEGIES IN INCREMENTAL PROGRAM DEVELOPMENT. FUDMA JOURNAL OF SCIENCES. 7. 57-64. https://doi.org/10.33003/fjs-2023-0702-1710 .

Prakash, K., & Kalaiarasan, C. (2024). Web services performance prediction with confusion matrix and K-fold cross validation to provide prior service quality characteristics. Journal of Electrical Systems, 20(2), 284-292. Retrieved from https://www.proquest.com/scholarly-journals/web-services-performance-prediction-with/docview/3078054634/se-2

Qiu J, Tang J, Liu T X, et al. (2016). Modeling and predicting learning behavior in MOOCs[C]//Proceedings of the ninth ACM international conference on web search and data mining. ACM, 2016: 93-102.

Sharma, N., Appukutti, S., Garg, U., Mukherjee, J., & Mishra, S. (2023). Analysis of Students academic performance based on their time spent on extra-curricular activities using machine learning techniques. International Journal of Modern Education and Computer Science, 15(1), 46. doi:https://doi.org/10.5815/ijmecs.2023.01.04

Song, Q. (2022). Generation and research of online english course learning evaluation model based on genetic algorithm improved neural set network. Computational Intelligence and Neuroscience : CIN, 2022 https://doi.org/10.1155/2022/7281892

Swaminathan, Sathyanarayanan & Tantri, B Roopashri. (2024). Confusion Matrix-Based Performance Evaluation Metrics. African Journal of Biomedical Research. 27. 4023-4031. https://doi.org/10.53555/AJBR.v27i4S.4345 .

Tang C, Ouyang Y, Rong W, et al. (2018). Time series model for predicting dropout in massive openonline courses[C]//International Conference on Artificial Intelligence in Education. Springer, Cham, 2018: 353-357.

Tri, N. Q., Tung, N. T., Anh, D. L., Huong Pham, T. V., & Doanh, S. C. (2024). Classification of glucose-level in deionized water using machine learning models and data pre-processing technique. PLoS One, 19(12) https://doi.org/10.1371/journal.pone.0311482

Wang W, Yu H, Miao C. (2020). Deep model for dropout prediction in MOOCs [C]//Proceedings of the ICDPAM 2020 Journal of Physics: Conference Series 1774 (2021) 012065 IOP Publishing https://doi.org/10.1088/1742-6596/1774/1/0120657

Xia, X., & Qi, W. (2023). Dropout prediction and decision feedback supported by multi temporal sequences of learning behavior in MOOCs: Revista de universidad y sociedad del conocimiento. International Journal of Educational Technology in Higher Education, 20(1), 32. https://doi.org/10.1186/s41239-023-00400-x

Xia, X., & Qi, W. (2024). Driving STEM learning effectiveness: Dropout prediction and intervention in MOOCs based on one novel behavioral data analysis approach. Humanities & Social Sciences Communications, 11(1), 430. https://doi.org/10.1057/s41599-024-02882-0

Xie, Yanxin. (2024). Efficiency of Hybrid Decision Tree Algorithms in Evaluating the Academic Performance of Students. International Journal of Advanced Computer Science and Applications. 15. https://doi.org/10.14569/IJACSA.2024.0150251 .

Yang, P., Liu, Y., Luo, Y., Wang, Z., & Cai, X. (2024). Text mining and multi-attribute decision-making-based course improvement in massive open online courses. Applied Sciences, 14(9), 3654. https://doi.org/10.3390/app14093654

Zihan, Song & Sung, Sang-Ha & Park, Do-Myung & Park, Byung-Kwon. (2023). All-Year Dropout Prediction Modeling and Analysis for University Students. Applied Sciences. 13. 1143. https://doi.org/10.3390/app13021143

Published

31-12-2024

How to Cite

PREDICTION ACCURACY ANALYSIS OF MACHINE LEARNING CLASSIFIERS ON STUDENT COURSE ASSESSMENT METHODS. (2024). FUDMA JOURNAL OF SCIENCES, 8(6), 288-298. https://doi.org/10.33003/fjs-2024-0806-2927

How to Cite

PREDICTION ACCURACY ANALYSIS OF MACHINE LEARNING CLASSIFIERS ON STUDENT COURSE ASSESSMENT METHODS. (2024). FUDMA JOURNAL OF SCIENCES, 8(6), 288-298. https://doi.org/10.33003/fjs-2024-0806-2927

Most read articles by the same author(s)