PREDICTION ACCURACY ANALYSIS OF MACHINE LEARNING CLASSIFIERS ON STUDENT COURSE ASSESSMENT METHODS
Abstract
There is growing need to improve the quality of education through an effective service delivery from educators. Also, educational institutions are searching for ways to reduce student failure rate. The rapid growth in size and availability of student data and robust algorithms to generate machine learning models, more accurate predictions and tailored learning interventions can be factored. The research investigates the prediction accuracy of machine learning algorithms, including Logistic Regression, Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Naive Bayes, XGBoost, and Gradient Boosting, applied to student learning attributes and course assessment. The aim is to evaluate the effectiveness of these algorithms in predicting student performance on various metrics. A dataset encompassing student learning attributes and assessment modes were analyzed. Each algorithm's predictive capabilities was assessed using accuracy, precision, recall and F1-score metrics. Logistic regression had the highest accuracy score of 0.93, SVM and XGBoost both achieved an accuracy 0f 0.90 while Random Forest, KNN and Naive Bayes achieved same accuracy score of 0.88 while Gradient Boosting achieved an accuracy score of 0.85 each which was the lowest. RF, SVM, KNN got the same F-score, recall and precision of 0.93, 0.97 and 0.90 respectively while Naive Bayes, XGBoost, and Gradient Boosting achieved the same recall of 0.94 while KNN had a recall of 0.97. Gradient Boosting had a precision of 0.89, and an F-score of 0.92, the F-score of Naïve Bayes was 0.93. This research underscores the potential of advanced machine learning techniques in enhancing educational outcomes.
References
Al-Shabandar R, Hussain A, Laws A, et al. (2017). Machine learning approaches to predict learningoutcomes in Massive open online courses[C]//2017 International Joint Conference on NeuralNetworks (IJCNN). IEEE, 2017: 713-720. DOI: https://doi.org/10.1109/IJCNN.2017.7965922
Alturki, S., Cohausz, L., & Stuckenschmidt, H. (2022). Predicting Masters students academic performance: An empirical study in germany. Smart Learning Environments, 9(1), 38. doi:https://doi.org/10.1186/s40561-022-00220-y DOI: https://doi.org/10.1186/s40561-022-00220-y
Balakrishnan G, Coetzee D. (2013). Predicting student retention in massive open online courses using hidden markov models [J]. Electrical Engineering and Computer Sciences University of California at Berkeley, 2013, 53: 57-58.
Burgos C, Campanario M L, de la Pena D, et al. (2018). Data mining for modeling students
performance: A tutoring action plan to prevent academic dropout [J]. Computers & ElectricalEngineering, 2018, 66: 541-556. DOI: https://doi.org/10.1016/j.compeleceng.2017.03.005
Chao Ma. (2024). Improving the prediction of student performance by integrating a random forest classifier with meta-heuristic optimization algorithms. International Journal of Advanced Computer Science and Applications, 15(6) https://doi.org/10.14569/IJACSA.2024.01506106 DOI: https://doi.org/10.14569/IJACSA.2024.01506106
Chen J, Feng J, Sun X, et al. (2019). MOOC dropout prediction using a hybrid algorithm based on DOI: https://doi.org/10.1155/2019/8404653
decision tree and extreme learning machine [J]. Mathematical Problems in Engineering,
, 2019
Chen, Min & Wu, Lifa. (2021). A dropout prediction method based on time series model in MOOCs. Journal of Physics: Conference Series. 1774. 012065. https://doi.org/10.1088/1742-6596/1774/1/012065 .
Dobson, J. E. (2024). On the confusion matrix. Configurations, 32(4), 331-350. https://doi.org/10.1353/con.2024.a942087 DOI: https://doi.org/10.1353/con.2024.a942087
Elvira Vidya Berliana, Mardhani Riasetiawan. (2024). Comparative analysis of nave bayes classifier, support vector machine and decision tree in rainfall classification using confusion matrix. International Journal of Advanced Computer Science and Applications, 15(7) https://doi.org/10.14569/IJACSA.2024.0150755 DOI: https://doi.org/10.14569/IJACSA.2024.0150755
Hadyaoui, A., & Cheniti-Belcadhi, L. (2023). Ontology-based group assessment analytics framework for performances prediction in project-based collaborative learning. Smart Learning Environments, 10(1), 43. https://doi.org/10.1186/s40561-023-00262-w DOI: https://doi.org/10.1186/s40561-023-00262-w
Kaur, A., & Sarmadi, M. (2024). Comparative analysis of data preprocessing methods, feature selection techniques and machine learning models for improved classification and regression performance on imbalanced genetic data. Ithaca: https://doi.org/10.1007/s40745-024-00575-8 DOI: https://doi.org/10.1007/s40745-024-00575-8
Kloft M, Stiehler F, Zheng Z, et al. (2014). Predicting MOOC dropout over weeks using machine DOI: https://doi.org/10.3115/v1/W14-4111
learning methods[C]//Proceedings of the EMNLP 2014 workshop on analysis of large scale social interaction in MOOCs. 2014: 60-65.
Lalitha, R., J, D. L., Kavitha, K., RamaReddy, T., Srinivas, R., & Sujana, C. (2021). Prediction and analysis of corona virus disease (COVID-19) using cubist and OneR. IOP Conference Series. Materials Science and Engineering, 1074(1) https://doi.org/10.1088/1757-899X/1074/1/012022 DOI: https://doi.org/10.1088/1757-899X/1074/1/012022
Liang J, Li C, Zheng L. (2014). Machine learning application in MOOCs: Dropout prediction[C]//2016 11th International Conference on Computer Science & Education (ICCSE). IEEE, 2016: 52-57. DOI: https://doi.org/10.1109/ICCSE.2016.7581554
LIU T, Xiu L I. (2017). Finding out reasons for low completion in MOOC environment: an explicable approach using hybrid data mining methods [J]. DEStech Transactions on Social Science,Education and Human Science, 2017 (meit). DOI: https://doi.org/10.12783/dtssehs/meit2017/12893
Ortiz, B. L., Gupta, V., Kumar, R., Jalin, A., Cao, X., Ziegenbein, C., . . . Choi, S. W. (2024). Data preprocessing techniques for AI and machine learning readiness: Scoping review of wearable sensor data in cancer care. JMIR mHealth and uHealth, 12 https://doi.org/10.2196/59587 DOI: https://doi.org/10.2196/59587
Otu, Godwin & Usman, Suleiman & Ugbe, Raphael & Iheagwara, Stephen & Okafor, Akudo & Okonkwo, Fredrick & Shukurah, Oyebanji & Adeniyi, Usman. (2023). COMPARATIVE ANALYSIS OF AGGREGATION AND INHERITANCE STRATEGIES IN INCREMENTAL PROGRAM DEVELOPMENT. FUDMA JOURNAL OF SCIENCES. 7. 57-64. https://doi.org/10.33003/fjs-2023-0702-1710 . DOI: https://doi.org/10.33003/fjs-2023-0702-1710
Prakash, K., & Kalaiarasan, C. (2024). Web services performance prediction with confusion matrix and K-fold cross validation to provide prior service quality characteristics. Journal of Electrical Systems, 20(2), 284-292. Retrieved from https://www.proquest.com/scholarly-journals/web-services-performance-prediction-with/docview/3078054634/se-2 DOI: https://doi.org/10.52783/jes.1139
Qiu J, Tang J, Liu T X, et al. (2016). Modeling and predicting learning behavior in MOOCs[C]//Proceedings of the ninth ACM international conference on web search and data mining. ACM, 2016: 93-102. DOI: https://doi.org/10.1145/2835776.2835842
Sharma, N., Appukutti, S., Garg, U., Mukherjee, J., & Mishra, S. (2023). Analysis of Students academic performance based on their time spent on extra-curricular activities using machine learning techniques. International Journal of Modern Education and Computer Science, 15(1), 46. doi:https://doi.org/10.5815/ijmecs.2023.01.04 DOI: https://doi.org/10.5815/ijmecs.2023.01.04
Song, Q. (2022). Generation and research of online english course learning evaluation model based on genetic algorithm improved neural set network. Computational Intelligence and Neuroscience : CIN, 2022 https://doi.org/10.1155/2022/7281892 DOI: https://doi.org/10.1155/2022/7281892
Swaminathan, Sathyanarayanan & Tantri, B Roopashri. (2024). Confusion Matrix-Based Performance Evaluation Metrics. African Journal of Biomedical Research. 27. 4023-4031. https://doi.org/10.53555/AJBR.v27i4S.4345 . DOI: https://doi.org/10.53555/AJBR.v27i4S.4345
Tang C, Ouyang Y, Rong W, et al. (2018). Time series model for predicting dropout in massive openonline courses[C]//International Conference on Artificial Intelligence in Education. Springer, Cham, 2018: 353-357. DOI: https://doi.org/10.1007/978-3-319-93846-2_66
Tri, N. Q., Tung, N. T., Anh, D. L., Huong Pham, T. V., & Doanh, S. C. (2024). Classification of glucose-level in deionized water using machine learning models and data pre-processing technique. PLoS One, 19(12) https://doi.org/10.1371/journal.pone.0311482 DOI: https://doi.org/10.1371/journal.pone.0311482
Wang W, Yu H, Miao C. (2020). Deep model for dropout prediction in MOOCs [C]//Proceedings of the ICDPAM 2020 Journal of Physics: Conference Series 1774 (2021) 012065 IOP Publishing https://doi.org/10.1088/1742-6596/1774/1/0120657 DOI: https://doi.org/10.1088/1742-6596/1774/1/012065
Xia, X., & Qi, W. (2023). Dropout prediction and decision feedback supported by multi temporal sequences of learning behavior in MOOCs: Revista de universidad y sociedad del conocimiento. International Journal of Educational Technology in Higher Education, 20(1), 32. https://doi.org/10.1186/s41239-023-00400-x DOI: https://doi.org/10.1186/s41239-023-00400-x
Xia, X., & Qi, W. (2024). Driving STEM learning effectiveness: Dropout prediction and intervention in MOOCs based on one novel behavioral data analysis approach. Humanities & Social Sciences Communications, 11(1), 430. https://doi.org/10.1057/s41599-024-02882-0 DOI: https://doi.org/10.1057/s41599-024-02882-0
Xie, Yanxin. (2024). Efficiency of Hybrid Decision Tree Algorithms in Evaluating the Academic Performance of Students. International Journal of Advanced Computer Science and Applications. 15. https://doi.org/10.14569/IJACSA.2024.0150251 . DOI: https://doi.org/10.14569/IJACSA.2024.0150251
Yang, P., Liu, Y., Luo, Y., Wang, Z., & Cai, X. (2024). Text mining and multi-attribute decision-making-based course improvement in massive open online courses. Applied Sciences, 14(9), 3654. https://doi.org/10.3390/app14093654 DOI: https://doi.org/10.3390/app14093654
Zihan, Song & Sung, Sang-Ha & Park, Do-Myung & Park, Byung-Kwon. (2023). All-Year Dropout Prediction Modeling and Analysis for University Students. Applied Sciences. 13. 1143. https://doi.org/10.3390/app13021143 DOI: https://doi.org/10.3390/app13021143
Copyright (c) 2024 FUDMA JOURNAL OF SCIENCES
This work is licensed under a Creative Commons Attribution 4.0 International License.
FUDMA Journal of Sciences