COMPARATIVE ANALYSIS OF DATA MINING TECHNIQUES FOR MOVIE PREDICTION

  • Ayokunle A. Omotunde
  • Gbenga Ogunsanwo TASUED
  • Olubukola Adekola
  • Aaron Izang
  • Samuel B. Abel
Keywords: Comparative Analysis, Movie, Support Vector Machine (SVM), Gradient Boosting Algorithm

Abstract

The rate at which movies are being produced is increasing at exponential rates and it has become pertinent to ascertain success rate since the investment that goes into these movie creation runs in millions of dollars. A number of data mining-based methods, ranging from Support Vector Machine (SVM) to logistic regression, have been proposed with a varying level of success with SVM showing the most promising results. This paper aims to carry out a comparative analysis of the performance of Gradient Boosting and SVM algorithms in optimizing the prediction of movie success. The study developed a framework for the research methodology; the dataset used contained 33 movie attributes and 838 entries. The dataset was cleaned with six attributes; features were identified and selected from the datasets using four methods. These methods include: Analysis of Variance (ANOVA), Lasso Regularization, Combination of Lasso Regularization and Random Forest (RF). Model Formulation were done using Support Vector Machine (SVM) and Gradient Boosting Algorithm and the performance evaluation of the developed predictive models was done using accuracy, precision and recall values. The results shows that the accuracy of the Gradient Boosting algorithm is around 100%, SVM-Linear is 86 %, SVM-Poly is 88%, SVM-RBF is 88% and SVM-Sigmoid is 72%. The study concluded that Gradient Boosting algorithm is more robust in predicting movie success. Also recommended that comparison should be done with different machine learning techniques.

References

Saraee, M., White, S., &Eccleston, J. (2004). A data mining approach to analysis and prediction of movie ratings. In A. Zanasi, N. F. Ebecken, & C. A. Brebbia (Eds.), Data Mining V: Data Mining, Text mining and their Business Applications. UK: WIT Press / Computational Mechanics.

Zheng, W., &Skiena, S. (2009). Improving Movie Gross Prediction through News Analysis. International Conference on Web Intelligence and Intelligent Technology (pp. 301 - 304). Department of Computer Science Stoney Brook University.

Mestya'n, M., &Yasseri, T. K. (2013). Early Prediction of Movie Box Office Success Based on Wikipedia Activity Big Data. PLoS ONE. doi:e71226.doi:10.1371/journal.pone.0071226

Dwi, R., Rosyida, I., &dan-Santi, W. P. (2019). Predicting Popularity of Movie using Support Vector Machines. INFERENSI, 2(1), 13 -17.

Miryala, G., Gomes, R., & Dayananda, K. R. (2017). Comparative Analysis of movie recommendation System using collaborative filtering in Spark Engine. Journal of Global Research in Computer Science, 8(1).

Mahesh, J., Dipanjan, D., Kevin, G., & Noah, A. S. (2010). Movie Reviews and Revenues: An Experiment in Text Regression. Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the ACL (pp. 293 - 296). Los Angeles: Association for Computational Linguistics.

Nithin, V. R., Pranav, M., Sarath, B., &Lijiya, P. B. (2014). A Predicting Movie success based on IMDB data. International Journal of data mining and techniques, 243-254.

Komal, G., Dhiral, S., Nirav, W., Mitul, S., &Ramanand, Y. (2018). Movie Success Prediction. IOSR Journla of Engineering, 6, 66-69.

Meenakshi, K., Maragatham, G., Agarwal, N., & Ghosh, I. (2018). A Data mining Technique for analysing and predicting the success of a movie. National Conference on Mathematical Techniques and its Applications (NCMTA18).

Muhammad, L., & Afzal, H. (2016). Predicting of Movies Popularity Using Machine Learning. International Journal of Computer Science and Network Security, 16(8), 127-131

Published
2023-01-10
How to Cite
OmotundeA. A., OgunsanwoG., AdekolaO., IzangA., & AbelS. B. (2023). COMPARATIVE ANALYSIS OF DATA MINING TECHNIQUES FOR MOVIE PREDICTION. FUDMA JOURNAL OF SCIENCES, 6(6), 224 - 228. https://doi.org/10.33003/fjs-2022-0606-1117