ENHANCED SMS SPAM DETECTION USING BERNOULLI NAIVE BAYES WITH TF-IDF

Authors

  • Abdullahi Burhanuddeen Ahmed Bayero University Kano
  • Khalid Haruna Bayero University Kano

DOI:

https://doi.org/10.33003/fjs-2025-0901-3226

Keywords:

SMS Spam Detection, TF-IDF, Bernoulli Naïve Bayes, Machine Learning, Text Classification, Feature Extraction

Abstract

The use of mobile text messaging for communication is increasingly widespread, with Short Message Service (SMS) experiencing significant growth over the last decade. Consequently, the increase in SMS usage has led to a concerning rise in SMS spam, presenting substantial challenges for users and service providers. This study proposes a novel method for detecting SMS spam by combining Term Frequency-Inverse Document Frequency (TF-IDF) with Bernoulli Naïve Bayes (BNB) algorithm. The approach employs the use of TF-IDF for comprehensive feature extraction and the classification capabilities of the Bernoulli Naïve Bayes Algorithm. Through experimental validation employing TF-IDF for feature extraction and the BNB algorithm for classification, the results demonstrate high accuracy (98.36%), precision (99.19%), and a notable Matthews Correlation Coefficient (MCC) of 0.93, showcasing superior model performance compared to existing benchmarks.  Likewise, the proposed model shows efficient processing time (0.22 seconds). By combining strengths of TF-IDF and BNB, the approach offers effective SMS spam detection, surpassing the performance of traditional and deep learning classifiers. This research contributes valuable insights towards enhancing SMS security, thereby increasing trust between users and service providers.

References

Apha, (2022). Standard methods for examination of water and wastewater (24th edition). Washington; American public health association.

Cima C. A., Abdellaoui Y., Abatal M., et al. Eco-efficient biosorbent based on leucaena leucocephala residues for the simultaneous removal of Pb (II) and Cd (II) ions from water system: sorption and mechanism. Bioinorganic Chemistry and Applications. 2019; 2019:13. doi: 10.1155/2019/2814047.2814047

Fatimah O.O; Tukura B.W; Madu P.C. (2018). Assessment of heavy metal pollution of soils from farms in the vicinity of Durumi Quarry site in Mpape, Abuja Nigeria. DOI: http://dx.doi.org/10.12944/CWE.13.3.09

Haider, A.; Haider, S.; Inn-Kyu, K. (2018). A comprehensive review summarizing the effect of electrospinning parameters and potential applications of nanofibres in biomedical and biotechnology. Arabian Journal of Chemistry.Volume 11, Issue 8, December 2018, pages 1165 -1188.

Merriam – Webster Dictionary, (2016). Word definition of Tremor. In Merriam-Webster collegiate dictionary (11th ed.). Merriam-Webster, Inc.

National Space Research and Development Agency, NASRDA, (2018). Preliminary Investigation on 2018 Abuja Earthquake. Premiumtimesng.com.

Ofonime U.A.; Tahir A.Y. (2010). A review of earthquake occurrences and observations in Nigeria. Centre for Geodesy and Geodynamics, Toro.

Unamba, C.L., Chika, E., Isu, N. R. (2016). Physiochemical and bacteriological assessment of some borehole waters in the Federal Capital Territory, Nigeria. The international Research Journal of Public and Environment Health, Vol. 3(6), 140-145. https://dx.doi.org/10.15739/irjpeh. 16.018

United States Geological Survey Agency, (2003). Earthquakes and their effects on groundwater. https://wwwusgs.gov>earthquake

United States Geological Survey Agency, (2018). Seismic disturbance of groundwater level in well. https://www.usgs.gov>faqs.

Wang, C. Y.; Cheng, L. H.; Chin, C. V.; Yu, S. B. (2001). Co-seismic hydrologic response of an alluvial fan to the 1999 Chi-Chi earthquake, Taiwan. Geology 29 (9), 831 - 834.

Wang, W.; Lai, G.; Hongkui, G.; Emily, E. Brodsky.; Fuqiong Huang, (2014). Tidal response variation and recovery following the Wenchuan earthquake from water level data of multiple wells in the nearfield. Volume 619 – 620, 21 April 2014, pages 115 – 122.

Watson, John; Watson, Kathie (2009). Volcanoes and Earthquake. United States Geological Survey Agency. Retrieved May 9, 2009.

Published

2025-04-16

How to Cite

Ahmed, A. B., & Haruna, K. (2025). ENHANCED SMS SPAM DETECTION USING BERNOULLI NAIVE BAYES WITH TF-IDF. FUDMA JOURNAL OF SCIENCES, 9(1), 393 - 399. https://doi.org/10.33003/fjs-2025-0901-3226